A friendly place for programming greenhorns!
Big Moose Saloon
Register / Login
Java in General
Reading contents from microsoft word document
Joined: Sep 28, 2006
Dec 21, 2006 22:21:00
my need is to read the microsoft word document
and print it in the console while doing that
i faced a problem . iam getting some ascii characters that are
not present in the document. when i do the same thing with
text (*.txt) file things are fine
Joined: Sep 26, 2003
Dec 22, 2006 00:08:00
I think you should have a look at the POI (apache) framework.
Joined: Mar 22, 2005
Dec 22, 2006 00:50:00
s contain many characters that are not part of the actual text (e.g., layout information and such). If you just want the text, use POI as suggested.
explains how it can be used for text extraction.
I agree. Here's the link:
subject: Reading contents from microsoft word document
question on FileInputStream
Generate Microsoft Word Document using JSP
Generating a word document
Writing a Microsoft Word Doc using java
Download to word and making word non-editable
All times are in JavaRanch time: GMT-6 in summer, GMT-7 in winter
| Powered by
Copyright © 1998-2015