aspose file tools
The moose likes Java in General and the fly likes Reading contents from microsoft word document Big Moose Saloon
  Search | Java FAQ | Recent Topics
Register / Login


JavaRanch » Java Forums » Java » Java in General
Reply Bookmark "Reading contents from microsoft word document" Watch "Reading contents from microsoft word document" New topic
Author

Reading contents from microsoft word document

Amirtharaj Chinnaraj
Ranch Hand

Joined: Sep 28, 2006
Posts: 215
hi guys

my need is to read the microsoft word document

and print it in the console while doing that

i faced a problem . iam getting some ascii characters that are

not present in the document. when i do the same thing with

text (*.txt) file things are fine
jeroen dijkmeijer
Ranch Hand

Joined: Sep 26, 2003
Posts: 131
I think you should have a look at the POI (apache) framework.
regards,
Jeroen.
Ulf Dittmer
Marshal

Joined: Mar 22, 2005
Posts: 35429
    
    9
.doc files contain many characters that are not part of the actual text (e.g., layout information and such). If you just want the text, use POI as suggested. This page explains how it can be used for text extraction.


Android appsImageJ pluginsJava web charts
 
I agree. Here's the link: http://zeroturnaround.com/jrebel - it saves me about five hours per week
 
subject: Reading contents from microsoft word document
 
Similar Threads
Download to word and making word non-editable
Generate Microsoft Word Document using JSP
question on FileInputStream
Writing a Microsoft Word Doc using java
Generating a word document