File APIs for Java Developers
Manipulate DOC, XLS, PPT, PDF and many others from your application.
http://aspose.com/file-tools
The moose likes Java in General and the fly likes Reading contents from microsoft word document Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login
JavaRanch » Java Forums » Java » Java in General
Bookmark "Reading contents from microsoft word document" Watch "Reading contents from microsoft word document" New topic
Author

Reading contents from microsoft word document

Amirtharaj Chinnaraj
Ranch Hand

Joined: Sep 28, 2006
Posts: 236
hi guys

my need is to read the microsoft word document

and print it in the console while doing that

i faced a problem . iam getting some ascii characters that are

not present in the document. when i do the same thing with

text (*.txt) file things are fine
jeroen dijkmeijer
Ranch Hand

Joined: Sep 26, 2003
Posts: 131
I think you should have a look at the POI (apache) framework.
regards,
Jeroen.
Ulf Dittmer
Marshal

Joined: Mar 22, 2005
Posts: 41114
    
  45
.doc files contain many characters that are not part of the actual text (e.g., layout information and such). If you just want the text, use POI as suggested. This page explains how it can be used for text extraction.


Ping & DNS - my free Android networking tools app
 
 
subject: Reading contents from microsoft word document
 
Similar Threads
Generating a word document
Writing a Microsoft Word Doc using java
Download to word and making word non-editable
Generate Microsoft Word Document using JSP
question on FileInputStream