File APIs for Java Developers
Manipulate DOC, XLS, PPT, PDF and many others from your application.
The moose likes Java in General and the fly likes Read a word document Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login
JavaRanch » Java Forums » Java » Java in General
Bookmark "Read a word document" Watch "Read a word document" New topic

Read a word document

Ram Kas
Ranch Hand

Joined: Jul 26, 2006
Posts: 83

I want t o read a word document and print the contents to console. But, when I do it as if I were doing it with text files, it displays some weird characters. Can anyone throw some light on how I should proceed?

Thanks in advance.

Dinakar Kasturi.
Ulf Dittmer

Joined: Mar 22, 2005
Posts: 42965
DOC is a binary file format; you can't treat it like you would treat text files. An API that can extract the text from a doc file is Jakarta POI; you can find some usage examples here.
I agree. Here's the link:
subject: Read a word document
It's not a secret anymore!