This week's book giveaway is in the OO, Patterns, UML and Refactoring forum.
We're giving away four copies of Refactoring for Software Design Smells: Managing Technical Debt and have Girish Suryanarayana, Ganesh Samarthyam & Tushar Sharma on-line!
See this thread for details.
The moose likes Java in General and the fly likes Read a word document Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login

JavaRanch » Java Forums » Java » Java in General
Bookmark "Read a word document" Watch "Read a word document" New topic

Read a word document

Ram Kas
Ranch Hand

Joined: Jul 26, 2006
Posts: 83

I want t o read a word document and print the contents to console. But, when I do it as if I were doing it with text files, it displays some weird characters. Can anyone throw some light on how I should proceed?

Thanks in advance.

Dinakar Kasturi.
Ulf Dittmer

Joined: Mar 22, 2005
Posts: 42958
DOC is a binary file format; you can't treat it like you would treat text files. An API that can extract the text from a doc file is Jakarta POI; you can find some usage examples here.
I agree. Here's the link:
subject: Read a word document
It's not a secret anymore!