File APIs for Java Developers
Manipulate DOC, XLS, PPT, PDF and many others from your application.
The moose likes Other JSE/JEE APIs and the fly likes Text Extraction from Word Document Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login
JavaRanch » Java Forums » Java » Other JSE/JEE APIs
Bookmark "Text Extraction from Word Document" Watch "Text Extraction from Word Document" New topic

Text Extraction from Word Document

ashu Suri

Joined: Oct 22, 2008
Posts: 18
I want to extract text from MS word files through Java.
I am not able to use POI properly.
Though many forums say that its easy to extract text using POI. In case anybody can help me
on this. I would be thankful.
Thanks in advance
Dawn Charangat
Ranch Hand

Joined: Apr 26, 2007
Posts: 249
Think we already had a discussion in the ranch on this..

Ulf Dittmer

Joined: Mar 22, 2005
Posts: 42959
I am not able to use POI properly.

What does this mean? How are you using it, and what are the results?

Note that the POI HWPF Quick Guide explicitly mentions how to extract text from doc files.
I agree. Here's the link:
subject: Text Extraction from Word Document
It's not a secret anymore!