This week's book giveaway is in the OCAJP 8 forum.
We're giving away four copies of OCA Java SE 8 Programmer I Study Guide and have Edward Finegan & Robert Liguori on-line!
See this thread for details.
The moose likes Beginning Java and the fly likes guidance Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login

Win a copy of OCA Java SE 8 Programmer I Study Guide this week in the OCAJP 8 forum!
JavaRanch » Java Forums » Java » Beginning Java
Bookmark "guidance" Watch "guidance" New topic


rishi reddy
Ranch Hand

Joined: Jun 06, 2006
Posts: 30

i have information in the pdf files as well as word document. now i need to convert this pdf file/word document information into full text file.

could any one let me know how to do this?

Scott Selikoff
Saloon Keeper

Joined: Oct 23, 2005
Posts: 3749

You need to use a library jar to convert it. There are some free ones available like iText and others that are commercial. Google would be the best, just search "java pdf library"

[OCA 8 Book] [Blog]
Ulf Dittmer

Joined: Mar 22, 2005
Posts: 42958
The AccessingFileFormats wiki page has a bunch of information on the subject. Look into Jakarta POI for converting DOC to text, and JPedal or PDFTextStream for extracting text from PDFs.

By the way, you should make the topic of your posts more descriptive - "guidance" conveys nothing.
[ June 16, 2006: Message edited by: Ulf Dittmer ]
I agree. Here's the link:
subject: guidance
It's not a secret anymore!