File APIs for Java Developers
Manipulate DOC, XLS, PPT, PDF and many others from your application.
The moose likes Beginning Java and the fly likes guidance Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login
JavaRanch » Java Forums » Java » Beginning Java
Bookmark "guidance" Watch "guidance" New topic


rishi reddy
Ranch Hand

Joined: Jun 06, 2006
Posts: 30

i have information in the pdf files as well as word document. now i need to convert this pdf file/word document information into full text file.

could any one let me know how to do this?

Scott Selikoff
Saloon Keeper

Joined: Oct 23, 2005
Posts: 3749

You need to use a library jar to convert it. There are some free ones available like iText and others that are commercial. Google would be the best, just search "java pdf library"

[OCA 8 Book] [Blog]
Ulf Dittmer

Joined: Mar 22, 2005
Posts: 42959
The AccessingFileFormats wiki page has a bunch of information on the subject. Look into Jakarta POI for converting DOC to text, and JPedal or PDFTextStream for extracting text from PDFs.

By the way, you should make the topic of your posts more descriptive - "guidance" conveys nothing.
[ June 16, 2006: Message edited by: Ulf Dittmer ]
I agree. Here's the link:
subject: guidance
jQuery in Action, 3rd edition