This week's giveaway is in the EJB and other Java EE Technologies forum.
We're giving away four copies of EJB 3 in Action and have Debu Panda, Reza Rahman, Ryan Cuprak, and Michael Remijan on-line!
See this thread for details.
The moose likes Beginning Java and the fly likes guidance Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login

Win a copy of EJB 3 in Action this week in the EJB and other Java EE Technologies forum!
JavaRanch » Java Forums » Java » Beginning Java
Bookmark "guidance" Watch "guidance" New topic


rishi reddy
Ranch Hand

Joined: Jun 06, 2006
Posts: 30

i have information in the pdf files as well as word document. now i need to convert this pdf file/word document information into full text file.

could any one let me know how to do this?

Scott Selikoff
Saloon Keeper

Joined: Oct 23, 2005
Posts: 3697

You need to use a library jar to convert it. There are some free ones available like iText and others that are commercial. Google would be the best, just search "java pdf library"

My Blog: Down Home Country Coding with Scott Selikoff
Ulf Dittmer

Joined: Mar 22, 2005
Posts: 39544
The AccessingFileFormats wiki page has a bunch of information on the subject. Look into Jakarta POI for converting DOC to text, and JPedal or PDFTextStream for extracting text from PDFs.

By the way, you should make the topic of your posts more descriptive - "guidance" conveys nothing.
[ June 16, 2006: Message edited by: Ulf Dittmer ]

Ping & DNS - updated with new look and Ping home screen widget
I agree. Here's the link:
subject: guidance
Similar Threads
Question on JFrame
Create pdf like word merge
How to generate thumbnail of word/excel document
Need a java code for convert PDF to Word document as well as Word document to XML. Is this possible
IFrame and word document