This week's book giveaway is in the OO, Patterns, UML and Refactoring forum.
We're giving away four copies of Refactoring for Software Design Smells: Managing Technical Debt and have Girish Suryanarayana, Ganesh Samarthyam & Tushar Sharma on-line!
See this thread for details.
The moose likes Beginning Java and the fly likes guidance Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login


JavaRanch » Java Forums » Java » Beginning Java
Bookmark "guidance" Watch "guidance" New topic
Author

guidance

rishi reddy
Ranch Hand

Joined: Jun 06, 2006
Posts: 30
hi,

i have information in the pdf files as well as word document. now i need to convert this pdf file/word document information into full text file.

could any one let me know how to do this?

rishi
Scott Selikoff
author
Saloon Keeper

Joined: Oct 23, 2005
Posts: 3740
    
  10

You need to use a library jar to convert it. There are some free ones available like iText and others that are commercial. Google would be the best, just search "java pdf library"


[OCA 8 Book] [Blog]
Ulf Dittmer
Rancher

Joined: Mar 22, 2005
Posts: 42958
    
  73
The AccessingFileFormats wiki page has a bunch of information on the subject. Look into Jakarta POI for converting DOC to text, and JPedal or PDFTextStream for extracting text from PDFs.

By the way, you should make the topic of your posts more descriptive - "guidance" conveys nothing.
[ June 16, 2006: Message edited by: Ulf Dittmer ]
 
I’ve looked at a lot of different solutions, and in my humble opinion Aspose is the way to go. Here’s the link: http://aspose.com
 
subject: guidance
 
It's not a secret anymore!