File APIs for Java Developers
Manipulate DOC, XLS, PPT, PDF and many others from your application.
http://aspose.com/file-tools
The moose likes Other Open Source Projects and the fly likes Can  the PDF file be  parsed and converted into user defined formats using Solr Big Moose Saloon
  Search | Java FAQ | Recent Topics
Register / Login


Win a copy of The Mikado Method this week in the Agile and other Processes forum!
JavaRanch » Java Forums » Products » Other Open Source Projects
Reply Bookmark "Can  the PDF file be  parsed and converted into user defined formats using Solr" Watch "Can  the PDF file be  parsed and converted into user defined formats using Solr" New topic
Author

Can the PDF file be parsed and converted into user defined formats using Solr

Matt Scott
Greenhorn

Joined: Dec 31, 2009
Posts: 4
Can the PDF file be parsed and converted into user defined formats using Solr
Ulf Dittmer
Marshal

Joined: Mar 22, 2005
Posts: 35241
    
    7
Solr can index PDFs and subsequently search them; it can not convert PDFs to any other format (nor can it do that for any other format - that's just not what Solr does). But PDFs in general are not amenable to format conversion.


Android appsImageJ pluginsJava web charts
Nick Phillips
Greenhorn

Joined: Mar 26, 2010
Posts: 1
Parsing & conversion of a PDF file to plain text / user defined format depends on its configuration. Solr does follow this mechanism before indexing it , try this link to for further details, http://www.lucidimagination.com/search/?q=Can++the+PDF+file+be++parsed+and+converted
Martijn Verburg
author
Bartender

Joined: Jun 24, 2003
Posts: 3268

Hi Nick and Welcome to Javaranch!


Cheers, Martijn - Blog,
Twitter, PCGen, Ikasan, My The Well-Grounded Java Developer book!,
My start-up.
 
I agree. Here's the link: http://zeroturnaround.com/jrebel - it saves me about five hours per week
 
subject: Can the PDF file be parsed and converted into user defined formats using Solr
 
Similar Threads
Read and Write PDF file using RandomAccessFile class
OpenOffice File Conversion formats
convertions to pdf
Ideas/inputs for basic design for searching topics from set of pdf books
jsp to pdf