This week's book giveaway is in the Java 8 forum.
We're giving away four copies of Java 8 in Action and have Raoul-Gabriel Urma, Mario Fusco, and Alan Mycroft on-line!
See this thread for details.
The moose likes Java in General and the fly likes converting PDF to xml using JasperReport or Apache FOP Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login


Win a copy of Java 8 in Action this week in the Java 8 forum!
JavaRanch » Java Forums » Java » Java in General
Bookmark "converting PDF to xml using JasperReport or Apache FOP " Watch "converting PDF to xml using JasperReport or Apache FOP " New topic
Author

converting PDF to xml using JasperReport or Apache FOP

ks goh
Greenhorn

Joined: Jan 19, 2005
Posts: 13
hi all

i got a PDF/RTF template with pictures, which i wan to convert to xml
so that i can use JapserReport or Apache FOP to generate
PDF.

can anyone help me how do i convert PDF to xml?
thanks
[ October 13, 2007: Message edited by: ks goh ]
Ulf Dittmer
Marshal

Joined: Mar 22, 2005
Posts: 39578
    
  27
PDF is -for the most part- a read-only format. While some limited alterations are possible, you can't extract the layout information. It's possible to extract the text they contain using libraries such as PDFBox and JPedal.


Ping & DNS - updated with new look and Ping home screen widget
marc weber
Sheriff

Joined: Aug 31, 2004
Posts: 11343

This is a bit advanced for the beginners forum, so I'm promoting it to the intermediate forum.


"We're kind of on the level of crossword puzzle writers... And no one ever goes to them and gives them an award." ~Joe Strummer
sscce.org
Peter Chase
Ranch Hand

Joined: Oct 30, 2001
Posts: 1970
See this thread where the issue is discussed.

In short, what you want to do is probably near-impossible.


Betty Rubble? Well, I would go with Betty... but I'd be thinking of Wilma.
 
I agree. Here's the link: http://aspose.com/file-tools
 
subject: converting PDF to xml using JasperReport or Apache FOP
 
Similar Threads
Xml + Xsl to PDF
PDF Creation
How to parse PDF File
Styles in iText
want to send pdf file to client from server. pdf is generated at server