aspose file tools*
The moose likes Java in General and the fly likes converting PDF to xml using JasperReport or Apache FOP Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login
JavaRanch » Java Forums » Java » Java in General
Bookmark "converting PDF to xml using JasperReport or Apache FOP " Watch "converting PDF to xml using JasperReport or Apache FOP " New topic
Author

converting PDF to xml using JasperReport or Apache FOP

ks goh
Greenhorn

Joined: Jan 19, 2005
Posts: 13
hi all

i got a PDF/RTF template with pictures, which i wan to convert to xml
so that i can use JapserReport or Apache FOP to generate
PDF.

can anyone help me how do i convert PDF to xml?
thanks
[ October 13, 2007: Message edited by: ks goh ]
Ulf Dittmer
Marshal

Joined: Mar 22, 2005
Posts: 41904
    
  63
PDF is -for the most part- a read-only format. While some limited alterations are possible, you can't extract the layout information. It's possible to extract the text they contain using libraries such as PDFBox and JPedal.


Ping & DNS - my free Android networking tools app
marc weber
Sheriff

Joined: Aug 31, 2004
Posts: 11343

This is a bit advanced for the beginners forum, so I'm promoting it to the intermediate forum.


"We're kind of on the level of crossword puzzle writers... And no one ever goes to them and gives them an award." ~Joe Strummer
sscce.org
Peter Chase
Ranch Hand

Joined: Oct 30, 2001
Posts: 1970
See this thread where the issue is discussed.

In short, what you want to do is probably near-impossible.


Betty Rubble? Well, I would go with Betty... but I'd be thinking of Wilma.
 
I agree. Here's the link: http://aspose.com/file-tools
 
subject: converting PDF to xml using JasperReport or Apache FOP