• Post Reply
  • Bookmark Topic Watch Topic
  • New Topic

converting PDF to xml using JasperReport or Apache FOP

 
ks goh
Greenhorn
Posts: 13
  • 0
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
hi all

i got a PDF/RTF template with pictures, which i wan to convert to xml
so that i can use JapserReport or Apache FOP to generate
PDF.

can anyone help me how do i convert PDF to xml?
thanks
[ October 13, 2007: Message edited by: ks goh ]
 
Ulf Dittmer
Rancher
Pie
Posts: 42966
73
  • 0
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
PDF is -for the most part- a read-only format. While some limited alterations are possible, you can't extract the layout information. It's possible to extract the text they contain using libraries such as PDFBox and JPedal.
 
marc weber
Sheriff
Posts: 11343
Java Mac Safari
  • 0
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
This is a bit advanced for the beginners forum, so I'm promoting it to the intermediate forum.
 
Peter Chase
Ranch Hand
Posts: 1970
  • 0
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
See this thread where the issue is discussed.

In short, what you want to do is probably near-impossible.
 
It is sorta covered in the JavaRanch Style Guide.
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic