jQuery in Action, 2nd edition
The moose likes Java in General and the fly likes Document Conversion Big Moose Saloon
  Search | Java FAQ | Recent Topics
Register / Login
JavaRanch » Java Forums » Java » Java in General
Reply Bookmark "Document Conversion" Watch "Document Conversion" New topic
Author

Document Conversion

Sahil Sharma
Ranch Hand

Joined: Aug 27, 2003
Posts: 152
Hi,

I have a requirement where i have to convert a document(doc,pdf) into an xml file. Are there any api's available that can help me to achieve this?

Thanks
Prafulla N. Patil
Ranch Hand

Joined: Mar 20, 2006
Posts: 106
not that much clear about your requirement, is there is xml file in Doc or PDF format and you want to create .xml file from that ??

Apache POI - HWPF - Java API to Handle Microsoft Word Files can help you with reading word files and then you can use XML generation with JAVA to create XML files.


Thanks, Prafulla Patil
SCJP, SCWCD
Sahil Sharma
Ranch Hand

Joined: Aug 27, 2003
Posts: 152
the document will contain normal contents or images but in a particular format. e.g. [Heading, sub-heading, paragraphs etc]
Ulf Dittmer
Marshal

Joined: Mar 22, 2005
Posts: 35224
    
    7
PDFs are tough; the best you will be able to do is to extract any text they contain, but structural information will be lost (unless you're prepared to invest a lot of time).


Android appsImageJ pluginsJava web charts
 
I agree. Here's the link: http://ej-technologies/jprofiler - if it wasn't for jprofiler, we would need to run our stuff on 16 servers instead of 3.
 
subject: Document Conversion
 
Similar Threads
Export document like a form
reading image in a document
null pointer exception
Validating against multiple schemas
Converting Document Object to String