aspose file tools*
The moose likes Servlets and the fly likes java code to extract EMBED pdf from doc file Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login
JavaRanch » Java Forums » Java » Servlets
Bookmark "java code to extract EMBED pdf from doc file" Watch "java code to extract EMBED pdf from doc file" New topic
Author

java code to extract EMBED pdf from doc file

Sunil Baboo
Greenhorn

Joined: Aug 12, 2010
Posts: 16
How can we extract embed pdf file from doc file using java.


Here's a link to one of these word files that has documents embedded into it:
http://www.seattle.gov/purchasing/docs/bids/ITBCTY11150.doc

Is there a way to get the Java code to extract these embedded files? Like extracting files from a zip file?

Any suggestion worth lot to me.
Thanks in advanced.
Lester Burnham
Rancher

Joined: Oct 14, 2008
Posts: 1337
You best bet is probably the (open source) Apache POI library, or the (commercial) Aspose stuff. Those are about the only Java libraries that have initimate knowledge of the DOC/DOCX formats.

Using OpenOffice in server mode -and accessing it from a Java client- could be another possibility, assuming it has a way to extract embedded file programmatically.
 
 
subject: java code to extract EMBED pdf from doc file