This week's book giveaways are in the Java EE and JavaScript forums.
We're giving away four copies each of The Java EE 7 Tutorial Volume 1 or Volume 2(winners choice) and jQuery UI in Action and have the authors on-line!
See this thread and this one for details.
The moose likes Servlets and the fly likes java code to extract EMBED pdf from doc file Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login


Win a copy of The Java EE 7 Tutorial Volume 1 or Volume 2 this week in the Java EE forum
or jQuery UI in Action in the JavaScript forum!
JavaRanch » Java Forums » Java » Servlets
Bookmark "java code to extract EMBED pdf from doc file" Watch "java code to extract EMBED pdf from doc file" New topic
Author

java code to extract EMBED pdf from doc file

Sunil Baboo
Greenhorn

Joined: Aug 12, 2010
Posts: 16
How can we extract embed pdf file from doc file using java.


Here's a link to one of these Word files that has documents embedded into it:
http://www.seattle.gov/purchasing/docs/bids/ITBCTY11150.doc

Is there a way to get the Java code to extract these embedded files? Like extracting files from a zip file?

Any suggestion worth lot to me.
Thanks in advanced.
Lester Burnham
Rancher

Joined: Oct 14, 2008
Posts: 1337
You best bet is probably the (open source) Apache POI library, or the (commercial) Aspose stuff. Those are about the only Java libraries that have initimate knowledge of the DOC/DOCX formats.

Using OpenOffice in server mode -and accessing it from a Java client- could be another possibility, assuming it has a way to extract embedded file programmatically.
 
I agree. Here's the link: http://aspose.com/file-tools
 
subject: java code to extract EMBED pdf from doc file