A friendly place for programming greenhorns!
Big Moose Saloon
Register / Login
Win a copy of
Elasticsearch in Action
this week in the
java code to extract EMBED pdf from doc file
Joined: Aug 12, 2010
Aug 12, 2010 22:48:25
How can we extract embed
Here's a link to one of these word files that has documents embedded into it:
Is there a way to get the Java code to extract these embedded files? Like extracting files from a zip file?
Any suggestion worth lot to me.
Thanks in advanced.
Joined: Oct 14, 2008
Aug 12, 2010 23:53:32
You best bet is probably the (open source) Apache POI library, or the (commercial) Aspose stuff. Those are about the only Java libraries that have initimate knowledge of the DOC/DOCX formats.
Using OpenOffice in server mode -and accessing it from a Java client- could be another possibility, assuming it has a way to extract embedded file programmatically.
I agree. Here's the link:
subject: java code to extract EMBED pdf from doc file
Embeded doc in word/java api to create word file
Extracting images and figures from Word Doc
convert word documents to PDF
All times are in JavaRanch time: GMT-6 in summer, GMT-7 in winter
| Powered by
Copyright © 1998-2015