This week's book giveaway is in the Java 8 forum.
We're giving away four copies of Java 8 in Action and have Raoul-Gabriel Urma, Mario Fusco, and Alan Mycroft on-line!
See this thread for details.
The moose likes Other Open Source Projects and the fly likes POI Word 2007+ .docx Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login


Win a copy of Java 8 in Action this week in the Java 8 forum!
JavaRanch » Java Forums » Products » Other Open Source Projects
Bookmark "POI Word 2007+ .docx" Watch "POI Word 2007+ .docx" New topic
Author

POI Word 2007+ .docx

Dan Evan
Greenhorn

Joined: Nov 08, 2010
Posts: 2
Hi All,

I have been trying for a day or so now to find an example of importing a .docx file for analysis of text

I have been unable to achieve it so far and have received many different errors, i fear it is something simple that is wrong but i have looked at it for so long it is all a blur now I have attached the code, any suggestions will be greatly appreciated

EDIT: POI is 3.7 with OOXML and Schemas




Thanks
Lester Burnham
Rancher

Joined: Oct 14, 2008
Posts: 1337
The XWPFWordExtractor class has no constructor that takes an InputStream; where did you get the idea that it does? You'll need to pass it an XWPFDocument (which does have such a constructor).
Dan Evan
Greenhorn

Joined: Nov 08, 2010
Posts: 2
Thanks...

I looked further in to it and have posted the example as there are few i could find online



 
I agree. Here's the link: http://aspose.com/file-tools
 
subject: POI Word 2007+ .docx
 
Similar Threads
A small error relating with return
java.io.FileNotFoundException
Convert .doc file to .txt file
Apache POI 2007 word documents
InvalidFormatException