File APIs for Java Developers
Manipulate DOC, XLS, PPT, PDF and many others from your application.
The moose likes Other Open Source Projects and the fly likes POI Word 2007+ .docx Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login
JavaRanch » Java Forums » Products » Other Open Source Projects
Bookmark "POI Word 2007+ .docx" Watch "POI Word 2007+ .docx" New topic

POI Word 2007+ .docx

Dan Evan

Joined: Nov 08, 2010
Posts: 2
Hi All,

I have been trying for a day or so now to find an example of importing a .docx file for analysis of text

I have been unable to achieve it so far and have received many different errors, i fear it is something simple that is wrong but i have looked at it for so long it is all a blur now I have attached the code, any suggestions will be greatly appreciated

EDIT: POI is 3.7 with OOXML and Schemas

Lester Burnham

Joined: Oct 14, 2008
Posts: 1337
The XWPFWordExtractor class has no constructor that takes an InputStream; where did you get the idea that it does? You'll need to pass it an XWPFDocument (which does have such a constructor).
Dan Evan

Joined: Nov 08, 2010
Posts: 2

I looked further in to it and have posted the example as there are few i could find online

I agree. Here's the link:
subject: POI Word 2007+ .docx
It's not a secret anymore!