Win a copy of Re-engineering Legacy Software this week in the Refactoring forum
or Docker in Action in the Cloud/Virtualization forum!
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic

org.apache.poi.poifs.filesystem.OfficeXmlFileException while reading .docx file

 
manu chandra
Greenhorn
Posts: 15
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Hi ,

I am trying to read a .docx format file . But i am getting an error like :

org.apache.poi.poifs.filesystem.OfficeXmlFileException: The supplied data appears to be in the Office 2007+ XML. POI only supports OLE2 Office documents

But it suggest to use xssf instead of hssf but my doubt is xssf is used for reading excel files..
I am not reading any excel here but i want to read .docx and .doc files..

Can any one please help?
 
Ulf Dittmer
Rancher
Pie
Posts: 42967
73
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
I don't know what the "it" is that recommends to use XSSF, but the API to use for reading .docx is XWPF, whereas for .doc it is HWPF. I don't think there is a combined API that can read both formats (like there is the SS API that combines HSSF and XSSF).
 
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic