I have a running Spring REST service that uses the Tika libraries.
Currently, these Tika libraries are 2.9.1.
At some point while adding other features (perhaps this is the issue), the DOCX portion of the Tika extract stopped working. The same basic code in a standalone (not REST)
Maven project works OK. so I'm a bit baffled.
Having spent about 8 hours on this, I thought I'd ask the community if anyone had run across this issue with DOCX files in Tika.
The error generated in the SpringBoot REST service is:
"TIKA-198: Illegal IOException from org.apache.tika.parser.microsoft.ooxml.OOXMLParser"
Below is the
test code that works in the standalone Spring project and is the same code spread across Spring's Controller and Service methods but doesn't work.
In the REST project where the error occurs, the data are "POSTed" -- not referenced with a disk path, but XLSX, PDF, TXT all work fine. It's just DOCX that is failing.
Thanks in advance for any suggestions.
- mike