This week's book giveaway is in the OO, Patterns, UML and Refactoring forum. We're giving away four copies of Refactoring for Software Design Smells: Managing Technical Debt and have Girish Suryanarayana, Ganesh Samarthyam & Tushar Sharma on-line! See this thread for details.
I need to use docx and xml files for translation prozess, not all of the translation tools can read xml, but docx, xml because it can be better assigned to each other, i want to convert plain text from docx to xml and backwards(from xml to docx), with what can i begin , do you know how can i do it programmatically using java?
The Apache POI library can read .docx files, and it has special classes for extracting the text of a document. Although those may not be accurate enough, in which case you need to fall back on using the regular API.