This week's book giveaway is in the OCMJEA forum. We're giving away four copies of OCM Java EE 6 Enterprise Architect Exam Guide and have Paul Allen & Joseph Bambara on-line! See this thread for details.
Hi, I need to convert a word document(which follows a standard template) into an XML document. Right now I am doing it manually by inserting the defined set of XML tags. Is there a way to automate this process of inserting XML tags based on the WORD template. Is there any tool available in market. Or we need to write a program(like in C or C#) to accomplish this. Any suggestion, welcome. I have a general XSL which will be used to all XML documents(converted from word). Our final aim is to come up with a knowledge management system, with search capabilities, discussion forums etc. Thanks, Rajan
- IBM Certified - eBusiness Solution Technologist
- IBM Certified Developer - XML
- Microsoft Certified Application Developer
- Tibco Certified Professional
- Sun Certified Java Professional
Geoff Oakham is writing a tool called doc2xml, which converts .doc files into XML. There are also some Windows/Office products/plugins for converting .doc files into XML as well. Google will help with those.
You could download the OpenOffice package of applications. OO has been able to open every Word document I have thrown at it, and it can then save the file in various formats including XML. http://www.openoffice.org/ I have also done this sort of conversion by saving the file from Word as text (with character replacement allowed to fix those accursed "smart" quotes) - If the template has been followed correctly, a Java program to reformat the text can be pretty simple. Bill
Thanks for everyone's suggestions. I have not looked into the suggested tools/websites. Will let you know the results afterwards. I tried with XML Spy. It has a facility to take a Word document as input and convert it to an XML document. The problem being, the converted XML is not WELL FORMED always. I am interested in a tool which will convert based on the XSD I have. Bit tough, but somewhere there should be one. The search continues. Rajan [ September 18, 2003: Message edited by: Rajan Thiyagarajan ]