Win a copy of Design for the Mind this week in the Design forum!
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic

Word 2003 to XML via XSLT

 
Eric Pascarello
author
Rancher
Posts: 15385
6
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Has anyone here tried to do convert an Word 2003 document into XML via XSLT? I may have a requirement in the near future that would require me to grab data from a word doc and put it into a database. If it could be done with an XSLT, it would make my life easier in the future to change.

I am finding poor documentation on the process. Hopefully someone has some insight into this matter.

Eric
 
Paul Clapham
Sheriff
Pie
Posts: 20955
31
Eclipse IDE Firefox Browser MySQL Database
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
I just did that a couple of days ago. First I saved the document as XML (I don't believe that the .doc format is XML itself). Then I eyeballed the XML to find the bits I wanted to extract, and messed around with the XSLT until it extracted only those bits.

Okay, that's not very professional. A quick hack, but it did what I needed. But I know Microsoft has schemas for the XML version of Word 2003. Have you seen this page yet? Looks like a good place to start.
 
Madhav Lakkapragada
Ranch Hand
Posts: 5040
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Glad to note that something is "free' from M$.

- m
 
Prabha Enjeti
Greenhorn
Posts: 2
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Hi,

I have a similar task to convert a MS Word document to an XML.
The word document has images and graphs.Someone suggested me to use Apache POI Framework for this task.Can some one please suggest me how to go about it?

Thanks,
Prabha
 
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic