If you can get the Word document saved in
RTF or HTML format instead of .doc you will probably find it easier to parse since the doc format is notoriously hard to work with.
The free Open Office
(download here) product has been able to read all of my .doc files, and can save in XML or other formats that are well documented and may be easier to parse.
The Apache
POI toolkit offers some support for reading .doc files - but I think their best kit is for Excel files.
Bill