I'm trying to parse some RSS feeds for my website, and I'm not really sure where to start. Can anyone recommend any tutorials or articles? Should I use SAX or DOM to do it?
If I am remember correctly, the 'ELEMENT' node type has broader range. So can you check the nodetype static variable again and try 'TEXTNODE' or something like that? Hope this helps you.
in the inner most loop of your code, you got to the point where you can print the node name. For example, "title" <title>Open source or no: Let the market decide</title> The node you are examining at this point has a node name "title" and node value "null". I assume you are trying to print the text "Open source....". It is represented by the child node and this child node has a node name "#text" (or null for some parsers) and the node value "Open source..." To get your program working, get the child node, check it's Node type (Node.TEXT_NODE) and then get the NodeValue.
<a href="http://www.mqtips.com" target="_blank" rel="nofollow">MQMessenger</a> - free Java app for MQSeries
you can remove a lot of lines from your code using the call getElementsByTagName("tag_name"). I use this function to get the value of a node:
the 'for' loop inside is just to be safe with all sorts of parser implementations. The text value of the node is represented, some times, in multiple child nodes.
<a href="http://www.mqtips.com" target="_blank" rel="nofollow">MQMessenger</a> - free Java app for MQSeries