I'm trying to figure out how to select some data out of HTML that looks like this:
<p> <b>aaaa</b> <i>bbbbb</i> cccccc </p>
Getting 'aaaa' and 'bbbb' are easy, but how to I select 'cccc' independent of the others? Somehow I want to say "grab all the text after the close of <i> up to </p>" I can't figure out how to get this sort of "free text" out with xpath.
Well, that text is in a node - a TEXT_NODE - so if you get the xpath expression to return a NODESET (same as Java's NodeList) for the <p> Element children, you can iterate through it till you find the Node with type TEXT_NODE you can take the value of that node.