Extract the headings from an html file and an xml file
Joined: Sep 27, 2004
I have two external systems. both of these contain html files. how do i pull the html headings out of these files and compare them to see if there are any differences?
SCJP 1.4, SCWCD 1.4<br /> <br />Thanks in advance!<br />Jayashree.
Joined: Jan 23, 2002
You could 1) do an HTTP GET request 2) read the HTML until you encounter the <head> element 3) read the HTML into a StringBuffer until until you encounter the </head> element, and 4) construct an XML DOM document with just the <head> element in it 5) compare the DOM with another page's similar DOM using XMLUnit's Diff class.