Ok, one of my friend asked me this and my answer was nope, can't be done. But I wanted to double check with folks here in case anyone else has solved this use case -
The input file is a HTML file. Using the JDK 1.4.xx (or any other) API's she wanted to parse a HTML file. The problem here was that the HTML file had a SCRIPT start element tag and an script end element tag.
So, the question is - 'Am I right in saying that there's no parser available to parse such HTML files?'
BTW, the input file is not in our control to modify. I would like to know. Thanks.
I havent used it but the release news of this html parsertool says it supports script blocks.
Joined: Jun 03, 2000
Thanks for that link Balaji. I will look into this.
You see, as if the requirements I said before were not challenging enough, the thing is I can't rely on some third party software. So, my options are narrowed down to the standard API's - JDK, Xerces, Xalan, things of that nature. If I could acheive this with the standard APIs, I would like to investigate more. If push comes to shove, then non-standard third party is acceptable.
My interest here is more academic in nature, something that I want to learn and see if I am missing something.