this is HTMLParser [ September 13, 2006: Message edited by: Maha Hassan ]
Joined: Mar 22, 2005
Don't know about that one, but JTidy, NekoXNI and TagSoup seem to be more widely used.
Joined: Aug 02, 2005
I am now using JTidy I want to extract the text within the tags the thing is it does not understand things like copyright sign,"-"," " and other special characters and when i change the encoding things do not get better