This week's book giveaway is in the OCAJP forum. We're giving away four copies of OCA Java SE 8 Programmer I Study Guide 1Z0-808 and have Jeanne Boyarsky & Scott Selikoff on-line! See this thread for details.
this is HTMLParser [ September 13, 2006: Message edited by: Maha Hassan ]
Joined: Mar 22, 2005
Don't know about that one, but JTidy, NekoXNI and TagSoup seem to be more widely used.
Joined: Aug 02, 2005
I am now using JTidy I want to extract the text within the tags the thing is it does not understand things like copyright sign,"-"," " and other special characters and when i change the encoding things do not get better