I am reading an HTML document, extracting some text from it and then I write them to a file.
I have a trouble with HTML entities like"&nbps;" is there a way to make them printed as a normal space.
Thanks
Maha
Stan James
(instanceof Sidekick)
Ranch Hand
Joined: Jan 29, 2003
Posts: 8791
posted
0
See Apache StringEscapeUtils that encodes and decodes HTML among other things. Does that do the trick?
If your string extraction is very tricky you might do well to use an HTML parser. I like the Quiotix Parser myself. [ September 05, 2006: Message edited by: Stan James ]
A good question is never answered. It is not a bolt to be tightened into place but a seed to be planted and to bear more seed toward the hope of greening the landscape of the idea. John Ciardi
Maha Hassan
Ranch Hand
Joined: Aug 02, 2005
Posts: 133
posted
0
Hi,
This is a silly question, but how to use this package ??