I am reading in an html file and converting it to xml. The problem is that when I come across special characters like < > & ... it makes the xml invalid when you try to read it with explorer. I started to write code to convert '<' to "& l t ;" (without spaces) but I keep running into new ones every time I add one to the list. Is there a
java class that I can use to convert these symbols or find a list of invalid xml characters and there appropriate alternative?