If you're going to parse HTML with an XML parser, then you have to make sure your HTML is also well-formed XML. Yours isn't, so you can't use an XML parser to parse it. So you have two options: (1) Use an HTML parser instead; (2) Run an HTML cleanup product like HTMLTidy to convert it to XHTML.
As for the error message, it looks like you have an invalid character reference; you should examine the document to find out what it is. My guess is that it's in "garant?a" because my browser and/or the forum software identifies the second-to-last character as something it can't understand.
Joined: May 06, 2009
Thank you soo much for replying, yes, I found Sax doesnt recognize that letter you mentioned and also "", special characters at all. So I got every thing between the tags using regex.