File APIs for Java Developers
Manipulate DOC, XLS, PPT, PDF and many others from your application.
http://aspose.com/file-tools
The moose likes XML and Related Technologies and the fly likes How to know if a parsed character using SAX parser is a new line character ? Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login
JavaRanch » Java Forums » Engineering » XML and Related Technologies
Bookmark "How to know if a parsed character using SAX parser is a new line character ?" Watch "How to know if a parsed character using SAX parser is a new line character ?" New topic
Author

How to know if a parsed character using SAX parser is a new line character ?

Roks Amin
Greenhorn

Joined: Mar 18, 2005
Posts: 3
Hi,
I am using SAX parser to parse an XML file which has <pre> element. Now I want to display the content in this tag as it is ( which includes tabs and carriage returns) into a jsp file. But I think that the parser is not recognizing carriage returns.
eg.
<pre>NameAgeSexRoks22MaleAman44Male</pre>

The output should be:
Name Age Sex
Roks 22 Male
Aman 44 Male

I think that in the characters() method the nextline characters after Sex,Male are not read. The encoding used is iso-8859-1.

The output which I am getting is :

Name Age Sex Roks 22 Male Aman 44 Male


Thanks in advance <br />Roks
William Brogden
Author and all-around good cowpoke
Rancher

Joined: Mar 22, 2000
Posts: 12835
    
    5
The characters() method will be getting exactly the characters in the element, but note that you can't be sure that one call has the complete text. Due to the fact that SAX parsers read one buffer at a time and do NOT assemble the characters for you. There may be a second call to characters() containing the data that was truncated.
It is more likely that the HTML browser is discarding extra spaces tabs and cr for some reason.
Bill
 
I agree. Here's the link: http://aspose.com/file-tools
 
subject: How to know if a parsed character using SAX parser is a new line character ?