File APIs for Java Developers
Manipulate DOC, XLS, PPT, PDF and many others from your application.
http://aspose.com/file-tools
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic

How to read a cdata using sax parser

 
riaz rahman
Greenhorn
Posts: 17
  • 0
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
I am new to XML & i want to read a query from xml.
which will be in cdata tag.
please help me by providing source code for reading cdata usin sax parser.
thanx in advance
 
Bajji Pat
Ranch Hand
Posts: 50
  • 0
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
You can use the following snippet- the characters() method would catch any CDATA sections within XML:


[ December 07, 2005: Message edited by: Bajji Pat ]
 
Anand Gondhiya
Ranch Hand
Posts: 155
  • 0
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Thanks for providing the code.
 
Jeremie Juste
Greenhorn
Posts: 2
  • 0
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Hello,

I have still a lot to learn in programming and I'm sorry for this stupid question.
But i was wondering how to use this code ?

Can you modify it to read txt files or html?

Will anyone be kind enough to reply to this question?
 
Ulf Dittmer
Rancher
Pie
Posts: 42966
73
  • 0
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
XML *is* text. But text is not normally XML, and neither is HTML. So, no, you can't use a SAX parser for reading text files in general or HTML.

But the question is: why would you want to? What are you trying to accomplish?
 
Jeremie Juste
Greenhorn
Posts: 2
  • 0
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Many thanks for the help

In fact i'm doing some web scraping and want to get some data found in between the CData.

I have found a nice code doing part of this job.
http://www.java-tips.org/java-se-tips/org.xml.sax/accessing-character-data-cdata-of-xml-element.html

 
Ulf Dittmer
Rancher
Pie
Posts: 42966
73
  • 0
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
HTML is not generally XML. I suspect that this code will not work for 99% of all web pages in existence (doesn't mean it won't work for any particular page you're interested in, of course, especially if that page happens to be XHTML).
 
It is sorta covered in the JavaRanch Style Guide.
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic