File APIs for Java Developers
Manipulate DOC, XLS, PPT, PDF and many others from your application.
http://aspose.com/file-tools
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic

HTML-based web page into XML

 
rani bedi
Ranch Hand
Posts: 358
  • 0
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Is it possible to change an HTML-based web page into XML?
 
Madhav Lakkapragada
Ranch Hand
Posts: 5040
  • 0
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator

I would like to be optimistic and say yes,
it is possible to do it. But then, do really
want to, is the qstn. Generally the flow is
other way around. XML transformed into HTML.
While I donot know any tools of the top of
my head, I don't see any reason why someone
with that much time, can't generate such a tool.
My $0.02
- satya
 
Chris Stehno
Ranch Hand
Posts: 180
  • 0
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Yes it is possible. You can use JTidy (http://sourceforge.net/projects/jtidy/) which is the Java port of the older Tidy HTML parser/cleaner ... in it there is a conversion tool for converting HTML into XHTML which is the valid XML version of HTML (for all intents and purposes). I have used it before and it is pretty handy.
Hope this helps
------------------
Chris Stehno (Sun Certified Programmer for the Java 2 Platform)
 
rani bedi
Ranch Hand
Posts: 358
  • 0
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Thanks Chris.
 
It is sorta covered in the JavaRanch Style Guide.
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic