File APIs for Java Developers
Manipulate DOC, XLS, PPT, PDF and many others from your application.
The moose likes XML and Related Technologies and the fly likes HTML-based web page into XML Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login
JavaRanch » Java Forums » Engineering » XML and Related Technologies
Bookmark "HTML-based web page into XML" Watch "HTML-based web page into XML" New topic

HTML-based web page into XML

rani bedi
Ranch Hand

Joined: Feb 06, 2001
Posts: 358
Is it possible to change an HTML-based web page into XML?

Cheers,<br />Rani<br />SCJP, SCWCD, SCBCD
Madhav Lakkapragada
Ranch Hand

Joined: Jun 03, 2000
Posts: 5040

I would like to be optimistic and say yes,
it is possible to do it. But then, do really
want to, is the qstn. Generally the flow is
other way around. XML transformed into HTML.
While I donot know any tools of the top of
my head, I don't see any reason why someone
with that much time, can't generate such a tool.
My $0.02
- satya

Take a Minute, Donate an Hour, Change a Life
Chris Stehno
Ranch Hand

Joined: Feb 26, 2001
Posts: 180
Yes it is possible. You can use JTidy ( which is the Java port of the older Tidy HTML parser/cleaner ... in it there is a conversion tool for converting HTML into XHTML which is the valid XML version of HTML (for all intents and purposes). I have used it before and it is pretty handy.
Hope this helps
Chris Stehno (Sun Certified Programmer for the Java 2 Platform)

- Chris Stehno, SCPJ
rani bedi
Ranch Hand

Joined: Feb 06, 2001
Posts: 358
Thanks Chris.
I agree. Here's the link:
subject: HTML-based web page into XML
It's not a secret anymore!