File APIs for Java Developers
Manipulate DOC, XLS, PPT, PDF and many others from your application.
The moose likes XML and Related Technologies and the fly likes UTF-8, Superscripts and XML Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login
JavaRanch » Java Forums » Engineering » XML and Related Technologies
Bookmark "UTF-8, Superscripts and XML" Watch "UTF-8, Superscripts and XML" New topic

UTF-8, Superscripts and XML

Nagendra Prasad
Ranch Hand

Joined: Jul 11, 2002
Posts: 219
Hello Again,
I have figured out that a small set of characters (normal english ones), have their superscript notation available in unicode and therefore it is possible to convert them into their HEX equivalents.
Now, I have an XML Document where there are <sup> tags. Are these to be
treated as unicode markup? or how do we derive the superscript notation
for a character that is enclosed by this tag?
And how can we determine if this <sup> tag is not a part of normal markup
I was planning on reading the XML as an input stream encoded in UTF-8.
Would I be able to differntiate the <sup> as a superscript tag or a
general markup one...
I am not sure if I have explained myself particularly well.. but if
there are question, please ask.. i shall try and be more comprehensible.

Best Regards,<br />Nagendra Prasad.
jason adam
Chicken Farmer ()
Ranch Hand

Joined: May 08, 2001
Posts: 1932
Moving this post the XML forum, misunderstood the other question, and am going to leave it in I/O as a separate issue. Sorry!!
William Brogden
Author and all-around good cowpoke

Joined: Mar 22, 2000
Posts: 13036
If I understand your question, <sup> in your document is an ordinary XML tag - nothing magic about it. Perhaps the document is intended to be used with a matching XSLT that translates the contents into the special characters, or maybe just uses the HTML <sup>.
I agree. Here's the link:
subject: UTF-8, Superscripts and XML
It's not a secret anymore!