wood burning stoves 2.0*
The moose likes Java in General and the fly likes Need help with Charset class issue ... :-) Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login


Win a copy of Java 8 in Action this week in the Java 8 forum!
JavaRanch » Java Forums » Java » Java in General
Bookmark "Need help with Charset class issue ... :-)" Watch "Need help with Charset class issue ... :-)" New topic
Author

Need help with Charset class issue ... :-)

Dominic Steng�rd
Ranch Hand

Joined: Feb 05, 2001
Posts: 186
Hi JavaGurus!

I am trying to convert two characters in the Big5 character set, 0xA2CF and 0xa55f that correspond to the characters A and 北, to the equivalent characters but in the Unicode character set.

The code is:



The output in the DOS prompt is:


Not the two characters A and 北, encoded in Unicode, that I had expected. There is not even two ?? but one ? ... meaning the data is being interpreted as one character.

What am I doing wrong?
Any help is hugely appreciated.
Thanks in advance! :-)

Best regards


Dominic Steng�rd<br />Sun Certified Java 2 Programmer
Paul Clapham
Bartender

Joined: Oct 14, 2005
Posts: 18155
    
    8

No, you're on the wrong track entirely.Okay, enough of that. The main point is that there's no such thing as a "Big5 character" in Java. All characters in Java are Unicode characters. It is possible to encode Java characters as a stream (or array) of bytes using Big5, and it's possible to decode those bytes back into Unicode characters using Big5 as well. Perhaps something like this:
 
I agree. Here's the link: http://aspose.com/file-tools
 
subject: Need help with Charset class issue ... :-)
 
Similar Threads
Character encoding problem
multiple language support in one XML
Byte vs Character streams
getBytes() in String
Bytes displaying for chinese characters