I mean i use oracle which charset is "UTF-8". And i have to insert a String which is a japanese String from a xml file which encoding="Shift_JIS" , So after insert , i read this String from the oracle , and compare it with the one in XML file. they are not equal. i wonder how can i do something to make them equal. i mean do something with the String in database. Thanks very much! [ June 18, 2003: Message edited by: Javan Li ]
If the two objects have been correctly converted into Strings (which by definition use Unicode internally) then equals() will return the correct answers here. Meaning either (a) one of the two Strings was converted incorrectly, (b) the two strings really are not equal, or (c) someone lied to you about the character sets which are being used. Based on my own experience, it's usually (c). But don't overlook (b). In this case it's difficult for me to imagine how your could screw up the conversion yourself - when you read from the XML file, most parsers should take care of the character conversion for you (after reading the encoding declaration from the file itself). And if the database is set up correctly, using ResultSet's getString() method should likewise handle the conversion. Here's what I'd do: for each of the two "different" Strings you're dealing with, write the value to a small file with a .html extension. Use UTF-8 to create the OutputStreamWriter. Then look it the files with your browser - go to View -> Character Encoding to make sure it's using UTF-8. Do the files look the same, or not? Show the files to someone who understands the language and data, if you don't - does either of the files display the "correct" value? This can help guide you to where the problem is - is the XML data bad, or the XML parsing, or the Oracle data, or the Oracle data, or the Oracle configuration (wherever the charset is declared)? There are many places something could be wrong here, but the place to fix it is almost certainly before you've converted the data into a Java String.
"I'm not back." - Bill Harding, Twister
Joined: Jul 24, 2002
Thanks , Jim. And i find the problem. because the field which store the long String is long type. But when i insert this String into oracle. i didnt' use the right InputStreamReader . So even it looks the same between these two String. They are not equal. And i fix this bug. Thanks!
I’ve looked at a lot of different solutions, and in my humble opinion Aspose is the way to go. Here’s the link: http://aspose.com
subject: How can i transform a unicode String into a japanese String??