File APIs for Java Developers
Manipulate DOC, XLS, PPT, PDF and many others from your application.
The moose likes Java in General and the fly likes UTF-8 Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login
JavaRanch » Java Forums » Java » Java in General
Bookmark "UTF-8" Watch "UTF-8" New topic


Lily Boland

Joined: May 11, 2006
Posts: 11

Does any one know a good way to parse in a UTF-8 xml file, store it in ASCII in DB, and export it back to UTF-8 format?

Ulf Dittmer

Joined: Mar 22, 2005
Posts: 42965
Just to ask the obvious, is there no way to run the DB in Unicode mode, instead of ASCII?

One possibility would be to base-64 encode the string with a library like Jakarta Commons Codec.
[ August 30, 2006: Message edited by: Ulf Dittmer ]
Srinivas Kalvala
Ranch Hand

Joined: Oct 20, 2005
Posts: 257


I am not clear but just thinking about following ...

The Reader and writer streams will convert back and forth of UTF format. First write your file into a ByteArrayOutputStream then store it in DB and after reading the data from DB again construct it with Reader.

UTF -----> Writer(Stream); -----> DB

DB ---> Reader(Stream) ---> UTF format

Please correct me If anything wrong.

Thank you.
Ilja Preuss

Joined: Jul 11, 2001
Posts: 14112
If you use a standard XML parser and JDBC, isn't encoding taken care of automatically?

The soul is dyed the color of its thoughts. Think only on those things that are in line with your principles and can bear the light of day. The content of your character is your choice. Day by day, what you do is who you become. Your integrity is your destiny - it is the light that guides your way. - Heraclitus
Manuel Palacio
Ranch Hand

Joined: Oct 16, 2000
Posts: 45
I usually base-64 encode it, store in the DB and then decode with:

<a href="" target="_blank" rel="nofollow"></a>
Lily Boland

Joined: May 11, 2006
Posts: 11
Thank you all for the helpful info!
I agree. Here's the link:
subject: UTF-8
It's not a secret anymore!