aspose file tools*
The moose likes EJB and other Java EE Technologies and the fly likes Issue in character encoding Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login
JavaRanch » Java Forums » Java » EJB and other Java EE Technologies
Bookmark "Issue in character encoding" Watch "Issue in character encoding" New topic
Author

Issue in character encoding

Abhishek Purwar
Ranch Hand

Joined: Dec 15, 2007
Posts: 63

Hello friends.......
I am currently working on my project and got stuck with a strange problem..........

I am working on windows machine and using eclipse and tomcat for my application.........

I am reading data from xls file in my application and storing it in database(mysql).....
in my xls sheet when i give non-english characters it got read correctly by my application and stored in the same way
in the database ........

But when i worked on linux machine and using same softwares i.e. eclipse, tomcat and mysql........
my application didn't read xls file having special charaters correctly .......
means it changes it into some other characters and store it in that manner only in database.

And when i debug my appkication in windows while reading it shows workbooksetting encoding cp1252........
but in linux it shows utf-8.......

it troubles me very much........
so please let me get out of this problem.........


Abhishek Purwar,
Bangalore.
Ulf Dittmer
Marshal

Joined: Mar 22, 2005
Posts: 41820
    
  62
Whenever you perform character I/O you need to specify which encoding you want to use; if you don't, then the platform default encoding will be used. That's MacRoman on OS X, CP-1252 on Windows, and generally either UTF-8 or ISO-8859-1 on Linux. So you need to think about which encoding you want at which place, and make sure that that gets used everywhere in the code.

Also make sure that the DB is set to use Unicode (or some other encoding that can accommodate all the characters you need to store, but not ASCII).


Ping & DNS - my free Android networking tools app
Abhishek Purwar
Ranch Hand

Joined: Dec 15, 2007
Posts: 63

Ulf Dittmer wrote:Whenever you perform character I/O you need to specify which encoding you want to use; if you don't, then the platform default encoding will be used. That's MacRoman on OS X, CP-1252 on Windows, and generally either UTF-8 or ISO-8859-1 on Linux. So you need to think about which encoding you want at which place, and make sure that that gets used everywhere in the code.

Also make sure that the DB is set to use Unicode (or some other encoding that can accommodate all the characters you need to store, but not ASCII).


Thanks for your suggestion.
Can you tell me how can i do that so that it can be platform independent???
And i am able to store non-english characters in both platform correctly in the database......
 
I agree. Here's the link: http://aspose.com/file-tools
 
subject: Issue in character encoding