aspose file tools*
The moose likes Java in General and the fly likes Multilangual PDF Creation(in all indian languages) Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login
JavaRanch » Java Forums » Java » Java in General
Bookmark "Multilangual PDF Creation(in all indian languages) " Watch "Multilangual PDF Creation(in all indian languages) " New topic
Author

Multilangual PDF Creation(in all indian languages)

Satish Kumar Kara
Greenhorn

Joined: Feb 29, 2008
Posts: 8
We have a module to generate PDF documents, which has to be processed by another module for some security features. Currently we are using iText API to create PDF documents in case of English. But we have to implement i18n in our application ( for all Indian languages). So we need to generate the PDF document in other Indian languages like Hindi, Guajarati, Kannada etc. Following are some criteria to be considered.

1. The document may have more than one language texts with English texts.
2. The document may contain all the numeric of all languages and special characters.
3. We are storing data in database as UNICODE(UTF-8) and using the JDBC API to retrieve.

Kindly suggest us some solution with iText API. Let us know if some other PDF API is supporting all Indian languages.
Ulf Dittmer
Marshal

Joined: Mar 22, 2005
Posts: 42648
    
  65
iText and PDFs can handle Unicode. As long as the machine on which the document is viewed has fonts available that cover all characters (like Microsoft Arial Unicode), there should not be any problems. If no such font is available, you can use iText to embed the font in the document. Note that such fonts are big (MS Arial Unicode is about 20 MB in size, IIRC), and that not all font licenses permit distribution of the font.

This may help: http://www.alanwood.net/unicode/fonts.html


Ping & DNS - my free Android networking tools app
Campbell Ritchie
Sheriff

Joined: Oct 13, 2005
Posts: 39885
    
  28
Not a "beginning" question. Moving thread.
Satish Kumar Kara
Greenhorn

Joined: Feb 29, 2008
Posts: 8
In the Machine on which the document is viewed has fonts which supports the particular language and

We are embedding the font which supports Hindi (Indian language) in the document using iText as



but still pdf displays different text than expected.We got all the hindi characters, but not in proper format…

For ex:

It displays text as shown in the link provided(hinadi) below, instead of हिन्दी (hindi)

http://doc-08-8o-docs.googleusercontent.com/docs/secure/rb76hs62dof3nkvdingba5cedl9ltjag/uo3tc5pdc11g50ml7n4mbc8t56rq19j9/1270101600000/00409798775290637475/*/0ByYDS0mupIdzZjhjZTdlZTUtZjZmYy00OGJiLTllYjgtYTJjMWZmNjUxZTY5


Note : The text हिन्दी (hindi) is stored in oracle DB in UTF-8 encoding.
We are able retrieve the same in the browser but not in pdf generated using iText.
When the iText is embedding, it is getting changed.

kindly suggest how we can achieve the required.


Ulf Dittmer
Marshal

Joined: Mar 22, 2005
Posts: 42648
    
  65
It looks like, basically, it gets it right, except for the first characters being swapped. That might be the Hindi equivalent of using ligatures vs. not using ligatures. A possible workaround is mentioned here.

I see that you also asked this elsewhere, and even got a reply; next time you do that, BeForthrightWhenCrossPostingToOtherSites.
Susan Acier
Greenhorn

Joined: Jul 25, 2014
Posts: 1
Since this thread is a few years old, I just wanted to see if anyone has additional information on how to solve the problem. I know there is no official itext solution, but wondering if anyone has come up with a workaround that's effective. Thanks!
 
I agree. Here's the link: http://aspose.com/file-tools
 
subject: Multilangual PDF Creation(in all indian languages)