We have a module to generate PDF documents, which has to be processed by another module for some security features. Currently we are using iText API to create PDF documents in case of English. But we have to implement i18n in our application ( for all Indian languages). So we need to generate the PDF document in other Indian languages like Hindi, Guajarati, Kannada etc. Following are some criteria to be considered.
1. The document may have more than one language texts with English texts.
2. The document may contain all the numeric of all languages and special characters.
3. We are storing data in database as UNICODE(UTF-8) and using the JDBC API to retrieve.
Kindly suggest us some solution with iText API. Let us know if some other PDF API is supporting all Indian languages.
iText and PDFs can handle Unicode. As long as the machine on which the document is viewed has fonts available that cover all characters (like Microsoft Arial Unicode), there should not be any problems. If no such font is available, you can use iText to embed the font in the document. Note that such fonts are big (MS Arial Unicode is about 20 MB in size, IIRC), and that not all font licenses permit distribution of the font.
Note : The text हिन्दी (hindi) is stored in oracle DB in UTF-8 encoding.
We are able retrieve the same in the browser but not in pdf generated using iText.
When the iText is embedding, it is getting changed.
It looks like, basically, it gets it right, except for the first characters being swapped. That might be the Hindi equivalent of using ligatures vs. not using ligatures. A possible workaround is mentioned here.
Since this thread is a few years old, I just wanted to see if anyone has additional information on how to solve the problem. I know there is no official itext solution, but wondering if anyone has come up with a workaround that's effective. Thanks!
Die Fledermaus does not fear such a tiny ad:
free, earth-friendly heat - a kickstarter for putting coin in your pocket while saving the earth