Originally posted by Ulf Dittmer:
The console where you're printing this to most likely does not support Arabic characters (many Console don't support much beyond ISO-8859-1), so the fact that the characters don't print properly doesn't mean much.
You can use the String.charAt method to check whether the string contains the proper Arabic characters.
Originally posted by Ulf Dittmer:
Where are the characters displayed as question marks? In the console? In the editor? If the former, most consoles can't display Unicode. If the latter, make sure the editor can handle Unicode (and Arabic) in general (meaning it has fonts and selects those properly).
Originally posted by Ulf Dittmer:
No, POI has no provisions for reading RTF. But the basic structure of RTF isn't complicated. If you spend some time with the specification (such as there is) and study a few simple RTF files, you should be able to write code that parses the file contents and extracts the actual text.