Can anyone please advise me on reading PDF file. I only want to read text part of it from my application. i tried using PJX from Etymon inc, but failed. NO significant help is available online to use this Jar.
Please help me in using PJX, or suggest any other utility which has proper documentation.
Regards sameer
Ulf Dittmer
Marshal
Joined: Mar 22, 2005
Posts: 35252
7
posted
0
PDF is a hard to read format. The best you can do is extract the text from it. If that's good enough for your purposes, check out JPedal. [ February 17, 2006: Message edited by: Ulf Dittmer ]
Thanks a lot for recommendation. JPEDAL seems to be a great utility.
I need to extract only text from PDF. Can you give me some code example to how to do it using JPEDAL
Regards sameer
Ulf Dittmer
Marshal
Joined: Mar 22, 2005
Posts: 35252
7
posted
0
Hello Sameer-
I don't seem to have noticed it before, but apparently you missedthat JavaRanch has a policy on display names, and yours does not comply with it - please adjust it accordingly, which you can do right here. Thanks for your prompt attention to this matter.
As to your question, check the examples that you can download in the same spot where you can download jpedal itself. [ February 21, 2006: Message edited by: Ulf Dittmer ]
Kerry Wilson
Ranch Hand
Joined: Oct 29, 2003
Posts: 251
posted
0
Anyone know of a PDF Text Extraction library that doesn't cost $500 for a single JVM license?
If you look carefully at the information the the Jpedal home page, you'll see that the open source version does include text extraction, and it's free. It's the enterprise edition that costs $500.