aspose file tools*
The moose likes Android and the fly likes How to extract text from a pdf file in android emulator Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login


Win a copy of EJB 3 in Action this week in the EJB and other Java EE Technologies forum!
JavaRanch » Java Forums » Mobile » Android
Bookmark "How to extract text from a pdf file in android emulator" Watch "How to extract text from a pdf file in android emulator" New topic
Author

How to extract text from a pdf file in android emulator

Mohit G Gupta
Ranch Hand

Joined: May 18, 2010
Posts: 634

i have tried the following code to extract text from pdf file on android.



it works when the file is in the file-system,but i want it to work ,when the file is in the sdcard
something similar to this:


OCPJP 6.0 93%
OCPJWCD 5.0 98%
Ulf Dittmer
Marshal

Joined: Mar 22, 2005
Posts: 39547
    
  27
For starters, PDF files are binary, not text. That means you can't use Readers to work with them, you need to use Streams.

So the actual problem is how to get an InputStream from a file on the SD card? The Android Dev Guide has a page called "Data Storage" that talks about using SD cards.


Ping & DNS - updated with new look and Ping home screen widget
Mohit G Gupta
Ranch Hand

Joined: May 18, 2010
Posts: 634

FileReader doesnot works for pdf files.
please,tell me how to use pdfbox within the android
Ulf Dittmer
Marshal

Joined: Mar 22, 2005
Posts: 39547
    
  27
FileReader doesnot works for pdf files.

Not just FileReader - all Readers and Writers. It's crucial that you understand the difference between text files and binary files.

please,tell me how to use pdfbox within the android

I have no idea whether PDFBox works on Android, but I pointed you to a resource that explains how to get an InputStream from a file on an SD card, and it seems that PDDocument.load can use an InputStream as well as a Reader. What else are you looking for?
 
I agree. Here's the link: http://aspose.com/file-tools
 
subject: How to extract text from a pdf file in android emulator
 
Similar Threads
Unable to read more than one text files stored in sdcard
PDFTextStripper returning null for all the japanese text in the PDF
JTextPane ignoring new lines when text is loaded from servlet
How to get the complete path of file from android application