my dog learned polymorphism*
The moose likes Android and the fly likes How to extract text from a pdf file in android emulator Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login


Win a copy of Android Security Essentials Live Lessons this week in the Android forum!
JavaRanch » Java Forums » Mobile » Android
Bookmark "How to extract text from a pdf file in android emulator" Watch "How to extract text from a pdf file in android emulator" New topic
Author

How to extract text from a pdf file in android emulator

Mohit G Gupta
Ranch Hand

Joined: May 18, 2010
Posts: 634

i have tried the following code to extract text from pdf file on android.



it works when the file is in the file-system,but i want it to work ,when the file is in the sdcard
something similar to this:


OCPJP 6.0 93%
OCPJWCD 5.0 98%
Ulf Dittmer
Marshal

Joined: Mar 22, 2005
Posts: 41123
    
  45
For starters, PDF files are binary, not text. That means you can't use Readers to work with them, you need to use Streams.

So the actual problem is how to get an InputStream from a file on the SD card? The Android Dev Guide has a page called "Data Storage" that talks about using SD cards.


Ping & DNS - my free Android networking tools app
Mohit G Gupta
Ranch Hand

Joined: May 18, 2010
Posts: 634

FileReader doesnot works for pdf files.
please,tell me how to use pdfbox within the android
Ulf Dittmer
Marshal

Joined: Mar 22, 2005
Posts: 41123
    
  45
FileReader doesnot works for pdf files.

Not just FileReader - all Readers and Writers. It's crucial that you understand the difference between text files and binary files.

please,tell me how to use pdfbox within the android

I have no idea whether PDFBox works on Android, but I pointed you to a resource that explains how to get an InputStream from a file on an SD card, and it seems that PDDocument.load can use an InputStream as well as a Reader. What else are you looking for?
 
With a little knowledge, a cast iron skillet is non-stick and lasts a lifetime.
 
subject: How to extract text from a pdf file in android emulator
 
Similar Threads
JTextPane ignoring new lines when text is loaded from servlet
PDFTextStripper returning null for all the japanese text in the PDF
How to get the complete path of file from android application
Unable to read more than one text files stored in sdcard