File APIs for Java Developers
Manipulate DOC, XLS, PPT, PDF and many others from your application.
The moose likes Other Open Source Projects and the fly likes Search in documents Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login
JavaRanch » Java Forums » Products » Other Open Source Projects
Bookmark "Search in documents" Watch "Search in documents" New topic

Search in documents

chetan dhumane
Ranch Hand

Joined: Jan 07, 2009
Posts: 641

I am using lucene for searching , My jsp pages contains document links which are in doc format.I need to search in these documents also.Document should get open after search.

Author @
Moody blogger who do not like to behave like target setting machines work.
David Newton

Joined: Sep 29, 2008
Posts: 12617

Can you describe what you want a little more clearly?

It sounds like you want a basic crawler that will also index Word documents?

What do you mean "Document should get open after search"?
Ulf Dittmer

Joined: Mar 22, 2005
Posts: 42959
The Apache POI library can extract the text from a DOC or DOCX file; that should be sufficient to index it:
I agree. Here's the link:
subject: Search in documents
It's not a secret anymore!