File APIs for Java Developers
Manipulate DOC, XLS, PPT, PDF and many others from your application.
http://aspose.com/file-tools
The moose likes Other Open Source Projects and the fly likes Integration between Mahout and Lucene/Solr Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login
JavaRanch » Java Forums » Products » Other Open Source Projects
Bookmark "Integration between Mahout and Lucene/Solr" Watch "Integration between Mahout and Lucene/Solr" New topic
Author

Integration between Mahout and Lucene/Solr

Artur Nowak
Greenhorn

Joined: Dec 13, 2009
Posts: 4
Hi,

First of all congratulations to the authors of Mahout in Action -- I think that's a big challenge to write a comprehensive guide to Mahout, which is such a moving target at the moment!

I'm curious how much coverage did integration between Mahout and Lucene/Solr get in the book. I think that it's pretty common use of the package, but the resources on the web are scattered and not very thorough (for people interested: Lucid Imagination blog, Example application from ApacheCon2010).

From the table of contents I can see that there are some use-case oriented chapters in the book (like for example 12th: Real-world applications of clustering), so I'm interested if they cover use of Lucene/Solr along with Mahout or rather explore some more isolated applications?

Thanks in advance!
Ted Dunning
Greenhorn

Joined: Aug 16, 2011
Posts: 11
We did not talk much about the integration of Solr and Lucene with Mahout. You are correct that this is a good form of data for processing with Mahout, but I think that in actual production cases, it isn't a really large part of the mix. Perhaps it should be, and perhaps more explanation would help that.

On the other hand, another forthcoming Manning book, "Taming Text" definitely does talk quite a bit about this integration.
Artur Nowak
Greenhorn

Joined: Dec 13, 2009
Posts: 4
Thanks for the answer!
 
 
subject: Integration between Mahout and Lucene/Solr
 
Similar Threads
* Winners: Mahout in Action
Mahout in Action - what kind of projects have you personally used Mahout for?
Come and Join Hug meeting with Hadoop ,Lucene and Solr
Mahout data access future
Apache mahout