Granny's Programming Pearls
"inside of every large program is a small program struggling to get out"
The moose likes Other Open Source Projects and the fly likes Integration between Mahout and Lucene/Solr Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login
JavaRanch » Java Forums » Products » Other Open Source Projects
Bookmark "Integration between Mahout and Lucene/Solr" Watch "Integration between Mahout and Lucene/Solr" New topic

Integration between Mahout and Lucene/Solr

Artur Nowak

Joined: Dec 13, 2009
Posts: 4

First of all congratulations to the authors of Mahout in Action -- I think that's a big challenge to write a comprehensive guide to Mahout, which is such a moving target at the moment!

I'm curious how much coverage did integration between Mahout and Lucene/Solr get in the book. I think that it's pretty common use of the package, but the resources on the web are scattered and not very thorough (for people interested: Lucid Imagination blog, Example application from ApacheCon2010).

From the table of contents I can see that there are some use-case oriented chapters in the book (like for example 12th: Real-world applications of clustering), so I'm interested if they cover use of Lucene/Solr along with Mahout or rather explore some more isolated applications?

Thanks in advance!
Ted Dunning

Joined: Aug 16, 2011
Posts: 11
We did not talk much about the integration of Solr and Lucene with Mahout. You are correct that this is a good form of data for processing with Mahout, but I think that in actual production cases, it isn't a really large part of the mix. Perhaps it should be, and perhaps more explanation would help that.

On the other hand, another forthcoming Manning book, "Taming Text" definitely does talk quite a bit about this integration.
Artur Nowak

Joined: Dec 13, 2009
Posts: 4
Thanks for the answer!
I agree. Here's the link:
subject: Integration between Mahout and Lucene/Solr
It's not a secret anymore!