wood burning stoves 2.0*
The moose likes Other Open Source Projects and the fly likes How closely coupled is Mahout to Hadoop and MapReduce? Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login

Win a copy of EJB 3 in Action this week in the EJB and other Java EE Technologies forum!
JavaRanch » Java Forums » Products » Other Open Source Projects
Bookmark "How closely coupled is Mahout to Hadoop and MapReduce?" Watch "How closely coupled is Mahout to Hadoop and MapReduce?" New topic

How closely coupled is Mahout to Hadoop and MapReduce?

Glenn Murray
Ranch Hand

Joined: Dec 07, 2001
Posts: 74

Thanks for your answers to my "Questions about Mahout's maturity" post.

Are the algorithms implemented by Mahout all implemented in Hadoop, i.e., does using Mahout imply that the problem can be implemented in Hadoop and is therefore amenable to implementation in MapReduce? As Mahout is about scalability, I suppose this question is getting at whether all of its scalability is, in the end, based on MapReduce.

Also, is there something in the book about running on Google AppEngine (GAE) and Amazon EC2?


Glenn Murray
Author of Yo Soy Una Vaca De Hoy
Luan Cestari
Ranch Hand

Joined: Feb 07, 2010
Posts: 125

I was about to ask the same thing. About one of the wuestions that you made, I read is that the GAE would do MapReduce jogs soon.

Please, visit me for some cool tech post at www.ourdailycodes.com
Sean Owen

Joined: Nov 08, 2004
Posts: 21
Most of it is based on Hadoop / MapReduce, yes. Not all of it is though, in particular a lot of the recommender code, which also has a significant non-distributed presence.

I don't think you can run Hadoop on GAE? Or at least I have not heard that you can, nor tried. I have personally run it on EC2. The book has a few pages on running Hadoop jobs on EC2; it's generally quite straightforward if you understand what's going on when you run it locally.
Rajesh Nagaraju
Ranch Hand

Joined: Nov 27, 2003
Posts: 50
Most of mahout algorithms can be run on a in memory mode...
You need not have Map Reduce, however Map Reduce is required to run large datasets
It is sorta covered in the JavaRanch Style Guide.
subject: How closely coupled is Mahout to Hadoop and MapReduce?
Similar Threads
Specific problem domains in which which mahout is best
Mahout in Action - evolution of the library and the book
Mahout data access future
Mapreduce using Java
Mahout and Stanford Classes