File APIs for Java Developers
Manipulate DOC, XLS, PPT, PDF and many others from your application.
The moose likes Hadoop and the fly likes Hadoop Map Reduce Cookbook Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login

Win a copy of Android Security Essentials Live Lessons this week in the Android forum!
JavaRanch » Java Forums » Databases » Hadoop
Bookmark "Hadoop Map Reduce Cookbook" Watch "Hadoop Map Reduce Cookbook" New topic

Hadoop Map Reduce Cookbook

Jon Ferguson

Joined: Sep 05, 2007
Posts: 16
There seems two key areas to address when looking at Hadoop.. First, the nuts and bolts of just getting it up and running on various hardware. Sounds like EMR might address a real need here since much of this is being done in the cloud anyway. Second, the 'why' of doing this in the first place. To this end I'm looking forward to perusing this book further. There's lot's of ways to attack a problem and Hadoop provides the framework to do this heavy lifting. However we still need to understand how to write the algorithms to get the knowledge we are looking for. In this regard - you've listed using Mahout for document classification with naive-bayes in your chapter on Text processing. Can you describe roughly how you approach this problem with Hadoop and Mahout?
subject: Hadoop Map Reduce Cookbook
Similar Threads
Do I need to be a machine-learning professional in order to use Apache Mahout?
How closely coupled is Mahout to Hadoop and MapReduce?
Questions about Mahout's maturity
Mahout data access future
Mahout in Action - evolution of the library and the book