File APIs for Java Developers
Manipulate DOC, XLS, PPT, PDF and many others from your application.
http://aspose.com/file-tools
The moose likes Hadoop and the fly likes Hadoop Map Reduce Cookbook Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login


Win a copy of Soft Skills this week in the Jobs Discussion forum!
JavaRanch » Java Forums » Databases » Hadoop
Bookmark "Hadoop Map Reduce Cookbook" Watch "Hadoop Map Reduce Cookbook" New topic
Author

Hadoop Map Reduce Cookbook

Jon Ferguson
Greenhorn

Joined: Sep 05, 2007
Posts: 16
There seems two key areas to address when looking at Hadoop.. First, the nuts and bolts of just getting it up and running on various hardware. Sounds like EMR might address a real need here since much of this is being done in the cloud anyway. Second, the 'why' of doing this in the first place. To this end I'm looking forward to perusing this book further. There's lot's of ways to attack a problem and Hadoop provides the framework to do this heavy lifting. However we still need to understand how to write the algorithms to get the knowledge we are looking for. In this regard - you've listed using Mahout for document classification with naive-bayes in your chapter on Text processing. Can you describe roughly how you approach this problem with Hadoop and Mahout?
 
Gartner says :Bigdata will be most advanced analytics products by 2015 !

Time to Become Big data architect by learning Hadoop(Developer, Administration,Analyst,QA),Cassandra,MongoDb,HBase,Datascience, Mahout, Splunk,R etc) from scratch to expert level

https://intellipaat.com/course-cat/big-data/?utm_source=coderanch%20&utm_medium=text&utm_campaign=coderanchdx1
 
subject: Hadoop Map Reduce Cookbook