Meaningless Drivel is fun!*
The moose likes Hadoop and the fly likes Hadoop and Statistics Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login


Win a copy of Spring in Action this week in the Spring forum!
JavaRanch » Java Forums » Databases » Hadoop
Bookmark "Hadoop and Statistics" Watch "Hadoop and Statistics" New topic
Author

Hadoop and Statistics

Santosh U Prabhu
Greenhorn

Joined: Sep 30, 2011
Posts: 5
We are currently doing a POC for a Hadoop implementation. Most of the users submitting their resumes seem to have statistical backgrounds. Is statistics a necessity for learning Hadoop? I am just beginning to learn Hadoop and I can already see that there is a learning curve and a differing way of approach to implementing Hadoop.
Carlos Morillo
Ranch Hand

Joined: Jun 06, 2009
Posts: 221

I'd say it depends on the use case. Likely that's for the Analytics and BI aspect or consumers of the output of MapReduce Jobs.

You need UNIX/Linux skills to install and manage a Hadoop cluster.

You need Java skills to understand the framework and to write MapReduce jobs but you can also use some other programming languages as well.

You need some SQL skills to play with Hive.

You need to understand RDBMS to understand their limitations and how NoSQL Databases such as HBase (Hadoop Database) solve certain kind of problems.

At the end there has to be some consumer to get insights and make decisions and these are Analytics and BI software such as Datameer, Tableau, etc.


HTH,

Carlos.


SCSA, OCA, SCJP 5.0, SCJD, CCDH, CCAH http://www.linkedin.com/in/carlosamorillo
 
I agree. Here's the link: http://aspose.com/file-tools
 
subject: Hadoop and Statistics