my dog learned polymorphism
The moose likes Hadoop and the fly likes Hadoop and Statistics Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login
JavaRanch » Java Forums » Databases » Hadoop
Bookmark "Hadoop and Statistics" Watch "Hadoop and Statistics" New topic

Hadoop and Statistics

Santosh U Prabhu

Joined: Sep 30, 2011
Posts: 5
We are currently doing a POC for a Hadoop implementation. Most of the users submitting their resumes seem to have statistical backgrounds. Is statistics a necessity for learning Hadoop? I am just beginning to learn Hadoop and I can already see that there is a learning curve and a differing way of approach to implementing Hadoop.
Carlos Morillo
Ranch Hand

Joined: Jun 06, 2009
Posts: 221

I'd say it depends on the use case. Likely that's for the Analytics and BI aspect or consumers of the output of MapReduce Jobs.

You need UNIX/Linux skills to install and manage a Hadoop cluster.

You need Java skills to understand the framework and to write MapReduce jobs but you can also use some other programming languages as well.

You need some SQL skills to play with Hive.

You need to understand RDBMS to understand their limitations and how NoSQL Databases such as HBase (Hadoop Database) solve certain kind of problems.

At the end there has to be some consumer to get insights and make decisions and these are Analytics and BI software such as Datameer, Tableau, etc.



I agree. Here's the link:
subject: Hadoop and Statistics
It's not a secret anymore!