File APIs for Java Developers
Manipulate DOC, XLS, PPT, PDF and many others from your application.
http://aspose.com/file-tools
The moose likes Hadoop and the fly likes Hadoop and Statistics Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login


Win a copy of Soft Skills this week in the Jobs Discussion forum!
JavaRanch » Java Forums » Databases » Hadoop
Bookmark "Hadoop and Statistics" Watch "Hadoop and Statistics" New topic
Author

Hadoop and Statistics

Santosh U Prabhu
Greenhorn

Joined: Sep 30, 2011
Posts: 5
We are currently doing a POC for a Hadoop implementation. Most of the users submitting their resumes seem to have statistical backgrounds. Is statistics a necessity for learning Hadoop? I am just beginning to learn Hadoop and I can already see that there is a learning curve and a differing way of approach to implementing Hadoop.
Carlos Morillo
Ranch Hand

Joined: Jun 06, 2009
Posts: 221

I'd say it depends on the use case. Likely that's for the Analytics and BI aspect or consumers of the output of MapReduce Jobs.

You need UNIX/Linux skills to install and manage a Hadoop cluster.

You need Java skills to understand the framework and to write MapReduce jobs but you can also use some other programming languages as well.

You need some SQL skills to play with Hive.

You need to understand RDBMS to understand their limitations and how NoSQL Databases such as HBase (Hadoop Database) solve certain kind of problems.

At the end there has to be some consumer to get insights and make decisions and these are Analytics and BI software such as Datameer, Tableau, etc.


HTH,

Carlos.


SCSA, OCA, SCJP 5.0, SCJD, CCDH, CCAH http://www.linkedin.com/in/carlosamorillo
 
Gartner says :Bigdata will be most advanced analytics products by 2015 !

Time to Become Big data architect by learning Hadoop(Developer, Administration,Analyst,QA),Cassandra,MongoDb,HBase,Datascience, Mahout, Splunk,R etc) from scratch to expert level

https://intellipaat.com/course-cat/big-data/?utm_source=coderanch%20&utm_medium=text&utm_campaign=coderanchdx1
 
subject: Hadoop and Statistics