aspose file tools*
The moose likes Hadoop and the fly likes Hadoop - mean time to productivity Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login
JavaRanch » Java Forums » Databases » Hadoop
Bookmark "Hadoop - mean time to productivity" Watch "Hadoop - mean time to productivity" New topic
Author

Hadoop - mean time to productivity

andrew ennamorato
Ranch Hand

Joined: Oct 03, 2007
Posts: 100
Chuck,

In your experience(s), how quickly does one go from Hadoop newbie to using it with success?

Sometimes, I feel like I've avoided trying/getting into Hadoop because there seems to be a lot to it - clusters, hdfs, etc. Could I spend a weekend fooling around and end up pretty comfortable?
David Newton
Author
Rancher

Joined: Sep 29, 2008
Posts: 12617

Depends on what you mean by "comfortable". I'd say no.
andrew ennamorato
Ranch Hand

Joined: Oct 03, 2007
Posts: 100
Good question. Comfortable enough to setup a small cluster, put some files on HDFS, run a few m/r tasks, etc. Not be an expert or anything, but enough to try out a few ideas and see how it works.
Chuck Lam
author
Greenhorn

Joined: Aug 09, 2010
Posts: 12
I've seen a number of courses in universities where students are expected to get up to speed on Hadoop in about 2-4 weeks. My memory is a bit vague on this one, but I do remember somewhere that a mid-term homework assignment was to implement PageRank over Wikipedia articles using Hadoop. I would certainly consider that a "comfortable" level.

Of course, your learning curve will vary depending on your background and available resources. The courses I referred to above almost always require "distributed systems" as a prerequisite. The classes also usually have a test cluster already set up. If you're setting one up yourself, factor in some time on learning systems administration.
andrew ennamorato
Ranch Hand

Joined: Oct 03, 2007
Posts: 100
Chuck,

Good to know. I've been wanting to pick up a copy of your book just for the M/R chapters anyway (maybe to use as a guide for CouchDB's internal M/R functionality), so if I don't win one I'll spring for it regardless.

Thanks,

andy
David Newton
Author
Rancher

Joined: Sep 29, 2008
Posts: 12617

Any of the MapReduce papers out there would be more than adequate for that, I think.
 
Gartner says :Bigdata will be most advanced analytics products by 2015 !

Time to Become Big data architect by learning Hadoop(Developer, Administration,Analyst,QA),Cassandra,MongoDb,HBase,Datascience, Mahout, Splunk,R etc) from scratch to expert level

https://intellipaat.com/course-cat/big-data/?utm_source=coderanch%20&utm_medium=text&utm_campaign=coderanchdx1
 
subject: Hadoop - mean time to productivity