Meaningless Drivel is fun!*
The moose likes Hadoop and the fly likes Hadoop typical uses Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login
JavaRanch » Java Forums » Databases » Hadoop
Bookmark "Hadoop typical uses" Watch "Hadoop typical uses" New topic
Author

Hadoop typical uses

Will Myers
Ranch Hand

Joined: Aug 05, 2009
Posts: 319

To the authors:

Can you explain scenarios where Hadoop really shines and why it is better than the competition?

What is the learning curve for it for an experienced Java developer?
Garry Turkington
author
Greenhorn

Joined: Apr 23, 2013
Posts: 15
Regarding where it shines, it really is the classic situation that if you have large volumes of structured or semi-structured data and have analytics that need to touch a lot of that data then it's possibly a good fit.

I suspect I'll make this point multiple times this week -- I view Hadoop as one component of the data processing systems I build but I use it alongside traditional databases and data warehouses. If your use case requires you to pull specific items from a well structured data set then odds are you'll be much better off with a traditional RDBMS. Can you do it in Hadoop, sure, but pick the best tool for the job. If your queries on the RDBMS turn into table scans because of how much data you need process to generate your results then in that case I'd consider Hadoop.

I find the Java APIs in Hadoop very well designed and easy to pick up. I find the biggest learning curve is more conceptual; learning how to take a particular problem and expressing it as a series of MapReduce jobs. You can find yourself with a series of MR jobs the code for each is literally only a few lines in each map and reduce method. But put together in the MapReduce framework the processing chain can do extremely sophisticated things. This is where the real experience will need develop.

Garry
 
I agree. Here's the link: http://aspose.com/file-tools
 
subject: Hadoop typical uses
 
Similar Threads
How much Hadoop knowledge should one have before diving into Pig?
Prerequisites for learning hadoop ?
new to Hadoop
requirements to install Hadoop on a notebook?
what's Hadoop ?