• Post Reply Bookmark Topic Watch Topic
  • New Topic
programming forums Java Mobile Certification Databases Caching Books Engineering Micro Controllers OS Languages Paradigms IDEs Build Tools Frameworks Application Servers Open Source This Site Careers Other all forums
this forum made possible by our volunteer staff, including ...
Marshals:
  • Campbell Ritchie
  • Bear Bibeault
  • Ron McLeod
  • Jeanne Boyarsky
  • Paul Clapham
Sheriffs:
  • Tim Cooke
  • Liutauras Vilda
  • Junilu Lacar
Saloon Keepers:
  • Tim Moores
  • Stephan van Hulst
  • Tim Holloway
  • fred rosenberger
  • salvin francis
Bartenders:
  • Piet Souris
  • Frits Walraven
  • Carey Brown

Hadoop and flavors

 
Ranch Hand
Posts: 35
Hibernate Chrome Java
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
I am still very new to Hadoop, so if my question seems lame please pardon me..

basically there's so many flavors of hadoop out there .. MapR, Cloudera, Hortonworks, etc.. how do i know which one is right for me.. i mean the company..
also, if you can elaborate on the differences that will be great.

thanks..
 
Ranch Hand
Posts: 221
Scala Python Java
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
If you want an Enterprise Grade distribution where a business can rely on, easy to use and manage, No Single Point of Failure, ease of on-boarding data into the cluster using regular UNIX/Linux commands through NFS, The Best Performance, MapR is the way to go.

Besides NFS and No single Point of Failure, you have features such as Volumes, Snapshots and Mirrors which are critical for Multitenancy and Disaster Recovery,
 
author
Posts: 15
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
The nice thing is that you can download and try a free version of each of them. The base Apache distro is good when you're just learning and getting started but you wouldn't deploy it in production.

The real advantage of the bundled distributions such as Cloudera and Hortonworks is that you don't have to do the juggling to get version x of Hive working with version y of Hadoop and version z of Hbase. They also come with better tools for deployment and management in an operational environment.

MapR gives that but as the previous poster mentioned also has a number of unique extensions such as its NameNode-free HA architecture and the NFS integration. If you have existing legacy systems that push data to NFS this is a very nice option.

So its really down to the combination of the products in the bundle, the tools you need to run it in production and how much you are willing to pay for the commercial aspects of CDH and MapR in particular. But as I said, try them all!

Garry
 
Bring me the box labeled "thinking cap" ... and then read this tiny ad:
Thread Boost feature
https://coderanch.com/t/674455/Thread-Boost-feature
    Bookmark Topic Watch Topic
  • New Topic