jQuery in Action, 2nd edition*
The moose likes Hadoop and the fly likes Configuring Hadoop, HBase and Hive Cluster Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login
JavaRanch » Java Forums » Databases » Hadoop
Bookmark "Configuring Hadoop, HBase and Hive Cluster" Watch "Configuring Hadoop, HBase and Hive Cluster" New topic
Author

Configuring Hadoop, HBase and Hive Cluster

Hakan Bo
Greenhorn

Joined: Nov 12, 2012
Posts: 1
Hi,

I am a newbie to Hadoop, HBase and Hive. I installed Hadoop, HBase and Hive in pseudodistributed mode and everything works fine. Now I am planning to set up an simple Hadoop Cluster (5 nodes) with Hive, HBase and ZooKeeper. I´ve read several documentations and instructions before but i could not find a good explanation for my question. I´m not sure, where to run all the daemons. This is my consideration:

Node_1 (Master)
NameNode
JobTrakcer
HBase Master
ZooKeeper (Standalone node; managed by HBase)


Node_2 (Backup_Master)
SecondaryNameNode


Node_3 (Slave1)
DataNode1
TaskTracker1
RegionServer1


Node_4 (Slave2)
DataNode2
TaskTracker2
RegionServer2


Node_5 (Slave3)
DataNode3
TaskTracker3
RegionServer3


I know, in production it is recommended to run ZooKeeper ensemble at an odd number of nodes (seperate Cluster). But for a simple cluster, is it OK to set up a standalone ZooKeeper node which runs on the master node?
Another question is regarding Hive: I know that Hive is a Hadoop client. Should I also install Hive on the master node? Does it make sense?

Thanks for all tips and comments! Hakan

Note: I have just 5 machines to simulate a cluster.
Atul Itankar
Greenhorn

Joined: Jan 09, 2013
Posts: 13
Answering to your second question
You can install hive on master node but you should not
as it is master node, should not use it. It doesn't make sense.


Thanks
Atul Itankar

InfoCepts | www.infocepts.com
 
I agree. Here's the link: http://aspose.com/file-tools
 
subject: Configuring Hadoop, HBase and Hive Cluster