Re: Configuring Hadoop, HBase and Hive Cluster

Dalia Sobhy Mon, 12 Nov 2012 16:20:58 -0800

I do advise you to use Cloudera Manager its a very simple and opensource 
cluster configuration software..


A good design is to run zookeeper on node1, node2, another node alone

Sent from my iPhone

On 2012-11-13, at 2:04 AM, "Hakan Bogay" <[email protected]> wrote:

> Hi,
> 
> I am a newbie to Hadoop, HBase and Hive. I installed Hadoop, HBase and Hive
> in pseudodistributed mode and everything works fine. Now I am planning to
> set up an simple Hadoop Cluster (5 nodes) with Hive, HBase and ZooKeeper.
> I´ve read several documentations and instructions before but i could not
> find a good explanation for my question. I´m not sure, where to run all the
> daemons. This is my consideration:
> 
> *Node_1* (Master)
> 
>   - NameNode
>   - JobTrakcer
>   - HBase Master
>   -
> 
>   ZooKeeper (Standalone node; managed by HBase)
> 
> 
> 
> *Node_2* (Backup_Master)
> 
>   -
> 
>   SecondaryNameNode
> 
> 
> 
> *Node_3* (Slave1)
> 
>   - DataNode1
>   - TaskTracker1
>   -
> 
>   RegionServer1
> 
> 
> 
> *Node_4* (Slave2)
> 
>   - DataNode2
>   - TaskTracker2
>   -
> 
>   RegionServer2
> 
> 
> 
> *Node_5* (Slave3)
> 
>   - DataNode3
>   - TaskTracker3
>   - RegionServer3
> 
> 
> I know, in production it is recommended to run ZooKeeper ensemble at an odd
> number of nodes (seperate Cluster). But for a simple cluster, is it OK to
> set up a standalone ZooKeeper node which runs on the master node?
> Another question is regarding Hive: I know that Hive is a Hadoop client.
> Should I also install Hive on the master node? Does it make sense?
> 
> Thanks for all tips and comments!
> 
> Hakan
> 
> Note: I have just 5 machines to simulate a cluster.

Re: Configuring Hadoop, HBase and Hive Cluster

Reply via email to