Hi Prabhakar
ZooKeeper. It will make no meaningful difference to Drill which nodes
you run it on since it is only used for configuration and control. I'd
probably put it on the two namenodes since it is similar in spirit and
has no need of the more serious storage that is likely to be installed
on the datanodes.
Drill. To take advantage of data locality in Hadoop you would install
Drill on the datanodes of your Hadoop cluster. Note, however, that for a
small cluster like this, and given modern LAN speeds, data locality can
often just be ignored. Nevertheless I'd personally start by installing
Drillbits on each of the three datanodes.
Regards
James
On 2023/10/26 10:12, Prabhakar Bhosale wrote:
Hi Team,
I am looking for deployment architecture of drill on production server. I
have following configurations
HAdoop cluster
2 name nodes - Faiiover
3 data nodes
I have following questions
1. should zookeeper to be installed only on name node or both datanode and
namenodes?
2. Should drill be installed on all nodes or only namenodes or only
datanodes?
Regards
Prabhakar