Dear Apache Drill Team,

I am trying to run Apache Drill in distributed mode on Google Cloud
Dataproc, but unable to start drillbit on each node in the cluster.

I have created a basic cluster (1 master, 2 worker) with GCP Dataproc
service, using the initialization scripts and instructions provided in the
Apache Drill website.

https://drill.apache.org/docs/installing-drill-in-distributed-mode-with-gcp-dataproc/


Apache Drill 1.19.0 and Apache Zookeeper 3.6.3 versions were configured in
the setup script. The cluster provisioning in Dataproc was successful and I
am able to connect with each node using SSH. When I tried to check the
status of Zookeeper using telnet localhost 2181 and entering stats, it is
showing the following

[image: zookeeper.png]

Then, I try to start drillbit service on each node using the command
bin/drillbit.sh start as mentioned here

https://drill.apache.org/docs/starting-drill-in-distributed-mode/

then it shows

Starting drillbit, logging to /opt/drill/log/drillbit.out

When I check the status of drill using bin/drillbit.sh status, it displays

/opt/drill/drillbit.pid file is present but drillbit is not running.

When I try to access Drill web UI public_ip_addr:8047 using public ip
address of any node, it gives "can’t establish a connection to the server".
So it is unclear whether drill is running or not. Note: I have opened port
8047 under firewall rules

Kindly provide help on how to resolve the issue and set up Apache Drill in
distributed mode on GCP.

Regards,
Vigneswaran S
vigneswaran....@gmail.com

Reply via email to