Hari Sekhon created DRILL-6406:
----------------------------------

             Summary: Apache Drill Add Proper Health Check to API - /status 
returns "Running" even though Drill isn't usable eg. java classpath errors for 
flexjson-*.jar when trying to execute any query
                 Key: DRILL-6406
                 URL: https://issues.apache.org/jira/browse/DRILL-6406
             Project: Apache Drill
          Issue Type: Improvement
          Components:  Server, Client - HTTP, Execution - Monitoring, Tools, 
Build & Test, Web Server
    Affects Versions: 1.13.0, 1.10.0
            Reporter: Hari Sekhon


I've come across situations on MapR 5.2 / 6.0 where Drill 1.10 / 1.13 is up and 
running but there are flexjson jar classpath errors meaning that Drill is not 
usable so queries fail even though the API gives the impression that Drill is 
available. This is bad for load balancers.

This API needs to be much more robust and terms of checking if Drill is fully 
working. I suggest internally doing a health check real query and if that 
executes successfully and returns the expected result, only then updating the 
status to OK / Running.

This internal health check query should be periodically re-executed (eg. every 
30 secs) and the status set to ERROR or something if if fails.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to