Nishant Bangarwa created HIVE-20349:
---------------------------------------

             Summary: Implement Retry Logic in HiveDruidSplit for Scan Queries
                 Key: HIVE-20349
                 URL: https://issues.apache.org/jira/browse/HIVE-20349
             Project: Hive
          Issue Type: Bug
            Reporter: Nishant Bangarwa
            Assignee: Nishant Bangarwa


while distributing druid scan query we check where the segments are loaded and 
then each HiveDruidSplit directly queries the historical node. 
There are few cases when we need to retry and refetch the segments. 

# The segment is loaded on multiple historical nodes and one of them went down. 
in this case when we do not get response from one segment, we query the next 
replica. 
# The segment was loaded onto a realtime task and was handed over, when we 
query the realtime task has already finished. In this case there is no replica. 
The Split needs to query the broker again for the location of the segment and 
then send the query to correct historical node. 

This is also the root cause of failure of druidkafkamini_basic.q test, where 
the segment handover happens before the scan query is executed.

Note: This is not a problem when we are directly querying Druid brokers as the 
broker handles the retry logic. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to