nishantmonu51 opened a new issue #11322:
URL: https://github.com/apache/druid/issues/11322


   ### Affected Version
   
   0.21.1
   
   ### Description
   
   Loaded a large dataset using the native parallel index. The load ran for 4 
hrs, during that some tasks failed and retried. 
   Noticed that the numbered shards generated for the interval were not 
continuous and were missing in between overall max(partition_num) was 2289, 
however, the total shard count was 2263. 
   On a quick look it seems that on a subtask failure, it may happen that some 
partition numbers were allocated to a task and are never pushed, on a retry, 
the new task gets newer partition numbers. 
   
   The overall result was that the ingestion completed successfully with 
retries and the coordinator loaded the data.
   The broker segment timeline was incomplete due to missing partition numbers 
and the data was never queryable. 
    


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to