didip opened a new issue #11396:
URL: https://github.com/apache/druid/issues/11396


   
   ### Affected Version
   
   Tested on 0.21.1
   
   ### Description
   
   Please include as much detailed information about the problem as possible.
   - 15 middle managers with 20 workers each.
   - We are deploying Druid inside Kubernetes.
   - Each middle manager pods has 32GB RAM and 20 CPU.
   - The configuration of the cluster is pretty basic, we don't use any 
affinity stuff.
   - The native ingestion job uses maxNumSegmentsToMerge=100. The input data is 
around 3TB per day with hundreds of parquet files.
   - To reproduce, we just keep it running for almost a day, we will see that 
index_parallel is marked failed but many of the subtasks are still running.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to