didip opened a new issue #11396: URL: https://github.com/apache/druid/issues/11396
### Affected Version Tested on 0.21.1 ### Description Please include as much detailed information about the problem as possible. - 15 middle managers with 20 workers each. - We are deploying Druid inside Kubernetes. - Each middle manager pods has 32GB RAM and 20 CPU. - The configuration of the cluster is pretty basic, we don't use any affinity stuff. - The native ingestion job uses maxNumSegmentsToMerge=100. The input data is around 3TB per day with hundreds of parquet files. - To reproduce, we just keep it running for almost a day, we will see that index_parallel is marked failed but many of the subtasks are still running. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
