kito4 opened a new issue, #17661: URL: https://github.com/apache/dolphinscheduler/issues/17661
### Search before asking - [x] I had searched in the [issues](https://github.com/apache/dolphinscheduler/issues?q=is%3Aissue) and found no similar feature requirement. ### Description As part of an educational research project at ITMO University, we aim to investigate how open-source schedulers, specifically Apache DolphinScheduler (DS), handle resource oversubscription. Oversubscription — allocating more tasks than available physical or logical resources — can increase utilization and reduce costs but often leads to performance degradation, instability, or SLA violations for critical workloads. The project will focus on identifying technical gaps in DS and proposing mechanisms to manage oversubscription safely, including metrics, scheduling policies, prioritization, and throttling strategies. ### Use case A DS cluster runs multiple concurrent workflows, temporarily exceeding available CPU, memory, or I/O resources. Without proper control, worker nodes may become overloaded, task queues grow, and critical tasks may fail or be delayed. The research project will explore potential solutions such as: Prioritizing critical workflows under oversubscription. Implementing back-pressure or throttling mechanisms. Adding observability and metrics for oversubscription states. Testing and simulating scenarios to evaluate improvements in throughput, latency, and stability. ### Related issues _No response_ ### Are you willing to submit a PR? - [x] Yes I am willing to submit a PR! ### Code of Conduct - [x] I agree to follow this project's [Code of Conduct](https://www.apache.org/foundation/policies/conduct) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
