Hi Celeborn community, I have written up CIP-17: Interruption Aware Slot Selection <https://docs.google.com/document/d/16Lj4KadSb6ypaXTg5tJB0QvaXG8vTLtyoj7V4umTZqw/edit?usp=sharing>. Please review and let me know if there are any comments or questions.
This is a feature we have introduced internally, given our heavy volume of interruptions. We have seen substantial decrease in task failures in both Flink and Spark jobs, and think the community would also benefit from this :) Looking forward to getting feedback from the community. Thanks, Aravind CIP 17: Interruption Aware Slot Selection <https://drive.google.com/open?id=16Lj4KadSb6ypaXTg5tJB0QvaXG8vTLtyoj7V4umTZqw>