Hi All, Operators that are not making progress are killed after TIMEOUT_WINDOW_COUNT. But this doesn't take the operator recovery into account. Operators that are recovering may need more time than what is set in TIMEOUT_WINDOW_COUNT.
The new attribute, RECOVERY_WINDOW_TIMEOUT_COUNT, will be used to decide the timeout when the operators are recovering. Thoughts?
