AutomationDev85 opened a new pull request, #58033:
URL: https://github.com/apache/airflow/pull/58033

   # Overview
   
   We are sporadically encountering "Too Many Requests" (HTTP 429) errors from 
the Kubernetes API when scaling up nodes in our Kubernetes cluster. While most 
PodManager functions already implement retries for various errors, the 
create_pod function previously only retried on HTTP 409 (Conflict) errors.
   With this change, the retry logic is extended to also handle HTTP 429 
errors, improving robustness during cluster scaling operations.
   
   We welcome your feedback on this change!
   
   # Details of change:
   
   * The create_pod function now retries on both HTTP 409 and HTTP 429 errors.
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to