ecerulm opened a new pull request, #23504:
URL: https://github.com/apache/airflow/pull/23504

   The previous implementation relied on watch returning the events
   sorted by resource_version which is not guaranteed at least in EKS. 
   
   So previously you could end up with KubernetesJobWatcher retrying watch from 
a resource_version that is not valid (too old already) 
   
   ```
   2022-05-05 08:41:40,522] {kubernetes_executor.py:126} INFO - Event: and now 
my watch begins starting at resource_version: 379140622
   [2022-05-05 08:41:40,545] {kubernetes_executor.py:111} ERROR - Unknown error 
in KubernetesJobWatcher. Failing
   Traceback (most recent call last):
     File 
"/home/airflow/.local/lib/python3.8/site-packages/airflow/executors/kubernetes_executor.py",
 line 102, in run
       self.resource_version = self._run(
     File 
"/home/airflow/.local/lib/python3.8/site-packages/airflow/executors/kubernetes_executor.py",
 line 145, in _run
       for event in list_worker_pods():
     File 
"/home/airflow/.local/lib/python3.8/site-packages/kubernetes/watch/watch.py", 
line 182, in stream
       raise client.rest.ApiException(
   kubernetes.client.exceptions.ApiException: (410)
   Reason: Expired: too old resource version: 379140622 (380367990)
   ```. 
   
   
   
   <!--
   Thank you for contributing! Please make sure that your code changes
   are covered with tests. And in case of new features or big changes
   remember to adjust the documentation.
   
   Feel free to ping committers for the review!
   
   In case of existing issue, reference it using one of the following:
   
   closes: #ISSUE
   related: #ISSUE
   
   How to write a good git commit message:
   http://chris.beams.io/posts/git-commit/
   -->
   
   ---
   **^ Add meaningful description above**
   
   Read the **[Pull Request 
Guidelines](https://github.com/apache/airflow/blob/main/CONTRIBUTING.rst#pull-request-guidelines)**
 for more information.
   In case of fundamental code change, Airflow Improvement Proposal 
([AIP](https://cwiki.apache.org/confluence/display/AIRFLOW/Airflow+Improvements+Proposals))
 is needed.
   In case of a new dependency, check compliance with the [ASF 3rd Party 
License Policy](https://www.apache.org/legal/resolved.html#category-x).
   In case of backwards incompatible changes please leave a note in a 
newsfragement file, named `{pr_number}.significant.rst`, in 
[newsfragments](https://github.com/apache/airflow/tree/main/newsfragments).
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to