karenbraganz commented on PR #46257:
URL: https://github.com/apache/airflow/pull/46257#issuecomment-2710736249

   I am updating "zombie task" terminology to "task instance heartbeat timeout" 
in the Airflow configurations, source code, and documentation after receiving 
approval in [this dev list 
discussion](https://lists.apache.org/[email protected]:lte=1M:Updating%20%22zombie%20task%22%20terminology%20to%20%22task%20heartbeat%20timeout%22).
   
   Three tests from the Edge Executor keep failing on this PR with the below 
error message even though I have changed the `scheduler_zombie_task_threshold` 
config to `task_instance_heartbeat_timeout` in 
airflow/config_templates/config.yml.
   ```
   raise AirflowConfigException(f"section/key [{section}/{key}] not found in 
config")
   E   airflow.exceptions.AirflowConfigException: section/key 
[scheduler/task_instance_heartbeat_timeout] not found in config
   ```
   - I ran all tests for the Edge provider on my local branch with `breeze 
testing providers-tests --test-type "Providers[edge]"` and did not see any 
failures.
   
![image](https://github.com/user-attachments/assets/e26c3f20-5a6e-4574-8773-958b95e7135c)
   - I built and ran the image from my local branch, then opened a Python shell 
and ran the code that is failing on the PR tests, but did not observe any 
errors. This confirms the config `task_instance_heartbeat_timeout` exists in my 
branch.
   ```
   (base) karenbraganza@Karens-MacBook-Pro airflow % docker run -it 
ghcr.io/apache/airflow/main/prod/python3.9 bash
   
   airflow@6a0d443df236:/opt/airflow$ python3
   Python 3.9.21 (main, Feb 25 2025, 10:00:15) 
   [GCC 12.2.0] on linux
   Type "help", "copyright", "credits" or "license" for more information.
   >>> from airflow.configuration import conf
   >>> from datetime import timedelta
   >>> 
   >>> timedelta(seconds=conf.getint("scheduler", 
"task_instance_heartbeat_timeout") + 1)
   datetime.timedelta(seconds=301)
   ```
   - I have synced my remote branch, local branch, and main airflow branch 
multiple times, but the test keeps failing on this PR.
   
   Could someone please help me understand why these tests are failing on my 
remote branch even though they are passing on my local branch, and my local and 
remote branches are synced?
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to