raphaelauv opened a new issue, #27478:
URL: https://github.com/apache/airflow/issues/27478

   ### Apache Airflow version
   
   2.4.2
   
   ### What happened
   
   I have clear a task A that was a normal task but that is now a mapped task
   
   ```log
   [2022-11-02 23:33:20 +0000] [17] [INFO] Worker exiting (pid: 17)
   2022-11-02T23:33:20.390911528Z Traceback (most recent call last):
   2022-11-02T23:33:20.390935788Z   File "/usr/local/bin/airflow", line 8, in 
<module>
   2022-11-02T23:33:20.390939798Z     sys.exit(main())
   2022-11-02T23:33:20.390942302Z   File 
"/usr/local/lib/python3.10/site-packages/airflow/__main__.py", line 39, in main
   2022-11-02T23:33:20.390944924Z     args.func(args)
   2022-11-02T23:33:20.390947345Z   File 
"/usr/local/lib/python3.10/site-packages/airflow/cli/cli_parser.py", line 52, 
in command
   2022-11-02T23:33:20.390949893Z     return func(*args, **kwargs)
   2022-11-02T23:33:20.390952237Z   File 
"/usr/local/lib/python3.10/site-packages/airflow/utils/cli.py", line 103, in 
wrapper
   2022-11-02T23:33:20.390954862Z     return f(*args, **kwargs)
   2022-11-02T23:33:20.390957163Z   File 
"/usr/local/lib/python3.10/site-packages/airflow/cli/commands/scheduler_command.py",
 line 85, in scheduler
   2022-11-02T23:33:20.390959672Z     _run_scheduler_job(args=args)
   2022-11-02T23:33:20.390961979Z   File 
"/usr/local/lib/python3.10/site-packages/airflow/cli/commands/scheduler_command.py",
 line 50, in _run_scheduler_job
   2022-11-02T23:33:20.390964496Z     job.run()
   2022-11-02T23:33:20.390966931Z   File 
"/usr/local/lib/python3.10/site-packages/airflow/jobs/base_job.py", line 247, 
in run
   2022-11-02T23:33:20.390969441Z     self._execute()
   2022-11-02T23:33:20.390971778Z   File 
"/usr/local/lib/python3.10/site-packages/airflow/jobs/scheduler_job.py", line 
746, in _execute
   2022-11-02T23:33:20.390974368Z     self._run_scheduler_loop()
   2022-11-02T23:33:20.390976612Z   File 
"/usr/local/lib/python3.10/site-packages/airflow/jobs/scheduler_job.py", line 
866, in _run_scheduler_loop
   2022-11-02T23:33:20.390979125Z     num_queued_tis = 
self._do_scheduling(session)
   2022-11-02T23:33:20.390981458Z   File 
"/usr/local/lib/python3.10/site-packages/airflow/jobs/scheduler_job.py", line 
946, in _do_scheduling
   2022-11-02T23:33:20.390984819Z     callback_tuples = 
self._schedule_all_dag_runs(guard, dag_runs, session)
   2022-11-02T23:33:20.390988440Z   File 
"/usr/local/lib/python3.10/site-packages/airflow/utils/retries.py", line 78, in 
wrapped_function
   2022-11-02T23:33:20.390991893Z     for attempt in 
run_with_db_retries(max_retries=retries, logger=logger, **retry_kwargs):
   2022-11-02T23:33:20.391008515Z   File 
"/usr/local/lib/python3.10/site-packages/tenacity/__init__.py", line 384, in 
__iter__
   2022-11-02T23:33:20.391012668Z     do = self.iter(retry_state=retry_state)
   2022-11-02T23:33:20.391016220Z   File 
"/usr/local/lib/python3.10/site-packages/tenacity/__init__.py", line 351, in 
iter
   2022-11-02T23:33:20.391019633Z     return fut.result()
   2022-11-02T23:33:20.391022534Z   File 
"/usr/local/lib/python3.10/concurrent/futures/_base.py", line 451, in result
   2022-11-02T23:33:20.391025820Z     return self.__get_result()
   2022-11-02T23:33:20.391029555Z   File 
"/usr/local/lib/python3.10/concurrent/futures/_base.py", line 403, in 
__get_result
   2022-11-02T23:33:20.391033787Z     raise self._exception
   2022-11-02T23:33:20.391037611Z   File 
"/usr/local/lib/python3.10/site-packages/airflow/utils/retries.py", line 87, in 
wrapped_function
   2022-11-02T23:33:20.391040339Z     return func(*args, **kwargs)
   2022-11-02T23:33:20.391042660Z   File 
"/usr/local/lib/python3.10/site-packages/airflow/jobs/scheduler_job.py", line 
1234, in _schedule_all_dag_runs
   2022-11-02T23:33:20.391045166Z     for dag_run in dag_runs:
   2022-11-02T23:33:20.391047413Z   File 
"/usr/local/lib/python3.10/site-packages/sqlalchemy/orm/query.py", line 2887, 
in __iter__
   2022-11-02T23:33:20.391049815Z     return self._iter().__iter__()
   2022-11-02T23:33:20.391052252Z   File 
"/usr/local/lib/python3.10/site-packages/sqlalchemy/orm/query.py", line 2894, 
in _iter
   2022-11-02T23:33:20.391054786Z     result = self.session.execute(
   2022-11-02T23:33:20.391057119Z   File 
"/usr/local/lib/python3.10/site-packages/sqlalchemy/orm/session.py", line 1688, 
in execute
   2022-11-02T23:33:20.391059741Z     conn = self._connection_for_bind(bind)
   2022-11-02T23:33:20.391062247Z   File 
"/usr/local/lib/python3.10/site-packages/sqlalchemy/orm/session.py", line 1529, 
in _connection_for_bind
   2022-11-02T23:33:20.391065901Z     return 
self._transaction._connection_for_bind(
   2022-11-02T23:33:20.391069140Z   File 
"/usr/local/lib/python3.10/site-packages/sqlalchemy/orm/session.py", line 721, 
in _connection_for_bind
   2022-11-02T23:33:20.391078064Z     self._assert_active()
   2022-11-02T23:33:20.391081939Z   File 
"/usr/local/lib/python3.10/site-packages/sqlalchemy/orm/session.py", line 601, 
in _assert_active
   2022-11-02T23:33:20.391085250Z     raise sa_exc.PendingRollbackError(
   2022-11-02T23:33:20.391087747Z sqlalchemy.exc.PendingRollbackError: This 
Session's transaction has been rolled back due to a previous exception during 
flush. To begin a new transaction with this Session, first issue 
Session.rollback(). Original exception was: 
(psycopg2.errors.ForeignKeyViolation) update or delete on table "task_instance" 
violates foreign key constraint "task_fail_ti_fkey" on table "task_fail"
   2022-11-02T23:33:20.391091226Z DETAIL:  Key (dag_id, task_id, run_id, 
map_index)=(kubernetes_dag, task-one, scheduled__2022-11-01T00:00:00+00:00, -1) 
is still referenced from table "task_fail".
   2022-11-02T23:33:20.391093987Z 
   2022-11-02T23:33:20.391102116Z [SQL: UPDATE task_instance SET 
map_index=%(map_index)s WHERE task_instance.dag_id = %(task_instance_dag_id)s 
AND task_instance.task_id = %(task_instance_task_id)s AND task_instance.run_id 
= %(task_instance_run_id)s AND task_instance.map_index = 
%(task_instance_map_index)s]
   2022-11-02T23:33:20.391105554Z [parameters: {'map_index': 0, 
'task_instance_dag_id': 'kubernetes_dag', 'task_instance_task_id': 'task-one', 
'task_instance_run_id': 'scheduled__2022-11-01T00:00:00+00:00', 
'task_instance_map_index': -1}]
   2022-11-02T23:33:20.391108241Z (Background on this error at: 
https://sqlalche.me/e/14/gkpj) (Background on this error at: 
https://sqlalche.me/e/14/7s2a)
   2022-11-02T23:33:20.489698500Z [2022-11-02 23:33:20 +0000] [7] [INFO] 
Shutting down: Master
   ```
   
   ### What you think should happen instead
   
   Airflow should evaluate the existing and previous runs as mapped task of 1 
task
   
   cause I can't see the logs anymore of a task that is now a mapped task
   
   
   
   ### How to reproduce
   
   dag with a normal task A
   run dag
   task A success
   edit dag to make task A a mapped task ( without changing name of task )
   clear task
   scheduler crash
   
   ### Operating System
   
   ubuntu 22.04
   
   ### Versions of Apache Airflow Providers
   
   _No response_
   
   ### Deployment
   
   Docker-Compose
   
   ### Deployment details
   
   _No response_
   
   ### Anything else
   
   _No response_
   
   ### Are you willing to submit PR?
   
   - [ ] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [X] I agree to follow this project's [Code of 
Conduct](https://github.com/apache/airflow/blob/main/CODE_OF_CONDUCT.md)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to