[
https://issues.apache.org/jira/browse/AIRFLOW-4922?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17467689#comment-17467689
]
ASF GitHub Bot commented on AIRFLOW-4922:
-----------------------------------------
derkuci commented on pull request #6722:
URL: https://github.com/apache/airflow/pull/6722#issuecomment-1003763219
@xuemengran nvm. I guess you meant the PR suggested here. I tried that;
the log table has changed, and I couldn't match the information easily. The
`execution_date` column is None when `event="cli_task_run"`, which makes
filtering impossible.
I understand why the PR was rejected. For cases where the logs exist but
the web UI couldn't locate the correct hostname, the issue is that the
"task_instance" table only stores the latest `try_number`/`hostname` for a task
run (as already indicated by @ITriangle). The PK doesn't include `try_number`.
It's better to fix the task_instance table, which is more fundamental, and
probably would intimidate most "amateurs" (like me).
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
> If a task crashes, host name is not committed to the database so logs aren't
> able to be seen in the UI
> ------------------------------------------------------------------------------------------------------
>
> Key: AIRFLOW-4922
> URL: https://issues.apache.org/jira/browse/AIRFLOW-4922
> Project: Apache Airflow
> Issue Type: Bug
> Components: logging
> Affects Versions: 1.10.3
> Reporter: Andrew Harmon
> Assignee: wanghong-T
> Priority: Major
>
> Sometimes when a task fails, the log show the following
> {code}
> *** Log file does not exist:
> /usr/local/airflow/logs/my_dag/my_task/2019-07-07T09:00:00+00:00/1.log***
> Fetching from:
> http://:8793/log/my_dag/my_task/2019-07-07T09:00:00+00:00/1.log***
> Failed to fetch log file from worker. Invalid URL
> 'http://:8793/log/my_dag/my_task/2019-07-07T09:00:00+00:00/1.log': No host
> supplied
> {code}
> I believe this is due to the fact that the row is not committed to the
> database until after the task finishes.
> https://github.com/apache/airflow/blob/a1f9d9a03faecbb4ab52def2735e374b2e88b2b9/airflow/models/taskinstance.py#L857
--
This message was sent by Atlassian Jira
(v8.20.1#820001)