narenjngr opened a new issue, #58717:
URL: https://github.com/apache/airflow/issues/58717
### Apache Airflow version
3.1.3
### If "Other Airflow 2/3 version" selected, which one?
3.1.3
### What happened?
What happened?
After upgrade to airflow 3.1.3, system started experiencing random DAG
disappearance.
The config for dag processor has this setup:
```
- name: AIRFLOW__DAG_PROCESSOR__BUNDLE_REFRESH_CHECK_INTERVAL
value: '120'
- name: AIRFLOW__DAG_PROCESSOR__REFRESH_INTERVAL
value: '120'
- name: AIRFLOW__CORE__STORE_SERIALIZED_DAGS
value: 'True'
- name: AIRFLOW__CORE__MIN_SERIALIZED_DAG_UPDATE_INTERVAL
value: '60'
- name: AIRFLOW__DAG_PROCESSOR__MIN_FILE_PROCESS_INTERVAL
value: '120'
- name: AIRFLOW__DAG_PROCESSOR__STALE_DAG_THRESHOLD
value: '86400'
```
Files found by aiflow dag -folder also reduce over the period of time.
Here is my dag processor values.yaml file section-
```
dagProcessor:
enabled: true
replicas: 1
revisionHistoryLimit: ~
command: ~
args: ["bash", "-c", "exec airflow dag-processor"]
strategy:
rollingUpdate:
maxSurge: "100%"
maxUnavailable: "50%"
livenessProbe:
initialDelaySeconds: 120
timeoutSeconds: 60
failureThreshold: 10
periodSeconds: 60
command: ~
serviceAccount:
automountServiceAccountToken: true
create: false
name: "airflow"
annotations: {}
securityContext: {}
securityContexts:
pod: {}
container: {}
containerLifecycleHooks: {}
resources:
limits:
cpu: 1
memory: 2Gi
requests:
cpu: 500m
memory: 500Mi
terminationGracePeriodSeconds: 60
safeToEvict: true
extraContainers: []
extraInitContainers: []
extraVolumes: []
extraVolumeMounts: []
# Select certain nodes for airflow dag processor pods.
nodeSelector: {}
affinity: {}
tolerations: []
topologySpreadConstraints: []
priorityClassName: ~
annotations: {}
podAnnotations: {}
logGroomerSidecar:
enabled: true
command: ~
args: ["bash", "/clean-logs"]
retentionDays: 15
frequencyMinutes: 15
resources: {}
securityContexts:
container: {}
env: []
waitForMigrations:
enabled: true
env: []
securityContexts:
container: {}
env:
- name: AIRFLOW__DAG_PROCESSOR__BUNDLE_REFRESH_CHECK_INTERVAL
value: '120'
- name: AIRFLOW__DAG_PROCESSOR__REFRESH_INTERVAL
value: '120'
- name: AIRFLOW__CORE__STORE_SERIALIZED_DAGS
value: 'True'
- name: AIRFLOW__CORE__MIN_SERIALIZED_DAG_UPDATE_INTERVAL
value: '60'
- name: AIRFLOW__DAG_PROCESSOR__MIN_FILE_PROCESS_INTERVAL
value: '120'
- name: AIRFLOW__DAG_PROCESSOR__STALE_DAG_THRESHOLD
value: '86400'
```
One important point I observed is that whatever DAG is disappeared from UI
if I check its detail on any of the airflow pod using command `airflow dags
details DAG_ID` then I can see IsStale = true
Why DAGs are being marked as stale even after high value 86400 of
AIRFLOW__DAG_PROCESSOR__STALE_DAG_THRESHOLD
### What you think should happen instead?
DAGs shouldn't disappear unless I delete these DAGs from the mounted volume.
### How to reproduce
Use same dag processor config and observer for few days. Intermittent issue.
### Operating System
Linux
### Versions of Apache Airflow Providers
eval_type_backport==0.2.2
apache-airflow-providers-databricks==6.4.0
apache-airflow-providers-mongo==4.2.1
apache-airflow-providers-git==0.0.2
apache-airflow-providers-standard==1.2.0
soda-core-spark-df==3.5.5
soda-core-spark[databricks]==3.5.5
soda-core-scientific==3.5.5
pymongo~=4.0.0
typing_extensions==4.13.2
paramiko<4
PyMuPDF~=1.26.5
### Deployment
Official Apache Airflow Helm Chart
### Deployment details
Official Apache Airflow Helm Chart
### Anything else?
_No response_
### Are you willing to submit PR?
- [ ] Yes I am willing to submit a PR!
### Code of Conduct
- [x] I agree to follow this project's [Code of
Conduct](https://github.com/apache/airflow/blob/main/CODE_OF_CONDUCT.md)
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]