GitHub user trau-sca created a discussion: Slow API server website load times

Hi, we've been battling issues for the past few weeks on our api-server pod 
being very slow to display the website information (up to a minute to load the 
dags card screen). Sometimes the api server pod even restarts due to the 
liveness probe. I don't see any errors when it happens either.

We are running on airflow 3.1.0 and the 1.18.0 helm chart in an Azure 
Kubernetes cluster with an Azure PostgreSQL database. 
No pods are going past their resource requests when this issue occurs, and the 
resources for the database are very low in usage.

I have disabled sqlalchemy pooling, and use pgbouncer in airflow as well as in 
the PostgreSQL database.
I have the following configuration for airflow:
```
pgbouncer:
    enabled: true
    maxClientConn: 500
    metadataPoolSize: 50
    resultBackendPoolSize: 10
    extraIni: |
      pool_mode = transaction
```

On PostgreSQL, I have pgbouncer's pool_mode set to session, and the 
default_pool_size is 100.

We have 16 DAGs, and have a daily job that runs a db clean for anything older 
than 90 days.

What can I do to troubleshoot this slow response time?

GitHub link: https://github.com/apache/airflow/discussions/56585

----
This is an automatically sent email for [email protected].
To unsubscribe, please send an email to: [email protected]

Reply via email to