GitHub user wolvery edited a discussion: Airflow Issue with OpenTelemetry

Hi!

We are experiencing an issue in Airflow 2.7.3 with the OpenTelemetry SDK. It 
seems that, while trying to send the metrics, we encounter this error when the 
OpenTelemetry Collectors are overwhelmed:
```
Transient error %s encountered while exporting metric batch, retrying in %ss.
```
This seems to impact the CPU consumption of the Scheduler, and when it happens 
with tasks, they get stuck.
Have someone experienced this?
Any suggestion to handle it?
I saw that the OpenTelemetry GRPC SDK for Python does not have this code to 
exponentially backoff the retry:
https://opentelemetry-python.readthedocs.io/en/latest/_modules/opentelemetry/exporter/otlp/proto/http/metric_exporter.html#OTLPMetricExporter

GitHub link: https://github.com/apache/airflow/discussions/43218

----
This is an automatically sent email for [email protected].
To unsubscribe, please send an email to: [email protected]

Reply via email to