OpenLineage failed to send DAG start event #44984
Labels
area:providers
kind:bug
This is a clearly a bug
needs-triage
label for new issues that we didn't triage yet
provider:openlineage
AIP-53
Apache Airflow Provider(s)
openlineage
Versions of Apache Airflow Providers
1.14.0
Also seeing missing start DAG events for versions <= 1.12.0. However, those versions weren't logging the exception, making it difficult to determine if this is the same issue.
Apache Airflow version
2.10.1
Operating System
Amazon Linux
Deployment
Amazon (AWS) MWAA
Deployment details
MWAA with:
apache-airflow-providers-openlineage==1.14.0
OPENLINEAGE_URL
pointing to a webserver logging all received requestsWhat happened
OpenLineage provider failed to send some DAG start events, with the following exception in the scheduler logs:
What you think should happen instead
No response
How to reproduce
The failures to send events were non-deterministic and appear to be caused by a race condition. They seem to occur more frequently when multiple DAGs are being scheduled simultaneously.
I used this code to reproduce the issue, and it failed to send at least one DAG start almost every minute.
Anything else
No response
Are you willing to submit PR?
Code of Conduct
The text was updated successfully, but these errors were encountered: