kacpermuda commented on code in PR #57809: URL: https://github.com/apache/airflow/pull/57809#discussion_r2494632603
########## providers/openlineage/docs/guides/user.rst: ########## @@ -478,6 +478,51 @@ You can enable this automation by setting ``spark_inject_transport_info`` option AIRFLOW__OPENLINEAGE__SPARK_INJECT_TRANSPORT_INFO=true +Passing parent information to Airflow DAG +^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ + +To enable full OpenLineage lineage tracking across dependent DAGs, you can pass parent and root job information +through the DAG's ``dag_run.conf``. When a DAG run configuration includes an ``_openlineage`` section with valid metadata, +this information is automatically parsed and converted into DAG's **ParentRunFacet**, from which the root information +is also propagated to all task runs. If no DAG run configuration is provided, the DAG run itself is considered the +lineage root for its tasks. + +The ``_openlineage`` dict in conf can contain the following keys: Review Comment: Yes, it's just for convenience when parent is the root, then we can sort it out. But will make it more clear in the doc, that we strongly encourage passing all of it. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
