mshahid6 opened a new pull request, #19107: URL: https://github.com/apache/druid/pull/19107
### Description Added `extensions-contrib/openlineage-emitter` as a contrib extension that uses the `RequestLogger` to transform and send lineage information to any [OpenLineage](https://openlineage.io)-compatible API. For SQL queries, the SQL text is parsed with the Calcite parser to extract input datasources (FROM clauses, JOINs, CTEs) and output datasources (INSERT INTO). For native queries, table names are read from `DataSource.getTableNames()`. Native sub-queries spawned by a SQL execution are deduplicated against the SQL-level event. Each event includes standard OpenLineage facets (`processing_engine`, `jobType`, `sql`,`errorMessage`) and custom Druid facets (`druid_query_context` with user identity and query metadata, `druid_query_statistics` with duration and bytes). Transport is configurable: `CONSOLE` (default) logs JSON to the Druid log; `HTTP` POSTs to an OpenLineage endpoint such as [Marquez](https://marquezproject.ai). Can be combined with other loggers via the `composing` provider. This PR has: - [ ] been self-reviewed. - [ ] using the [concurrency checklist](https://github.com/apache/druid/blob/master/dev/code-review/concurrency.md) (Remove this item if the PR doesn't have any relation to concurrency.) - [ ] added documentation for new or modified features or behaviors. - [ ] a release note entry in the PR description. - [x] added Javadocs for most classes and all non-trivial methods. Linked related entities via Javadoc links. - [ ] added or updated version, license, or notice information in [licenses.yaml](https://github.com/apache/druid/blob/master/dev/license.md) - [ ] added comments explaining the "why" and the intent of the code wherever would not be obvious for an unfamiliar reader. - [x] added unit tests or modified existing tests to cover new code paths, ensuring the threshold for [code coverage](https://github.com/apache/druid/blob/master/dev/code-review/code-coverage.md) is met. - [ ] added integration tests. - [ ] been tested in a test Druid cluster. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
