Hi,
Following the discussion in this thread:
https://lists.apache.org/thread/vwlhllkp009r6dkxpvr5115qmfppf8v6, I’d like
to call for *lazy consensus* on adding a new provider for Apache TinkerPop
using the Gremlin client.
I’ve created a pull request for this new provider:
https://github.com/apache/airflow/pull/47446
The main changes include:
-
GremlinHook
- Implements a basic connection structure using the Client from the
*gremlinpython* package (no extras are expected with this version).
- Requires a mandatory Gremlin query argument, with options to pass
additional parameters for serializer, bindings, and request_options.
-
GremlinOperator
- Added to run Gremlin queries.
-
Testing
- Unit tests for both classes cover all methods.
- A system test using the GremlinOperator.
- Integration tests that spin up a Gremlin server using
docker-compose as part of the CI pipeline, covering:
- Adding records into the graph database.
- Querying the added records.
- Cleaning up the database.
The lazy consensus period lasts for 72 hours until Friday, March 14th at
1pm UTC. If there are no objections, I will merge the latest changes from
main and move forward with code reviews (if any).
Hope you have a great rest of the week!
Thanks,
Farhan