What Yaron is going for is really something similar to custom data source
in Spark (
https://levelup.gitconnected.com/easy-guide-to-create-a-custom-read-data-source-in-apache-spark-3-194afdc9627a)
that allows utilizing existing Python APIs that knows how to read data
source as a stream of record batches.

Reply via email to