What Yaron is going for is really something similar to custom data source in Spark ( https://levelup.gitconnected.com/easy-guide-to-create-a-custom-read-data-source-in-apache-spark-3-194afdc9627a) that allows utilizing existing Python APIs that knows how to read data source as a stream of record batches.
- data-source UDFs Yaron Gvili
- Re: data-source UDFs Vibhatha Abeykoon
- Re: data-source UDFs Li Jin
- Re: data-source UDFs Li Jin
- Re: data-source UDFs Li Jin
- Re: data-source UDFs Weston Pace
- Re: data-source UDFs David Li
- Re: data-source UDFs Yaron Gvili
- Re: data-source UDFs Weston Pace
- Re: data-source UDFs Yaron Gvili
- Re: data-source UDFs Weston Pace
- Re: data-source UDFs Vibhatha Abeykoon