[ https://issues.apache.org/jira/browse/SPARK-52979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Allison Wang updated SPARK-52979: --------------------------------- Description: Similar to Arrow UDFs https://issues.apache.org/jira/browse/SPARK-52214, we can introduce a vectorized API for Python UDTF: {code:java} @arrow_udtf class MyUDTF: def eval(self, tbl: pa.RecordBatch) -> Iterator[pa.Table | pa.RecordBatch] ...{code} was:Implement Arrow Python UDTF > Python Arrow UDTF > ----------------- > > Key: SPARK-52979 > URL: https://issues.apache.org/jira/browse/SPARK-52979 > Project: Spark > Issue Type: Umbrella > Components: PySpark > Affects Versions: 4.1.0 > Reporter: Allison Wang > Assignee: Allison Wang > Priority: Major > > Similar to Arrow UDFs https://issues.apache.org/jira/browse/SPARK-52214, we > can introduce a vectorized API for Python UDTF: > {code:java} > @arrow_udtf > class MyUDTF: > def eval(self, tbl: pa.RecordBatch) -> Iterator[pa.Table | pa.RecordBatch] > ...{code} -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org