[ 
https://issues.apache.org/jira/browse/SPARK-55723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yicong Huang updated SPARK-55723:
---------------------------------
    Affects Version/s: 4.2.0
                           (was: 4.1.2)

> Generalize RESULT_COLUMNS_MISMATCH_FOR_ARROW_UDTF error in enforce_schema for 
> all UDF types
> -------------------------------------------------------------------------------------------
>
>                 Key: SPARK-55723
>                 URL: https://issues.apache.org/jira/browse/SPARK-55723
>             Project: Spark
>          Issue Type: Improvement
>          Components: PySpark
>    Affects Versions: 4.2.0
>            Reporter: Yicong Huang
>            Priority: Minor
>
> The `enforce_schema` method in `ArrowBatchTransformer` raises 
> `RESULT_COLUMNS_MISMATCH_FOR_ARROW_UDTF` when Arrow type casting fails. This 
> error class is UDTF-specific, but `enforce_schema` is a general-purpose 
> utility that will be shared across other Arrow-based UDF types (e.g., scalar 
> Arrow UDFs).
> We should introduce a more general error class (e.g., 
> `RESULT_COLUMNS_MISMATCH_FOR_ARROW_UDF`) and use it in `enforce_schema`, so 
> the error message is appropriate regardless of the calling UDF type.
> See: https://github.com/apache/spark/pull/54296#discussion_r2861772381



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to