[
https://issues.apache.org/jira/browse/FLINK-22297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Dian Fu updated FLINK-22297:
----------------------------
Description:
For Pandas UDF, the input type for each input argument is Pandas.Series and the
result type is also of type Pandas.Series. Besides, the length of the result
should be the same as the inputs. If this is not the case, currently the
behavior is unclear. We should perform early check for this and provide a clear
error message.
See
http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/PyFlink-Vectorized-UDF-throws-NullPointerException-td42952.html
for more details.
was:For Pandas UDF, the input type for each input argument is Pandas.Series
and the result type is also of type Pandas.Series. Besides, the length of the
result should be the same as the inputs. If this is not the case, currently the
behavior is unclear. We should perform early check for this and provide a clear
error message.
> Perform early check to ensure that the length of the result is the same as
> the input for Pandas UDF
> ---------------------------------------------------------------------------------------------------
>
> Key: FLINK-22297
> URL: https://issues.apache.org/jira/browse/FLINK-22297
> Project: Flink
> Issue Type: Improvement
> Components: API / Python
> Reporter: Dian Fu
> Priority: Major
>
> For Pandas UDF, the input type for each input argument is Pandas.Series and
> the result type is also of type Pandas.Series. Besides, the length of the
> result should be the same as the inputs. If this is not the case, currently
> the behavior is unclear. We should perform early check for this and provide a
> clear error message.
> See
> http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/PyFlink-Vectorized-UDF-throws-NullPointerException-td42952.html
> for more details.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)