[
https://issues.apache.org/jira/browse/SPARK-11775?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15008408#comment-15008408
]
Apache Spark commented on SPARK-11775:
--------------------------------------
User 'zjffdu' has created a pull request for this issue:
https://github.com/apache/spark/pull/9766
> Allow PySpark to register Java UDF
> ----------------------------------
>
> Key: SPARK-11775
> URL: https://issues.apache.org/jira/browse/SPARK-11775
> Project: Spark
> Issue Type: New Feature
> Components: PySpark, SQL
> Reporter: Jeff Zhang
>
> Currently pyspark can only call the builtin java UDF, but can not call custom
> java UDF. It would be better to allow that. 2 benefits:
> * Leverage the power of rich third party java library
> * Improve the performance. Because if we use python UDF, python daemons will
> be started on worker which will affect the performance.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]