RE: Spark SQL udf(ScalaUdf) is very slow

2015-03-23 Thread Cheng, Hao
This is a very interesting issue, the root reason for the lower performance probably is, in Scala UDF, Spark SQL converts the data type from internal representation to Scala representation via Scala reflection recursively. Can you create a Jira issue for tracking this? I can start to work on

Re: RE: Spark SQL udf(ScalaUdf) is very slow

2015-03-23 Thread ??o0/ka????
@spark.apache.orguser@spark.apache.org; Subject: RE: Spark SQL udf(ScalaUdf) is very slow This is a very interesting issue, the root reason for the lower performance probably is, in Scala UDF, Spark SQL converts the data type from internal representation to Scala representation via Scala reflection