Vladimir Feinberg created SPARK-16504:
-----------------------------------------
Summary: UDAF should be typed
Key: SPARK-16504
URL: https://issues.apache.org/jira/browse/SPARK-16504
Project: Spark
Issue Type: Improvement
Components: SQL
Reporter: Vladimir Feinberg
Currently, UDAFs can be implemented by using a generic
{{MutableAggregationBuffer}}. This type-less class requires the user specify
the schema.
If the user wants to create vector output from a UDAF, this requires specifying
an output schema with a VectorUDT(), which is only accessible through a
DeveloperApi.
Since we would prefer not to expose VectorUDT, the only option would be to
resolve the user's inability to (legally) specify a schema containing a
VectorUDT the same way that we would do so for creating dataframes: by type
inference, just like createDataFrame does.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]