I just find that MutableAggregationBuffer.update will convert data for every update, which is terrible when I use something like Map, Array.
It is hard to implement a collect_set udaf, which will be O(n^2) in this convert semantic. Any advice? -- View this message in context: http://apache-spark-developers-list.1001551.n3.nabble.com/SparkSQL-UDAF-CatalystTypeConverters-for-each-update-tp18348.html Sent from the Apache Spark Developers List mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe e-mail: dev-unsubscr...@spark.apache.org