Github user viirya commented on the issue:
https://github.com/apache/spark/pull/19229
@zhengruifeng Yeah, it is better. Actually the difference between running
multiple `withColumn` and one `withColumns` is mainly in the cost of query
analysis and plan/dataset initialization. I will re-run the benchmark.--- --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
