Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/23131 shall we say `union` is an alias of `unionAll` instead of `unionAll` is an alias of `Union`? According to the SQL spec, `unionAll` is implemented correctly that it keeps duplicated rows, while `union` does not follow SQL spec, as it's too widely used and it's too late to change behavior.
--- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org