Github user cloud-fan commented on a diff in the pull request:
https://github.com/apache/spark/pull/22469#discussion_r218877589
--- Diff: docs/sql-programming-guide.md ---
@@ -1879,6 +1879,7 @@ working with timestamps in `pandas_udf`s to get the
best performance, see
## Upgrading From Spark SQL 2.3 to 2.4
+ - Since Spark 2.4, when there is a struct field in front of the IN
operator, the inner query must contain a struct field as well. In previous
versions, instead, the fields of the struct were compared to the output of the
inner query. Eg. if `a` is a `struct(a string, b int)`, in Spark 2.4 `a in
(select (1 as a, 'a' as b) from range(1))` is a valid query, while `a in
(select 1, 'a' from range(1))` is not. In previous version it was the opposite.
--- End diff --
`IN operator` => `IN operator before a subquery`
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]