Nicolai Balbino created SPARK-51262:
---------------------------------------
Summary: exceptAll not working with drop_duplicates using subset
Key: SPARK-51262
URL: https://issues.apache.org/jira/browse/SPARK-51262
Project: Spark
Issue Type: Bug
Components: PySpark
Affects Versions: 3.5.3, 3.5.0
Reporter: Nicolai Balbino
When using drop_duplicate with subset and after uses exceptAll method, when
calling some action (isEmpty, show, collect, count) raises a Py4J error.
Searching web, this issues is related here:
[https://issues.apache.org/jira/plugins/servlet/mobile#issue/SPARK-39612,] also
marked as resolved.
I tested locally with version 3.5.3 and also AWS Glue 5.0, using 3.5.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]