[
https://issues.apache.org/jira/browse/SPARK-51262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18063663#comment-18063663
]
Holden Karau commented on SPARK-51262:
--------------------------------------
Can you share a repro?
> exceptAll not working with drop_duplicates using subset
> -------------------------------------------------------
>
> Key: SPARK-51262
> URL: https://issues.apache.org/jira/browse/SPARK-51262
> Project: Spark
> Issue Type: Bug
> Components: PySpark
> Affects Versions: 3.5.0, 3.5.3
> Reporter: Nicolau Balbino
> Priority: Minor
> Labels: SQL
>
> When using drop_duplicate with subset and after use exceptAll method, when
> calling some action (isEmpty, show, collect, count) raises a Py4J error.
> Searching web, this issues is related here:
> [https://issues.apache.org/jira/plugins/servlet/mobile#issue/SPARK-39612,]
> also marked as resolved.
> I tested locally with version 3.5.3 and also AWS Glue 5.0, using 3.5.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]