[
https://issues.apache.org/jira/browse/SPARK-51262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nicolau Balbino updated SPARK-51262:
------------------------------------
Description:
When using drop_duplicate with subset and after use exceptAll method, when
calling some action (isEmpty, show, collect, count) raises a Py4J error.
Searching web, this issues is related here:
[https://issues.apache.org/jira/plugins/servlet/mobile#issue/SPARK-39612,] also
marked as resolved.
I tested locally with version 3.5.3 and also AWS Glue 5.0, using 3.5.
was:
When using drop_duplicate with subset and after uses exceptAll method, when
calling some action (isEmpty, show, collect, count) raises a Py4J error.
Searching web, this issues is related here:
[https://issues.apache.org/jira/plugins/servlet/mobile#issue/SPARK-39612,] also
marked as resolved.
I tested locally with version 3.5.3 and also AWS Glue 5.0, using 3.5.
> exceptAll not working with drop_duplicates using subset
> -------------------------------------------------------
>
> Key: SPARK-51262
> URL: https://issues.apache.org/jira/browse/SPARK-51262
> Project: Spark
> Issue Type: Bug
> Components: PySpark
> Affects Versions: 3.5.0, 3.5.3
> Reporter: Nicolau Balbino
> Priority: Minor
> Labels: SQL
>
> When using drop_duplicate with subset and after use exceptAll method, when
> calling some action (isEmpty, show, collect, count) raises a Py4J error.
> Searching web, this issues is related here:
> [https://issues.apache.org/jira/plugins/servlet/mobile#issue/SPARK-39612,]
> also marked as resolved.
> I tested locally with version 3.5.3 and also AWS Glue 5.0, using 3.5.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]