[ 
https://issues.apache.org/jira/browse/SPARK-16683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Witold Jędrzejewski updated SPARK-16683:
----------------------------------------
    Description: 
When I join a dataframe, group by a field from it, then join it again by 
different field and group by field from it, second aggregation does not trigger.

Minimal example showing the problem is attached as the text to paste into 
spark-shell (code_2.0.txt).
The detailed description and minimal example, workaround and possible cause are 
in the attachment, in a form of Zeppelin notebook.

  was:
When I join a dataframe, group by a field from it, then join it again by 
different field and group by field from it, second aggregation does not trigger.

The detailed description and minimal example, workaround and possible cause are 
in the attachment, in a form of Zeppelin notebook.


> Group by does not work after multiple joins of the same dataframe
> -----------------------------------------------------------------
>
>                 Key: SPARK-16683
>                 URL: https://issues.apache.org/jira/browse/SPARK-16683
>             Project: Spark
>          Issue Type: Bug
>          Components: Optimizer
>    Affects Versions: 1.6.0, 1.6.1, 1.6.2, 2.0.0
>         Environment: local and yarn
>            Reporter: Witold Jędrzejewski
>         Attachments: Duplicates Problem Presentation.json, code_2.0.txt
>
>
> When I join a dataframe, group by a field from it, then join it again by 
> different field and group by field from it, second aggregation does not 
> trigger.
> Minimal example showing the problem is attached as the text to paste into 
> spark-shell (code_2.0.txt).
> The detailed description and minimal example, workaround and possible cause 
> are in the attachment, in a form of Zeppelin notebook.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to