[jira] [Assigned] (SPARK-26097) Show partitioning details in DAG UI

Apache Spark (JIRA) Fri, 16 Nov 2018 21:13:33 -0800


     [ 
https://issues.apache.org/jira/browse/SPARK-26097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Apache Spark reassigned SPARK-26097:
------------------------------------

    Assignee: Apache Spark

> Show partitioning details in DAG UI
> -----------------------------------
>
>                 Key: SPARK-26097
>                 URL: https://issues.apache.org/jira/browse/SPARK-26097
>             Project: Spark
>          Issue Type: Improvement
>          Components: Web UI
>    Affects Versions: 2.2.0, 2.2.1, 2.2.2, 2.3.0, 2.3.1, 2.3.2, 2.4.0
>            Reporter: Idan Zalzberg
>            Assignee: Apache Spark
>            Priority: Major
>         Attachments: image (8).png
>
>
> We run complex SQL queries using Spark SQL, often we have to tackle a join 
> skew or incorrect partition count. The problem is that while the Spark UI 
> shows the existence of the problem and what *stage* it is part of, it's hard 
> to infer back to the original SQL query that was given (e.g. what is the 
> specific join operation that is actually skewed).
> One way to resolve this is to relate the Exchange nodes in the DAG to the 
> partitioning that they represent, this is actually a trivial change in code 
> (less than one line) that we believe can greatly benefit the research of 
> performance issues.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Assigned] (SPARK-26097) Show partitioning details in DAG UI

Reply via email to