[ https://issues.apache.org/jira/browse/SPARK-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Patrick Wendell updated SPARK-2086: ----------------------------------- Description: It would be nice if the toDebugString method of an RDD did a better job of explaining where shuffle boundaries occur in the lineage graph. One way to do this would be to only indent the tree at a shuffle boundary instead of indenting it for every parent. We can determine when a shuffle boundary occurs based on the type of dependency seen in the RDD. was:It would be nice if the toDebugString method of an RDD did a better job of explaining where shuffle boundaries occur in the lineage graph. One way to do this would be to only indent the tree at a shuffle boundary instead of indenting it for every parent. > Improve output of toDebugString to make shuffle boundaries more clear > --------------------------------------------------------------------- > > Key: SPARK-2086 > URL: https://issues.apache.org/jira/browse/SPARK-2086 > Project: Spark > Issue Type: Improvement > Reporter: Patrick Wendell > Assignee: Gregory Owen > Priority: Minor > > It would be nice if the toDebugString method of an RDD did a better job of > explaining where shuffle boundaries occur in the lineage graph. One way to do > this would be to only indent the tree at a shuffle boundary instead of > indenting it for every parent. > We can determine when a shuffle boundary occurs based on the type of > dependency seen in the RDD. -- This message was sent by Atlassian JIRA (v6.2#6252)