[ https://issues.apache.org/jira/browse/SPARK-14486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
meiyoula updated SPARK-14486: ----------------------------- Description: For partition table, when partition rdds do some maps, the rdd number will multiple grow. So rdd number in dag will become thousands, and occurs oom. Can we make a improvement to reduce the rdd number in dag. show the same rdds just one time, not each partition. As the screen shot shows "HiveTableScan" cluster has thousands same rdds. was: For partition table, when partition rdds do some maps, the rdd number will multiple grow. So rdd number in dag will become thounds, and occurs oom. Can we make a improvement to reduce the rdd number in dag. show the same rdds just one time, not each partition. > For partition table, the dag occurs oom because of too many same rdds > --------------------------------------------------------------------- > > Key: SPARK-14486 > URL: https://issues.apache.org/jira/browse/SPARK-14486 > Project: Spark > Issue Type: Bug > Reporter: meiyoula > Attachments: screenshot-1.png > > > For partition table, when partition rdds do some maps, the rdd number will > multiple grow. So rdd number in dag will become thousands, and occurs oom. > Can we make a improvement to reduce the rdd number in dag. show the same rdds > just one time, not each partition. > As the screen shot shows "HiveTableScan" cluster has thousands same rdds. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org