[
https://issues.apache.org/jira/browse/SPARK-3694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14228580#comment-14228580
]
Ilya Ganelin commented on SPARK-3694:
-------------------------------------
Hi Patrick - I am working on it - I am just trying to finalize a test for this.
The reason I asked about task serialization is that in the description you talk
about task serialization within the TaskSetManager, not the task serialization
within the DAGScheduler - for the DAGScheduler you only mention RDD
serialization. I wanted to confirm whether to print the task serialization for
the DAGScheduler as well as the task serialization for the TaskSetManager.
> Allow printing object graph of tasks/RDD's with a debug flag
> ------------------------------------------------------------
>
> Key: SPARK-3694
> URL: https://issues.apache.org/jira/browse/SPARK-3694
> Project: Spark
> Issue Type: Improvement
> Components: Spark Core
> Reporter: Patrick Wendell
> Assignee: Ilya Ganelin
> Labels: starter
>
> This would be useful for debugging extra references inside of RDD's
> Here is an example for inspiration:
> http://ehcache.org/xref/net/sf/ehcache/pool/sizeof/ObjectGraphWalker.html
> We'd want to print this trace for both the RDD serialization inside of the
> DAGScheduler and the task serialization in the TaskSetManager.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]