[
https://issues.apache.org/jira/browse/SPARK-15569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andrew Or resolved SPARK-15569.
-------------------------------
Resolution: Fixed
Assignee: Sital Kedia
Fix Version/s: 2.0.0
Target Version/s: 2.0.0
> Executors spending significant time in DiskObjectWriter.updateBytesWritten
> function
> -----------------------------------------------------------------------------------
>
> Key: SPARK-15569
> URL: https://issues.apache.org/jira/browse/SPARK-15569
> Project: Spark
> Issue Type: Bug
> Components: Shuffle
> Reporter: Sital Kedia
> Assignee: Sital Kedia
> Fix For: 2.0.0
>
>
> Profiling a Spark job spilling large amount of intermediate data we found
> that significant portion of time is being spent in
> DiskObjectWriter.updateBytesWritten function. Looking at the code
> (https://github.com/sitalkedia/spark/blob/master/core/src/main/scala/org/apache/spark/storage/DiskBlockObjectWriter.scala#L206),
> we see that the function is being called too frequently to update the number
> of bytes written to disk. We should reduce the frequency to avoid this.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]