[
https://issues.apache.org/jira/browse/SPARK-36070?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17377938#comment-17377938
]
Apache Spark commented on SPARK-36070:
--------------------------------------
User 'yaooqinn' has created a pull request for this issue:
https://github.com/apache/spark/pull/33279
> Add time cost info for writing rows out and committing the task.
> ----------------------------------------------------------------
>
> Key: SPARK-36070
> URL: https://issues.apache.org/jira/browse/SPARK-36070
> Project: Spark
> Issue Type: Improvement
> Components: Spark Core
> Affects Versions: 3.2.0
> Reporter: Kent Yao
> Priority: Minor
>
> We have a job that has a stage that contains about 8k tasks. Most tasks take
> about 1~10min to finish but 3 of them tasks run extremely slow. They take
> about 1 hour each to finish and also do their speculations. The root cause is
> most likely the delay of the storage system. On the spark side, we can record
> the time cost in logs for better bug hunting or performance tuning.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]