pan3793 commented on code in PR #46951:
URL: https://github.com/apache/spark/pull/46951#discussion_r1637415038
##########
core/src/main/scala/org/apache/spark/scheduler/TaskSetManager.scala:
##########
@@ -989,8 +989,8 @@ private[spark] class TaskSetManager(
if (ef.className ==
classOf[TaskOutputFileAlreadyExistException].getName) {
// If we can not write to output file in the task, there's no point
in trying to
// re-execute it.
- logError(log"Task ${MDC(TASK_ID, info.id)} in stage ${MDC(STAGE_ID,
taskSet.id)} " +
- log"(TID ${MDC(TID, tid)}) can not write to output file: " +
+ logError(log"Task ${MDC(TASK_INFO_ID, info.id)} in stage
${MDC(STAGE_ID, taskSet.id)} " +
Review Comment:
for cases like TASK_INFO_ID and TASK_SET_ID, I wondering if we can just
expose
- TASK_INDEX
- TASK_ATTEMP_NUM
- STAGE_ID
- STAGE_ATTEMP_ID
to the MDC?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]