Re: [PR] [SPARK-47577][SPARK-47579] Correct misleading usage of log key TASK_ID [spark]

via GitHub Wed, 12 Jun 2024 19:39:48 -0700


pan3793 commented on code in PR #46951:
URL: https://github.com/apache/spark/pull/46951#discussion_r1637415038



##########
core/src/main/scala/org/apache/spark/scheduler/TaskSetManager.scala:
##########
@@ -989,8 +989,8 @@ private[spark] class TaskSetManager(
         if (ef.className == 
classOf[TaskOutputFileAlreadyExistException].getName) {
           // If we can not write to output file in the task, there's no point 
in trying to
           // re-execute it.
-          logError(log"Task ${MDC(TASK_ID, info.id)} in stage ${MDC(STAGE_ID, 
taskSet.id)} " +
-            log"(TID ${MDC(TID, tid)}) can not write to output file: " +
+          logError(log"Task ${MDC(TASK_INFO_ID, info.id)} in stage 
${MDC(STAGE_ID, taskSet.id)} " +

Review Comment:
   for cases like TASK_INFO_ID and TASK_SET_ID, I wondering if we can just 
expose
   
   - TASK_INDEX
   - TASK_ATTEMP_NUM
   - STAGE_ID
   - STAGE_ATTEMP_ID
   
   to the MDC?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [PR] [SPARK-47577][SPARK-47579] Correct misleading usage of log key TASK_ID [spark]

Reply via email to