Ngone51 commented on a change in pull request #31876:
URL: https://github.com/apache/spark/pull/31876#discussion_r618528752



##########
File path: core/src/main/scala/org/apache/spark/TaskEndReason.scala
##########
@@ -81,7 +81,7 @@ case object Resubmitted extends TaskFailedReason {
  */
 @DeveloperApi
 case class FetchFailed(
-    bmAddress: BlockManagerId,  // Note that bmAddress can be null
+    bmAddress: Location,  // Note that bmAddress can be null

Review comment:
       I think we'll have to add interfaces to `Location` in order to support 
json serde , e.g.,
   
   ```java
   
   public JValue serializeToJson();
   
   public Location deserilaizeFromJson(json: JValue);
   
   ```
   
   > How to handle different shuffle impls being in play for the same event 
directory.
   
   Adding metadata is good idea, we can have the format like,
   
   
   ```json
   "Mapstatus Location": {
     "type":  "xx.yy.zz", // qualified class name
     "content": { // content is generate by `Location.serializeToJson`
       "aaa":"bbb"
      }
   }
   ```
   
   with the constant format, end-users and SHS are able to parse them as well.
   
   (I had an idea about SHS is to add the location type as the extension of the 
event log file. That's the way what compression does now.  But I think it 
doesn't solve the problem of REST case.)
   
   BTW, I have added https://issues.apache.org/jira/browse/SPARK-35188 for this 
support.
   
   
   
   




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to