zhengchenyu commented on PR #235:
URL: https://github.com/apache/tez/pull/235#issuecomment-1220522116

   > thanks for this patch @zhengchenyu!
   > 
   > can you include a unit test to TestTaskScheduler which confirms that a 
TaskScheduler returns Resource(0,0) even if the RM client returned null?
   > 
   > I'm not familiar with yarn federation, but defaulting to Resource(0,0) 
makes sense in edge cases can you please clarify if this is specific to yarn 
federation or can happen without yarn federation too? (it has never been 
reported yet) why does it return null? does it reflect the state of a specific 
RM or the whole cluster of RMs?
   
   It happen only in yarn federation, will never happen without yarn federation.
   In fact, YARN-8933 have fix it. After apply YARN-8933, it will never happen 
in yarn federation.
   I don't know it is necessary to continue it. Because it is not a problem for 
latest hadoop version, but still a problem for some popular version (For 
example: hadoop-3.2.1).
   If you think it is necessary, I will add some unit test. If you think it is 
not necessary, I will close it.
   
   
   ### For why return null in yarn federation? 
   It is another issue about yarn.
   Yarn router use some async thread to connect rm. When all down streaming 
resourcemanager timeout, yarn router may return null. But After YARN-8933, will 
return Resource(0,0).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@tez.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to