Omkar Vinit Joshi created YARN-544:
--------------------------------------

             Summary: Failed resource localization might introduce a race 
condition.
                 Key: YARN-544
                 URL: https://issues.apache.org/jira/browse/YARN-544
             Project: Hadoop YARN
          Issue Type: Bug
            Reporter: Omkar Vinit Joshi
            Assignee: Omkar Vinit Joshi


When resource localization fails [Public localizer / LocalizerRunner(Private)] 
it sends ContainerResourceFailedEvent to the containers which then sends 
ResourceReleaseEvent to the failed resource. In the end when 
LocalizedResource's ref count drops to 0 its state is changed from DOWNLOADING 
to INIT.
Now if a Resource gets ResourceRequestEvent in between 
ContainerResourceFailedEvent and last ResourceReleaseEvent then for that 
resource ref count will not drop to 0 and the container which sent the 
ResourceRequestEvent will keep waiting.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to