Hai Lu created SAMZA-2601:
-----------------------------

             Summary: Continuous heavy logging when YARN does not allocate 
resources for a container
                 Key: SAMZA-2601
                 URL: https://issues.apache.org/jira/browse/SAMZA-2601
             Project: Samza
          Issue Type: Bug
            Reporter: Hai Lu
            Assignee: Hai Lu


If YARN does not immediately allocate resources for a container and host 
affinity is disabled for the job, then the container allocator will spam the 
logs with messages about the request, even if the request is considered 
expired. This is because the allocator loops over the pending requests to check 
if they have been allocated, but there is no delay between loop iterations, so 
it might just keep checking the same pending request over and over as fast as 
possible. A metric is also incremented in this flow, so that metric is 
extremely high, which is probably not the intention.

One case we saw is that if YARN does not have enough capacity, then it will not 
return resources, so the AM gets into this logging loop:

Handling assignment for Processor ID:

Did not find any allocated containers for running Processor ID:

...

Functionally, I did not observe impact, since YARN is out of capacity anyways, 
so the job isn't going to be deployed in any case. However, the AM basically 
goes into an infinite polling loop if YARN is out of capacity, and this will 
fill up disk with logs very quickly (3GB/s).



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to