And what's the number of nodes you have? Number of concurrent running applications? ResourceManager log-level?
A 1000 event-queue size isn't really a problem, till what size does it keep on increasing? And how did you find out that ResourceManager is "freezing", meaning what are the symptoms you were observing? Thanks, +Vinod Kumar Vavilapalli Hortonworks Inc. http://hortonworks.com/ On Mar 11, 2013, at 11:28 PM, Muntasir Raihan Rahman wrote: > Hi, > > I am trying to do some experiments with hadoop yarn. I am submitting a > large number of jobs to a queue, but after some time the resource manager > freezes, and I get the following yarn log message: "Size of event-queue is > 1000", and the size keeps increasing. I tried to increase the > node-manager-heartbeat interval from 1sec to 3sec, but I still see the same > problem. > > Can anyone please give me a hint at the problem, and how to avoid resource > manager queue build up? > > Thanks > Muntasir. > > -- > Best Regards > Muntasir Raihan Rahman > Email: [email protected] > Phone: 1-217-979-9307 > Department of Computer Science, > University of Illinois Urbana Champaign, > 3111 Siebel Center, > 201 N. Goodwin Avenue, > Urbana, IL 61801
