I am running it on a 8 node emulab cluster. We have one master and 7 slaves, each with 3 containers. The capacity scheduler web interface becomes un-responsive, and the yarn logs show that the queue size keeps increasing. The number of concurrent running applications is 2-3, but they always occupy all containers in the cluster.
The queue size increases upto 10, 000 on some runs. Muntasir. On Tue, Mar 12, 2013 at 9:55 PM, Vinod Kumar Vavilapalli < [email protected]> wrote: > And what's the number of nodes you have? Number of concurrent running > applications? ResourceManager log-level? > > A 1000 event-queue size isn't really a problem, till what size does it > keep on increasing? And how did you find out that ResourceManager is > "freezing", meaning what are the symptoms you were observing? > > Thanks, > +Vinod Kumar Vavilapalli > Hortonworks Inc. > http://hortonworks.com/ > > On Mar 11, 2013, at 11:28 PM, Muntasir Raihan Rahman wrote: > > > Hi, > > > > I am trying to do some experiments with hadoop yarn. I am submitting a > > large number of jobs to a queue, but after some time the resource manager > > freezes, and I get the following yarn log message: "Size of event-queue > is > > 1000", and the size keeps increasing. I tried to increase the > > node-manager-heartbeat interval from 1sec to 3sec, but I still see the > same > > problem. > > > > Can anyone please give me a hint at the problem, and how to avoid > resource > > manager queue build up? > > > > Thanks > > Muntasir. > > > > -- > > Best Regards > > Muntasir Raihan Rahman > > Email: [email protected] > > Phone: 1-217-979-9307 > > Department of Computer Science, > > University of Illinois Urbana Champaign, > > 3111 Siebel Center, > > 201 N. Goodwin Avenue, > > Urbana, IL 61801 > > -- Best Regards Muntasir Raihan Rahman Email: [email protected] Phone: 1-217-979-9307 Department of Computer Science, University of Illinois Urbana Champaign, 3111 Siebel Center, 201 N. Goodwin Avenue, Urbana, IL 61801
