trunk got some patches with the help of Mike Dickun. However, the problem described in this thread was never reproduced by yours truly (:-P) both with 2.3.2 or the latest scheduler ......
I'm always available if something is not working ok with the scheduler.... just send along the log and the most detailed description of the issue you can come up with . On Monday, January 14, 2013 9:44:46 PM UTC+1, Adi wrote: > > Seems like we have the same situation (Version 2.3.2 (2012-12-17 15:03:30) > stable). More and more tasks are just being assigned, but not completed nor > failed. I can send you a debug log via email, if this hasn't been fixed in > trunk? > > Thanks, > Adnan > > On Wednesday, December 19, 2012 2:46:07 PM UTC-5, Niphlod wrote: >> >> yep, I started with "sorry" because I can't reproduce with the newer one >> and the older one. But let me (us) know as soon as logs are filled. >> >> On Wednesday, December 19, 2012 7:52:44 PM UTC+1, JimK wrote: >>> >>> I'll give it a try with debug mode later this afternoon and send along >>> the logs. >>> >>> I'm certain that there's a real bug here and not user error as I tested >>> it quite thoroughly from a blackbox perspective (I didn't know how to turn >>> on logging, doh!). Again, with all other criteria being equal, switching >>> out the schedule.py file out with the 2.12(?) version fixed the problem. >>> >>> Jim >>> >>> On Wednesday, December 19, 2012 1:32:09 AM UTC-8, Niphlod wrote: >>>> >>>> sorry, just tried, can't reproduce the issue. Can you post the logs of >>>> the schedulers ? >>>> Maybe start them adding *-D 0* to activate the DEBUG level >>>> What normally goes on is: >>>> - you start "dir", it's the first one so it becomes the "TICKER", but >>>> it doesn't see any "metrics" worker so it doesn't assign tasks >>>> - you start "metrics", after a while the "dir" worker sees it and >>>> starts assigning tasks to "metrics" >>>> - etc. >>>> >>>> >>>> Il giorno mercoledì 19 dicembre 2012 04:31:32 UTC+1, JimK ha scritto: >>>>> >>>>> I'm not sure what the issue was but replacing the gluon/scheduler.py >>>>> file with the older one fixed the problem. There were a lot of changes >>>>> in >>>>> 2.3.2 so it's hard to pinpoint what broke things. >>>>> >>>>> On Tuesday, December 18, 2012 6:36:29 PM UTC-8, JimK wrote: >>>>>> >>>>>> There appears to be a major bug with the scheduler workers with 2.3.2 >>>>>> in tasks are stuck in the ASSIGNED state if that worker group was not >>>>>> the >>>>>> first one started. This problem did not exist until I upgraded to 2.3.2 >>>>>> this morning. >>>>>> >>>>>> I have 2 task groups in my service (dir, metrics). >>>>>> >>>>>> I start the workers after starting the server like so (art is the >>>>>> application name): >>>>>> /usr/bin/python2.6 /var/www/web2py/web2py.py -K art:dir >>>>>> /usr/bin/python2.6 /var/www/web2py/web2py.py -K art:metrics >>>>>> /usr/bin/python2.6 /var/www/web2py/web2py.py -K art:rap >>>>>> >>>>>> If I start the "dir" worker first and the "metrics", my "dir" group >>>>>> tasks complete fine but the "metrics" group tasks are stuck in the >>>>>> ASSIGNED >>>>>> state. >>>>>> To prove my theory, I then killed all of the workers and stared the >>>>>> "metrics" worker first and then "dir" worker. As hypothesized, the >>>>>> "metrics" group tasks complete but the "dir" group tasks are stuck in >>>>>> the >>>>>> ASSIGNED state. >>>>>> >>>>>> Any ideas??? >>>>>> >>>>> --

