Hi Arnau, Am Montag, den 02.03.2015, 09:34 +0100 schrieb Arnau Bria: > Hi Andreas, > > > over the weekend we managed to provoke an identical behaviour. Jobs > > crash during the epilog phase when the job's CGroup gets removed. > > So UGE 8.2.1 + Kernel is 2.6.32-504.8.1?
Yes. > what kernel are you running in your production cluster? did you see > this problem there, too? I can' upgrade to newer kernel because many > nodes reboot and we lose many many jobs... Our production cluster is still on UGE 8.0.1 without cgroup support - so it is unaffected. In a "safe environment" you might want to consider a kernel downgrade. In our case this is clearly not an option. So this issue just moves the UGE 8.2.x production status to some unspecified time in future :-( Cheers, Andreas -- | Andreas Haupt | E-Mail: [email protected] | DESY Zeuthen | WWW: http://www-zeuthen.desy.de/~ahaupt | Platanenallee 6 | Phone: +49/33762/7-7359 | D-15738 Zeuthen | Fax: +49/33762/7-7216
