Good morning Joseph,
according to my experience the suspension of distributed(!) parallel
jobs with gridengine does not work correctly, at least not until version
6.2u5. Don't be paranoid but expect unexpected behavior of your jobs,
even if you have easily suspendable jobs:
Example: Subordinate job 1 runs on host A and B. It gets suspended by
superordinate job 2 on host A. If then superordinate job 3 starts on
host B job 1 stays suspended. But when job 1 has finished, job 2
restarts and you have both jobs 1 and 3 running and overloading host B.
You probably need some kind of cronjob to suspend and unsuspend your
parallel jobs correctly. Or does anyone have a patch for this?
Regards, Erik Soyez.
On Tue, 12 Jun 2012, Joseph Farran wrote:
Well, for our needs, we *REALLY* need Parallel Job suspension. It's
not even a choice for us.
If Torque/Maui can do it, I am sure OGE can do it without issues.
Can someone please tell me what patch I need to install to un-break / turn-on
Parallel job suspension?
If you guys are that paranoid about PE suspension, how about adding an on/off
flag for this since the code is already there and let the admin pick?
On 06/12/2012 06:52 AM, Dave Love wrote:
"Joseph A. Farran"<[email protected]> writes:
If you guys are taking requests, *please* add suspension and ignore old
Sun recommendation.
Support for suspension exists, it's just broken (per the issue Reuti
pointed to). The use of | is clearly wrong, but the other bit isn't
clear. It's one of the available patches I wanted to understand before
applying (and had forgotten about). Can anyone cast more light on it?
--
--
Vorstandsvorsitzender/Chairman of the board of management:
Gerd-Lothar Leonhart
Vorstand/Board of Management:
Dr. Bernd Finkbeiner, Michael Heinrichs,
Dr. Arno Steitz, Dr. Ingrid Zech
Vorsitzender des Aufsichtsrats/
Chairman of the Supervisory Board:
Philippe Miltin
Sitz/Registered Office: Tuebingen
Registergericht/Registration Court: Stuttgart
Registernummer/Commercial Register No.: HRB 382196
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users