Hi,

I have previously used load_sensors to disable a queue when one of my
filesystems reach 98% of occupation but I don't know how to do the
same to disable a single exec node, not a full queue. In the man I see
the "suspend_threshold" is only available for queues but not for exec
nodes.

I would like to disable single exec nodes in case the node can't acces
/home. Exactly what I am trying to achieve is to run a load_sensor in
every exec node  just doing 'ls /home/username' and if this
load_sensor returns a FALSE (can't access the filesystem) then just
disable the node so it doesn't accept more jobs until the problem is
solved.

is this possible using load_sensors or should try a different approach?

many thanks in advance for any help
Pablo.
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to