Hi, I have previously used load_sensors to disable a queue when one of my filesystems reach 98% of occupation but I don't know how to do the same to disable a single exec node, not a full queue. In the man I see the "suspend_threshold" is only available for queues but not for exec nodes.
I would like to disable single exec nodes in case the node can't acces /home. Exactly what I am trying to achieve is to run a load_sensor in every exec node just doing 'ls /home/username' and if this load_sensor returns a FALSE (can't access the filesystem) then just disable the node so it doesn't accept more jobs until the problem is solved. is this possible using load_sensors or should try a different approach? many thanks in advance for any help Pablo. _______________________________________________ users mailing list [email protected] https://gridengine.org/mailman/listinfo/users
