[gridengine users] Restricting / controlling the access to $TMPDIR

Txema Heredia Genestar Wed, 29 Feb 2012 09:08:14 -0800

Hello all,

I want to control the usage of the local disk of our execution nodes. Asfar as I have found, the only related option offered by SGE is theh_fsize limit. But that will not work because it just limits the maximumfile size of any created file in any filesystem, being it the local diskor the NFS shared volume.


What I came around is:

1- Create a load sensor for the usage percentage of the local disk ofeach host.

2- Add that sensor to the Suspend Threshold of all queues.

3- Create a consumable attribute "local_disk", with default value = 0KB(most jobs won't make any use of it)

4- Set the value of "local_disk" in each host

That way, whenever a job is sent, if it requests no disk space, nothinghappens. If the job explicitly requests disk space, the job will bescheduled to a host with enough free space. If that job exceeds therequested disk space, "usually" nothing will happen. But if the jobexceeds its disk space in a node with several other jobs using thatdisk, instead of filling the disk and crash the jobs due to lack ofspace, all jobs will be suspended until the problem is manually fixed.I understand that this is not a true resource limit as with h_vmem, andit requires human conflict solving.


Does anyone have a better idea?

Thanks in advance,

Txema

PS: Another possible option i thought about would be a prolog script(and the epilog cleanup equivalent) that, before the job starts:

1- Creates a group for the jobid, and assigns the group to the user.

2- Creates a group quota for the local disk with the requestedlocal_disk valueBut that would be much more complicated and could add some unwantedcomplexity to the whole system.

_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

[gridengine users] Restricting / controlling the access to $TMPDIR

Reply via email to