We're a Grid Engine shop, and use cgroups (m_mem_free) to control user process memory usage. In the GE exec host configuration, we reserve 4GB for the OS (including GPFS) so jobs are not able to consume all the physical memory on the system.
On Tue, Dec 20, 2016 at 11:25:04AM -0500, Brian Marshall wrote: > All, > > What is your favorite method for stopping a user process from eating up all > the system memory and saving 1 GB (or more) for the GPFS / system > processes? We have always kicked around the idea of cgroups but never > moved on it. > > The problem: A user launches a job which uses all the memory on a node, > which causes the node to be expelled, which causes brief filesystem > slowness everywhere. > > I bet this problem has already been solved and I am just googling the wrong > search terms. > > > Thanks, > Brian > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss -- -- Skylar Thompson ([email protected]) -- Genome Sciences Department, System Administrator -- Foege Building S046, (206)-685-7354 -- University of Washington School of Medicine _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss
