We use adaptive - Moab torque right now but are thinking about going to Skyrim
Brian On Dec 20, 2016 11:38 AM, "Buterbaugh, Kevin L" < [email protected]> wrote: > Hi Brian, > > It would be helpful to know what scheduling software, if any, you use. > > We were a PBS / Moab shop for a number of years but switched to SLURM two > years ago. With both you can configure the maximum amount of memory > available to all jobs on a node. So we just simply “reserve” however much > we need for GPFS and other “system” processes. > > I can tell you that SLURM is *much* more efficient at killing processes as > soon as they exceed the amount of memory they’ve requested than PBS / Moab > ever dreamed of being. > > Kevin > > On Dec 20, 2016, at 10:27 AM, Skylar Thompson <[email protected]> > wrote: > > We're a Grid Engine shop, and use cgroups (m_mem_free) to control user > process memory > usage. In the GE exec host configuration, we reserve 4GB for the OS > (including GPFS) so jobs are not able to consume all the physical memory on > the system. > > On Tue, Dec 20, 2016 at 11:25:04AM -0500, Brian Marshall wrote: > > All, > > What is your favorite method for stopping a user process from eating up all > the system memory and saving 1 GB (or more) for the GPFS / system > processes? We have always kicked around the idea of cgroups but never > moved on it. > > The problem: A user launches a job which uses all the memory on a node, > which causes the node to be expelled, which causes brief filesystem > slowness everywhere. > > I bet this problem has already been solved and I am just googling the wrong > search terms. > > > Thanks, > Brian > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > > -- > -- Skylar Thompson ([email protected]) > -- Genome Sciences Department, System Administrator > -- Foege Building S046, (206)-685-7354 <(206)%20685-7354> > -- University of Washington School of Medicine > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > > > — > Kevin Buterbaugh - Senior System Administrator > Vanderbilt University - Advanced Computing Center for Research and > Education > [email protected] - (615)875-9633 <(615)%20875-9633> > > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > >
_______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss
