Hi,
We are building a webapp which will utilize data stored in the slurm msyql db.
Is there anything wrong with adding a read only user that the app can use
indirectly to cache statistics? I don’t see any issue with it, but I’m curious
if it would get in the way of any normal slurm
Hi,
We are trying to limit 300 CPU usage per user in our cluster.
We have tried:
sacctmgr modify qos normal set Grpcpus=300
and
sacctmgr modify user username set GrpCPUs=300
Both seems to allow job to run which asking for 308 CPUs.
Is there other way to implement this requirement?
Hi,
I have a few nodes in the cluster which hangs every job in complete state
and will not return to idle.
I cannot find out why.
All nodes are running same OS (diskless image).
>From log I only see:
--
[2016-10-12T08:44:08.133] error: we don't have select plugin type 102
On 10/18/2016 10:07 AM, cfernanrodri . wrote:
> I am managing a small machine but I am not sysadmin or so.
>
> For the moment Slrum is working fine but jobs are runing in a FIFO
> scheduling, I would like to implement priorities per user, without a DB.
>
> Is possible to do it with txt accounting?
I am managing a small machine but I am not sysadmin or so.
For the moment Slrum is working fine but jobs are runing in a FIFO
scheduling, I would like to implement priorities per user, without a DB.
Is possible to do it with txt accounting? if not, can I set at least
fair-sharing with *only* txt
Hi!
> For the same time period, what is the reserved column say for "sreport -T
> CPU -t MinPer cluster utilization"? Meaning, not by account. If that
> Reserved is 0 for the cluster overall, then that does explain why it's also
> zero for all accounts. If there is a discrepancy, then
Reserved time in sreport is time nodes are held idle (by the backfill
scheduler) to start the job. If you aren't using backfill, or if all
job submissions request about the same quantity of hardware resources
then it may always be zero. If there were some users submitting large
jobs and some
Hi,
I have a pending job with a time limit of 2 days, it is assigned by
default to "normal" qos that has a limit of 1 day.
When I realize that it is in PENDING
Reason=QOSMaxWallDurationPerJobLimit, i move it to lowprio qos:
$] scontrol update job 2767745 qos=lowprio
Then, I check the wall
Hi,
It seems that right-now (or at least in version 15.08.9) the column Reserved
in a UserUtilizationByAccount sreport is always 0, like Idle and Down.
For example:
sreport -T CPU -t MinPer cluster UserUtilizationByAccount
Format=TresName%4,Login,Used,Reserved,Idle,Down Start=`date -d "last