Hi, Chrysovalantis.

Thanks a lot for the prompt reply.

Actually we have many nodes and put all the logs/pids/tmpfiles/ in one
folder  named slurm-xxx on a shared filesystem.
My boss think it's a good idea because we won't have to update every node
on every update attempt.  Well, I think it's not bad either. Hah.

Simbolic links is a feasible solution. But I will have to remove the local
simbolic link once some nodes is to be removed for any other use.

Now I put the pid files into /var/run and it will be deleted if the process
is killed. And the logs of slurmctlds of server and backup server will be
put in  one file.

Anyway, thanks a lot for your kind help.
May you have a good day.


2015-06-10 18:57 GMT+08:00 Chrysovalantis Paschoulas <
[email protected]>:

>  Hi!
>
> In our site we store the logs and the pid files of slurmctld and slurmdbbd
> daemons on local disk, it is better so to distinguish who is doing what and
> since if a node is down the daemons are not running then there is no need
> to put them on a shared filesystem. For slurmctld we have the state dir and
> jobcomp on a shared active/active filesystem. For slurmdbd we have an
> active/passive filesystem between the master nodes where we store the
> mysql/mariadb database.
>
> I understand that it is your policy to use shared filesystems for those
> files (and in general is not a bad idea) but in this case for slurm control
> daemons you should use local disks (personal opinion).
>
> Anyway, if you insist to use shared fs then I can only think of one
> solution: use symbolic links. E.g.:
> @master1: path-of-pidfile-in-slurm.conf -> /sharedfs/slurm/run/master1.pid
> @master2: path-of-pidfile-in-slurm.conf -> /sharedfs/slurm/run/master2.pid
>
> Cheers,
> Chrysovalantis Paschoulas
>
>
>
> On 06/10/2015 04:47 AM, Qianqian Sha wrote:
>
> Hi,
>
>       We store all slurm logs/pids/tmpfiles on a shared disk.
> logs/pids/tmpfiles of different slurmds can be distinguished by
> nodename(%n) or hostname(%h). But it seems that logs/pids path of slurmctld
> or backup-slurmctld does not support %n or %h.
>
>     Any assistance or advice is appreciated.
>
>  Thanks.
>
>
>
>
>
> ------------------------------------------------------------------------------------------------
>
> ------------------------------------------------------------------------------------------------
> Forschungszentrum Juelich GmbH
> 52425 Juelich
> Sitz der Gesellschaft: Juelich
> Eingetragen im Handelsregister des Amtsgerichts Dueren Nr. HR B 3498
> Vorsitzender des Aufsichtsrats: MinDir Dr. Karl Eugen Huthmacher
> Geschaeftsfuehrung: Prof. Dr.-Ing. Wolfgang Marquardt (Vorsitzender),
> Karsten Beneke (stellv. Vorsitzender), Prof. Dr.-Ing. Harald Bolt,
> Prof. Dr. Sebastian M. Schmidt
>
> ------------------------------------------------------------------------------------------------
>
> ------------------------------------------------------------------------------------------------
>
>

Reply via email to