[slurm-dev] Re: successful systemd service start on RHEL7?

2015-03-25 Thread David Bigagli


The slurm.spec file decides if to install the init.d scripts or the 
systemd stuff.


On 03/24/2015 07:24 PM, Fred Liu wrote:




-Original Message-
From: David Bigagli [mailto:da...@schedmd.com]
Sent: 星期三, 三月 25, 2015 1:19
To: slurm-dev
Subject: [slurm-dev] Re: successful systemd service start on RHEL7?


It should build just fine on CentOS7 we tried it several times. In the
etc directory under the source code there are examples of service files.
The rpm installs the service files as well.


Which rpm has the related service files? I just saw init file like following:

rpm -ql slurm-14.11.5-1.el7.x86_64
/etc/init.d/slurm
/etc/ld.so.conf.d
/etc/ld.so.conf.d/slurm.conf
/etc/slurm
/etc/slurm/cgroup.conf.example
/etc/slurm/cgroup.release_common.example
/etc/slurm/cgroup/release_cpuset
/etc/slurm/cgroup/release_freezer
/etc/slurm/cgroup/release_memory
/etc/slurm/cgroup_allowed_devices_file.conf.example
/etc/slurm/slurm.conf.example
/etc/slurm/slurm.epilog.clean
/usr/bin/sacct
/usr/bin/sacctmgr
/usr/bin/salloc
/usr/bin/sattach
...
...
...

And yes, I copied and modified the sample service file from etc folder under 
source tree like following:

[Unit]
Description=Slurm controller daemon
After=network.target
ConditionPathExists=/tool/slurm/14.11.5/etc/slurm.conf

[Service]
Type=forking
#EnvironmentFile=/etc/default/slurmctld
ExecStart=/tool/slurm/14.11.5/sbin/slurmctld
PIDFile=/run/slurm/slurmctld.pid

[Install]
WantedBy=multi-user.target

Thanks.

Fred



--

Thanks,
  /David/Bigagli

www.schedmd.com


[slurm-dev] Re: successful systemd service start on RHEL7?

2015-03-24 Thread Paul Edmon


I have tried building slurm 14.11.4 on CentOS7 but it never quite worked 
right.  I'm not sure if it has been vetted for RHEL7 yet.  I didn't dig 
too deeply though when I did build it as I just figured it wasn't ready 
for RHEL7.


-Paul Edmon-

On 03/24/2015 10:32 AM, Fred Liu wrote:

Hi,


Anyone successfully started systemd service on RHEL7?
I failed like following:

[root@cnlnx03 system]# systemctl start slurmctld
Job for slurmctld.service failed. See 'systemctl status slurmctld.service' and 
'journalctl -xn' for details.
[root@cnlnx03 system]# systemctl status slurmctld.service
slurmctld.service - Slurm controller daemon
Loaded: loaded (/usr/lib/systemd/system/slurmctld.service; enabled)
Active: failed (Result: timeout) since Tue 2015-03-24 22:22:46 CST; 4min 
32s ago

Mar 24 22:21:05 cnlnx03 slurmctld[20561]: init_requeue_policy: 
kill_invalid_depend is set to 0
Mar 24 22:21:05 cnlnx03 slurmctld[20561]: Recovered state of 0 reservations
Mar 24 22:21:05 cnlnx03 slurmctld[20561]: read_slurm_conf: backup_controller 
not specified.
Mar 24 22:21:05 cnlnx03 slurmctld[20561]: Running as primary controller
Mar 24 22:22:05 cnlnx03 slurmctld[20561]: 
SchedulerParameters=default_queue_depth=100,max_rpc_cnt=0,max_sched_tim...pth=0
Mar 24 22:22:45 cnlnx03 systemd[1]: slurmctld.service operation timed out. 
Terminating.
Mar 24 22:22:45 cnlnx03 slurmctld[20561]: Terminate signal (SIGINT or SIGTERM) 
received
Mar 24 22:22:45 cnlnx03 slurmctld[20561]: Saving all slurm state
Mar 24 22:22:46 cnlnx03 systemd[1]: Failed to start Slurm controller daemon.
Mar 24 22:22:46 cnlnx03 systemd[1]: Unit slurmctld.service entered failed state.
Hint: Some lines were ellipsized, use -l to show in full.
[root@cnlnx03 system]# journalctl -xn
-- Logs begin at Wed 2015-03-11 17:23:37 CST, end at Tue 2015-03-24 22:25:02 
CST. --
Mar 24 22:22:45 cnlnx03 systemd[1]: slurmctld.service operation timed out. 
Terminating.
Mar 24 22:22:45 cnlnx03 slurmctld[20561]: Terminate signal (SIGINT or SIGTERM) 
received
Mar 24 22:22:45 cnlnx03 slurmctld[20561]: Saving all slurm state
Mar 24 22:22:46 cnlnx03 slurmctld[20561]: layouts: all layouts are now unloaded.
Mar 24 22:22:46 cnlnx03 systemd[1]: Failed to start Slurm controller daemon.
-- Subject: Unit slurmctld.service has failed
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel


[slurm-dev] Re: successful systemd service start on RHEL7?

2015-03-24 Thread Fred Liu


 -Original Message-
 From: David Bigagli [mailto:da...@schedmd.com]
 Sent: 星期三, 三月 25, 2015 1:19
 To: slurm-dev
 Subject: [slurm-dev] Re: successful systemd service start on RHEL7?
 
 
 It should build just fine on CentOS7 we tried it several times. In the
 etc directory under the source code there are examples of service files.
 The rpm installs the service files as well.

Which rpm has the related service files? I just saw init file like following:

rpm -ql slurm-14.11.5-1.el7.x86_64
/etc/init.d/slurm
/etc/ld.so.conf.d
/etc/ld.so.conf.d/slurm.conf
/etc/slurm
/etc/slurm/cgroup.conf.example
/etc/slurm/cgroup.release_common.example
/etc/slurm/cgroup/release_cpuset
/etc/slurm/cgroup/release_freezer
/etc/slurm/cgroup/release_memory
/etc/slurm/cgroup_allowed_devices_file.conf.example
/etc/slurm/slurm.conf.example
/etc/slurm/slurm.epilog.clean
/usr/bin/sacct
/usr/bin/sacctmgr
/usr/bin/salloc
/usr/bin/sattach
...
...
...

And yes, I copied and modified the sample service file from etc folder under 
source tree like following:

[Unit]
Description=Slurm controller daemon
After=network.target
ConditionPathExists=/tool/slurm/14.11.5/etc/slurm.conf

[Service]
Type=forking
#EnvironmentFile=/etc/default/slurmctld
ExecStart=/tool/slurm/14.11.5/sbin/slurmctld
PIDFile=/run/slurm/slurmctld.pid

[Install]
WantedBy=multi-user.target

Thanks.

Fred


[slurm-dev] Re: successful systemd service start on RHEL7?

2015-03-24 Thread David Bigagli


It should build just fine on CentOS7 we tried it several times. In the 
etc directory under the source code there are examples of service files.

The rpm installs the service files as well.

On 03/24/2015 07:31 AM, Fred Liu wrote:

Hi,


Anyone successfully started systemd service on RHEL7?
I failed like following:

[root@cnlnx03 system]# systemctl start slurmctld
Job for slurmctld.service failed. See 'systemctl status slurmctld.service' and 
'journalctl -xn' for details.
[root@cnlnx03 system]# systemctl status slurmctld.service
slurmctld.service - Slurm controller daemon
Loaded: loaded (/usr/lib/systemd/system/slurmctld.service; enabled)
Active: failed (Result: timeout) since Tue 2015-03-24 22:22:46 CST; 4min 
32s ago

Mar 24 22:21:05 cnlnx03 slurmctld[20561]: init_requeue_policy: 
kill_invalid_depend is set to 0
Mar 24 22:21:05 cnlnx03 slurmctld[20561]: Recovered state of 0 reservations
Mar 24 22:21:05 cnlnx03 slurmctld[20561]: read_slurm_conf: backup_controller 
not specified.
Mar 24 22:21:05 cnlnx03 slurmctld[20561]: Running as primary controller
Mar 24 22:22:05 cnlnx03 slurmctld[20561]: 
SchedulerParameters=default_queue_depth=100,max_rpc_cnt=0,max_sched_tim...pth=0
Mar 24 22:22:45 cnlnx03 systemd[1]: slurmctld.service operation timed out. 
Terminating.
Mar 24 22:22:45 cnlnx03 slurmctld[20561]: Terminate signal (SIGINT or SIGTERM) 
received
Mar 24 22:22:45 cnlnx03 slurmctld[20561]: Saving all slurm state
Mar 24 22:22:46 cnlnx03 systemd[1]: Failed to start Slurm controller daemon.
Mar 24 22:22:46 cnlnx03 systemd[1]: Unit slurmctld.service entered failed state.
Hint: Some lines were ellipsized, use -l to show in full.
[root@cnlnx03 system]# journalctl -xn
-- Logs begin at Wed 2015-03-11 17:23:37 CST, end at Tue 2015-03-24 22:25:02 
CST. --
Mar 24 22:22:45 cnlnx03 systemd[1]: slurmctld.service operation timed out. 
Terminating.
Mar 24 22:22:45 cnlnx03 slurmctld[20561]: Terminate signal (SIGINT or SIGTERM) 
received
Mar 24 22:22:45 cnlnx03 slurmctld[20561]: Saving all slurm state
Mar 24 22:22:46 cnlnx03 slurmctld[20561]: layouts: all layouts are now unloaded.
Mar 24 22:22:46 cnlnx03 systemd[1]: Failed to start Slurm controller daemon.
-- Subject: Unit slurmctld.service has failed
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel



--

Thanks,
  /David/Bigagli

www.schedmd.com


[slurm-dev] Re: successful systemd service start on RHEL7?

2015-03-24 Thread Anatoliy Kovalenko
Hi there. I use slurm 14.03.8, compiled from its source, on fedora 20.
In the beginning I had issue with stop/start the slurm's daemon through
systemctl, but after specified the location of pid.file in
slurmctld.servise, systemctl works fine. Show contents of your
slurmctld.servise file and 'cat slurm.conf | grep pid'.
What happens, if you try execute 'slurmstld -D -vvv'?

2015-03-24 16:40 GMT+02:00 Paul Edmon ped...@cfa.harvard.edu:


 I have tried building slurm 14.11.4 on CentOS7 but it never quite worked
 right.  I'm not sure if it has been vetted for RHEL7 yet.  I didn't dig too
 deeply though when I did build it as I just figured it wasn't ready for
 RHEL7.

 -Paul Edmon-


 On 03/24/2015 10:32 AM, Fred Liu wrote:

 Hi,


 Anyone successfully started systemd service on RHEL7?
 I failed like following:

 [root@cnlnx03 system]# systemctl start slurmctld
 Job for slurmctld.service failed. See 'systemctl status
 slurmctld.service' and 'journalctl -xn' for details.
 [root@cnlnx03 system]# systemctl status slurmctld.service
 slurmctld.service - Slurm controller daemon
 Loaded: loaded (/usr/lib/systemd/system/slurmctld.service; enabled)
 Active: failed (Result: timeout) since Tue 2015-03-24 22:22:46 CST;
 4min 32s ago

 Mar 24 22:21:05 cnlnx03 slurmctld[20561]: init_requeue_policy:
 kill_invalid_depend is set to 0
 Mar 24 22:21:05 cnlnx03 slurmctld[20561]: Recovered state of 0
 reservations
 Mar 24 22:21:05 cnlnx03 slurmctld[20561]: read_slurm_conf:
 backup_controller not specified.
 Mar 24 22:21:05 cnlnx03 slurmctld[20561]: Running as primary controller
 Mar 24 22:22:05 cnlnx03 slurmctld[20561]: SchedulerParameters=default_
 queue_depth=100,max_rpc_cnt=0,max_sched_tim...pth=0
 Mar 24 22:22:45 cnlnx03 systemd[1]: slurmctld.service operation timed
 out. Terminating.
 Mar 24 22:22:45 cnlnx03 slurmctld[20561]: Terminate signal (SIGINT or
 SIGTERM) received
 Mar 24 22:22:45 cnlnx03 slurmctld[20561]: Saving all slurm state
 Mar 24 22:22:46 cnlnx03 systemd[1]: Failed to start Slurm controller
 daemon.
 Mar 24 22:22:46 cnlnx03 systemd[1]: Unit slurmctld.service entered failed
 state.
 Hint: Some lines were ellipsized, use -l to show in full.
 [root@cnlnx03 system]# journalctl -xn
 -- Logs begin at Wed 2015-03-11 17:23:37 CST, end at Tue 2015-03-24
 22:25:02 CST. --
 Mar 24 22:22:45 cnlnx03 systemd[1]: slurmctld.service operation timed
 out. Terminating.
 Mar 24 22:22:45 cnlnx03 slurmctld[20561]: Terminate signal (SIGINT or
 SIGTERM) received
 Mar 24 22:22:45 cnlnx03 slurmctld[20561]: Saving all slurm state
 Mar 24 22:22:46 cnlnx03 slurmctld[20561]: layouts: all layouts are now
 unloaded.
 Mar 24 22:22:46 cnlnx03 systemd[1]: Failed to start Slurm controller
 daemon.
 -- Subject: Unit slurmctld.service has failed
 -- Defined-By: systemd
 -- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel