[slurm-dev] Re: successful systemd service start on RHEL7?
The slurm.spec file decides if to install the init.d scripts or the systemd stuff. On 03/24/2015 07:24 PM, Fred Liu wrote: -Original Message- From: David Bigagli [mailto:da...@schedmd.com] Sent: 星期三, 三月 25, 2015 1:19 To: slurm-dev Subject: [slurm-dev] Re: successful systemd service start on RHEL7? It should build just fine on CentOS7 we tried it several times. In the etc directory under the source code there are examples of service files. The rpm installs the service files as well. Which rpm has the related service files? I just saw init file like following: rpm -ql slurm-14.11.5-1.el7.x86_64 /etc/init.d/slurm /etc/ld.so.conf.d /etc/ld.so.conf.d/slurm.conf /etc/slurm /etc/slurm/cgroup.conf.example /etc/slurm/cgroup.release_common.example /etc/slurm/cgroup/release_cpuset /etc/slurm/cgroup/release_freezer /etc/slurm/cgroup/release_memory /etc/slurm/cgroup_allowed_devices_file.conf.example /etc/slurm/slurm.conf.example /etc/slurm/slurm.epilog.clean /usr/bin/sacct /usr/bin/sacctmgr /usr/bin/salloc /usr/bin/sattach ... ... ... And yes, I copied and modified the sample service file from etc folder under source tree like following: [Unit] Description=Slurm controller daemon After=network.target ConditionPathExists=/tool/slurm/14.11.5/etc/slurm.conf [Service] Type=forking #EnvironmentFile=/etc/default/slurmctld ExecStart=/tool/slurm/14.11.5/sbin/slurmctld PIDFile=/run/slurm/slurmctld.pid [Install] WantedBy=multi-user.target Thanks. Fred -- Thanks, /David/Bigagli www.schedmd.com
[slurm-dev] Re: successful systemd service start on RHEL7?
I have tried building slurm 14.11.4 on CentOS7 but it never quite worked right. I'm not sure if it has been vetted for RHEL7 yet. I didn't dig too deeply though when I did build it as I just figured it wasn't ready for RHEL7. -Paul Edmon- On 03/24/2015 10:32 AM, Fred Liu wrote: Hi, Anyone successfully started systemd service on RHEL7? I failed like following: [root@cnlnx03 system]# systemctl start slurmctld Job for slurmctld.service failed. See 'systemctl status slurmctld.service' and 'journalctl -xn' for details. [root@cnlnx03 system]# systemctl status slurmctld.service slurmctld.service - Slurm controller daemon Loaded: loaded (/usr/lib/systemd/system/slurmctld.service; enabled) Active: failed (Result: timeout) since Tue 2015-03-24 22:22:46 CST; 4min 32s ago Mar 24 22:21:05 cnlnx03 slurmctld[20561]: init_requeue_policy: kill_invalid_depend is set to 0 Mar 24 22:21:05 cnlnx03 slurmctld[20561]: Recovered state of 0 reservations Mar 24 22:21:05 cnlnx03 slurmctld[20561]: read_slurm_conf: backup_controller not specified. Mar 24 22:21:05 cnlnx03 slurmctld[20561]: Running as primary controller Mar 24 22:22:05 cnlnx03 slurmctld[20561]: SchedulerParameters=default_queue_depth=100,max_rpc_cnt=0,max_sched_tim...pth=0 Mar 24 22:22:45 cnlnx03 systemd[1]: slurmctld.service operation timed out. Terminating. Mar 24 22:22:45 cnlnx03 slurmctld[20561]: Terminate signal (SIGINT or SIGTERM) received Mar 24 22:22:45 cnlnx03 slurmctld[20561]: Saving all slurm state Mar 24 22:22:46 cnlnx03 systemd[1]: Failed to start Slurm controller daemon. Mar 24 22:22:46 cnlnx03 systemd[1]: Unit slurmctld.service entered failed state. Hint: Some lines were ellipsized, use -l to show in full. [root@cnlnx03 system]# journalctl -xn -- Logs begin at Wed 2015-03-11 17:23:37 CST, end at Tue 2015-03-24 22:25:02 CST. -- Mar 24 22:22:45 cnlnx03 systemd[1]: slurmctld.service operation timed out. Terminating. Mar 24 22:22:45 cnlnx03 slurmctld[20561]: Terminate signal (SIGINT or SIGTERM) received Mar 24 22:22:45 cnlnx03 slurmctld[20561]: Saving all slurm state Mar 24 22:22:46 cnlnx03 slurmctld[20561]: layouts: all layouts are now unloaded. Mar 24 22:22:46 cnlnx03 systemd[1]: Failed to start Slurm controller daemon. -- Subject: Unit slurmctld.service has failed -- Defined-By: systemd -- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel
[slurm-dev] Re: successful systemd service start on RHEL7?
-Original Message- From: David Bigagli [mailto:da...@schedmd.com] Sent: 星期三, 三月 25, 2015 1:19 To: slurm-dev Subject: [slurm-dev] Re: successful systemd service start on RHEL7? It should build just fine on CentOS7 we tried it several times. In the etc directory under the source code there are examples of service files. The rpm installs the service files as well. Which rpm has the related service files? I just saw init file like following: rpm -ql slurm-14.11.5-1.el7.x86_64 /etc/init.d/slurm /etc/ld.so.conf.d /etc/ld.so.conf.d/slurm.conf /etc/slurm /etc/slurm/cgroup.conf.example /etc/slurm/cgroup.release_common.example /etc/slurm/cgroup/release_cpuset /etc/slurm/cgroup/release_freezer /etc/slurm/cgroup/release_memory /etc/slurm/cgroup_allowed_devices_file.conf.example /etc/slurm/slurm.conf.example /etc/slurm/slurm.epilog.clean /usr/bin/sacct /usr/bin/sacctmgr /usr/bin/salloc /usr/bin/sattach ... ... ... And yes, I copied and modified the sample service file from etc folder under source tree like following: [Unit] Description=Slurm controller daemon After=network.target ConditionPathExists=/tool/slurm/14.11.5/etc/slurm.conf [Service] Type=forking #EnvironmentFile=/etc/default/slurmctld ExecStart=/tool/slurm/14.11.5/sbin/slurmctld PIDFile=/run/slurm/slurmctld.pid [Install] WantedBy=multi-user.target Thanks. Fred
[slurm-dev] Re: successful systemd service start on RHEL7?
It should build just fine on CentOS7 we tried it several times. In the etc directory under the source code there are examples of service files. The rpm installs the service files as well. On 03/24/2015 07:31 AM, Fred Liu wrote: Hi, Anyone successfully started systemd service on RHEL7? I failed like following: [root@cnlnx03 system]# systemctl start slurmctld Job for slurmctld.service failed. See 'systemctl status slurmctld.service' and 'journalctl -xn' for details. [root@cnlnx03 system]# systemctl status slurmctld.service slurmctld.service - Slurm controller daemon Loaded: loaded (/usr/lib/systemd/system/slurmctld.service; enabled) Active: failed (Result: timeout) since Tue 2015-03-24 22:22:46 CST; 4min 32s ago Mar 24 22:21:05 cnlnx03 slurmctld[20561]: init_requeue_policy: kill_invalid_depend is set to 0 Mar 24 22:21:05 cnlnx03 slurmctld[20561]: Recovered state of 0 reservations Mar 24 22:21:05 cnlnx03 slurmctld[20561]: read_slurm_conf: backup_controller not specified. Mar 24 22:21:05 cnlnx03 slurmctld[20561]: Running as primary controller Mar 24 22:22:05 cnlnx03 slurmctld[20561]: SchedulerParameters=default_queue_depth=100,max_rpc_cnt=0,max_sched_tim...pth=0 Mar 24 22:22:45 cnlnx03 systemd[1]: slurmctld.service operation timed out. Terminating. Mar 24 22:22:45 cnlnx03 slurmctld[20561]: Terminate signal (SIGINT or SIGTERM) received Mar 24 22:22:45 cnlnx03 slurmctld[20561]: Saving all slurm state Mar 24 22:22:46 cnlnx03 systemd[1]: Failed to start Slurm controller daemon. Mar 24 22:22:46 cnlnx03 systemd[1]: Unit slurmctld.service entered failed state. Hint: Some lines were ellipsized, use -l to show in full. [root@cnlnx03 system]# journalctl -xn -- Logs begin at Wed 2015-03-11 17:23:37 CST, end at Tue 2015-03-24 22:25:02 CST. -- Mar 24 22:22:45 cnlnx03 systemd[1]: slurmctld.service operation timed out. Terminating. Mar 24 22:22:45 cnlnx03 slurmctld[20561]: Terminate signal (SIGINT or SIGTERM) received Mar 24 22:22:45 cnlnx03 slurmctld[20561]: Saving all slurm state Mar 24 22:22:46 cnlnx03 slurmctld[20561]: layouts: all layouts are now unloaded. Mar 24 22:22:46 cnlnx03 systemd[1]: Failed to start Slurm controller daemon. -- Subject: Unit slurmctld.service has failed -- Defined-By: systemd -- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel -- Thanks, /David/Bigagli www.schedmd.com
[slurm-dev] Re: successful systemd service start on RHEL7?
Hi there. I use slurm 14.03.8, compiled from its source, on fedora 20. In the beginning I had issue with stop/start the slurm's daemon through systemctl, but after specified the location of pid.file in slurmctld.servise, systemctl works fine. Show contents of your slurmctld.servise file and 'cat slurm.conf | grep pid'. What happens, if you try execute 'slurmstld -D -vvv'? 2015-03-24 16:40 GMT+02:00 Paul Edmon ped...@cfa.harvard.edu: I have tried building slurm 14.11.4 on CentOS7 but it never quite worked right. I'm not sure if it has been vetted for RHEL7 yet. I didn't dig too deeply though when I did build it as I just figured it wasn't ready for RHEL7. -Paul Edmon- On 03/24/2015 10:32 AM, Fred Liu wrote: Hi, Anyone successfully started systemd service on RHEL7? I failed like following: [root@cnlnx03 system]# systemctl start slurmctld Job for slurmctld.service failed. See 'systemctl status slurmctld.service' and 'journalctl -xn' for details. [root@cnlnx03 system]# systemctl status slurmctld.service slurmctld.service - Slurm controller daemon Loaded: loaded (/usr/lib/systemd/system/slurmctld.service; enabled) Active: failed (Result: timeout) since Tue 2015-03-24 22:22:46 CST; 4min 32s ago Mar 24 22:21:05 cnlnx03 slurmctld[20561]: init_requeue_policy: kill_invalid_depend is set to 0 Mar 24 22:21:05 cnlnx03 slurmctld[20561]: Recovered state of 0 reservations Mar 24 22:21:05 cnlnx03 slurmctld[20561]: read_slurm_conf: backup_controller not specified. Mar 24 22:21:05 cnlnx03 slurmctld[20561]: Running as primary controller Mar 24 22:22:05 cnlnx03 slurmctld[20561]: SchedulerParameters=default_ queue_depth=100,max_rpc_cnt=0,max_sched_tim...pth=0 Mar 24 22:22:45 cnlnx03 systemd[1]: slurmctld.service operation timed out. Terminating. Mar 24 22:22:45 cnlnx03 slurmctld[20561]: Terminate signal (SIGINT or SIGTERM) received Mar 24 22:22:45 cnlnx03 slurmctld[20561]: Saving all slurm state Mar 24 22:22:46 cnlnx03 systemd[1]: Failed to start Slurm controller daemon. Mar 24 22:22:46 cnlnx03 systemd[1]: Unit slurmctld.service entered failed state. Hint: Some lines were ellipsized, use -l to show in full. [root@cnlnx03 system]# journalctl -xn -- Logs begin at Wed 2015-03-11 17:23:37 CST, end at Tue 2015-03-24 22:25:02 CST. -- Mar 24 22:22:45 cnlnx03 systemd[1]: slurmctld.service operation timed out. Terminating. Mar 24 22:22:45 cnlnx03 slurmctld[20561]: Terminate signal (SIGINT or SIGTERM) received Mar 24 22:22:45 cnlnx03 slurmctld[20561]: Saving all slurm state Mar 24 22:22:46 cnlnx03 slurmctld[20561]: layouts: all layouts are now unloaded. Mar 24 22:22:46 cnlnx03 systemd[1]: Failed to start Slurm controller daemon. -- Subject: Unit slurmctld.service has failed -- Defined-By: systemd -- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel