[slurm-dev] Re: slurm on NFS for a cluster - Part II
All of them should be owned by munge. Further more for security's sake I would make them all only accessible to munge, at least the etc one. -Paul Edmon- On 03/25/2015 10:29 AM, Jeff Layton wrote: I assume the same is true for /var/log/munge and /var/run/munge? How about /etc/munge? Thanks! Jeff Yea, that folder and the files inside needs to be owned by munge. -Paul Edmon- On 03/25/2015 09:54 AM, Jeff Layton wrote: Good morning, Thanks for all of the advice in regard to slurm on NFS. I've started on my slurm quest by installing munge but I'm having some trouble. I'm not sure this is the right place to ask about munge but here goes. I'm building and install munge with the following options: ./configure --exec-prefix=/share/ec2-user --prefix= \ --sysconfdir=/etc/ --sharedstatedir=/var/ This allows me to store the local state information in /var and the configuration information in /etc, but everything else is stored in /share/ec2-user/ (NFS shared directory). The build goes fine but when I try to start the munge service I get the following: [ec2-user@ip-10-0-1-72 munge-0.5.11]$ sudo service munge start Starting MUNGE: munged (failed). munged: Error: Failed to check logfile /var/log/munge/munged.log: Permission denied The permissions on /var/log/munge are: [ec2-user@ip-10-0-1-72 munge-0.5.11]$ sudo ls -lstar /var/log/munge total 8 4 drwxr-xr-x 6 root root 4096 Mar 25 13:39 .. 4 drwx-- 2 root root 4096 Mar 25 13:39 I'm not sure but should this directory be owned by user munge? TIA! Jeff
[slurm-dev] Re: slurm on NFS for a cluster - Part II
Yea, that folder and the files inside needs to be owned by munge. -Paul Edmon- On 03/25/2015 09:54 AM, Jeff Layton wrote: Good morning, Thanks for all of the advice in regard to slurm on NFS. I've started on my slurm quest by installing munge but I'm having some trouble. I'm not sure this is the right place to ask about munge but here goes. I'm building and install munge with the following options: ./configure --exec-prefix=/share/ec2-user --prefix= \ --sysconfdir=/etc/ --sharedstatedir=/var/ This allows me to store the local state information in /var and the configuration information in /etc, but everything else is stored in /share/ec2-user/ (NFS shared directory). The build goes fine but when I try to start the munge service I get the following: [ec2-user@ip-10-0-1-72 munge-0.5.11]$ sudo service munge start Starting MUNGE: munged (failed). munged: Error: Failed to check logfile /var/log/munge/munged.log: Permission denied The permissions on /var/log/munge are: [ec2-user@ip-10-0-1-72 munge-0.5.11]$ sudo ls -lstar /var/log/munge total 8 4 drwxr-xr-x 6 root root 4096 Mar 25 13:39 .. 4 drwx-- 2 root root 4096 Mar 25 13:39 I'm not sure but should this directory be owned by user munge? TIA! Jeff
[slurm-dev] Re: slurm on NFS for a cluster - Part II
I assume the same is true for /var/log/munge and /var/run/munge? How about /etc/munge? Thanks! Jeff Yea, that folder and the files inside needs to be owned by munge. -Paul Edmon- On 03/25/2015 09:54 AM, Jeff Layton wrote: Good morning, Thanks for all of the advice in regard to slurm on NFS. I've started on my slurm quest by installing munge but I'm having some trouble. I'm not sure this is the right place to ask about munge but here goes. I'm building and install munge with the following options: ./configure --exec-prefix=/share/ec2-user --prefix= \ --sysconfdir=/etc/ --sharedstatedir=/var/ This allows me to store the local state information in /var and the configuration information in /etc, but everything else is stored in /share/ec2-user/ (NFS shared directory). The build goes fine but when I try to start the munge service I get the following: [ec2-user@ip-10-0-1-72 munge-0.5.11]$ sudo service munge start Starting MUNGE: munged (failed). munged: Error: Failed to check logfile /var/log/munge/munged.log: Permission denied The permissions on /var/log/munge are: [ec2-user@ip-10-0-1-72 munge-0.5.11]$ sudo ls -lstar /var/log/munge total 8 4 drwxr-xr-x 6 root root 4096 Mar 25 13:39 .. 4 drwx-- 2 root root 4096 Mar 25 13:39 I'm not sure but should this directory be owned by user munge? TIA! Jeff
[slurm-dev] Re: slurm on NFS for a cluster - Part II
There is also good documentation that is written on installing munge which explains what permissions and ownerships should be on what directories. You can take a look at https://code.google.com/p/munge/wiki/InstallationGuide if you have not already. Thanks Jackie On Wed, Mar 25, 2015 at 7:28 AM, Jeff Layton layto...@att.net wrote: I assume the same is true for /var/log/munge and /var/run/munge? How about /etc/munge? Thanks! Jeff Yea, that folder and the files inside needs to be owned by munge. -Paul Edmon- On 03/25/2015 09:54 AM, Jeff Layton wrote: Good morning, Thanks for all of the advice in regard to slurm on NFS. I've started on my slurm quest by installing munge but I'm having some trouble. I'm not sure this is the right place to ask about munge but here goes. I'm building and install munge with the following options: ./configure --exec-prefix=/share/ec2-user --prefix= \ --sysconfdir=*/etc/* --sharedstatedir=*/var/* This allows me to store the local state information in /var and the configuration information in /etc, but everything else is stored in */share/ec2-user/* (NFS shared directory). The build goes fine but when I try to start the munge service I get the following: [ec2-user@ip-10-0-1-72 munge-0.5.11]$ sudo service munge start Starting MUNGE: munged (failed). munged: Error: Failed to check logfile /var/log/munge/munged.log: Permission denied The permissions on /var/log/munge are: [ec2-user@ip-10-0-1-72 munge-0.5.11]$ sudo ls -lstar /var/log/munge total 8 4 drwxr-xr-x 6 root root 4096 Mar 25 13:39 .. 4 drwx-- 2 root root 4096 Mar 25 13:39 I'm not sure but should this directory be owned by user munge? TIA! Jeff
[slurm-dev] Re: slurm on NFS for a cluster - Part II
On 26/03/15 00:55, Jeff Layton wrote: Thanks for all of the advice in regard to slurm on NFS. I've started on my slurm quest by installing munge but I'm having some trouble. I'm not sure this is the right place to ask about munge but here goes. Munge is something we just install as an RPM (from EPEL) and our xCAT takes care of ensuring the keys are correct. All the best, Chris -- Christopher SamuelSenior Systems Administrator VLSCI - Victorian Life Sciences Computation Initiative Email: sam...@unimelb.edu.au Phone: +61 (0)3 903 55545 http://www.vlsci.org.au/ http://twitter.com/vlsci
[slurm-dev] Re: slurm on NFS for a cluster - Part II
We use it extensively as well. It has some very useful features and scales well. It¹s also nice that it open source as well. Although it is primarily developed by IBM and Lenovo now, it does work on other platforms as Chris mentioned. We¹ve used it on all our systems in the recent years. Makes our life much easier! -- Jared On 3/25/15, 9:35 PM, Christopher Samuel sam...@unimelb.edu.au wrote: On 26/03/15 14:11, Fred Liu wrote: Just curious, What is xCAT? It's a cluster management suite, we've used it on SGI as well as IBM gear: http://xcat.sourceforge.net/ All the best, Chris -- Christopher SamuelSenior Systems Administrator VLSCI - Victorian Life Sciences Computation Initiative Email: sam...@unimelb.edu.au Phone: +61 (0)3 903 55545 http://www.vlsci.org.au/ http://twitter.com/vlsci
[slurm-dev] Re: slurm on NFS for a cluster - Part II
Munge is something we just install as an RPM (from EPEL) and our xCAT takes care of ensuring the keys are correct. Just curious, What is xCAT? Fred
[slurm-dev] Re: slurm on NFS for a cluster - Part II
Cluster admin tools - http://sourceforge.net/p/xcat/wiki/Main_Page/ I wrote a Puppet module specifically to manage Munge - https://github.com/treydock/puppet-munge - Trey = Trey Dockendorf Systems Analyst I Texas AM University Academy for Advanced Telecommunications and Learning Technologies Phone: (979)458-2396 Email: treyd...@tamu.edu Jabber: treyd...@tamu.edu On Wed, Mar 25, 2015 at 10:12 PM, Fred Liu fred_...@issi.com wrote: Munge is something we just install as an RPM (from EPEL) and our xCAT takes care of ensuring the keys are correct. Just curious, What is xCAT? Fred
[slurm-dev] Re: slurm on NFS for a cluster - Part II
On 26/03/15 14:11, Fred Liu wrote: Just curious, What is xCAT? It's a cluster management suite, we've used it on SGI as well as IBM gear: http://xcat.sourceforge.net/ All the best, Chris -- Christopher SamuelSenior Systems Administrator VLSCI - Victorian Life Sciences Computation Initiative Email: sam...@unimelb.edu.au Phone: +61 (0)3 903 55545 http://www.vlsci.org.au/ http://twitter.com/vlsci