Re: [SGE-discuss] Fwd: Error at the time of Distribution staging
On Tue, Nov 15, 2016 at 04:43:08PM +0530, Himanshu Joshi wrote: > As root: > source /opt/sge/default/common/settings.sh > qconf -ae > >Thanks,Please find the outputs and advise >[root@mbialjpj ~]# source /opt/sge/default/common/settings.sh >SGE_ROOT=/opt/sge: Command not found. >export: Command not found. >SGE_ROOT: Undefined variable. >[root@mbialjpj ~]# qconf -ae >qconf: Command not found. That is a little odd. The settings.sh file should work with most bourne like shells. Try source /opt/sge/default/common/settings.csh instead echo $SHELL as root to see what shell you are running under. > >[root@mbialjpj ~]# $SGE_ROOT >SGE_ROOT: Undefined variable. >[root@mbialjpj ~]# which $SGE_ROOT >SGE_ROOT: Undefined variable. $SGE_ROOT isn't a command just a variable that tells the various SGE commands where to find gride engine. William signature.asc Description: Digital signature ___ SGE-discuss mailing list SGE-discuss@liv.ac.uk https://arc.liv.ac.uk/mailman/listinfo/sge-discuss
Re: [SGE-discuss] Fwd: Error at the time of Distribution staging
On Tue, Nov 15, 2016 at 4:17 PM, William Haywrote: > On Tue, Nov 15, 2016 at 10:44:01AM +0530, Himanshu Joshi wrote: > >On Mon, Nov 14, 2016 at 8:41 PM, William Hay wrote: > > > > On Mon, Nov 14, 2016 at 06:03:43PM +0530, Himanshu Joshi wrote: > > >Thanks William > > >On Fri, Nov 11, 2016 at 10:31 PM, William Hay > > > wrote: > > > > > > On Thu, Nov 10, 2016 at 02:26:35PM +0530, Himanshu Joshi > wrote: > > > > I suspect you probably want to use inst_sge to > configure > > the node > > > as an > > > > execd as well. > > > > > > > >Is there any documentation available for doing that > because > > I do > > > not have > > > >any idea how to do it > > > http://arc.liv.ac.uk/SGE/howto/commontasks.html > > > If you are just making the initial qmaster into an execution > host > > as > > > well > > > then changing to $SGE_ROOT and running ./install_execd > should do > > it. > > > > > > > > >It worked, Now Execution daemon installed successfully. But I > am > > not sure > > >whether the nodes are configured or not... > > > > > > Make sure you have SGE_ROOT set correctly first (see below) > > > > > > >And I tried some computations with the current setup but > > some of > > > the > > > >errors were > > > > > > > >Error: which: no qconf in (/usr/local.. ) > > > >Warning SGE_ROOT environment variable is set but Grid > Engine > > > software is > > > >not found, will run locally > > > If you installed Dave's packages then they install into > /opt/sge > > by > > > default so set the > > > SGE_ROOT environment variable to point to that. > > > > > > sourcing /opt/sge/default/common/seetings.sh should set up > the > > > enironment. > > > changes done in .bashrc file as suggested > > > > > > > >And there is no folder gridengine in usr/share/doc > > > >Thus it indicates the software is not at all installed > > > Dave's packages are designed to be installed under /opt and > don't > > > stick things into /usr/share/doc. > > > > > > William > > > > > >Please find below the outputs of few of the configuration > commands > > >(without using sudo) in my terminal > > > > > >"qconf -sh" shows > > >mbialjpj > > > > > >"qconf -sel" shows > > >no execution host defined > > > > > >"qconf -ae" shows > > >denied: "JPJ" must be manager for this operation" > > iRunning qconf -ae as root so you can add a host should do it. > > > >If I understand this one liner correctly, you mean to say the qconf > -ae > >newhost can add "newhost". But as a root using this command says > qconf: > >Command not found. > As root: > source /opt/sge/default/common/settings.sh > qconf -ae > Thanks,Please find the outputs and advise [root@mbialjpj ~]# source /opt/sge/default/common/settings.sh SGE_ROOT=/opt/sge: Command not found. export: Command not found. SGE_ROOT: Undefined variable. [root@mbialjpj ~]# qconf -ae qconf: Command not found. [root@mbialjpj ~]# $SGE_ROOT SGE_ROOT: Undefined variable. [root@mbialjpj ~]# which $SGE_ROOT SGE_ROOT: Undefined variable. and without sudo -i..the outputs are like this [JPJ@mbialjpj ~]$ $SGE_ROOT bash: /opt/sge: Is a directory [JPJ@mbialjpj ~]$ which $SGE_ROOT /usr/bin/which: no sge in (/opt [JPJ@mbialjpj ~]$ qconf -ae denied: "JPJ" must be manager for this operation [JPJ@mbialjpj ~]$ qconf -ae newhost denied: "JPJ" must be manager for this operation Regards > > > > > > > William > > > >Kindly suggest the needful > >-- > >Himanshu Joshi > >M.Tech. Cognitive & Neuroscience. > >Ph.D Scholar, > >Department of Psychiatry > >NIMHANS, Bangalore > >Publications > >Multimodal Brain Image Analysis Laboratory > Kindly advise the needful -- Himanshu Joshi ___ SGE-discuss mailing list SGE-discuss@liv.ac.uk https://arc.liv.ac.uk/mailman/listinfo/sge-discuss
Re: [SGE-discuss] Fwd: Error at the time of Distribution staging
Dear all, Can you please edit my_configuration.conf file and help help in setting up the nodes as an execd. so I can use the following command for the entire installation ./inst_sge -m -auto /opt/sge/util/install_modules/my_configuration.conf Please find attached my_configuration.conf file and do the needful Regards Himanshu On Thu, Nov 10, 2016 at 2:26 PM, Himanshu Joshiwrote: > > > On Wed, Nov 9, 2016 at 6:56 PM, William Hay wrote: > >> On Wed, Nov 09, 2016 at 04:59:18PM +0530, Himanshu Joshi wrote: >> >On Wed, Nov 9, 2016 at 2:18 PM, William Hay wrote: >> > >> > On Wed, Nov 09, 2016 at 11:25:42AM +0530, Himanshu Joshi wrote: >> > >On Tue, Nov 8, 2016 at 9:38 PM, William Hay >> > wrote: >> > > >> > > On Tue, Nov 08, 2016 at 11:30:35AM +0530, Himanshu Joshi >> wrote: >> > > > I'd try running the command >> > > > >> > > > /usr/lib/lsb/install_initd >> > /etc/init.d/sgemaster.mbialjpj55||echo >> > > $? >> > > > >> > > > To see if it produces any output. >> > > > >> > > >Yes the output for this command is >> > > >1 >> > > Annoyingly silent error. >> > > >> > >Ya true.. >> > > >> > > What does >> > > ls -l /etc/rc.d/rc3.d/*sge* >> > > output if anything? >> > > >> > >It says " no match" >> > > i.e. /etc/rc.d/rc3.d folder has no file with *sge* >> > > >> > > > >> > > >command "ps ax |grep sge" says >> > > > >> > > >17870 pts/4S+ 0:00 grep --color=auto sge >> > > >26341 ?S10557:34 /bin/sh ./inst_sge -m -x >> > > You have a copy of inst_sge running eating that amount of >> cpu >> > time? Was >> > > that intentionally still running? >> > > >> > >I was not running it intentionally , and system monitor also >> does >> > not show >> > >any process with name "inst_sge". I had tried closing all the >> > terminals >> > >and restarted the system >> > > >> > >now the output is >> > >8160 pts/0S+ 0:00 grep --color=auto sge >> > IIRC the installation of the init script is the last thing >> inst_sge does >> > so >> > if this is the only thing blocking the install then you just need >> to >> > set the file up by hand >> > >> > Try the install_initd command by hand again now that there isn't a >> > running inst_sge >> > >> >The ./install_initd says >> If you leave out the ./ it will search the path. >> >> >Command not found >> >I think this file (install_initd) is not available in /opt/sge that >> is why >> >command not found >> > >> > >> > If that doesn't work try: >> > >> > chkconfig --add sgemaster.mbialjpj55 >> > chkconfig sgemaster.mbialjpj55 on >> > service sgemaster.mbialjpj55 start >> > >> > >> > >> > Try running >> > /etc/init.d/sgemaster.mbialjpj55 start >> > by hand does it produce output? >> > >> >It worked and then the output of "ps ax | grep sge" is >> >29305 ?Sl 0:00 /opt/sge/bin/lx-amd64/sge_qmaster >> >29974 pts/0S+ 0:00 grep --color=auto sge >> > >> >Now the below 3 commands are immaterial >> >chkconfig --add sgemaster.mbialjpj55 >> >chkconfig sgemaster.mbialjpj55 on >> >service sgemaster.mbialjpj55 start >> >as these commands say >> Well the first two make sure it will start on reboot. >> >> > >> >"sge_qmaster with PID 29305 is already running" >> > >> > >> > >> > cat /etc/init.d/sgemaster.mbialjpj55 >> > >> > >> > >> > This command displays the contents of sgemaster.mbialjpj55 >> executable >> > file in terminal >> >> > >> > >> > >> > William >> > >> >Thanks William... >> > >> >But now also, I am not sure about installation,If it is done or not >> >> If you installed Dave's RPMS then it is installed. inst_sge despite the >> name >> really just does an initial config. >> > > !! Cheers Hope inst_sge really just does an initial config... > >> > >> >Kindly suggest the needful >> I suspect you probably want to use inst_sge to configure the node as an >> execd as well. >> > > Is there any documentation available for doing that because I do not have > any idea how to do it > > And I tried some computations with the current setup but some of the > errors were > > Error: which: no qconf in (/usr/local.. ) > Warning SGE_ROOT environment variable is set but Grid Engine software is > not found, will run locally > > And there is no folder gridengine in usr/share/doc > Thus it indicates the software is not at all installed > >> >> >-- >> >Himanshu Joshi >> > > > > -- > Himanshu Joshi > -- Ph.D Scholar, Department of
Re: [SGE-discuss] Fwd: Error at the time of Distribution staging
> > I'd try running the command > > /usr/lib/lsb/install_initd /etc/init.d/sgemaster.mbialjpj55||echo $? > > To see if it produces any output. > Yes the output for this command is 1 > > > > > >Command failed: /usr/lib/lsb/install_initd > >/etc/init.d/sgemaster.mbialjpj55 > > > >Probably a permission problem. Please check file access permissions. > >Check root read/write permission. Check if SGE daemons are running. > > > >I have found the file "sgeqmaster.mbialjpj55" in the location > described > >as /etc/init.d > > and ls -l command gives the file permissions as > > > >-rwxr-xr-x. 1 root root 24883 Nov 7 17:27 sgemaster.mbialjpj55 > > > >How to check if SGE Daemons is running because command "service > >--status-all" reveals > ps ax |grep sge > > should reveal any sge daemons > command "ps ax |grep sge" says 17870 pts/4S+ 0:00 grep --color=auto sge 26341 ?S10557:34 /bin/sh ./inst_sge -m -x > > William > -- Himanshu Joshi ___ SGE-discuss mailing list SGE-discuss@liv.ac.uk https://arc.liv.ac.uk/mailman/listinfo/sge-discuss
Re: [SGE-discuss] Fwd: Error at the time of Distribution staging
On Mon, Nov 07, 2016 at 05:41:54PM +0530, Himanshu Joshi wrote: >On Mon, Nov 7, 2016 at 3:09 PM, William Haywrote: > > On Sat, Nov 05, 2016 at 10:55:38AM +0530, Himanshu Joshi wrote: > >Redhat enterprise Linux 7.2 with X86-64 architecture > >Please find the requested information with other relevant info > >hostnamectl status > > Static hostname: mbialjpj > > Pretty hostname: MBIALJPJ > > Icon name: computer-desktop > > Chassis: desktop > >Machine ID: 431da268159243088e0e02874e8d36bf > > Boot ID: 24057a4a63554a72b9c7b4b7d9e72b74 > > Operating System: Red Hat Enterprise Linux > > CPE OS Name: > cpe:/o:redhat:enterprise_linux:7.2:GA:workstation > >Kernel: Linux 3.10.0-327.el7.x86_64 > > Architecture: x86-64 > > > >I was able to initiate the installation but now stuck up in the > same error > >reported on October 20 > > > > >qmaster startup script > >-- > > > >We can install the startup script that will > >start qmaster at machine boot (y/n) [y] >> > > > >cp /opt/sge/default/common/sgemaster > /etc/init.d/sgemaster.mbialjpj55 > >/usr/lib/lsb/install_initd /etc/init.d/sgemaster.mbialjpj55 > > > >Command failed: /usr/lib/lsb/install_initd > >/etc/init.d/sgemaster.mbialjpj55 > Does /usr/lib/lsb/install_initd exist? > >Yes it is a folder owned by root > > On my RHEL7 box this is a relative symlink pointing to /sbin/chkconfig. > >Yes exactly because the command " ls -la /usr/lib/lsb | grep "\->" " >provides >the output as > >lrwxrwxrwx. 1 root root23 Jun 1 2015 install_initd -> >../../../sbin/chkconfig >lrwxrwxrwx. 1 root root23 Jun 1 2015 remove_initd -> >../../../sbin/chkconfig > > Does it exist on your machine and to what does it point? > >Yes it exists with file permissions and it points to /sbin/chkconfig > > > What are the permissions on the file to which it points? > >The following command "ls -l /sbin/chkconfig" says >-rwxr-xr-x. 1 root root 41136 Apr 29 2016 /sbin/chkconfig > > > > >Probably a permission problem. Please check file access > permissions. > >Check root read/write permission. Check if SGE daemons are running. > >How to check whether SGE daemons is running? > > > > >Looking forward to receive binary packages from Dave because I do > not know > >how to look for the one which my distribution provides > > > Dave's packages for RHEL7 are available by downloading the file at: > > https://copr.fedorainfracloud.org/coprs/loveshack/SGE/repo/epel-7/loveshack-SGE-epel-7.repo > and placing it in /etc/yum.repos.d > >I had made a document file named "loveshack-SGE.repo" and pasted it in >/etc/yum.repos.d > > Then > yum install gridengine gridengine-qmaster gridengine-qmon > gridengine-execd > >Then I went into /opt/sge and followed the above command > >This resolved many dependencies and enabled sufficient repositories > > These install into /opt/sge so if you do switch to using these (which > will simplify future > upgrades) then remove any grid engine install you have there first. > >Again the command "./inst_sge -m -x"" reached upto the process of >We can install the startup script that will >start qmaster at machine boot (y/n) [y] >> > >but landed up in the same error i.e. > >cp /opt/sge/default/common/sgemaster /etc/init.d/sgemaster.mbialjpj55 >/usr/lib/lsb/install_initd /etc/init.d/sgemaster.mbialjpj55 I'd try running the command /usr/lib/lsb/install_initd /etc/init.d/sgemaster.mbialjpj55||echo $? To see if it produces any output. > >Command failed: /usr/lib/lsb/install_initd >/etc/init.d/sgemaster.mbialjpj55 > >Probably a permission problem. Please check file access permissions. >Check root read/write permission. Check if SGE daemons are running. > >I have found the file "sgeqmaster.mbialjpj55" in the location described >as /etc/init.d > and ls -l command gives the file permissions as > >-rwxr-xr-x. 1 root root 24883 Nov 7 17:27 sgemaster.mbialjpj55 > >How to check if SGE Daemons is running because command "service >--status-all" reveals ps ax |grep sge should reveal any sge daemons William signature.asc Description: Digital signature ___ SGE-discuss mailing list SGE-discuss@liv.ac.uk https://arc.liv.ac.uk/mailman/listinfo/sge-discuss
Re: [SGE-discuss] Fwd: Error at the time of Distribution staging
On Mon, Nov 7, 2016 at 3:09 PM, William Haywrote: > On Sat, Nov 05, 2016 at 10:55:38AM +0530, Himanshu Joshi wrote: > >Redhat enterprise Linux 7.2 with X86-64 architecture > >Please find the requested information with other relevant info > >hostnamectl status > > Static hostname: mbialjpj > > Pretty hostname: MBIALJPJ > > Icon name: computer-desktop > > Chassis: desktop > >Machine ID: 431da268159243088e0e02874e8d36bf > > Boot ID: 24057a4a63554a72b9c7b4b7d9e72b74 > > Operating System: Red Hat Enterprise Linux > > CPE OS Name: cpe:/o:redhat:enterprise_linux:7.2:GA:workstation > >Kernel: Linux 3.10.0-327.el7.x86_64 > > Architecture: x86-64 > > > >I was able to initiate the installation but now stuck up in the same > error > >reported on October 20 > > > > >qmaster startup script > >-- > > > >We can install the startup script that will > >start qmaster at machine boot (y/n) [y] >> > > > >cp /opt/sge/default/common/sgemaster /etc/init.d/sgemaster.mbialjpj55 > >/usr/lib/lsb/install_initd /etc/init.d/sgemaster.mbialjpj55 > > > >Command failed: /usr/lib/lsb/install_initd > >/etc/init.d/sgemaster.mbialjpj55 > Does /usr/lib/lsb/install_initd exist? > Yes it is a folder owned by root On my RHEL7 box this is a relative symlink pointing to /sbin/chkconfig. > Yes exactly because the command " ls -la /usr/lib/lsb | grep "\->" " provides the output as lrwxrwxrwx. 1 root root23 Jun 1 2015 install_initd -> ../../../sbin/chkconfig lrwxrwxrwx. 1 root root23 Jun 1 2015 remove_initd -> ../../../sbin/chkconfig Does it exist on your machine and to what does it point? > Yes it exists with file permissions and it points to /sbin/chkconfig > What are the permissions on the file to which it points? > The following command "ls -l /sbin/chkconfig" says -rwxr-xr-x. 1 root root 41136 Apr 29 2016 /sbin/chkconfig > > > >Probably a permission problem. Please check file access permissions. > >Check root read/write permission. Check if SGE daemons are running. > How to check whether SGE daemons is running? > > > >Looking forward to receive binary packages from Dave because I do not > know > >how to look for the one which my distribution provides > > > Dave's packages for RHEL7 are available by downloading the file at: > https://copr.fedorainfracloud.org/coprs/loveshack/SGE/repo/ > epel-7/loveshack-SGE-epel-7.repo > and placing it in /etc/yum.repos.d > I had made a document file named "loveshack-SGE.repo" and pasted it in /etc/yum.repos.d > Then > yum install gridengine gridengine-qmaster gridengine-qmon gridengine-execd > Then I went into /opt/sge and followed the above command This resolved many dependencies and enabled sufficient repositories > > These install into /opt/sge so if you do switch to using these (which will > simplify future > upgrades) then remove any grid engine install you have there first. > > Again the command "./inst_sge -m -x"" reached upto the process of We can install the startup script that will start qmaster at machine boot (y/n) [y] >> but landed up in the same error i.e. cp /opt/sge/default/common/sgemaster /etc/init.d/sgemaster.mbialjpj55 /usr/lib/lsb/install_initd /etc/init.d/sgemaster.mbialjpj55 Command failed: /usr/lib/lsb/install_initd /etc/init.d/sgemaster.mbialjpj55 Probably a permission problem. Please check file access permissions. Check root read/write permission. Check if SGE daemons are running. I have found the file "sgeqmaster.mbialjpj55" in the location described as /etc/init.d and ls -l command gives the file permissions as -rwxr-xr-x. 1 root root 24883 Nov 7 17:27 sgemaster.mbialjpj55 *How to check if SGE Daemons is running *because command "service --status-all" reveals netconsole module not loaded Configured devices: lo Profile_2 enp0s25 p1p1 Currently active devices: lo p1p1 enp0s25 virbr0 ● rhnsd.service - LSB: Starts the Spacewalk Daemon Loaded: loaded (/etc/rc.d/init.d/rhnsd) Active: active (running) since Thu 2016-10-13 15:44:00 IST; 3 weeks 4 days ago Docs: man:systemd-sysv-generator(8) Main PID: 2453 (rhnsd) CGroup: /system.slice/rhnsd.service └─2453 rhnsd Nov 06 03:13:31 mbialjpj rhnsd[2453]: /etc/sysconfig/rhn/systemid does not exist or is unreadable Nov 06 07:13:31 mbialjpj rhnsd[2453]: /etc/sysconfig/rhn/systemid does not exist or is unreadable Nov 06 11:13:31 mbialjpj rhnsd[2453]: /etc/sysconfig/rhn/systemid does not exist or is unreadable Nov 06 15:13:31 mbialjpj rhnsd[2453]: /etc/sysconfig/rhn/systemid does not exist or is unreadable Nov 06 19:13:31 mbialjpj rhnsd[2453]: /etc/sysconfig/rhn/systemid does not exist or is unreadable Nov 06 23:13:16 mbialjpj rhnsd[2453]: /etc/sysconfig/rhn/systemid does not exist or is unreadable Nov 07 03:13:16 mbialjpj rhnsd[2453]: /etc/sysconfig/rhn/systemid does not exist or
Re: [SGE-discuss] Fwd: Error at the time of Distribution staging
On Thu, Oct 20, 2016 at 07:47:38PM +0530, Himanshu Joshi wrote: >-- Forwarded message -- >From: William Hay>Date: Thu, Oct 20, 2016 at 6:41 PM >Subject: Re: [SGE-discuss] Error at the time of Distribution staging >To: Himanshu Joshi >Cc: sge-disc...@liverpool.ac.uk > >On Thu, Oct 20, 2016 at 11:44:42AM +0530, Himanshu Joshi wrote: >>Thanks William and Love, >>Now I had downloaded gridengine-8.1.9-1.el6.src >>and performed rpm -Uvh gridengine-8.1.9-1.el6.src in mu /opt/sge >folder as >>a super user >> >>warning: gridengine-8.1.9-1.el6.src.rpm: Header V3 RSA/SHA1 >Signature, key >>ID 92258035: NOKEY >>Updating / installing... >> 1:gridengine-8.1.9-1.el6 > # >>[100%] > >I assume you then rebuilt and installed the rpms? IIRC rpm -Uvh on a >.src.rpm >will just unpack the sources. Where possible I would start with the >binary RPMs >Dave provides. Still you seem to be getting further than before. > >I suppose -U option shall Upgrade if previous version is ther otherwise >it installs the rpm > >there are warning messages as well during installation of the above >package as > > warning: user mockbuild does not exist - using root > warning: group mockbuild does not exist - using root > warning: user mockbuild does not exist - using root > warning: group mockbuild does not exist - using root > > > Is that what you were referring to with the term "rebuild and install" No that just looks like the user and group Dave used when prepping the src.rpm. Since they don't exist on your system rpm whinges a bit. Doesn't matter a whole lot when installing a .src.rpm. > > > If that is the case, kindly opine how to build this package I would avoid doing that if at all possible. If you can use the binary rpms appropriate to the OS you have. See my previous message (plus Dave's correction of the version number) for where to find them. You'll need the gridengine and gridengine-qmaster rpms to install the qmaster IIRC. If your RPM based OS is not supported by any of the existing binary RPMS then there are fairly generic instructions for building binary rpms from a .src.rpm here: https://wiki.centos.org/HowTos/RebuildSRPM And remember to pick a cluster name without underscores or other funny characters when running inst_sge. William signature.asc Description: Digital signature ___ SGE-discuss mailing list SGE-discuss@liv.ac.uk https://arc.liv.ac.uk/mailman/listinfo/sge-discuss