Re: [SGE-discuss] Error at the time of Distribution staging

2016-10-20 Thread Himanshu Joshi
Thanks William and Love,
Now I had downloaded gridengine-8.1.9-1.el6.src
and performed rpm -Uvh gridengine-8.1.9-1.el6.src in mu /opt/sge folder as
a super user

warning: gridengine-8.1.9-1.el6.src.rpm: Header V3 RSA/SHA1 Signature, key
ID 92258035: NOKEY
Updating / installing...
   1:gridengine-8.1.9-1.el6   #
[100%]


During the process of installation through ./inst_sge -m -x command I got
the following error

qmaster startup script
--

We can install the startup script that will
start qmaster at machine boot (y/n) [y] >>

*after hitting Return the following error came*

cp /opt/sge/default/common/sgemaster /etc/init.d/sgemaster.mbialjpj_cluster
/usr/lib/lsb/install_initd /etc/init.d/sgemaster.mbialjpj_cluster

Command failed: /usr/lib/lsb/install_initd
/etc/init.d/sgemaster.mbialjpj_cluster

Probably a permission problem. Please check file access permissions.
Check root read/write permission. Check if SGE daemons are running.

P.S.
I had selected user as "root" .
Installing Grid Engine as user >root<
Hit  to continue >>

And I have given cluster name as = "mbialjpj_cluster"

Looking forward to hear from the experts

Regards

On Wed, Oct 19, 2016 at 8:08 PM, Dave Love  wrote:

> Himanshu Joshi  writes:
>
> > Dear William,
> > Apologies, I am new to the setup or I might be wrong in interpreting the
> > suggested solution.
> > Can you just help me in making the packages available for RHEL 7 because
> > the link you sent (http://copr.fedoraproject.org/coprs/loveshack/SGE/)
> does
> > not have any package or repositories.for successful installation of SGE,
> > unlike the link for RHEL5/RHEL6 or a Debianish version.
>
> It has links to the .repo files and another about enabling copr
> repositories, i.e. what to do with the .repo files to that you can do
> "yum install gridengine ...".
>



-- 
Himanshu Joshi
M.Tech. Cognitive & Neuroscience.
Ph.D Scholar,
Department of Psychiatry
NIMHANS, Bangalore
Publications

Multimodal Brain Image Analysis Laboratory

___
SGE-discuss mailing list
SGE-discuss@liv.ac.uk
https://arc.liv.ac.uk/mailman/listinfo/sge-discuss


Re: [SGE-discuss] Error at the time of Distribution staging

2016-10-19 Thread Dave Love
William Hay  writes:

> I think the recommended process is to install the RPMs/debs Dave provides 
> then run inst_sge with the appropriate options for the sort of node you 
> are installing.  

Yes, that's what I do.

> If you are using something similar to RHEL5/RHEL6 or a Debianish version of 
> Linux then the appropriate packages can be found here:
>
> https://arc.liv.ac.uk/downloads/SGE/releases/8.1.7/

[The current version is 8.1.9.]

> If you are using RHEL7 or similar then they are available here:
> http://copr.fedoraproject.org/coprs/loveshack/SGE/

You might as well use that repository anyhow.  I don't know why I
bothered keeping the recent rpms locally.

> At UCL we install the qmaster node and have it export /opt/sge/default/common
> and its spool.
>
> New nodes have the appropriate rpms installed, mount the exported filesystems
> and run a copy of the execd startup script at boot.  Don't even need to 
> run inst_sge on the workers or submit nodes.

Same here, except the compute nodes have an NFS root.
___
SGE-discuss mailing list
SGE-discuss@liv.ac.uk
https://arc.liv.ac.uk/mailman/listinfo/sge-discuss


Re: [SGE-discuss] Error at the time of Distribution staging

2016-10-14 Thread Dave Love
Himanshu Joshi  writes:

> Using jvm library >/etc/alternatives/jre/lib/amd64/server/libjvm.so<

So it appears to be a Debian-like system, in which case why not use the
Debian packaging?  If it's some variety of system on which the packaging
doesn't work, I can fix it, given the names of the relevant packages for
building.
___
SGE-discuss mailing list
SGE-discuss@liv.ac.uk
https://arc.liv.ac.uk/mailman/listinfo/sge-discuss


Re: [SGE-discuss] Error at the time of Distribution staging

2016-10-14 Thread Dave Love
William Hay  writes:

>>It does not accepts cell name as default and asks for changing the name
>>I had changed this also to- mbialjpj
> You mean you specified default as the Cell Name and it rejected it?
> That's a little odd.

If I recall correctly, it complains if you try to install the same cell
twice.
___
SGE-discuss mailing list
SGE-discuss@liv.ac.uk
https://arc.liv.ac.uk/mailman/listinfo/sge-discuss


Re: [SGE-discuss] Error at the time of Distribution staging

2016-10-14 Thread William Hay
On Thu, Oct 13, 2016 at 08:23:13PM +0530, Himanshu Joshi wrote:
>Lets SGE-discuss answer the question,
>As you have rightly pointed out I would like to mention that at the time
>of first installation,  I specified "default" as the cell name. But
>default was never used as qmaster host in  any of my installation trial.
> 
>Subsequently I had deleted the default folder from my $SGE_ROOT directory.
>And again put cell name as default
>The same error continues
> 
>Data Base Updated
>Error: Cannot create keystore /var/sgeCA/port6444/default/private/keystore
Is the directory above on a NFS filesystem perhaps?

Did you specify a user other than root to install as?  It looks like you are 
installing
as a user called 'default' but that user doesn't exist.  Not sure how that 
happened as 
if you entered it by hand the existence of the user should have been checked 
for.  

One possibility is that the installer picked up the username from a directory 
imported
from a network filesystem of some sort where the user is valid.  If network 
filesystems are
involved you'll want to ensure that usernames and group names, uids and gids  
either 
match across the cluster or are reliably translated.

Are there any directories that appear to be owned by user 'default'  
does getent passwd default return anything?

William


signature.asc
Description: Digital signature
___
SGE-discuss mailing list
SGE-discuss@liv.ac.uk
https://arc.liv.ac.uk/mailman/listinfo/sge-discuss


Re: [SGE-discuss] Error at the time of Distribution staging

2016-10-13 Thread William Hay
On Thu, Oct 13, 2016 at 03:41:27PM +0530, Himanshu Joshi wrote:
>Thanks William,
> 
>As per your suggestion I had changed the hostname to MBIALJPJ
>hostnamectl status command says
> 
>   Static hostname: mbialjpj
>   Pretty hostname: MBIALJPJ
> Icon name: computer-desktop
>   Chassis: desktop
>Machine ID: 431da268159243088e0e02874e8d36bf
>   Boot ID: f3bb3c227eea4390a1d306b23ba5e25b
>  Operating System: Red Hat Enterprise Linux
>   CPE OS Name: cpe:/o:redhat:enterprise_linux:7.2:GA:workstation
>Kernel: Linux 3.10.0-327.el7.x86_64
>  Architecture: x86-64
> 
>Still
> 
>It does not accepts cell name as default and asks for changing the name
>I had changed this also to- mbialjpj
You mean you specified default as the Cell Name and it rejected it?  That's a 
little odd.


> 
>But still the GUI says
I'd try with the text mode installer it is a little easier to cut and paste the 
output of any problems
into an e-mail.

> 
>FAILED: Task failed.
> 
>OUTPUT:
> 
>...
>
>
> 
>Error: Cannot create keystore
>/var/sgeCA/port6444/mbialjpj/private/keystore
>Error: keystore directory does not exist:
>/var/sgeCA/port6444/mbialjpj/private
>./util/install_modules/inst_qmaster.sh: line 1159:
>/var/sgeCA/port6444/mbialjpj/private/keystore.password: No such file or
>directory
>chown: invalid user: `default'
> 
>Kindly suggest the needful
Not sure.  Did you specify default as the user to install as somewhere?
Anyway copying to sge-disc...@liverpool.ac.uk.


William


signature.asc
Description: Digital signature
___
SGE-discuss mailing list
SGE-discuss@liv.ac.uk
https://arc.liv.ac.uk/mailman/listinfo/sge-discuss


Re: [SGE-discuss] Error at the time of Distribution staging

2016-10-13 Thread William Hay
On Thu, Oct 13, 2016 at 11:07:28AM +0530, Himanshu Joshi wrote:
> 
>The error again is
> 
>Error: Unable to access jarfile ./util/gui-installer/installer.jar

IIRC you were having issues with building with Java earlier.  I suspect the 
above may be a
result of that.  Possibly you just need to make sure all the prerequisites are 
installed
before building and then you'll get the java parts built.

> 
>I tried through command line as well
>and ran the following command
>./inst_sge -m -x -csp
> 
>again the error was
> 
>"sed: can't read dist/util/install_modules/inst_common.sh: No such file or
>directory
>[3;J"

Not sure where that error message appeared relative to the messages below.  I'd 
suggest
fixing the localhost issue and seeing if the above perissts.

>with the following display
> 
>Welcome to the Grid Engine installation
>---
> 
>Grid Engine qmaster host installation
>-
> 
>Before you continue with the installation please read these hints:
> 
>   - Your terminal window should have a size of at least
> 80x24 characters
> 
>   - The INTR character is often bound to the key Ctrl-C.
> The term >Ctrl-C< is used during the installation if you
> have the possibility to abort the installation
> 
>The qmaster installation procedure will take approximately 5-10 minutes.
> 
>Hit  to continue >>
>after hitting return
>the message appears like
> 
>"Unsupported local hostname
>--
> 
>The current hostname is resolved as follows:
> 
>Hostname: localhost
>Aliases: localhost.localdomain localhost4 localhost4.localdomain4
>localhost.localdomain localhost6 localhost6.localdomain6
>Host Address(es): 127.0.0.1 127.0.0.1
> 
>It is not supported for a Grid Engine installation that the local hostname
>contains the hostname "localhost" and/or the IP address "127.0.x.x" of the
>loopback interface.
>The "localhost" hostname should be reserved for the loopback interface
>("127.0.0.1") and the real hostname should be assigned to one of the
>physical or logical network interfaces of this machine.

Update the system hostname to something other than localhost.  Assuming this
is a linux box then modifying /etc/hostname should change it from the next 
boot.  You can use the hostname command (or hostnamectl with a systemd based
system) to change it from now until next boot.  Other unix like systems should
be fairly similar.

You also need to change what IP address this hostname refers to.  Usually just 
edit
the /etc/hosts file to include a line referencing the ip address of a 
non-loopback interface
the hostname and the fqdn of the machine.


William


signature.asc
Description: Digital signature
___
SGE-discuss mailing list
SGE-discuss@liv.ac.uk
https://arc.liv.ac.uk/mailman/listinfo/sge-discuss