>
> Random ideas:
>
> 1. try disabling the log redirects to see if anything ends up in the
> standard kickstart log?
>

OK I'll try this. Have to wait for a host to free up to try a reinstall
again.


> 2. SGE is unusually sensitive to hostname and DNS resolution. Is your
> kickstart environment giving the node the same IP address during
> provisioning as it has when running? Does your kickstart environment
> have reverse DNS lookup working so that a lookup on the IP returns the
> proper hostname?
>

I'll dump tests in the kickstart file and check.
Don't know how to check the last bit - you mean a lookup on the IP by the
execute host as it's booting?


> 3. qconf requires communication with the qmaster, it looks like you are
> defining ENV vars that point only to the bin directory rather than
> setting up the full SGE environment during the kickstart. Consider
> sourcing the SGE init scripts or at least setting SGE_ROOT and SGE_CELL
> values so that the SGE binaries can navigate to
> $SGE_ROOT/$SGE_CELL/act_qmaster so that it knows what host to be
> communicating with
>

I source /etc/profile.d/sge-binaries.sh at the begin of my code. Should I
need something else than that? In any case I'm dumping relevent env vars in
the kickstart now to check them.

Thanks

-M

Regards,
> Chris
>
>
> Michael Stauffer wrote:
> > Hi,
> >
> > I'm trying to get some resource configurations in place during
> > kickstart. I have the following in my kickstart file
> > "replace-partition.xml". The file is run during kickstart: I can see
> > output to text files when I add debugging info.
> >
> > This code runs correctly if I run it in a shell once the node is up.
> >
> > The issue seems to be that qhost and qconf aren't outputting anything
> > when they run. Is that to be expected? Here's what I have added:
> >
> > <post>
> >
> >   <....snipped the default stuff for this post...>
> >
> >         # Here's the code as I'd like it to work:
> >         # This code gets reached. I can output these env vars and the
> >         #  values are correct.
> >         export SGEBIN=$SGE_ROOT/bin/$SGE_ARCH
> >         export NODE=$(/bin/hostname -s)
> >         export MEMFREE=`$SGEBIN/qhost -F mem_total -h $NODE|tail -n
> > 1|cut -d: -f3 | cut -d= -f2`
> >         $SGEBIN/qconf -mattr exechost complex_values h_vmem=$MEMFREE
> > $NODE 2&gt;&amp;1 &gt; /root/qconf_complex_setup.log
> >         $SGEBIN/qconf -mattr exechost complex_values s_vmem=$MEMFREE
> > $NODE 2&gt;&amp;1 &gt;&gt; /root/qconf_complex_setup.log
> >
> > </post>
> >
> > Thanks!
> >
> > -M
> >
> > _______________________________________________
> > users mailing list
> > [email protected]
> > https://gridengine.org/mailman/listinfo/users
>
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to