It has been working for a year.
Then today I went to add new execute nodes to the grid, all my grid
engine clients and the master share a common /opt/sge directory.
And my whole grid went down with the bootstrap message.
I am asking for a restore from the other group, but I need to understand
maybe what I did, and can I fix it. They can take days to do a restore
and this is a production system arggg
Thanks,
Dan
On 06/02/2015 11:18 AM, Skylar Thompson wrote:
Rebuilding the bootstrap file is easy, but possibly unnecessary. You should
find out why bootstrap no longer exists - did it live there before?
On Tue, Jun 02, 2015 at 11:15:32AM -0500, Dan Hyatt wrote:
is it easier to rebuild the bootstrap file?
Or restore it from tape (assuming the other group is backing it up as
requested).
Dan
On 06/02/2015 11:05 AM, Skylar Thompson wrote:
Did your SGE_ROOT and/or SGE_CELL environment variable settings change? All
the GE binaries expect to find the bootstrap file at
${SGE_ROOT}/${SGE_CELL}/common. I suspect that the settings changed, and
your bootstrap file actually lives elsewhere.
On Tue, Jun 02, 2015 at 10:51:58AM -0500, Dan Hyatt wrote:
I was trying to add some new exec nodes, and now my qstat and qsub on my
master is giving this error.
error: fopen("/opt/sge/default/common/bootstrap") failed: No such file
or directory
When I go to that directory, the bootstrap file does not exist.
What did I do and how do I recover?
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users