Since I haven't been able to find a listserv specifically for MAAS and
this listserv is close to the subject and seems to provide good
answers...

I have a MAAS server that was last installed December 18, 2014 to test
building an OpenStack cloud.  When I first built it, it would discover,
commission, and deploy physical machines just fine.  Over time it starts
loosing the ability to deploy machines.  It will reboot them and pxe
boot them with the installer, but something fails in the installer
process and they end up in one of 2 states.

1.  claim to be deployed, but the machine ends up powered down.  The
last node event is something like:

Failed to query node's BMC — Node could not be queried
node-27aac898-bec8-11e4-bb0d-180373b04ac9 (esxi06.maas) connection
timeout

or
2.  fails the deploy, but finishes the install, but never configures the
network interfaces and will not talk to the world (thus the failed
deploy).  The last node event for this one also looks like:

Failed to query node's BMC — Node could not be queried
node-4881df02-bec8-11e4-bcd7-180373b04ac9 (esxi05.maas) connection
timeout

I have restarted bind9 (fixed an earlier dns forwarder problem that
caused juju to fail in deployments) and maas-dhcp.  The machines seem to
be getting a correct IP address when booting up, they just don't keep it
to the end of their configuration process.

I suspect that both problems are related but am not sure where to look
to find the debug information on it.

Any ideas or suggestions for a better place to ask?

I have been having problem with MAAS and juju not being very reliable or
stable in repeating operations.  Any help is greatly appreciated.
-- 
Daniel Bidwell <drbidw...@gmail.com>


-- 
Juju mailing list
Juju@lists.ubuntu.com
Modify settings or unsubscribe at: 
https://lists.ubuntu.com/mailman/listinfo/juju

Reply via email to