Status: New
Owner: ----
New issue 387 by [email protected]: gnt-instance failing to sync drbd
http://code.google.com/p/ganeti/issues/detail?id=387
Using the "ganeti2" 2.5.2-1 package on Debian Wheezy, I am attempting to
create an instance using DRBD on a 2-node cluster.
The LVM 'xenvg' volume is backed using bcache and an SSD with a
custom-compiled kernel on node1, but not on node2. They are both using the
same kernel, "3.2.28+" according to uname -a. So the underlying block
device for LVM is /dev/bcache0 on node1, and /dev/sda8 on the other. I am
hoping that won't make a difference.
Using the following command,
gnt-instance add \
-n node1.novylen.net:node2.novylen.net \
-o debootstrap+default \
-t drbd \
-s 10G \
-B vcpus=4,memory=8192 \
--net 0:link=xen-br1 \
instance1
I get the following output and the attached syslog output on node1 and
node2 (filtered to show drbd).
root@node1:/var/lib/ganeti# ~/instance1.sh
Mon Feb 25 21:21:41 2013 * creating instance disks...
Mon Feb 25 21:21:46 2013 adding instance instance1.novylen.net to cluster
config
Mon Feb 25 21:21:47 2013 - INFO: Waiting for instance
instance1.novylen.net to sync disks.
Mon Feb 25 21:21:58 2013 - INFO: Instance instance1.novylen.net's disks
are in sync.
Failure: command execution error:
There are some degraded disks for this instance
Obviously DRBD is failing to syncronize with node2, but I don't understand
why.
I have re-added node2 (which did generate a PEM error; I re-generated a
custom one and the error has gone away).
/etc/drbd.d is set to the Debian defaults, that is, all sections are empty
except the first one.
Here is the result of gnt-cluster verify:
Submitted jobs 1446, 1447
Waiting for job 1446 ...
Mon Feb 25 21:25:11 2013 * Verifying cluster config
Mon Feb 25 21:25:11 2013 * Verifying cluster certificate files
Mon Feb 25 21:25:11 2013 * Verifying hypervisor parameters
Mon Feb 25 21:25:11 2013 * Verifying all nodes belong to an existing group
Waiting for job 1447 ...
Mon Feb 25 21:25:11 2013 * Verifying group 'default'
Mon Feb 25 21:25:11 2013 * Gathering data (2 nodes)
Mon Feb 25 21:25:12 2013 * Gathering disk information (2 nodes)
Mon Feb 25 21:25:12 2013 * Verifying configuration file consistency
Mon Feb 25 21:25:12 2013 * Verifying node status
Mon Feb 25 21:25:12 2013 * Verifying instance status
Mon Feb 25 21:25:12 2013 * Verifying orphan volumes
Mon Feb 25 21:25:12 2013 * Verifying N+1 Memory redundancy
Mon Feb 25 21:25:12 2013 * Other Notes
Mon Feb 25 21:25:12 2013 - NOTICE: 1 non-redundant instance(s) found.
Mon Feb 25 21:25:12 2013 * Hooks Results
Thank you for any insight you might be able to offer.
Attachments:
node1.txt 5.2 KB
node2.txt 4.8 KB
--
You received this message because this project is configured to send all
issue notifications to this address.
You may adjust your notification preferences at:
https://code.google.com/hosting/settings