Status: New
Owner: ----

New issue 387 by [email protected]: gnt-instance failing to sync drbd
http://code.google.com/p/ganeti/issues/detail?id=387


Using the "ganeti2" 2.5.2-1 package on Debian Wheezy, I am attempting to create an instance using DRBD on a 2-node cluster.

The LVM 'xenvg' volume is backed using bcache and an SSD with a custom-compiled kernel on node1, but not on node2. They are both using the same kernel, "3.2.28+" according to uname -a. So the underlying block device for LVM is /dev/bcache0 on node1, and /dev/sda8 on the other. I am hoping that won't make a difference.

Using the following command,

gnt-instance add \
        -n node1.novylen.net:node2.novylen.net \
        -o debootstrap+default \
        -t drbd \
        -s 10G \
        -B vcpus=4,memory=8192 \
        --net 0:link=xen-br1 \
        instance1

I get the following output and the attached syslog output on node1 and node2 (filtered to show drbd).

root@node1:/var/lib/ganeti# ~/instance1.sh
Mon Feb 25 21:21:41 2013 * creating instance disks...
Mon Feb 25 21:21:46 2013 adding instance instance1.novylen.net to cluster config Mon Feb 25 21:21:47 2013 - INFO: Waiting for instance instance1.novylen.net to sync disks. Mon Feb 25 21:21:58 2013 - INFO: Instance instance1.novylen.net's disks are in sync.
Failure: command execution error:
There are some degraded disks for this instance

Obviously DRBD is failing to syncronize with node2, but I don't understand why.

I have re-added node2 (which did generate a PEM error; I re-generated a custom one and the error has gone away).

/etc/drbd.d is set to the Debian defaults, that is, all sections are empty except the first one.

Here is the result of gnt-cluster verify:

Submitted jobs 1446, 1447
Waiting for job 1446 ...
Mon Feb 25 21:25:11 2013 * Verifying cluster config
Mon Feb 25 21:25:11 2013 * Verifying cluster certificate files
Mon Feb 25 21:25:11 2013 * Verifying hypervisor parameters
Mon Feb 25 21:25:11 2013 * Verifying all nodes belong to an existing group
Waiting for job 1447 ...
Mon Feb 25 21:25:11 2013 * Verifying group 'default'
Mon Feb 25 21:25:11 2013 * Gathering data (2 nodes)
Mon Feb 25 21:25:12 2013 * Gathering disk information (2 nodes)
Mon Feb 25 21:25:12 2013 * Verifying configuration file consistency
Mon Feb 25 21:25:12 2013 * Verifying node status
Mon Feb 25 21:25:12 2013 * Verifying instance status
Mon Feb 25 21:25:12 2013 * Verifying orphan volumes
Mon Feb 25 21:25:12 2013 * Verifying N+1 Memory redundancy
Mon Feb 25 21:25:12 2013 * Other Notes
Mon Feb 25 21:25:12 2013   - NOTICE: 1 non-redundant instance(s) found.
Mon Feb 25 21:25:12 2013 * Hooks Results

Thank you for any insight you might be able to offer.

Attachments:
        node1.txt  5.2 KB
        node2.txt  4.8 KB

--
You received this message because this project is configured to send all issue notifications to this address.
You may adjust your notification preferences at:
https://code.google.com/hosting/settings

Reply via email to