Hi,

I just received my new test hardware and went on setting up my Sheepdog
cluster again.

But right now i'm running into a different problem.

The cluster consists of 5 nodes, which can be showed by:

  Idx   Node id (FNV-1a) - Host:Port
------------------------------------------------
  0     13e9d7233684c11d - 192.168.6.215:7000
  1     27ca81e942cd0eef - 192.168.6.213:7000
* 2     4f5de28d9ad07d49 - 192.168.6.211:7000
  3     d3d995c9a4f4336a - 192.168.6.212:7000
  4     e269ca1559662fa8 - 192.168.6.214:7000

That works fine.

All 5 hosts have the directory /srv/sheepdog mounted with Btrfs, this is
a partition of 196GB:

r...@osd1:~# df -h|grep sheepdog
/dev/sda7             196G   72K  192G   1% /srv/sheepdog
r...@osd1:~# mount|grep sheepdog
/dev/sda7 on /srv/sheepdog type btrfs (rw,noatime)
r...@osd1:~#

This is the same on all the hosts, i verified that.

But now, when i try to create a image, i get:

r...@osd1:~# /usr/local/bin/qemu-img create -f sheepdog vm002 50G
Formatting 'vm002', fmt=sheepdog size=53687091200 
do_sd_create 1143: The system is still booting, vm002
qemu-img: Error while formatting
r...@osd1:~#

Now that seems odd, but when checking my cluster i got:

r...@osd1:~# shepherd info -t cluster
startup

Ctime              Epoch Nodes
r...@osd1:~# shepherd info -t sheep
Id      Size    Used    Use%

Total   0.0 MB  0.0 MB  -2147483648%, total virtual VDI Size    0.0 MB
r...@osd1:~#

As you can see the sizes of the nodes is not detected correctly..

I attached the collie.log of "osd1", which might give you some more
clues.

To me everything seems fine?

All 5 hosts are running Ubuntu 9.10 with kernel version 2.6.34 and
Sheepdog is build against the latest GIT revision.

Any idea?

-- 
Met vriendelijke groet,

Wido den Hollander
Hoofd Systeembeheer / CSO
Telefoon Support Nederland: 0900 9633 (45 cpm)
Telefoon Support Belgiƫ: 0900 70312 (45 cpm)
Telefoon Direct: (+31) (0)20 50 60 104
Fax: +31 (0)20 50 60 111
E-mail: [email protected]
Website: http://www.pcextreme.nl
Kennisbank: http://support.pcextreme.nl/
Netwerkstatus: http://nmc.pcextreme.nl


Apr 09 01:07:05 worker_routine(181) started this thread 0
Apr 09 01:07:05 worker_routine(181) started this thread 0
Apr 09 01:07:05 worker_routine(181) started this thread 0
Apr 09 01:07:05 worker_routine(181) started this thread 0
Apr 09 01:07:05 set_addr(1039) addr = 192.168.6.211
Apr 09 01:07:05 main(126) Sheepdog daemon (version 52432db-1edc1f0) started
Apr 09 01:07:05 sd_confch(924) confchg nodeid 5306a8c0
Apr 09 01:07:05 sd_confch(926) 5 0 1
Apr 09 01:07:05 sd_confch(930) [0] node_id: 1392945344, pid: 5554, reason: 1693280720
Apr 09 01:07:05 sd_confch(930) [1] node_id: 1409722560, pid: 1551, reason: 168
Apr 09 01:07:05 sd_confch(930) [2] node_id: 1426499776, pid: 1490, reason: 1298
Apr 09 01:07:05 sd_confch(930) [3] node_id: 1443276992, pid: 1181, reason: 0
Apr 09 01:07:05 sd_confch(930) [4] node_id: 1460054208, pid: 1298, reason: 1065042176
Apr 09 01:07:05 __sd_confch(884) 0
Apr 09 01:07:05 sd_deliver(793) op: 1, done: 0, size: 41024, from: 192.168.6.211:7000
Apr 09 01:07:05 __sd_deliver(718) op: 1, done: 0, size: 41024, from: 192.168.6.211:7000
Apr 09 01:07:05 listen_handler(369) accepted a new connection, 11
Apr 09 01:07:05 read_epoch(1512) failed to read epoch 0
Apr 09 01:07:05 epoch_queue_request(1541) failed, 0, 25, 3
Apr 09 01:07:05 client_handler(330) closed a connection, 11
Apr 09 01:07:05 sd_deliver(793) op: 1, done: 1, size: 41024, from: 192.168.6.211:7000
Apr 09 01:07:05 __sd_deliver(718) op: 1, done: 1, size: 41024, from: 192.168.6.211:7000
Apr 09 01:07:05 update_cluster_info(528) system status = 1, epoch = 0
Apr 09 01:07:05 print_node_list(259)   nodeid: 5706a8c0, pid: 1298, ip: 192.168.6.215:7000
Apr 09 01:07:05 print_node_list(259)   nodeid: 5606a8c0, pid: 1181, ip: 192.168.6.214:7000
Apr 09 01:07:05 print_node_list(259)   nodeid: 5506a8c0, pid: 1490, ip: 192.168.6.213:7000
Apr 09 01:07:05 print_node_list(259)   nodeid: 5406a8c0, pid: 1551, ip: 192.168.6.212:7000
Apr 09 01:07:05 print_node_list(259) l nodeid: 5306a8c0, pid: 5554, ip: 192.168.6.211:7000
Apr 09 01:07:08 listen_handler(369) accepted a new connection, 11
Apr 09 01:07:08 cluster_queue_request(175) 0x37b0f00 19
Apr 09 01:07:08 client_handler(330) closed a connection, 11
Apr 09 01:07:10 listen_handler(369) accepted a new connection, 11
Apr 09 01:07:10 cluster_queue_request(175) 0x37b0f00 19
Apr 09 01:07:10 client_handler(330) closed a connection, 11
Apr 09 01:07:12 listen_handler(369) accepted a new connection, 11
Apr 09 01:07:12 cluster_queue_request(175) 0x37b0f00 19
Apr 09 01:07:12 client_handler(330) closed a connection, 11
Apr 09 01:07:35 listen_handler(369) accepted a new connection, 11
Apr 09 01:07:35 client_handler(330) closed a connection, 11
Apr 09 01:07:42 listen_handler(369) accepted a new connection, 11
Apr 09 01:07:42 client_handler(330) closed a connection, 11
Apr 09 01:12:34 listen_handler(369) accepted a new connection, 11
Apr 09 01:12:34 cluster_queue_request(175) 0x7f30ba2e0010 b1
Apr 09 01:12:34 client_handler(330) closed a connection, 11
Apr 09 01:12:37 listen_handler(369) accepted a new connection, 11
Apr 09 01:12:37 cluster_queue_request(175) 0x37b0f00 19
Apr 09 01:12:37 client_handler(330) closed a connection, 11
Apr 09 01:12:39 listen_handler(369) accepted a new connection, 11
Apr 09 01:12:39 cluster_queue_request(175) 0x37b0f00 19
Apr 09 01:12:39 client_handler(330) closed a connection, 11
Apr 09 01:12:39 listen_handler(369) accepted a new connection, 11
Apr 09 01:12:39 client_handler(330) closed a connection, 11
Apr 09 01:12:39 listen_handler(369) accepted a new connection, 11
Apr 09 01:12:39 client_handler(330) closed a connection, 11
Apr 09 01:13:58 worker_routine(181) started this thread 0
Apr 09 01:13:58 worker_routine(181) started this thread 0
Apr 09 01:13:58 worker_routine(181) started this thread 0
Apr 09 01:13:58 worker_routine(181) started this thread 0
Apr 09 01:13:58 set_addr(1039) addr = 192.168.6.211
Apr 09 01:13:58 main(126) Sheepdog daemon (version 52432db-1edc1f0) started
Apr 09 01:13:58 sd_confch(924) confchg nodeid 5306a8c0
Apr 09 01:13:58 sd_confch(926) 1 0 1
Apr 09 01:13:58 sd_confch(930) [0] node_id: 1392945344, pid: 5740, reason: 1065042176
Apr 09 01:13:58 __sd_confch(884) 0
Apr 09 01:13:58 sd_deliver(793) op: 1, done: 0, size: 41024, from: 192.168.6.211:7000
Apr 09 01:13:58 __sd_deliver(718) op: 1, done: 0, size: 41024, from: 192.168.6.211:7000
Apr 09 01:13:58 read_epoch(1512) failed to read epoch 0
Apr 09 01:13:58 get_cluster_status(359) failed to read epoch, 3
Apr 09 01:13:58 sd_deliver(793) op: 1, done: 1, size: 41024, from: 192.168.6.211:7000
Apr 09 01:13:58 __sd_deliver(718) op: 1, done: 1, size: 41024, from: 192.168.6.211:7000
Apr 09 01:13:58 print_node_list(259) l nodeid: 5306a8c0, pid: 5740, ip: 192.168.6.211:7000
Apr 09 01:13:59 sd_confch(924) confchg nodeid 5306a8c0
Apr 09 01:13:59 sd_confch(926) 2 0 1
Apr 09 01:13:59 sd_confch(930) [0] node_id: 1392945344, pid: 5740, reason: 0
Apr 09 01:13:59 sd_confch(930) [1] node_id: 1409722560, pid: 9189, reason: 455349247
Apr 09 01:13:59 sd_deliver(793) op: 1, done: 0, size: 41024, from: 192.168.6.212:7000
Apr 09 01:13:59 __sd_deliver(718) op: 1, done: 0, size: 41024, from: 192.168.6.212:7000
Apr 09 01:13:59 get_cluster_status(359) failed to read epoch, 3
Apr 09 01:13:59 sd_deliver(793) op: 1, done: 1, size: 41024, from: 192.168.6.212:7000
Apr 09 01:13:59 __sd_deliver(718) op: 1, done: 1, size: 41024, from: 192.168.6.212:7000
Apr 09 01:13:59 print_node_list(259) l nodeid: 5306a8c0, pid: 5740, ip: 192.168.6.211:7000
Apr 09 01:13:59 print_node_list(259)   nodeid: 5406a8c0, pid: 9189, ip: 192.168.6.212:7000
Apr 09 01:13:59 sd_confch(924) confchg nodeid 5306a8c0
Apr 09 01:13:59 sd_confch(926) 3 0 1
Apr 09 01:13:59 sd_confch(930) [0] node_id: 1392945344, pid: 5740, reason: 8387822
Apr 09 01:13:59 sd_confch(930) [1] node_id: 1409722560, pid: 9189, reason: 12288
Apr 09 01:13:59 sd_confch(930) [2] node_id: 1426499776, pid: 4694, reason: 1065042176
Apr 09 01:13:59 sd_deliver(793) op: 1, done: 0, size: 41024, from: 192.168.6.213:7000
Apr 09 01:13:59 __sd_deliver(718) op: 1, done: 0, size: 41024, from: 192.168.6.213:7000
Apr 09 01:13:59 get_cluster_status(359) failed to read epoch, 3
Apr 09 01:13:59 sd_deliver(793) op: 1, done: 1, size: 41024, from: 192.168.6.213:7000
Apr 09 01:13:59 __sd_deliver(718) op: 1, done: 1, size: 41024, from: 192.168.6.213:7000
Apr 09 01:13:59 print_node_list(259) l nodeid: 5306a8c0, pid: 5740, ip: 192.168.6.211:7000
Apr 09 01:13:59 print_node_list(259)   nodeid: 5406a8c0, pid: 9189, ip: 192.168.6.212:7000
Apr 09 01:13:59 print_node_list(259)   nodeid: 5506a8c0, pid: 4694, ip: 192.168.6.213:7000
Apr 09 01:14:00 sd_confch(924) confchg nodeid 5306a8c0
Apr 09 01:14:00 sd_confch(926) 4 0 1
Apr 09 01:14:00 sd_confch(930) [0] node_id: 1392945344, pid: 5740, reason: 0
Apr 09 01:14:00 sd_confch(930) [1] node_id: 1409722560, pid: 9189, reason: 8378470
Apr 09 01:14:00 sd_confch(930) [2] node_id: 1426499776, pid: 4694, reason: 255
Apr 09 01:14:00 sd_confch(930) [3] node_id: 1443276992, pid: 4249, reason: 455349247
Apr 09 01:14:00 sd_deliver(793) op: 1, done: 0, size: 41024, from: 192.168.6.214:7000
Apr 09 01:14:00 __sd_deliver(718) op: 1, done: 0, size: 41024, from: 192.168.6.214:7000
Apr 09 01:14:00 get_cluster_status(359) failed to read epoch, 3
Apr 09 01:14:00 sd_deliver(793) op: 1, done: 1, size: 41024, from: 192.168.6.214:7000
Apr 09 01:14:00 __sd_deliver(718) op: 1, done: 1, size: 41024, from: 192.168.6.214:7000
Apr 09 01:14:00 print_node_list(259) l nodeid: 5306a8c0, pid: 5740, ip: 192.168.6.211:7000
Apr 09 01:14:00 print_node_list(259)   nodeid: 5406a8c0, pid: 9189, ip: 192.168.6.212:7000
Apr 09 01:14:00 print_node_list(259)   nodeid: 5506a8c0, pid: 4694, ip: 192.168.6.213:7000
Apr 09 01:14:00 print_node_list(259)   nodeid: 5606a8c0, pid: 4249, ip: 192.168.6.214:7000
Apr 09 01:14:00 sd_confch(924) confchg nodeid 5306a8c0
Apr 09 01:14:00 sd_confch(926) 5 0 1
Apr 09 01:14:00 sd_confch(930) [0] node_id: 1392945344, pid: 5740, reason: 0
Apr 09 01:14:00 sd_confch(930) [1] node_id: 1409722560, pid: 9189, reason: 0
Apr 09 01:14:00 sd_confch(930) [2] node_id: 1426499776, pid: 4694, reason: 8385427
Apr 09 01:14:00 sd_confch(930) [3] node_id: 1443276992, pid: 4249, reason: 12288
Apr 09 01:14:00 sd_confch(930) [4] node_id: 1460054208, pid: 4400, reason: 1065042176
Apr 09 01:14:00 sd_deliver(793) op: 1, done: 0, size: 41024, from: 192.168.6.215:7000
Apr 09 01:14:00 __sd_deliver(718) op: 1, done: 0, size: 41024, from: 192.168.6.215:7000
Apr 09 01:14:00 get_cluster_status(359) failed to read epoch, 3
Apr 09 01:14:00 sd_deliver(793) op: 1, done: 1, size: 41024, from: 192.168.6.215:7000
Apr 09 01:14:00 __sd_deliver(718) op: 1, done: 1, size: 41024, from: 192.168.6.215:7000
Apr 09 01:14:00 print_node_list(259) l nodeid: 5306a8c0, pid: 5740, ip: 192.168.6.211:7000
Apr 09 01:14:00 print_node_list(259)   nodeid: 5406a8c0, pid: 9189, ip: 192.168.6.212:7000
Apr 09 01:14:00 print_node_list(259)   nodeid: 5506a8c0, pid: 4694, ip: 192.168.6.213:7000
Apr 09 01:14:00 print_node_list(259)   nodeid: 5606a8c0, pid: 4249, ip: 192.168.6.214:7000
Apr 09 01:14:00 print_node_list(259)   nodeid: 5706a8c0, pid: 4400, ip: 192.168.6.215:7000
Apr 09 01:14:03 listen_handler(369) accepted a new connection, 11
Apr 09 01:14:03 cluster_queue_request(175) 0x326df00 19
Apr 09 01:14:03 client_handler(330) closed a connection, 11
Apr 09 01:14:03 listen_handler(369) accepted a new connection, 11
Apr 09 01:14:03 client_handler(330) closed a connection, 11
Apr 09 01:14:03 listen_handler(369) accepted a new connection, 11
Apr 09 01:14:03 client_handler(330) closed a connection, 11
Apr 09 01:14:06 listen_handler(369) accepted a new connection, 11
Apr 09 01:14:06 cluster_queue_request(175) 0x326df00 19
Apr 09 01:14:06 client_handler(330) closed a connection, 11
Apr 09 02:13:19 listen_handler(369) accepted a new connection, 11
Apr 09 02:13:19 cluster_queue_request(175) 0x326df00 19
Apr 09 02:13:19 client_handler(330) closed a connection, 11
Apr 09 02:13:23 listen_handler(369) accepted a new connection, 11
Apr 09 02:13:23 cluster_queue_request(175) 0x326df00 b1
Apr 09 02:13:23 client_handler(330) closed a connection, 11
Apr 09 02:13:51 listen_handler(369) accepted a new connection, 11
Apr 09 02:13:51 client_handler(330) closed a connection, 11
Apr 09 02:14:56 listen_handler(369) accepted a new connection, 11
Apr 09 02:14:56 cluster_queue_request(175) 0x326df00 b1
Apr 09 02:14:56 client_handler(330) closed a connection, 11
Apr 09 02:14:59 listen_handler(369) accepted a new connection, 11
Apr 09 02:14:59 cluster_queue_request(175) 0x326df00 19
Apr 09 02:14:59 client_handler(330) closed a connection, 11
Apr 09 02:16:24 listen_handler(369) accepted a new connection, 11
Apr 09 02:16:24 client_handler(330) closed a connection, 11
Apr 09 02:16:52 listen_handler(369) accepted a new connection, 11
Apr 09 02:16:52 cluster_queue_request(175) 0x326df00 b1
Apr 09 02:16:52 client_handler(330) closed a connection, 11
Apr 09 02:16:54 listen_handler(369) accepted a new connection, 11
Apr 09 02:16:54 cluster_queue_request(175) 0x326df00 19
Apr 09 02:16:54 client_handler(330) closed a connection, 11
Apr 09 02:16:54 listen_handler(369) accepted a new connection, 11
Apr 09 02:16:54 client_handler(330) closed a connection, 11
Apr 09 02:16:54 listen_handler(369) accepted a new connection, 11
Apr 09 02:16:54 client_handler(330) closed a connection, 11
-- 
sheepdog mailing list
[email protected]
http://lists.wpkg.org/mailman/listinfo/sheepdog

Reply via email to