Hi, I just received my new test hardware and went on setting up my Sheepdog cluster again.
But right now i'm running into a different problem. The cluster consists of 5 nodes, which can be showed by: Idx Node id (FNV-1a) - Host:Port ------------------------------------------------ 0 13e9d7233684c11d - 192.168.6.215:7000 1 27ca81e942cd0eef - 192.168.6.213:7000 * 2 4f5de28d9ad07d49 - 192.168.6.211:7000 3 d3d995c9a4f4336a - 192.168.6.212:7000 4 e269ca1559662fa8 - 192.168.6.214:7000 That works fine. All 5 hosts have the directory /srv/sheepdog mounted with Btrfs, this is a partition of 196GB: r...@osd1:~# df -h|grep sheepdog /dev/sda7 196G 72K 192G 1% /srv/sheepdog r...@osd1:~# mount|grep sheepdog /dev/sda7 on /srv/sheepdog type btrfs (rw,noatime) r...@osd1:~# This is the same on all the hosts, i verified that. But now, when i try to create a image, i get: r...@osd1:~# /usr/local/bin/qemu-img create -f sheepdog vm002 50G Formatting 'vm002', fmt=sheepdog size=53687091200 do_sd_create 1143: The system is still booting, vm002 qemu-img: Error while formatting r...@osd1:~# Now that seems odd, but when checking my cluster i got: r...@osd1:~# shepherd info -t cluster startup Ctime Epoch Nodes r...@osd1:~# shepherd info -t sheep Id Size Used Use% Total 0.0 MB 0.0 MB -2147483648%, total virtual VDI Size 0.0 MB r...@osd1:~# As you can see the sizes of the nodes is not detected correctly.. I attached the collie.log of "osd1", which might give you some more clues. To me everything seems fine? All 5 hosts are running Ubuntu 9.10 with kernel version 2.6.34 and Sheepdog is build against the latest GIT revision. Any idea? -- Met vriendelijke groet, Wido den Hollander Hoofd Systeembeheer / CSO Telefoon Support Nederland: 0900 9633 (45 cpm) Telefoon Support Belgiƫ: 0900 70312 (45 cpm) Telefoon Direct: (+31) (0)20 50 60 104 Fax: +31 (0)20 50 60 111 E-mail: [email protected] Website: http://www.pcextreme.nl Kennisbank: http://support.pcextreme.nl/ Netwerkstatus: http://nmc.pcextreme.nl
Apr 09 01:07:05 worker_routine(181) started this thread 0 Apr 09 01:07:05 worker_routine(181) started this thread 0 Apr 09 01:07:05 worker_routine(181) started this thread 0 Apr 09 01:07:05 worker_routine(181) started this thread 0 Apr 09 01:07:05 set_addr(1039) addr = 192.168.6.211 Apr 09 01:07:05 main(126) Sheepdog daemon (version 52432db-1edc1f0) started Apr 09 01:07:05 sd_confch(924) confchg nodeid 5306a8c0 Apr 09 01:07:05 sd_confch(926) 5 0 1 Apr 09 01:07:05 sd_confch(930) [0] node_id: 1392945344, pid: 5554, reason: 1693280720 Apr 09 01:07:05 sd_confch(930) [1] node_id: 1409722560, pid: 1551, reason: 168 Apr 09 01:07:05 sd_confch(930) [2] node_id: 1426499776, pid: 1490, reason: 1298 Apr 09 01:07:05 sd_confch(930) [3] node_id: 1443276992, pid: 1181, reason: 0 Apr 09 01:07:05 sd_confch(930) [4] node_id: 1460054208, pid: 1298, reason: 1065042176 Apr 09 01:07:05 __sd_confch(884) 0 Apr 09 01:07:05 sd_deliver(793) op: 1, done: 0, size: 41024, from: 192.168.6.211:7000 Apr 09 01:07:05 __sd_deliver(718) op: 1, done: 0, size: 41024, from: 192.168.6.211:7000 Apr 09 01:07:05 listen_handler(369) accepted a new connection, 11 Apr 09 01:07:05 read_epoch(1512) failed to read epoch 0 Apr 09 01:07:05 epoch_queue_request(1541) failed, 0, 25, 3 Apr 09 01:07:05 client_handler(330) closed a connection, 11 Apr 09 01:07:05 sd_deliver(793) op: 1, done: 1, size: 41024, from: 192.168.6.211:7000 Apr 09 01:07:05 __sd_deliver(718) op: 1, done: 1, size: 41024, from: 192.168.6.211:7000 Apr 09 01:07:05 update_cluster_info(528) system status = 1, epoch = 0 Apr 09 01:07:05 print_node_list(259) nodeid: 5706a8c0, pid: 1298, ip: 192.168.6.215:7000 Apr 09 01:07:05 print_node_list(259) nodeid: 5606a8c0, pid: 1181, ip: 192.168.6.214:7000 Apr 09 01:07:05 print_node_list(259) nodeid: 5506a8c0, pid: 1490, ip: 192.168.6.213:7000 Apr 09 01:07:05 print_node_list(259) nodeid: 5406a8c0, pid: 1551, ip: 192.168.6.212:7000 Apr 09 01:07:05 print_node_list(259) l nodeid: 5306a8c0, pid: 5554, ip: 192.168.6.211:7000 Apr 09 01:07:08 listen_handler(369) accepted a new connection, 11 Apr 09 01:07:08 cluster_queue_request(175) 0x37b0f00 19 Apr 09 01:07:08 client_handler(330) closed a connection, 11 Apr 09 01:07:10 listen_handler(369) accepted a new connection, 11 Apr 09 01:07:10 cluster_queue_request(175) 0x37b0f00 19 Apr 09 01:07:10 client_handler(330) closed a connection, 11 Apr 09 01:07:12 listen_handler(369) accepted a new connection, 11 Apr 09 01:07:12 cluster_queue_request(175) 0x37b0f00 19 Apr 09 01:07:12 client_handler(330) closed a connection, 11 Apr 09 01:07:35 listen_handler(369) accepted a new connection, 11 Apr 09 01:07:35 client_handler(330) closed a connection, 11 Apr 09 01:07:42 listen_handler(369) accepted a new connection, 11 Apr 09 01:07:42 client_handler(330) closed a connection, 11 Apr 09 01:12:34 listen_handler(369) accepted a new connection, 11 Apr 09 01:12:34 cluster_queue_request(175) 0x7f30ba2e0010 b1 Apr 09 01:12:34 client_handler(330) closed a connection, 11 Apr 09 01:12:37 listen_handler(369) accepted a new connection, 11 Apr 09 01:12:37 cluster_queue_request(175) 0x37b0f00 19 Apr 09 01:12:37 client_handler(330) closed a connection, 11 Apr 09 01:12:39 listen_handler(369) accepted a new connection, 11 Apr 09 01:12:39 cluster_queue_request(175) 0x37b0f00 19 Apr 09 01:12:39 client_handler(330) closed a connection, 11 Apr 09 01:12:39 listen_handler(369) accepted a new connection, 11 Apr 09 01:12:39 client_handler(330) closed a connection, 11 Apr 09 01:12:39 listen_handler(369) accepted a new connection, 11 Apr 09 01:12:39 client_handler(330) closed a connection, 11 Apr 09 01:13:58 worker_routine(181) started this thread 0 Apr 09 01:13:58 worker_routine(181) started this thread 0 Apr 09 01:13:58 worker_routine(181) started this thread 0 Apr 09 01:13:58 worker_routine(181) started this thread 0 Apr 09 01:13:58 set_addr(1039) addr = 192.168.6.211 Apr 09 01:13:58 main(126) Sheepdog daemon (version 52432db-1edc1f0) started Apr 09 01:13:58 sd_confch(924) confchg nodeid 5306a8c0 Apr 09 01:13:58 sd_confch(926) 1 0 1 Apr 09 01:13:58 sd_confch(930) [0] node_id: 1392945344, pid: 5740, reason: 1065042176 Apr 09 01:13:58 __sd_confch(884) 0 Apr 09 01:13:58 sd_deliver(793) op: 1, done: 0, size: 41024, from: 192.168.6.211:7000 Apr 09 01:13:58 __sd_deliver(718) op: 1, done: 0, size: 41024, from: 192.168.6.211:7000 Apr 09 01:13:58 read_epoch(1512) failed to read epoch 0 Apr 09 01:13:58 get_cluster_status(359) failed to read epoch, 3 Apr 09 01:13:58 sd_deliver(793) op: 1, done: 1, size: 41024, from: 192.168.6.211:7000 Apr 09 01:13:58 __sd_deliver(718) op: 1, done: 1, size: 41024, from: 192.168.6.211:7000 Apr 09 01:13:58 print_node_list(259) l nodeid: 5306a8c0, pid: 5740, ip: 192.168.6.211:7000 Apr 09 01:13:59 sd_confch(924) confchg nodeid 5306a8c0 Apr 09 01:13:59 sd_confch(926) 2 0 1 Apr 09 01:13:59 sd_confch(930) [0] node_id: 1392945344, pid: 5740, reason: 0 Apr 09 01:13:59 sd_confch(930) [1] node_id: 1409722560, pid: 9189, reason: 455349247 Apr 09 01:13:59 sd_deliver(793) op: 1, done: 0, size: 41024, from: 192.168.6.212:7000 Apr 09 01:13:59 __sd_deliver(718) op: 1, done: 0, size: 41024, from: 192.168.6.212:7000 Apr 09 01:13:59 get_cluster_status(359) failed to read epoch, 3 Apr 09 01:13:59 sd_deliver(793) op: 1, done: 1, size: 41024, from: 192.168.6.212:7000 Apr 09 01:13:59 __sd_deliver(718) op: 1, done: 1, size: 41024, from: 192.168.6.212:7000 Apr 09 01:13:59 print_node_list(259) l nodeid: 5306a8c0, pid: 5740, ip: 192.168.6.211:7000 Apr 09 01:13:59 print_node_list(259) nodeid: 5406a8c0, pid: 9189, ip: 192.168.6.212:7000 Apr 09 01:13:59 sd_confch(924) confchg nodeid 5306a8c0 Apr 09 01:13:59 sd_confch(926) 3 0 1 Apr 09 01:13:59 sd_confch(930) [0] node_id: 1392945344, pid: 5740, reason: 8387822 Apr 09 01:13:59 sd_confch(930) [1] node_id: 1409722560, pid: 9189, reason: 12288 Apr 09 01:13:59 sd_confch(930) [2] node_id: 1426499776, pid: 4694, reason: 1065042176 Apr 09 01:13:59 sd_deliver(793) op: 1, done: 0, size: 41024, from: 192.168.6.213:7000 Apr 09 01:13:59 __sd_deliver(718) op: 1, done: 0, size: 41024, from: 192.168.6.213:7000 Apr 09 01:13:59 get_cluster_status(359) failed to read epoch, 3 Apr 09 01:13:59 sd_deliver(793) op: 1, done: 1, size: 41024, from: 192.168.6.213:7000 Apr 09 01:13:59 __sd_deliver(718) op: 1, done: 1, size: 41024, from: 192.168.6.213:7000 Apr 09 01:13:59 print_node_list(259) l nodeid: 5306a8c0, pid: 5740, ip: 192.168.6.211:7000 Apr 09 01:13:59 print_node_list(259) nodeid: 5406a8c0, pid: 9189, ip: 192.168.6.212:7000 Apr 09 01:13:59 print_node_list(259) nodeid: 5506a8c0, pid: 4694, ip: 192.168.6.213:7000 Apr 09 01:14:00 sd_confch(924) confchg nodeid 5306a8c0 Apr 09 01:14:00 sd_confch(926) 4 0 1 Apr 09 01:14:00 sd_confch(930) [0] node_id: 1392945344, pid: 5740, reason: 0 Apr 09 01:14:00 sd_confch(930) [1] node_id: 1409722560, pid: 9189, reason: 8378470 Apr 09 01:14:00 sd_confch(930) [2] node_id: 1426499776, pid: 4694, reason: 255 Apr 09 01:14:00 sd_confch(930) [3] node_id: 1443276992, pid: 4249, reason: 455349247 Apr 09 01:14:00 sd_deliver(793) op: 1, done: 0, size: 41024, from: 192.168.6.214:7000 Apr 09 01:14:00 __sd_deliver(718) op: 1, done: 0, size: 41024, from: 192.168.6.214:7000 Apr 09 01:14:00 get_cluster_status(359) failed to read epoch, 3 Apr 09 01:14:00 sd_deliver(793) op: 1, done: 1, size: 41024, from: 192.168.6.214:7000 Apr 09 01:14:00 __sd_deliver(718) op: 1, done: 1, size: 41024, from: 192.168.6.214:7000 Apr 09 01:14:00 print_node_list(259) l nodeid: 5306a8c0, pid: 5740, ip: 192.168.6.211:7000 Apr 09 01:14:00 print_node_list(259) nodeid: 5406a8c0, pid: 9189, ip: 192.168.6.212:7000 Apr 09 01:14:00 print_node_list(259) nodeid: 5506a8c0, pid: 4694, ip: 192.168.6.213:7000 Apr 09 01:14:00 print_node_list(259) nodeid: 5606a8c0, pid: 4249, ip: 192.168.6.214:7000 Apr 09 01:14:00 sd_confch(924) confchg nodeid 5306a8c0 Apr 09 01:14:00 sd_confch(926) 5 0 1 Apr 09 01:14:00 sd_confch(930) [0] node_id: 1392945344, pid: 5740, reason: 0 Apr 09 01:14:00 sd_confch(930) [1] node_id: 1409722560, pid: 9189, reason: 0 Apr 09 01:14:00 sd_confch(930) [2] node_id: 1426499776, pid: 4694, reason: 8385427 Apr 09 01:14:00 sd_confch(930) [3] node_id: 1443276992, pid: 4249, reason: 12288 Apr 09 01:14:00 sd_confch(930) [4] node_id: 1460054208, pid: 4400, reason: 1065042176 Apr 09 01:14:00 sd_deliver(793) op: 1, done: 0, size: 41024, from: 192.168.6.215:7000 Apr 09 01:14:00 __sd_deliver(718) op: 1, done: 0, size: 41024, from: 192.168.6.215:7000 Apr 09 01:14:00 get_cluster_status(359) failed to read epoch, 3 Apr 09 01:14:00 sd_deliver(793) op: 1, done: 1, size: 41024, from: 192.168.6.215:7000 Apr 09 01:14:00 __sd_deliver(718) op: 1, done: 1, size: 41024, from: 192.168.6.215:7000 Apr 09 01:14:00 print_node_list(259) l nodeid: 5306a8c0, pid: 5740, ip: 192.168.6.211:7000 Apr 09 01:14:00 print_node_list(259) nodeid: 5406a8c0, pid: 9189, ip: 192.168.6.212:7000 Apr 09 01:14:00 print_node_list(259) nodeid: 5506a8c0, pid: 4694, ip: 192.168.6.213:7000 Apr 09 01:14:00 print_node_list(259) nodeid: 5606a8c0, pid: 4249, ip: 192.168.6.214:7000 Apr 09 01:14:00 print_node_list(259) nodeid: 5706a8c0, pid: 4400, ip: 192.168.6.215:7000 Apr 09 01:14:03 listen_handler(369) accepted a new connection, 11 Apr 09 01:14:03 cluster_queue_request(175) 0x326df00 19 Apr 09 01:14:03 client_handler(330) closed a connection, 11 Apr 09 01:14:03 listen_handler(369) accepted a new connection, 11 Apr 09 01:14:03 client_handler(330) closed a connection, 11 Apr 09 01:14:03 listen_handler(369) accepted a new connection, 11 Apr 09 01:14:03 client_handler(330) closed a connection, 11 Apr 09 01:14:06 listen_handler(369) accepted a new connection, 11 Apr 09 01:14:06 cluster_queue_request(175) 0x326df00 19 Apr 09 01:14:06 client_handler(330) closed a connection, 11 Apr 09 02:13:19 listen_handler(369) accepted a new connection, 11 Apr 09 02:13:19 cluster_queue_request(175) 0x326df00 19 Apr 09 02:13:19 client_handler(330) closed a connection, 11 Apr 09 02:13:23 listen_handler(369) accepted a new connection, 11 Apr 09 02:13:23 cluster_queue_request(175) 0x326df00 b1 Apr 09 02:13:23 client_handler(330) closed a connection, 11 Apr 09 02:13:51 listen_handler(369) accepted a new connection, 11 Apr 09 02:13:51 client_handler(330) closed a connection, 11 Apr 09 02:14:56 listen_handler(369) accepted a new connection, 11 Apr 09 02:14:56 cluster_queue_request(175) 0x326df00 b1 Apr 09 02:14:56 client_handler(330) closed a connection, 11 Apr 09 02:14:59 listen_handler(369) accepted a new connection, 11 Apr 09 02:14:59 cluster_queue_request(175) 0x326df00 19 Apr 09 02:14:59 client_handler(330) closed a connection, 11 Apr 09 02:16:24 listen_handler(369) accepted a new connection, 11 Apr 09 02:16:24 client_handler(330) closed a connection, 11 Apr 09 02:16:52 listen_handler(369) accepted a new connection, 11 Apr 09 02:16:52 cluster_queue_request(175) 0x326df00 b1 Apr 09 02:16:52 client_handler(330) closed a connection, 11 Apr 09 02:16:54 listen_handler(369) accepted a new connection, 11 Apr 09 02:16:54 cluster_queue_request(175) 0x326df00 19 Apr 09 02:16:54 client_handler(330) closed a connection, 11 Apr 09 02:16:54 listen_handler(369) accepted a new connection, 11 Apr 09 02:16:54 client_handler(330) closed a connection, 11 Apr 09 02:16:54 listen_handler(369) accepted a new connection, 11 Apr 09 02:16:54 client_handler(330) closed a connection, 11
-- sheepdog mailing list [email protected] http://lists.wpkg.org/mailman/listinfo/sheepdog
