Hi, I have a corosync problem on my production cluster, and I don't known how to debug.
Cluster is a 12 nodes cluster, multicast is working fine on this cluster, 2 nodes show "corosync [TOTEM ] Retransmit List" All nodes are show ------------------ # cman_tool nodes Node Sts Inc Joined Name 1 M 76636 2014-09-08 12:23:04 kvm6 2 M 76636 2014-09-08 12:23:04 kvm4 3 M 76636 2014-09-08 12:23:04 kvm3 4 M 76636 2014-09-08 12:23:04 kvm2 5 M 76636 2014-09-08 12:23:04 kvm5 6 M 76672 2014-09-12 16:52:08 kvm1 7 M 76636 2014-09-08 12:23:04 kvm8 8 M 76636 2014-09-08 12:23:04 kvm7 8 M 76636 2014-09-08 12:23:04 kvm9 10 M 76636 2014-09-08 12:23:04 kvm10 11 M 76944 2014-09-14 08:08:18 kvm11 12 M 4 2014-09-03 06:57:27 kvm12 I have quorum -------------- #cman_tool status Version: 6.2.0 Config Version: 12 Cluster Name: odiso Cluster Id: 3337 Cluster Member: Yes Cluster Generation: 76944 Membership state: Cluster-Member Nodes: 12 Expected votes: 12 Total votes: 12 Node votes: 1 Quorum: 7 Active subsystems: 5 Flags: Ports Bound: 0 Node name: kvm12 Node ID: 12 Multicast addresses: 239.192.13.22 Node addresses: 10.3.94.59 But I can't write anything in pmxcfs on any node (read is ok) with a lot erros like this kvm1 pmxcfs[65403]: [dcdb] notice: cpg_join retry 32310 kvm1 pmxcfs[65403]: [dcdb] notice: cpg_join retry 32320 kvm1 pmxcfs[65403]: [dcdb] notice: cpg_join retry 32330 Any idea ? I would like to try to change corosync window_size, but how can I do it online ? (and /etc/init.d/cman stop is hanging) _______________________________________________ pve-devel mailing list pve-devel@pve.proxmox.com http://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-devel