Thanks for your prompt response. Given that my mon.c /var/lib/ceph/mon/ceph-c is currently populated, should I delete it's contents after removing the monitor and before re-adding it?
Darryl On 06/26/13 12:50, Mike Dawson wrote:
Darryl, I've seen this issue a few times recently. I believe Joao was looking into it at one point, but I don't know if it has been resolved (Any news Joao?). Others have run into it too. Look closely at: http://tracker.ceph.com/issues/4999 http://irclogs.ceph.widodh.nl/index.php?date=2013-06-07 http://irclogs.ceph.widodh.nl/index.php?date=2013-05-27 http://irclogs.ceph.widodh.nl/index.php?date=2013-05-25 http://irclogs.ceph.widodh.nl/index.php?date=2013-05-21 http://irclogs.ceph.widodh.nl/index.php?date=2013-05-15 I'd recommend you submit this as a bug on the tracker. It sounds like you have reliable quorum between a and b, that's good. The workaround that has worked for me is to remove mon.c, then re-add it. Assuming your monitor leveldb stores aren't too large, the process is rather quick. Follow the instructions at: http://ceph.com/docs/next/rados/operations/add-or-rm-mons/#removing-monitors then http://ceph.com/docs/next/rados/operations/add-or-rm-mons/#adding-monitors - Mike On 6/25/2013 10:34 PM, Darryl Bond wrote:Upgrading a cluster from 6.1.3 to 6.1.4 with 3 monitors. Cluster had been successfully upgraded from bobtail to cuttlefish and then from 6.1.2 to 6.1.3. There have been no changes to ceph.conf. Node mon.a upgrade, a,b,c monitors OK after upgrade Node mon.b upgrade a,b monitors OK after upgrade (note that c was not available, even though I hadn't touched it) Node mon.c very slow to install the upgrade, RAM was tight for some reason and mon process was using half the RAM Node mon.c shutdown mon.c Node mon.c performed the upgrade Node mon.c restart ceph - mon.c will not start service ceph start mon.c === mon.c === Starting Ceph mon.c on ceph3... [23992]: (33) Numerical argument out of domain failed: 'ulimit -n 8192; /usr/bin/ceph-mon -i c --pid-file /var/run/ceph/mon.c.pid -c /etc/ceph/ceph.conf ' Starting ceph-create-keys on ceph3... health HEALTH_WARN 1 mons down, quorum 0,1 a,b monmap e1: 3 mons at {a=192.168.6.101:6789/0,b=192.168.6.102:6789/0,c=192.168.6.103:6789/0}, election epoch 14224, quorum 0,1 a,b osdmap e1342: 18 osds: 18 up, 18 in pgmap v4058788: 5448 pgs: 5447 active+clean, 1 active+clean+scrubbing+deep; 5820 GB data, 11673 GB used, 35464 GB / 47137 GB avail; 813B/s rd, 643KB/s wr, 69op/s mdsmap e1: 0/0/1 up Set debug mon = 20 Nothing going into logs other than assertion--- begin dump of recent events --- 0> 2013-06-26 12:20:36.383430 7fd5e81b57c0 -1 *** Caught signal (Aborted) ** in thread 7fd5e81b57c0 ceph version 0.61.4 (1669132fcfc27d0c0b5e5bb93ade59d147e23404) 1: /usr/bin/ceph-mon() [0x596fe2] 2: (()+0xf000) [0x7fd5e7820000] 3: (gsignal()+0x35) [0x7fd5e619fba5] 4: (abort()+0x148) [0x7fd5e61a1358] 5: (__gnu_cxx::__verbose_terminate_handler()+0x11d) [0x7fd5e6a99e1d] 6: (()+0x5eeb6) [0x7fd5e6a97eb6] 7: (()+0x5eee3) [0x7fd5e6a97ee3] 8: (()+0x5f10e) [0x7fd5e6a9810e] 9: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x40a) [0x64a6aa] 10: /usr/bin/ceph-mon() [0x65f916] 11: /usr/bin/ceph-mon() [0x6960e9] 12: (pick_addresses(CephContext*)+0x8d) [0x69624d] 13: (main()+0x1a8a) [0x49786a] 14: (__libc_start_main()+0xf5) [0x7fd5e618ba05] 15: /usr/bin/ceph-mon() [0x499a69] NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.
The contents of this electronic message and any attachments are intended only for the addressee and may contain legally privileged, personal, sensitive or confidential information. If you are not the intended addressee, and have received this email, any transmission, distribution, downloading, printing or photocopying of the contents of this message or attachments is strictly prohibited. Any legal privilege or confidentiality attached to this message and attachments is not waived, lost or destroyed by reason of delivery to any person other than intended addressee. If you have received this message and are not the intended addressee you should notify the sender by return email and destroy all copies of the message and any attachments. Unless expressly attributed, the views expressed in this email do not necessarily represent the views of the company. _______________________________________________ ceph-users mailing list [email protected] http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
