Hi,
My kernels are running:
3.8.11-200.fc18.x86_64 and 3.8.9-200.fc18.x86_64
My cephx settings are below
auth cluster required = cephx
auth service required = cephx
auth client required = cephx
I will be working on my test cluster later this week and will try to reproduce
and will file a bug then.
Joe
----- Original Message -----
From: "Sage Weil" <[email protected]>
To: "Joe Ryner" <[email protected]>
Cc: [email protected]
Sent: Friday, May 17, 2013 4:01:35 PM
Subject: Re: [ceph-users] Kernel panic on rbd map when cluster is out of
monitor quorum
On Fri, 17 May 2013, Joe Ryner wrote:
> Hi All,
>
> I have had an issue recently while working on my ceph clusters. The
> following issue seems to be true on bobtail and cuttlefish. I have two
> production clusters in two different data centers and a test cluster. We are
> using ceph to run virtual machines. I use rbd as block devices for sanlock.
>
> I am running Fedora 18.
>
> I have been moving monitors around and in the process I got the cluster
> out of quorum, so ceph stopped responding. During this time I decided
> to reboot a ceph node that performs an rbd map during startup. The
> system boots ok but the service script that is performing the rbd map
> doesn't finish and eventually the system will OOPS and then finally
> panic. I was able to disable the rbd map during boot and finally got
> the cluster back in quorum and everything settled down nicely.
What kernel version? Are you using cephx authentication? If you could
open a bug at tracker.ceph.com that would be most helpful!
> Question, has anyone seen this behavior of crashing/panic? I have seen this
> happen on both of my production clusters.
> Secondly, the ceph command hangs when the cluster is out of quorum, is there
> a timeout available?
Not currently. You can do this yourself with 'timeout 120 ...' with any
recent coreutils.
Thanks-
sage
_______________________________________________
ceph-users mailing list
[email protected]
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com