Hi Martin-
On Sat, 14 Jan 2012, Sage Weil wrote:
> Hi Martin-
>
> On Sat, 14 Jan 2012, Martin Mailand wrote:
>
> > Hi
> > one of four OSD died during the update to v0.40 with an Assertion
> > os/FileStore.cc: 2438: FAILED assert(0 == "unexpected error")
> > Even after a complete shutdown of the cluster an a new start with all OSD at
> > the same version, this osd did not start.
> >
> > The OSD Log it attached.
>
> It's trying to replay a transaction that appears to be invalid because the
> .2 clone is smaller than it thinks. Is this the first time the OSD
> crashed, or did it crash once, and you cranked up logs and generated
> this one? If you have the previous log, that would be helpful... it
> should have a similar tranasction dump but a different stack trace.
I pushed a wip-osd-dump-journal branch to git that will make
ceph-osd -i <whatever> --dump-journal > /tmp/foo.txt
dump the contents of your entire osd journal (sans data) to a text file.
Do you mind sending that along as well? I'd like to see what is in the
journal _after_ the event that is failing (if anything).
Thanks!
sage
>
> Also, are any of the 6 patches on top of 0.40 related to the filestore or
> osd?
>
> Thanks!
> sage
>
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to [email protected]
> More majordomo info at http://vger.kernel.org/majordomo-info.html
>
>
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to [email protected]
More majordomo info at http://vger.kernel.org/majordomo-info.html