I have the same question too. I know Ceph was based on a simple fs of its own 
years ago.
I'd like to hear some more details.


------------------ Original ------------------
From:  "James Harper"<[email protected]>;
Date:  Sat, Jun 15, 2013 11:07 AM
To:  "Gregory Farnum"<[email protected]>; "Craig 
Lewis"<[email protected]>; 
Cc:  "[email protected]"<[email protected]>; 
Subject:  [ceph-users] Why does ceph need a filesystem (was Simulating 
DiskFailure)



> 
> Yeah. You've picked up on some warty bits of Ceph's error handling here for
> sure, but it's exacerbated by the fact that you're not simulating what you
> think. In a real disk error situation the filesystem would be returning EIO or
> something, but here it's returning ENOENT. Since the OSD is authoritative for
> that key space and the filesystem says there is no such object, presto! It
> doesn't exist.
> If you restart the OSD it does a scan of the PGs on-disk as well as what it
> should have, and can pick up on the data not being there and recover. But
> "correctly" handling data that has been (from the local FS' perspective)
> properly deleted under a running process would require huge and expensive
> contortions on the part of the daemon (in any distributed system that I can
> think of).
> -Greg
> 

Why was the decision made for ceph to require an underlying filesystem, rather 
than direct access to disk (like drbd does)?

All of my recent disk failures have been unrecoverable read errors (pending 
sector in SMART stats), which are easy enough to repair in the short term just 
by rewriting with a known good copy of the data (assuming that there isn't some 
other underlying cause and this was just a power-off-at-the-wrong-moment 
error). Unfortunately because of the disconnect between ceph and the LBA this 
can't be done by ceph.

Just curious...

Thanks

James
_______________________________________________
ceph-users mailing list
[email protected]
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
_______________________________________________
ceph-users mailing list
[email protected]
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Reply via email to