Hi Ceph, In the context of the ceph puppet module ( https://wiki.openstack.org/wiki/Puppet-openstack/ceph-blueprint ) I tried to think about what should be provided to deal with disks / OSDs when they are removed or moved around.
Here is a possible scenario:
* Machine A dies and contains OSD 42
* ceph osd rm 42 is done to get rid of the OSD
* ceph-prepare is called on a new disk and gets OSD id 42
ceph/src/mon/OSDMonitor.cc
// allocate a new id
for (i=0; i < osdmap.get_max_osd(); i++) {
if (!osdmap.exists(i) &&
pending_inc.new_up_client.count(i) == 0 &&
(pending_inc.new_state.count(i) == 0 ||
(pending_inc.new_state[i] & CEPH_OSD_EXISTS) == 0))
goto done;
}
* The disk of machine A is still good and is plugged into machine C
* the udev logic sees it has the ceph magic uuid and contains a well formed osd
file system and runs the osd daemon on it. The osd daemon fails and dies
because its key does not match. It will try again and fail in the same way when
the machine reboots.
If the osd id was not reused, the disk would find its way back in the cluster
and be reused without manual intervention. Since ceph-disk uses the osd uuid to
create the disk, it does not matter that it has been removed :
https://github.com/ceph/ceph/blob/master/src/ceph-disk#L458 . I'm not sure to
understand how the key that was previously registered is re-imported. If I
understand correctly it is created with ceph-osd --mkkey
https://github.com/ceph/ceph/blob/master/src/ceph-disk#L1301 and stored in the
osd tree at the location specified by --keyring
https://github.com/ceph/ceph/blob/master/src/ceph-disk#L1307 .
At this point my understanding is that (as long as OSD ids are not reused),
removing a disk or moving it from machine to machine, even over a long period
of time does not require any action. The OSD id is an int which is probably
large enough for any kind of cluster in the near future. The OSD ids that are
not used and not removed could be cleaned from time to time to garbage collect
the space they use.
Please let me know if I've missed something :-)
--
Loïc Dachary, Artisan Logiciel Libre
All that is necessary for the triumph of evil is that good people do nothing.
signature.asc
Description: OpenPGP digital signature
