Our cluster has a little over 100 RBDs. Each RBD is snapshotted with a typical
"frequently", hourly, daily, monthly type of schedule.
A while back a 4th monitor was temporarily added to the cluster that took hours
to synchronize with the other 3.
While trying to figure out why that addition took so long, we discovered that
our monitors have what seems like a really large number of osd_snap keys:
​ceph-monstore-tool /var/lib/ceph/mon/xxxxxx dump-keys |awk '{print $1}'|uniq -c
153 auth
2 config
10 health
1441 logm
3 mdsmap
313 mgr
1 mgr_command_descs
3 mgr_metadata
163 mgrstat
1 mkfs
323 mon_config_key
1 mon_sync
6 monitor
1 monitor_store
32 monmap
120 osd_metadata
1 osd_pg_creating
5818618 osd_snap
41338 osdmap
754 paxos
A few questions:
Could this be the cause of the slow addition/synchronization?
Is what looks like an unbounded number of osd_snaps expected?
If trimming/compacting them would help, how would one do that?
Thanks,
Brian
________________________________
The materials in this message are private and may contain Protected Healthcare
Information or other information of a sensitive nature. If you are not the
intended recipient, be advised that any unauthorized use, disclosure, copying
or the taking of any action in reliance on the contents of this information is
strictly prohibited. If you have received this email in error, please
immediately notify the sender via telephone or return mail.
_______________________________________________
ceph-users mailing list -- [email protected]
To unsubscribe send an email to [email protected]