ceph-2?

Craig Lewis Mon, 24 Jun 2013 14:24:15 -0700

I also have problems keeping my time in sync on VMWare virtualmachines. My problems occurs most when the VM Host is oversubscribed,or when I'm doing stress tests. I ended up disabling ntpd in theguests, and enabled Host Time Sync using the VMWare Guest Tools. All ofmy VMWare Hosts runs ntpd, using the same ntpd servers.

That's my development cluster. For production, I'm using ntpd on realservers.




*Craig Lewis*
Senior Systems Engineer
Office +1.714.602.1309
Email [email protected] <mailto:[email protected]>

*Central Desktop. Work together in ways you never thought possible.*

Connect with us Website <http://www.centraldesktop.com/> | Twitter<http://www.twitter.com/centraldesktop> | Facebook<http://www.facebook.com/CentralDesktop> | LinkedIn<http://www.linkedin.com/groups?gid=147417> | Blog<http://cdblog.centraldesktop.com/>


On 6/18/13 05:41 , Da Chun wrote:

Thanks! Craig.
umount works.
About the time skew, I saw the log said the time difference should beless than 50ms. I setup one of my nodes as the time server, and theothers sync the time with it. I don't know why the system time stillchanges frequently especially after reboot. Maybe it's because all mynodes are VMware virtual machines. The softclock is not accurate enough.
------------------ Original ------------------
*From: * "Craig Lewis"<[email protected]>;
*Date: * Tue, Jun 18, 2013 05:34 AM
*To: * "ceph-users"<[email protected]>;
*Subject: * Re: [ceph-users] How to remove /var/lib/ceph/osd/ceph-2?
If you followed the standard setup, each OSD is it's own disk +filesystem. /var/lib/ceph/osd/ceph-2 is in use, as the mount point forthe OSD.2 filesystem. Double check by examining the output of the`mount` command.
I get the same error when I try to rename a directory that's used as amount point.
Try `umount /var/lib/ceph/osd/ceph-2` instead of the mv and rm. Thefuser command is telling you that the kernel has a filesystem mountedin that directory. Nothing else appears to be using it, so the umountshould complete successfully.
Also, you should fix that time skew on mon.ceph-node5. The mailinglist archives have several posts with good answers.
On 6/15/2013 2:14 AM, Da Chun wrote:
Hi all,
On Ubuntu 13.04 with ceph 0.61.3.
I want to remove osd.2 from my cluster. The following steps wereperformed:
root@ceph-node6:~# ceph osd out osd.2
marked out osd.2.
root@ceph-node6:~# ceph -w
   health HEALTH_WARN clock skew detected on mon.ceph-node5
monmap e1: 3 mons at{ceph-node4=172.18.46.34:6789/0,ceph-node5=172.18.46.35:6789/0,ceph-node6=172.18.46.36:6789/0},election epoch 124, quorum 0,1,2 ceph-node4,ceph-node5,ceph-node6
   osdmap e414: 6 osds: 5 up, 5 in
pgmap v10540: 456 pgs: 456 active+clean; 12171 MB data, 24325 MBused, 50360 MB / 74685 MB avail
   mdsmap e102: 1/1/1 up {0=ceph-node4=up:active}
2013-06-15 16:55:22.096059 mon.0 [INF] pgmap v10540: 456 pgs: 456active+clean; 12171 MB data, 24325 MB used, 50360 MB / 74685 MB avail
^C
root@ceph-node6:~# stop ceph-osd id=2
ceph-osd stop/waiting
root@ceph-node6:~# ceph osd crush remove osd.2
removed item id 2 name 'osd.2' from crush map
root@ceph-node6:~# ceph auth del osd.2
updated
root@ceph-node6:~# ceph osd rm 2
removed osd.2
root@ceph-node6:~# mv /var/lib/ceph/osd/ceph-2/var/lib/ceph/osd/ceph-2.bakmv: cannot move ??/var/lib/ceph/osd/ceph-2?? to??/var/lib/ceph/osd/ceph-2.bak??: Device or resource busy
Everything was working OK until the last step to remove the osd.2directory /var/lib/ceph/osd/ceph-2.
root@ceph-node6:~# fuser -v /var/lib/ceph/osd/ceph-2
                     USER  PID ACCESS COMMAND
/var/lib/ceph/osd/ceph-2:
root kernel mount /var/lib/ceph/osd/ceph-2////////////////// What does this mean?
root@ceph-node6:~# lsof +D /var/lib/ceph/osd/ceph-2
root@ceph-node6:~#
I restarted the system, and found that the osd.2 daemon was stillrunning:
root@ceph-node6:~# ps aux | grep osd
root 1264 1.4 12.3 550940 125732 ? Ssl 16:41 0:20/usr/bin/ceph-osd --cluster=ceph -i 2 -froot 2876 0.0 0.0 4440 628 ? Ss 16:44 0:00/bin/sh -e -c /usr/bin/ceph-osd --cluster="${cluster:-ceph}" -i "$id"-f /bin/shroot 2877 4.9 18.2 613780 185676 ? Sl 16:44 1:04/usr/bin/ceph-osd --cluster=ceph -i 5 -f
I have to take this workaround:
root@ceph-node6:~# rm -rf /var/lib/ceph/osd/ceph-2
rm: cannot remove ??/var/lib/ceph/osd/ceph-2??: Device or resource busy
root@ceph-node6:~# ls /var/lib/ceph/osd/ceph-2
root@ceph-node6:~# shutdown -r now
....
root@ceph-node6:~# ps aux | grep osd
root 1416 0.0 0.0 4440 628 ? Ss 17:10 0:00/bin/sh -e -c /usr/bin/ceph-osd --cluster="${cluster:-ceph}" -i "$id"-f /bin/shroot 1417 8.9 5.8 468052 59868 ? Sl 17:10 0:02/usr/bin/ceph-osd --cluster=ceph -i 5 -f
root@ceph-node6:~# rm -r /var/lib/ceph/osd/ceph-2
root@ceph-node6:~#

Any idea? HELP!



_______________________________________________
ceph-users mailing list
[email protected]
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

_______________________________________________
ceph-users mailing list
[email protected]
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Re: [ceph-users] How to remove /var/lib/ceph/osd/ceph-2?

Reply via email to