I like the idea to leave ping in cluster network because it can help us detect 
switch/nic failure.

What confuse me is I keep pinging every ceph node's cluster ip,it is OK during 
the whole run with less than 1 ms latency,why the heartbeat still suffer? TOP 
show my cpu not 100% utilized(with >30% io wait).Enabling jumbo frame **seems** 
make things worth.(just feeling.no data supports)

发自我的 iPhone

在 2013-5-14,23:36,"Mark Nelson" <[email protected]> 写道:

> On 05/14/2013 10:30 AM, Sage Weil wrote:
>> On Tue, 14 May 2013, Chen, Xiaoxi wrote:
>>> 
>>> Hi
>>> 
>>>   We are suffering our OSD flipping between up and down ( OSD X be voted to
>>> down due to 3 missing ping, and after a while it tells the monitor ?map xxx
>>> wrongly mark me down? ). Because we are running sequential write performance
>>> test on top of RBDs, and the cluster network nics is really in high
>>> utilization (8Gb/s+ for a 10Gb network).
>>> 
>>>          Is this a expected behavior ? or how can I prevent this happen?
>> 
>> You an increase the heartbeat grace period.  The pings are handled by a
>> separate thread on the backside interface (if there is one).  If you are
>> missing pings then the network or scheduler is preventing those (small)
>> messages from being processed (there is almost no lock contention in that
>> path).  Which means it really is taking ~20 seconds or wahtever to handle
>> those messages.  It's really a questin of how unresponsive you want to
>> permit the OSDs to be before you consider it a failure..
>> 
>> sage
>> 
>> 
> 
> It might be worth testing out how long pings or other network traffic are 
> taking during these tests.  There may be some tcp tunning you can do here, or 
> even consider using a separate network for the mons.
> 
> Mark
_______________________________________________
ceph-users mailing list
[email protected]
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Reply via email to