I guess I finally got it ;-) We only get slow response for a few objects where 
we
received an epoch mismatch – most request are directed to a living node which
already have the object?

That still means that IO on the KVM side is extremely slow during (2h) recovery?

From: Liu Yuan 
[mailto:namei.u...@gmail.com]<mailto:[mailto:namei.u...@gmail.com]>
Sent: Freitag, 20. Juli 2012 16:18
To: Dietmar Maurer
Cc: sheepdog@lists.wpkg.org<mailto:sheepdog@lists.wpkg.org>; Chris Webb
Subject: RE: [sheepdog] [PATCH] sheep: add a kill node operation


nope, at most dozens of sec as I observed
在 2012-7-20 PM10:04,"Dietmar Maurer" 
<diet...@proxmox.com<mailto:diet...@proxmox.com>>写道:
> > Let's assume a complete recovery takes about 2 hours. Does that mean
> > my VMs are blocked for 2 hours (instead of continue operation on other
> nodes)?
> >
>
> This is actually why we spend lots of lines in recovery and IO patch, there 
> are
> some mechanism, such as request retry, oid scheduling that tries to complete
> any request in a very short period because IOs from VM are timeouted by
> guest kernel, for e.g,
> 120 seconds for Linux kernel.

So if a VM access such object, it would block for 2 hours (confused)?
-- 
sheepdog mailing list
sheepdog@lists.wpkg.org
http://lists.wpkg.org/mailman/listinfo/sheepdog

Reply via email to