On Fri, Oct 14, 2016 at 07:07:55AM +, Eric Robinson wrote:
> > > > Oct 12 06:56:11 ha14a kernel: block drbd1: Began resync as SyncTarget
> > (will sync 0 KB [0 bits set]).
> > > > Oct 12 06:56:12 ha14a kernel: d-con ha02_mysql: PingAck did not arrive
> > > > in
> > time.
> > > > Oct 12
> -Original Message-
> From: Viktor Villafuerte [mailto:viktor.villafue...@optusnet.com.au]
> Sent: Wednesday, October 12, 2016 3:19 PM
> To: Eric Robinson <eric.robin...@psmnv.com>
> Cc: drbd-user@lists.linbit.com
> Subject: Re: [DRBD-user] DRBD constantly re
> -Original Message-
> From: drbd-user-boun...@lists.linbit.com [mailto:drbd-user-
> boun...@lists.linbit.com] On Behalf Of Lars Ellenberg
> Sent: Wednesday, October 12, 2016 11:49 PM
> To: drbd-user@lists.linbit.com
> Subject: Re: [DRBD-user] DRBD constantly re-synci
On Wed, Oct 12, 2016 at 04:35:58PM +0200, Jan Schermer wrote:
> Short in the dark - are the drives (or their controller if you're
> using raid) using any form of caching? It is conceivable that when
> resync is finished it tries flushing the data to the device, and if
> this takes way to long
Hi Eric,
I've had the pleasure to deal with this exact issue, and in prod too :O
On Wed 12 Oct 2016 14:04:48, Eric Robinson wrote:
> This morning we are seeing an issue where drbd is repeatedly resyncing,
> getting to 100%, and starting over, and never getting to an UpToDate/UpToDate
> state.
Short in the dark - are the drives (or their controller if you're using raid)
using any form of caching? It is conceivable that when resync is finished it
tries flushing the data to the device, and if this takes way to long it
could lead to timeout of the drbd kernel thread.
Is IO happening