Hello,

On 12/20/2011 02:47 PM, Ulrich Windl wrote:
> Hi!
> 
> I have a dual-primary DRBD that is not working well: It was working, then I 
> shut it down and restarted it. DRBD complained about split brain and fenced 
> the other node. When coming up, the other node fenced this node. IMHO no node 
> should have fenced each other.
> 

no config from drbd, no cluster config, partial/filtered logs ...
fragments ... you have _all_ information and can't find the problem ...
sorry, but I can't see how anyone can help you based on that information.

I personally think it is part of the free community support deal to
share as much information as possible if one wants help for free.

Regards,
Andreas

-- 
Need help with Pacemaker or DRBD?
http://www.hastexo.com/now

> Here are the logs from both nodes, restricted to DRBD:
> 
> Dec 20 14:22:01 h06 kernel: [339936.743323] block drbd0: Starting worker 
> thread (from cqueue [13353])
> Dec 20 14:22:01 h06 kernel: [339936.743452] block drbd0: disk( Diskless -> 
> Attaching )
> Dec 20 14:22:01 h06 kernel: [339936.767174] block drbd0: Found 4 transactions 
> (6 active extents) in activity log.
> Dec 20 14:22:01 h06 kernel: [339936.767178] block drbd0: Method to ensure 
> write ordering: barrier
> Dec 20 14:22:01 h06 kernel: [339936.767185] block drbd0: drbd_bm_resize 
> called with capacity == 1048472
> Dec 20 14:22:01 h06 kernel: [339936.767194] block drbd0: resync bitmap: 
> bits=131059 words=2048 pages=4
> Dec 20 14:22:01 h06 kernel: [339936.767197] block drbd0: size = 512 MB 
> (524236 KB)
> Dec 20 14:22:01 h06 kernel: [339936.773015] block drbd0: bitmap READ of 4 
> pages took 2 jiffies
> Dec 20 14:22:01 h06 kernel: [339936.773032] block drbd0: recounting of set 
> bits took additional 0 jiffies
> Dec 20 14:22:01 h06 kernel: [339936.773035] block drbd0: 0 KB (0 bits) marked 
> out-of-sync by on disk bit-map.
> Dec 20 14:22:01 h06 kernel: [339936.773041] block drbd0: disk( Attaching -> 
> UpToDate )
> Dec 20 14:22:01 h06 kernel: [339936.773045] block drbd0: attached to UUIDs 
> 8344B9D0C389D2DC:0000000000000000:902F198E803AB8E3:902E198E803AB8E3
> Dec 20 14:22:01 h06 kernel: [339936.795343] block drbd0: conn( StandAlone -> 
> Unconnected )
> Dec 20 14:22:01 h06 kernel: [339936.795395] block drbd0: Starting receiver 
> thread (from drbd0_worker [10322])
> Dec 20 14:22:01 h06 kernel: [339936.795452] block drbd0: receiver (re)started
> Dec 20 14:22:01 h06 kernel: [339936.795458] block drbd0: conn( Unconnected -> 
> WFConnection )
> Dec 20 14:22:02 h06 kernel: [339937.490329] block drbd0: role( Secondary -> 
> Primary )
> Dec 20 14:22:02 h06 kernel: [339937.490583] block drbd0: new current UUID 
> B95131C56A7C2935:8344B9D0C389D2DC:902F198E803AB8E3:902E198E803AB8E3
> Dec 20 14:22:02 h06 multipathd: drbd0: update path write_protect to '0' 
> (uevent)
> Dec 20 14:22:02 h06 kernel: [339937.537270] block drbd0: Handshake 
> successful: Agreed network protocol version 96
> Dec 20 14:22:02 h06 kernel: [339937.537278] block drbd0: conn( WFConnection 
> -> WFReportParams )
> Dec 20 14:22:02 h06 kernel: [339937.537335] block drbd0: Starting asender 
> thread (from drbd0_receiver [10344])
> Dec 20 14:22:02 h06 kernel: [339937.537725] block drbd0: data-integrity-alg: 
> <not-used>
> Dec 20 14:22:02 h06 kernel: [339937.543391] block drbd0: drbd_sync_handshake:
> Dec 20 14:22:02 h06 kernel: [339937.543394] block drbd0: self 
> B95131C56A7C2935:8344B9D0C389D2DC:902F198E803AB8E3:902E198E803AB8E3 bits:0 
> flags:
> 0
> Dec 20 14:22:02 h06 kernel: [339937.543397] block drbd0: peer 
> 3778E40F06BD4779:8344B9D0C389D2DC:902F198E803AB8E2:902E198E803AB8E3 bits:0 
> flags:
> 0
> Dec 20 14:22:02 h06 kernel: [339937.543399] block drbd0: uuid_compare()=100 
> by rule 90
> Dec 20 14:22:02 h06 kernel: [339937.543403] block drbd0: helper command: 
> /sbin/drbdadm initial-split-brain minor-0
> Dec 20 14:22:02 h06 kernel: [339937.546011] block drbd0: helper command: 
> /sbin/drbdadm initial-split-brain minor-0 exit code 0 (0x0)
> Dec 20 14:22:02 h06 kernel: [339937.546015] block drbd0: Split-Brain detected 
> but unresolved, dropping connection!
> Dec 20 14:22:02 h06 kernel: [339937.546018] block drbd0: helper command: 
> /sbin/drbdadm split-brain minor-0
> Dec 20 14:22:02 h06 kernel: [339937.551050] block drbd0: meta connection shut 
> down by peer.
> Dec 20 14:22:02 h06 kernel: [339937.551056] block drbd0: conn( WFReportParams 
> -> NetworkFailure )
> Dec 20 14:22:02 h06 kernel: [339937.551065] block drbd0: asender terminated
> Dec 20 14:22:02 h06 kernel: [339937.551067] block drbd0: Terminating asender 
> thread
> Dec 20 14:22:02 h06 kernel: [339937.586136] block drbd0: helper command: 
> /sbin/drbdadm split-brain minor-0 exit code 0 (0x0)
> Dec 20 14:22:02 h06 kernel: [339937.586146] block drbd0: conn( NetworkFailure 
> -> Disconnecting )
> Dec 20 14:22:02 h06 kernel: [339937.586152] block drbd0: error receiving 
> ReportState, l: 4!
> Dec 20 14:22:02 h06 kernel: [339937.586211] block drbd0: Connection closed
> Dec 20 14:22:02 h06 kernel: [339937.586217] block drbd0: conn( Disconnecting 
> -> StandAlone )
> Dec 20 14:22:02 h06 kernel: [339937.586379] block drbd0: receiver terminated
> Dec 20 14:22:02 h06 kernel: [339937.586381] block drbd0: Terminating receiver 
> thread
> 
> ### Node was fenced by other node ####
> 
> Dec 20 14:32:29 h06 kernel: [   82.661133] drbd: initialized. Version: 8.3.11 
> (api:88/proto:86-96)
> Dec 20 14:32:29 h06 kernel: [   82.661137] drbd: GIT-hash: 
> 0de839cee13a4160eed6037c4bddd066645e23c5 build by phil@fat-tyre, 2011-06-29 
> 11:37:11
> Dec 20 14:32:29 h06 kernel: [   82.661140] drbd: registered as block device 
> major 147
> Dec 20 14:32:29 h06 kernel: [   82.661142] drbd: minor_table @ 
> 0xffff881fe93aec80
> Dec 20 14:32:29 h06 multipathd: drbd0: add path (uevent)
> Dec 20 14:32:29 h06 multipathd: drbd0: failed to get path uid
> Dec 20 14:32:30 h06 kernel: [   83.726463] block drbd0: Starting worker 
> thread (from cqueue [13371])
> Dec 20 14:32:30 h06 kernel: [   83.726763] block drbd0: disk( Diskless -> 
> Attaching )
> Dec 20 14:32:30 h06 kernel: [   83.743970] block drbd0: Found 4 transactions 
> (6 active extents) in activity log.
> Dec 20 14:32:30 h06 kernel: [   83.743974] block drbd0: Method to ensure 
> write ordering: barrier
> Dec 20 14:32:30 h06 kernel: [   83.743977] block drbd0: max BIO size = 131072
> Dec 20 14:32:30 h06 kernel: [   83.743983] block drbd0: drbd_bm_resize called 
> with capacity == 1048472
> Dec 20 14:32:30 h06 kernel: [   83.743992] block drbd0: resync bitmap: 
> bits=131059 words=2048 pages=4
> Dec 20 14:32:30 h06 kernel: [   83.743995] block drbd0: size = 512 MB (524236 
> KB)
> Dec 20 14:32:30 h06 kernel: [   83.749884] block drbd0: bitmap READ of 4 
> pages took 2 jiffies
> Dec 20 14:32:30 h06 kernel: [   83.749902] block drbd0: recounting of set 
> bits took additional 0 jiffies
> Dec 20 14:32:30 h06 kernel: [   83.749905] block drbd0: 0 KB (0 bits) marked 
> out-of-sync by on disk bit-map.
> Dec 20 14:32:30 h06 kernel: [   83.749910] block drbd0: Marked additional 16 
> MB as out-of-sync based on AL.
> Dec 20 14:32:30 h06 kernel: [   83.749913] block drbd0: bitmap WRITE of 0 
> pages took 0 jiffies
> Dec 20 14:32:30 h06 kernel: [   83.750053] block drbd0: 16 MB (4096 bits) 
> marked out-of-sync by on disk bit-map.
> Dec 20 14:32:30 h06 kernel: [   83.750060] block drbd0: disk( Attaching -> 
> UpToDate )
> Dec 20 14:32:30 h06 kernel: [   83.750063] block drbd0: attached to UUIDs 
> B95131C56A7C2935:8344B9D0C389D2DC:902F198E803AB8E3:902E198E803AB8E3
> Dec 20 14:32:30 h06 kernel: [   83.768968] block drbd0: conn( StandAlone -> 
> Unconnected )
> Dec 20 14:32:30 h06 kernel: [   83.768987] block drbd0: Starting receiver 
> thread (from drbd0_worker [13662])
> Dec 20 14:32:30 h06 kernel: [   83.769144] block drbd0: receiver (re)started
> Dec 20 14:32:30 h06 kernel: [   83.769154] block drbd0: conn( Unconnected -> 
> WFConnection )
> Dec 20 14:32:31 h06 kernel: [   84.268286] block drbd0: Handshake successful: 
> Agreed network protocol version 96
> Dec 20 14:32:31 h06 kernel: [   84.268297] block drbd0: conn( WFConnection -> 
> WFReportParams )
> Dec 20 14:32:31 h06 kernel: [   84.268425] block drbd0: Starting asender 
> thread (from drbd0_receiver [13681])
> Dec 20 14:32:31 h06 kernel: [   84.268636] block drbd0: data-integrity-alg: 
> <not-used>
> Dec 20 14:32:31 h06 kernel: [   84.268828] block drbd0: drbd_sync_handshake:
> Dec 20 14:32:31 h06 kernel: [   84.268833] block drbd0: self 
> B95131C56A7C2934:8344B9D0C389D2DC:902F198E803AB8E3:902E198E803AB8E3 bits:4096 
> flag
> s:0
> Dec 20 14:32:31 h06 kernel: [   84.268837] block drbd0: peer 
> 3778E40F06BD4779:8344B9D0C389D2DC:902F198E803AB8E2:902E198E803AB8E3 bits:8 
> flags:0
> Dec 20 14:32:31 h06 kernel: [   84.268841] block drbd0: uuid_compare()=100 by 
> rule 90
> Dec 20 14:32:31 h06 kernel: [   84.268846] block drbd0: helper command: 
> /sbin/drbdadm initial-split-brain minor-0
> Dec 20 14:32:31 h06 kernel: [   84.271076] block drbd0: helper command: 
> /sbin/drbdadm initial-split-brain minor-0 exit code 0 (0x0)
> Dec 20 14:32:31 h06 kernel: [   84.271081] block drbd0: Split-Brain detected, 
> 1 primaries, automatically solved. Sync from peer node
> Dec 20 14:32:31 h06 kernel: [   84.271089] block drbd0: peer( Unknown -> 
> Primary ) conn( WFReportParams -> WFBitMapT ) disk( UpToDate -> Outdated ) 
> pdsk( DUnknown -> UpToDate)
> 
> ### other node was fenced ####
> 
> Dec 20 14:32:41 h06 kernel: [   94.768068] block drbd0: PingAck did not 
> arrive in time.
> Dec 20 14:32:41 h06 kernel: [   94.768078] block drbd0: peer( Primary -> 
> Unknown ) conn( WFBitMapT -> NetworkFailure ) pdsk( UpToDate -> DUnknown )
> Dec 20 14:32:41 h06 kernel: [   94.768161] block drbd0: bitmap WRITE of 0 
> pages took 0 jiffies
> Dec 20 14:32:41 h06 kernel: [   94.768379] block drbd0: asender terminated
> Dec 20 14:32:41 h06 kernel: [   94.768384] block drbd0: 16 MB (4096 bits) 
> marked out-of-sync by on disk bit-map.
> Dec 20 14:32:41 h06 kernel: [   94.768394] block drbd0: Terminating asender 
> thread
> Dec 20 14:32:41 h06 kernel: [   94.768616] block drbd0: Connection closed
> 
> 
> The other node:
> Dec 20 14:22:01 h02 kernel: [340315.613659] block drbd0: Starting worker 
> thread (from cqueue [13627])
> Dec 20 14:22:01 h02 kernel: [340315.614016] block drbd0: disk( Diskless -> 
> Attaching )
> Dec 20 14:22:01 h02 kernel: [340315.681269] block drbd0: Found 4 transactions 
> (55 active extents) in activity log.
> Dec 20 14:22:01 h02 kernel: [340315.681273] block drbd0: Method to ensure 
> write ordering: barrier
> Dec 20 14:22:01 h02 kernel: [340315.681281] block drbd0: drbd_bm_resize 
> called with capacity == 1048472
> Dec 20 14:22:01 h02 kernel: [340315.681289] block drbd0: resync bitmap: 
> bits=131059 words=2048 pages=4
> Dec 20 14:22:01 h02 kernel: [340315.681292] block drbd0: size = 512 MB 
> (524236 KB)
> Dec 20 14:22:01 h02 kernel: [340315.687209] block drbd0: bitmap READ of 4 
> pages took 1 jiffies
> Dec 20 14:22:01 h02 kernel: [340315.687227] block drbd0: recounting of set 
> bits took additional 0 jiffies
> Dec 20 14:22:01 h02 kernel: [340315.687230] block drbd0: 0 KB (0 bits) marked 
> out-of-sync by on disk bit-map.
> Dec 20 14:22:01 h02 kernel: [340315.687235] block drbd0: disk( Attaching -> 
> UpToDate )
> Dec 20 14:22:01 h02 kernel: [340315.687239] block drbd0: attached to UUIDs 
> 8344B9D0C389D2DC:0000000000000000:902F198E803AB8E2:902E198E803AB8E3
> Dec 20 14:22:01 h02 kernel: [340315.703404] block drbd0: conn( StandAlone -> 
> Unconnected )
> Dec 20 14:22:01 h02 kernel: [340315.703448] block drbd0: Starting receiver 
> thread (from drbd0_worker [27349])
> Dec 20 14:22:01 h02 kernel: [340315.703535] block drbd0: receiver (re)started
> Dec 20 14:22:01 h02 kernel: [340315.703543] block drbd0: conn( Unconnected -> 
> WFConnection )
> Dec 20 14:22:02 h02 kernel: [340316.360240] block drbd0: role( Secondary -> 
> Primary )
> Dec 20 14:22:02 h02 kernel: [340316.360532] block drbd0: new current UUID 
> 3778E40F06BD4779:8344B9D0C389D2DC:902F198E803AB8E2:902E198E803AB8E3
> Dec 20 14:22:02 h02 kernel: [340316.405852] block drbd0: Handshake 
> successful: Agreed network protocol version 96
> Dec 20 14:22:02 h02 kernel: [340316.405859] block drbd0: conn( WFConnection 
> -> WFReportParams )
> Dec 20 14:22:02 h02 kernel: [340316.406014] block drbd0: Starting asender 
> thread (from drbd0_receiver [27371])
> Dec 20 14:22:02 h02 kernel: [340316.406153] block drbd0: data-integrity-alg: 
> <not-used>
> Dec 20 14:22:02 h02 kernel: [340316.409149] block drbd0: drbd_sync_handshake:
> Dec 20 14:22:02 h02 kernel: [340316.409153] block drbd0: self 
> 3778E40F06BD4779:8344B9D0C389D2DC:902F198E803AB8E2:902E198E803AB8E3 bits:0 
> flags:0
> Dec 20 14:22:02 h02 kernel: [340316.409157] block drbd0: peer 
> B95131C56A7C2935:8344B9D0C389D2DC:902F198E803AB8E3:902E198E803AB8E3 bits:0 
> flags:0
> Dec 20 14:22:02 h02 kernel: [340316.409161] block drbd0: uuid_compare()=100 
> by rule 90
> Dec 20 14:22:02 h02 kernel: [340316.409166] block drbd0: helper command: 
> /sbin/drbdadm initial-split-brain minor-0
> Dec 20 14:22:02 h02 kernel: [340316.411198] block drbd0: helper command: 
> /sbin/drbdadm initial-split-brain minor-0 exit code 0 (0x0)
> Dec 20 14:22:02 h02 kernel: [340316.411202] block drbd0: Split-Brain detected 
> but unresolved, dropping connection!
> Dec 20 14:22:02 h02 kernel: [340316.411206] block drbd0: helper command: 
> /sbin/drbdadm split-brain minor-0
> Dec 20 14:22:02 h02 multipathd: drbd0: update path write_protect to '0' 
> (uevent)
> Dec 20 14:22:02 h02 kernel: [340316.419274] block drbd0: helper command: 
> /sbin/drbdadm split-brain minor-0 exit code 0 (0x0)
> Dec 20 14:22:02 h02 kernel: [340316.419280] block drbd0: conn( WFReportParams 
> -> Disconnecting )
> 
> 
> ### other node was fenced ###
> 
> Dec 20 14:22:02 h02 kernel: [340316.419287] block drbd0: error receiving 
> ReportState, l: 4!
> Dec 20 14:22:02 h02 kernel: [340316.419390] block drbd0: asender terminated
> Dec 20 14:22:02 h02 kernel: [340316.419396] block drbd0: Terminating asender 
> thread
> Dec 20 14:22:02 h02 kernel: [340316.419487] block drbd0: Connection closed
> Dec 20 14:22:02 h02 kernel: [340316.419493] block drbd0: conn( Disconnecting 
> -> StandAlone )
> Dec 20 14:22:02 h02 kernel: [340316.419510] block drbd0: receiver terminated
> Dec 20 14:22:02 h02 kernel: [340316.419512] block drbd0: Terminating receiver 
> thread
> Dec 20 14:32:21 h02 kernel: [340935.071481] block drbd0: conn( StandAlone -> 
> Unconnected )
> Dec 20 14:32:21 h02 kernel: [340935.071526] block drbd0: Starting receiver 
> thread (from drbd0_worker [27349])
> Dec 20 14:32:21 h02 kernel: [340935.071679] block drbd0: receiver (re)started
> Dec 20 14:32:21 h02 kernel: [340935.071687] block drbd0: conn( Unconnected -> 
> WFConnection )
> 
> ### this node was fenced ###
> 
> Dec 20 14:38:51 h02 kernel: [   81.985090] drbd: initialized. Version: 8.3.11 
> (api:88/proto:86-96)
> Dec 20 14:38:51 h02 kernel: [   81.985094] drbd: GIT-hash: 
> 0de839cee13a4160eed6037c4bddd066645e23c5 build by phil@fat-tyre, 2011-06-29 
> 11:37:11
> Dec 20 14:38:51 h02 kernel: [   81.985096] drbd: registered as block device 
> major 147
> Dec 20 14:38:51 h02 kernel: [   81.985098] drbd: minor_table @ 
> 0xffff881fe87af180
> Dec 20 14:38:51 h02 multipathd: drbd0: add path (uevent)
> Dec 20 14:38:51 h02 multipathd: drbd0: failed to get path uid
> Dec 20 14:38:52 h02 kernel: [   83.045961] block drbd0: Starting worker 
> thread (from cqueue [13576])
> Dec 20 14:38:52 h02 kernel: [   83.046260] block drbd0: disk( Diskless -> 
> Attaching )
> Dec 20 14:38:52 h02 kernel: [   83.064193] block drbd0: Found 4 transactions 
> (55 active extents) in activity log.
> Dec 20 14:38:52 h02 kernel: [   83.064197] block drbd0: Method to ensure 
> write ordering: barrier
> Dec 20 14:38:52 h02 kernel: [   83.064200] block drbd0: max BIO size = 131072
> Dec 20 14:38:52 h02 kernel: [   83.064206] block drbd0: drbd_bm_resize called 
> with capacity == 1048472
> Dec 20 14:38:52 h02 kernel: [   83.064215] block drbd0: resync bitmap: 
> bits=131059 words=2048 pages=4
> Dec 20 14:38:52 h02 kernel: [   83.064218] block drbd0: size = 512 MB (524236 
> KB)
> Dec 20 14:38:52 h02 kernel: [   83.070075] block drbd0: bitmap READ of 4 
> pages took 1 jiffies
> Dec 20 14:38:52 h02 kernel: [   83.070093] block drbd0: recounting of set 
> bits took additional 0 jiffies
> Dec 20 14:38:52 h02 kernel: [   83.070096] block drbd0: 0 KB (0 bits) marked 
> out-of-sync by on disk bit-map.
> Dec 20 14:38:52 h02 kernel: [   83.070109] block drbd0: Marked additional 192 
> MB as out-of-sync based on AL.
> Dec 20 14:38:52 h02 kernel: [   83.070112] block drbd0: bitmap WRITE of 0 
> pages took 0 jiffies
> Dec 20 14:38:52 h02 kernel: [   83.070239] block drbd0: 192 MB (49139 bits) 
> marked out-of-sync by on disk bit-map.
> Dec 20 14:38:52 h02 kernel: [   83.070245] block drbd0: disk( Attaching -> 
> UpToDate )
> Dec 20 14:38:52 h02 kernel: [   83.070249] block drbd0: attached to UUIDs 
> 3778E40F06BD4779:8344B9D0C389D2DC:902F198E803AB8E2:902E198E803AB8E3
> Dec 20 14:38:52 h02 kernel: [   83.088979] block drbd0: conn( StandAlone -> 
> Unconnected )
> Dec 20 14:38:52 h02 kernel: [   83.088995] block drbd0: Starting receiver 
> thread (from drbd0_worker [13878])
> Dec 20 14:38:52 h02 kernel: [   83.089044] block drbd0: receiver (re)started
> Dec 20 14:38:52 h02 kernel: [   83.089053] block drbd0: conn( Unconnected -> 
> WFConnection )
> Dec 20 14:38:53 h02 kernel: [   83.588830] block drbd0: Handshake successful: 
> Agreed network protocol version 96
> Dec 20 14:38:53 h02 kernel: [   83.588842] block drbd0: conn( WFConnection -> 
> WFReportParams )
> Dec 20 14:38:53 h02 kernel: [   83.588972] block drbd0: Starting asender 
> thread (from drbd0_receiver [13901])
> Dec 20 14:38:53 h02 kernel: [   83.589219] block drbd0: data-integrity-alg: 
> <not-used>
> Dec 20 14:38:53 h02 kernel: [   83.589447] block drbd0: drbd_sync_handshake:
> Dec 20 14:38:53 h02 kernel: [   83.589452] block drbd0: self 
> 3778E40F06BD4778:8344B9D0C389D2DC:902F198E803AB8E2:902E198E803AB8E3 
> bits:49139 flags:0
> Dec 20 14:38:53 h02 kernel: [   83.589456] block drbd0: peer 
> B95131C56A7C2934:8344B9D0C389D2DC:902F198E803AB8E3:902E198E803AB8E3 bits:0 
> flags:2
> Dec 20 14:38:53 h02 kernel: [   83.589459] block drbd0: uuid_compare()=100 by 
> rule 90
> Dec 20 14:38:53 h02 kernel: [   83.589465] block drbd0: helper command: 
> /sbin/drbdadm initial-split-brain minor-0
> Dec 20 14:38:53 h02 kernel: [   83.591700] block drbd0: helper command: 
> /sbin/drbdadm initial-split-brain minor-0 exit code 0 (0x0)
> Dec 20 14:38:53 h02 kernel: [   83.591706] block drbd0: Split-Brain detected, 
> 0 primaries, automatically solved. Sync from this node
> Dec 20 14:38:53 h02 kernel: [   83.591714] block drbd0: peer( Unknown -> 
> Secondary ) conn( WFReportParams -> WFBitMapS ) pdsk( DUnknown -> Consistent )
> Dec 20 14:38:53 h02 kernel: [   83.593414] block drbd0: helper command: 
> /sbin/drbdadm before-resync-source minor-0
> Dec 20 14:38:53 h02 kernel: [   83.595099] block drbd0: helper command: 
> /sbin/drbdadm before-resync-source minor-0 exit code 0 (0x0)
> Dec 20 14:38:53 h02 kernel: [   83.595106] block drbd0: conn( WFBitMapS -> 
> SyncSource ) pdsk( Consistent -> Inconsistent )
> Dec 20 14:38:53 h02 kernel: [   83.595113] block drbd0: Began resync as 
> SyncSource (will sync 196556 KB [49139 bits set]).
> Dec 20 14:38:53 h02 kernel: [   83.595222] block drbd0: updated sync UUID 
> 3778E40F06BD4778:8345B9D0C389D2DC:8344B9D0C389D2DC:902F198E803AB8E2
> Dec 20 14:39:14 h02 kernel: [  104.848088] block drbd0: role( Secondary -> 
> Primary )
> Dec 20 14:39:14 h02 multipathd: drbd0: update path write_protect to '0' 
> (uevent)
> Dec 20 14:39:18 h02 kernel: [  109.167582] block drbd0: Resync done (total 25 
> sec; paused 0 sec; 7860 K/sec)
> Dec 20 14:39:18 h02 kernel: [  109.167588] block drbd0: updated UUIDs 
> 3778E40F06BD4779:0000000000000000:8345B9D0C389D2DC:8344B9D0C389D2DC
> Dec 20 14:39:18 h02 kernel: [  109.167596] block drbd0: conn( SyncSource -> 
> Connected ) pdsk( Inconsistent -> UpToDate )
> Dec 20 14:39:18 h02 kernel: [  109.167722] block drbd0: bitmap WRITE of 0 
> pages took 0 jiffies
> Dec 20 14:39:18 h02 kernel: [  109.167738] block drbd0: 0 KB (0 bits) marked 
> out-of-sync by on disk bit-map.
> Dec 20 14:39:44 h02 kernel: [  135.213231] block drbd0: peer( Secondary -> 
> Primary )
> 
> I doubt that those "split brains were caused by a network connection problem 
> at that times. What might be wrong? I always thought DRBD had production 
> quality. Ist it a problem with the Cluster RAs?
> 
> The version used is this (SLES11 SP1):
> # rpm -qf /usr/lib/ocf/resource.d/linbit/drbd
> drbd-pacemaker-8.3.11-0.3.1
> 
> 
> Regards,
> Ulrich
> 
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems


Attachment: signature.asc
Description: OpenPGP digital signature

_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to