Hello, On 12/20/2011 02:47 PM, Ulrich Windl wrote: > Hi! > > I have a dual-primary DRBD that is not working well: It was working, then I > shut it down and restarted it. DRBD complained about split brain and fenced > the other node. When coming up, the other node fenced this node. IMHO no node > should have fenced each other. >
no config from drbd, no cluster config, partial/filtered logs ... fragments ... you have _all_ information and can't find the problem ... sorry, but I can't see how anyone can help you based on that information. I personally think it is part of the free community support deal to share as much information as possible if one wants help for free. Regards, Andreas -- Need help with Pacemaker or DRBD? http://www.hastexo.com/now > Here are the logs from both nodes, restricted to DRBD: > > Dec 20 14:22:01 h06 kernel: [339936.743323] block drbd0: Starting worker > thread (from cqueue [13353]) > Dec 20 14:22:01 h06 kernel: [339936.743452] block drbd0: disk( Diskless -> > Attaching ) > Dec 20 14:22:01 h06 kernel: [339936.767174] block drbd0: Found 4 transactions > (6 active extents) in activity log. > Dec 20 14:22:01 h06 kernel: [339936.767178] block drbd0: Method to ensure > write ordering: barrier > Dec 20 14:22:01 h06 kernel: [339936.767185] block drbd0: drbd_bm_resize > called with capacity == 1048472 > Dec 20 14:22:01 h06 kernel: [339936.767194] block drbd0: resync bitmap: > bits=131059 words=2048 pages=4 > Dec 20 14:22:01 h06 kernel: [339936.767197] block drbd0: size = 512 MB > (524236 KB) > Dec 20 14:22:01 h06 kernel: [339936.773015] block drbd0: bitmap READ of 4 > pages took 2 jiffies > Dec 20 14:22:01 h06 kernel: [339936.773032] block drbd0: recounting of set > bits took additional 0 jiffies > Dec 20 14:22:01 h06 kernel: [339936.773035] block drbd0: 0 KB (0 bits) marked > out-of-sync by on disk bit-map. > Dec 20 14:22:01 h06 kernel: [339936.773041] block drbd0: disk( Attaching -> > UpToDate ) > Dec 20 14:22:01 h06 kernel: [339936.773045] block drbd0: attached to UUIDs > 8344B9D0C389D2DC:0000000000000000:902F198E803AB8E3:902E198E803AB8E3 > Dec 20 14:22:01 h06 kernel: [339936.795343] block drbd0: conn( StandAlone -> > Unconnected ) > Dec 20 14:22:01 h06 kernel: [339936.795395] block drbd0: Starting receiver > thread (from drbd0_worker [10322]) > Dec 20 14:22:01 h06 kernel: [339936.795452] block drbd0: receiver (re)started > Dec 20 14:22:01 h06 kernel: [339936.795458] block drbd0: conn( Unconnected -> > WFConnection ) > Dec 20 14:22:02 h06 kernel: [339937.490329] block drbd0: role( Secondary -> > Primary ) > Dec 20 14:22:02 h06 kernel: [339937.490583] block drbd0: new current UUID > B95131C56A7C2935:8344B9D0C389D2DC:902F198E803AB8E3:902E198E803AB8E3 > Dec 20 14:22:02 h06 multipathd: drbd0: update path write_protect to '0' > (uevent) > Dec 20 14:22:02 h06 kernel: [339937.537270] block drbd0: Handshake > successful: Agreed network protocol version 96 > Dec 20 14:22:02 h06 kernel: [339937.537278] block drbd0: conn( WFConnection > -> WFReportParams ) > Dec 20 14:22:02 h06 kernel: [339937.537335] block drbd0: Starting asender > thread (from drbd0_receiver [10344]) > Dec 20 14:22:02 h06 kernel: [339937.537725] block drbd0: data-integrity-alg: > <not-used> > Dec 20 14:22:02 h06 kernel: [339937.543391] block drbd0: drbd_sync_handshake: > Dec 20 14:22:02 h06 kernel: [339937.543394] block drbd0: self > B95131C56A7C2935:8344B9D0C389D2DC:902F198E803AB8E3:902E198E803AB8E3 bits:0 > flags: > 0 > Dec 20 14:22:02 h06 kernel: [339937.543397] block drbd0: peer > 3778E40F06BD4779:8344B9D0C389D2DC:902F198E803AB8E2:902E198E803AB8E3 bits:0 > flags: > 0 > Dec 20 14:22:02 h06 kernel: [339937.543399] block drbd0: uuid_compare()=100 > by rule 90 > Dec 20 14:22:02 h06 kernel: [339937.543403] block drbd0: helper command: > /sbin/drbdadm initial-split-brain minor-0 > Dec 20 14:22:02 h06 kernel: [339937.546011] block drbd0: helper command: > /sbin/drbdadm initial-split-brain minor-0 exit code 0 (0x0) > Dec 20 14:22:02 h06 kernel: [339937.546015] block drbd0: Split-Brain detected > but unresolved, dropping connection! > Dec 20 14:22:02 h06 kernel: [339937.546018] block drbd0: helper command: > /sbin/drbdadm split-brain minor-0 > Dec 20 14:22:02 h06 kernel: [339937.551050] block drbd0: meta connection shut > down by peer. > Dec 20 14:22:02 h06 kernel: [339937.551056] block drbd0: conn( WFReportParams > -> NetworkFailure ) > Dec 20 14:22:02 h06 kernel: [339937.551065] block drbd0: asender terminated > Dec 20 14:22:02 h06 kernel: [339937.551067] block drbd0: Terminating asender > thread > Dec 20 14:22:02 h06 kernel: [339937.586136] block drbd0: helper command: > /sbin/drbdadm split-brain minor-0 exit code 0 (0x0) > Dec 20 14:22:02 h06 kernel: [339937.586146] block drbd0: conn( NetworkFailure > -> Disconnecting ) > Dec 20 14:22:02 h06 kernel: [339937.586152] block drbd0: error receiving > ReportState, l: 4! > Dec 20 14:22:02 h06 kernel: [339937.586211] block drbd0: Connection closed > Dec 20 14:22:02 h06 kernel: [339937.586217] block drbd0: conn( Disconnecting > -> StandAlone ) > Dec 20 14:22:02 h06 kernel: [339937.586379] block drbd0: receiver terminated > Dec 20 14:22:02 h06 kernel: [339937.586381] block drbd0: Terminating receiver > thread > > ### Node was fenced by other node #### > > Dec 20 14:32:29 h06 kernel: [ 82.661133] drbd: initialized. Version: 8.3.11 > (api:88/proto:86-96) > Dec 20 14:32:29 h06 kernel: [ 82.661137] drbd: GIT-hash: > 0de839cee13a4160eed6037c4bddd066645e23c5 build by phil@fat-tyre, 2011-06-29 > 11:37:11 > Dec 20 14:32:29 h06 kernel: [ 82.661140] drbd: registered as block device > major 147 > Dec 20 14:32:29 h06 kernel: [ 82.661142] drbd: minor_table @ > 0xffff881fe93aec80 > Dec 20 14:32:29 h06 multipathd: drbd0: add path (uevent) > Dec 20 14:32:29 h06 multipathd: drbd0: failed to get path uid > Dec 20 14:32:30 h06 kernel: [ 83.726463] block drbd0: Starting worker > thread (from cqueue [13371]) > Dec 20 14:32:30 h06 kernel: [ 83.726763] block drbd0: disk( Diskless -> > Attaching ) > Dec 20 14:32:30 h06 kernel: [ 83.743970] block drbd0: Found 4 transactions > (6 active extents) in activity log. > Dec 20 14:32:30 h06 kernel: [ 83.743974] block drbd0: Method to ensure > write ordering: barrier > Dec 20 14:32:30 h06 kernel: [ 83.743977] block drbd0: max BIO size = 131072 > Dec 20 14:32:30 h06 kernel: [ 83.743983] block drbd0: drbd_bm_resize called > with capacity == 1048472 > Dec 20 14:32:30 h06 kernel: [ 83.743992] block drbd0: resync bitmap: > bits=131059 words=2048 pages=4 > Dec 20 14:32:30 h06 kernel: [ 83.743995] block drbd0: size = 512 MB (524236 > KB) > Dec 20 14:32:30 h06 kernel: [ 83.749884] block drbd0: bitmap READ of 4 > pages took 2 jiffies > Dec 20 14:32:30 h06 kernel: [ 83.749902] block drbd0: recounting of set > bits took additional 0 jiffies > Dec 20 14:32:30 h06 kernel: [ 83.749905] block drbd0: 0 KB (0 bits) marked > out-of-sync by on disk bit-map. > Dec 20 14:32:30 h06 kernel: [ 83.749910] block drbd0: Marked additional 16 > MB as out-of-sync based on AL. > Dec 20 14:32:30 h06 kernel: [ 83.749913] block drbd0: bitmap WRITE of 0 > pages took 0 jiffies > Dec 20 14:32:30 h06 kernel: [ 83.750053] block drbd0: 16 MB (4096 bits) > marked out-of-sync by on disk bit-map. > Dec 20 14:32:30 h06 kernel: [ 83.750060] block drbd0: disk( Attaching -> > UpToDate ) > Dec 20 14:32:30 h06 kernel: [ 83.750063] block drbd0: attached to UUIDs > B95131C56A7C2935:8344B9D0C389D2DC:902F198E803AB8E3:902E198E803AB8E3 > Dec 20 14:32:30 h06 kernel: [ 83.768968] block drbd0: conn( StandAlone -> > Unconnected ) > Dec 20 14:32:30 h06 kernel: [ 83.768987] block drbd0: Starting receiver > thread (from drbd0_worker [13662]) > Dec 20 14:32:30 h06 kernel: [ 83.769144] block drbd0: receiver (re)started > Dec 20 14:32:30 h06 kernel: [ 83.769154] block drbd0: conn( Unconnected -> > WFConnection ) > Dec 20 14:32:31 h06 kernel: [ 84.268286] block drbd0: Handshake successful: > Agreed network protocol version 96 > Dec 20 14:32:31 h06 kernel: [ 84.268297] block drbd0: conn( WFConnection -> > WFReportParams ) > Dec 20 14:32:31 h06 kernel: [ 84.268425] block drbd0: Starting asender > thread (from drbd0_receiver [13681]) > Dec 20 14:32:31 h06 kernel: [ 84.268636] block drbd0: data-integrity-alg: > <not-used> > Dec 20 14:32:31 h06 kernel: [ 84.268828] block drbd0: drbd_sync_handshake: > Dec 20 14:32:31 h06 kernel: [ 84.268833] block drbd0: self > B95131C56A7C2934:8344B9D0C389D2DC:902F198E803AB8E3:902E198E803AB8E3 bits:4096 > flag > s:0 > Dec 20 14:32:31 h06 kernel: [ 84.268837] block drbd0: peer > 3778E40F06BD4779:8344B9D0C389D2DC:902F198E803AB8E2:902E198E803AB8E3 bits:8 > flags:0 > Dec 20 14:32:31 h06 kernel: [ 84.268841] block drbd0: uuid_compare()=100 by > rule 90 > Dec 20 14:32:31 h06 kernel: [ 84.268846] block drbd0: helper command: > /sbin/drbdadm initial-split-brain minor-0 > Dec 20 14:32:31 h06 kernel: [ 84.271076] block drbd0: helper command: > /sbin/drbdadm initial-split-brain minor-0 exit code 0 (0x0) > Dec 20 14:32:31 h06 kernel: [ 84.271081] block drbd0: Split-Brain detected, > 1 primaries, automatically solved. Sync from peer node > Dec 20 14:32:31 h06 kernel: [ 84.271089] block drbd0: peer( Unknown -> > Primary ) conn( WFReportParams -> WFBitMapT ) disk( UpToDate -> Outdated ) > pdsk( DUnknown -> UpToDate) > > ### other node was fenced #### > > Dec 20 14:32:41 h06 kernel: [ 94.768068] block drbd0: PingAck did not > arrive in time. > Dec 20 14:32:41 h06 kernel: [ 94.768078] block drbd0: peer( Primary -> > Unknown ) conn( WFBitMapT -> NetworkFailure ) pdsk( UpToDate -> DUnknown ) > Dec 20 14:32:41 h06 kernel: [ 94.768161] block drbd0: bitmap WRITE of 0 > pages took 0 jiffies > Dec 20 14:32:41 h06 kernel: [ 94.768379] block drbd0: asender terminated > Dec 20 14:32:41 h06 kernel: [ 94.768384] block drbd0: 16 MB (4096 bits) > marked out-of-sync by on disk bit-map. > Dec 20 14:32:41 h06 kernel: [ 94.768394] block drbd0: Terminating asender > thread > Dec 20 14:32:41 h06 kernel: [ 94.768616] block drbd0: Connection closed > > > The other node: > Dec 20 14:22:01 h02 kernel: [340315.613659] block drbd0: Starting worker > thread (from cqueue [13627]) > Dec 20 14:22:01 h02 kernel: [340315.614016] block drbd0: disk( Diskless -> > Attaching ) > Dec 20 14:22:01 h02 kernel: [340315.681269] block drbd0: Found 4 transactions > (55 active extents) in activity log. > Dec 20 14:22:01 h02 kernel: [340315.681273] block drbd0: Method to ensure > write ordering: barrier > Dec 20 14:22:01 h02 kernel: [340315.681281] block drbd0: drbd_bm_resize > called with capacity == 1048472 > Dec 20 14:22:01 h02 kernel: [340315.681289] block drbd0: resync bitmap: > bits=131059 words=2048 pages=4 > Dec 20 14:22:01 h02 kernel: [340315.681292] block drbd0: size = 512 MB > (524236 KB) > Dec 20 14:22:01 h02 kernel: [340315.687209] block drbd0: bitmap READ of 4 > pages took 1 jiffies > Dec 20 14:22:01 h02 kernel: [340315.687227] block drbd0: recounting of set > bits took additional 0 jiffies > Dec 20 14:22:01 h02 kernel: [340315.687230] block drbd0: 0 KB (0 bits) marked > out-of-sync by on disk bit-map. > Dec 20 14:22:01 h02 kernel: [340315.687235] block drbd0: disk( Attaching -> > UpToDate ) > Dec 20 14:22:01 h02 kernel: [340315.687239] block drbd0: attached to UUIDs > 8344B9D0C389D2DC:0000000000000000:902F198E803AB8E2:902E198E803AB8E3 > Dec 20 14:22:01 h02 kernel: [340315.703404] block drbd0: conn( StandAlone -> > Unconnected ) > Dec 20 14:22:01 h02 kernel: [340315.703448] block drbd0: Starting receiver > thread (from drbd0_worker [27349]) > Dec 20 14:22:01 h02 kernel: [340315.703535] block drbd0: receiver (re)started > Dec 20 14:22:01 h02 kernel: [340315.703543] block drbd0: conn( Unconnected -> > WFConnection ) > Dec 20 14:22:02 h02 kernel: [340316.360240] block drbd0: role( Secondary -> > Primary ) > Dec 20 14:22:02 h02 kernel: [340316.360532] block drbd0: new current UUID > 3778E40F06BD4779:8344B9D0C389D2DC:902F198E803AB8E2:902E198E803AB8E3 > Dec 20 14:22:02 h02 kernel: [340316.405852] block drbd0: Handshake > successful: Agreed network protocol version 96 > Dec 20 14:22:02 h02 kernel: [340316.405859] block drbd0: conn( WFConnection > -> WFReportParams ) > Dec 20 14:22:02 h02 kernel: [340316.406014] block drbd0: Starting asender > thread (from drbd0_receiver [27371]) > Dec 20 14:22:02 h02 kernel: [340316.406153] block drbd0: data-integrity-alg: > <not-used> > Dec 20 14:22:02 h02 kernel: [340316.409149] block drbd0: drbd_sync_handshake: > Dec 20 14:22:02 h02 kernel: [340316.409153] block drbd0: self > 3778E40F06BD4779:8344B9D0C389D2DC:902F198E803AB8E2:902E198E803AB8E3 bits:0 > flags:0 > Dec 20 14:22:02 h02 kernel: [340316.409157] block drbd0: peer > B95131C56A7C2935:8344B9D0C389D2DC:902F198E803AB8E3:902E198E803AB8E3 bits:0 > flags:0 > Dec 20 14:22:02 h02 kernel: [340316.409161] block drbd0: uuid_compare()=100 > by rule 90 > Dec 20 14:22:02 h02 kernel: [340316.409166] block drbd0: helper command: > /sbin/drbdadm initial-split-brain minor-0 > Dec 20 14:22:02 h02 kernel: [340316.411198] block drbd0: helper command: > /sbin/drbdadm initial-split-brain minor-0 exit code 0 (0x0) > Dec 20 14:22:02 h02 kernel: [340316.411202] block drbd0: Split-Brain detected > but unresolved, dropping connection! > Dec 20 14:22:02 h02 kernel: [340316.411206] block drbd0: helper command: > /sbin/drbdadm split-brain minor-0 > Dec 20 14:22:02 h02 multipathd: drbd0: update path write_protect to '0' > (uevent) > Dec 20 14:22:02 h02 kernel: [340316.419274] block drbd0: helper command: > /sbin/drbdadm split-brain minor-0 exit code 0 (0x0) > Dec 20 14:22:02 h02 kernel: [340316.419280] block drbd0: conn( WFReportParams > -> Disconnecting ) > > > ### other node was fenced ### > > Dec 20 14:22:02 h02 kernel: [340316.419287] block drbd0: error receiving > ReportState, l: 4! > Dec 20 14:22:02 h02 kernel: [340316.419390] block drbd0: asender terminated > Dec 20 14:22:02 h02 kernel: [340316.419396] block drbd0: Terminating asender > thread > Dec 20 14:22:02 h02 kernel: [340316.419487] block drbd0: Connection closed > Dec 20 14:22:02 h02 kernel: [340316.419493] block drbd0: conn( Disconnecting > -> StandAlone ) > Dec 20 14:22:02 h02 kernel: [340316.419510] block drbd0: receiver terminated > Dec 20 14:22:02 h02 kernel: [340316.419512] block drbd0: Terminating receiver > thread > Dec 20 14:32:21 h02 kernel: [340935.071481] block drbd0: conn( StandAlone -> > Unconnected ) > Dec 20 14:32:21 h02 kernel: [340935.071526] block drbd0: Starting receiver > thread (from drbd0_worker [27349]) > Dec 20 14:32:21 h02 kernel: [340935.071679] block drbd0: receiver (re)started > Dec 20 14:32:21 h02 kernel: [340935.071687] block drbd0: conn( Unconnected -> > WFConnection ) > > ### this node was fenced ### > > Dec 20 14:38:51 h02 kernel: [ 81.985090] drbd: initialized. Version: 8.3.11 > (api:88/proto:86-96) > Dec 20 14:38:51 h02 kernel: [ 81.985094] drbd: GIT-hash: > 0de839cee13a4160eed6037c4bddd066645e23c5 build by phil@fat-tyre, 2011-06-29 > 11:37:11 > Dec 20 14:38:51 h02 kernel: [ 81.985096] drbd: registered as block device > major 147 > Dec 20 14:38:51 h02 kernel: [ 81.985098] drbd: minor_table @ > 0xffff881fe87af180 > Dec 20 14:38:51 h02 multipathd: drbd0: add path (uevent) > Dec 20 14:38:51 h02 multipathd: drbd0: failed to get path uid > Dec 20 14:38:52 h02 kernel: [ 83.045961] block drbd0: Starting worker > thread (from cqueue [13576]) > Dec 20 14:38:52 h02 kernel: [ 83.046260] block drbd0: disk( Diskless -> > Attaching ) > Dec 20 14:38:52 h02 kernel: [ 83.064193] block drbd0: Found 4 transactions > (55 active extents) in activity log. > Dec 20 14:38:52 h02 kernel: [ 83.064197] block drbd0: Method to ensure > write ordering: barrier > Dec 20 14:38:52 h02 kernel: [ 83.064200] block drbd0: max BIO size = 131072 > Dec 20 14:38:52 h02 kernel: [ 83.064206] block drbd0: drbd_bm_resize called > with capacity == 1048472 > Dec 20 14:38:52 h02 kernel: [ 83.064215] block drbd0: resync bitmap: > bits=131059 words=2048 pages=4 > Dec 20 14:38:52 h02 kernel: [ 83.064218] block drbd0: size = 512 MB (524236 > KB) > Dec 20 14:38:52 h02 kernel: [ 83.070075] block drbd0: bitmap READ of 4 > pages took 1 jiffies > Dec 20 14:38:52 h02 kernel: [ 83.070093] block drbd0: recounting of set > bits took additional 0 jiffies > Dec 20 14:38:52 h02 kernel: [ 83.070096] block drbd0: 0 KB (0 bits) marked > out-of-sync by on disk bit-map. > Dec 20 14:38:52 h02 kernel: [ 83.070109] block drbd0: Marked additional 192 > MB as out-of-sync based on AL. > Dec 20 14:38:52 h02 kernel: [ 83.070112] block drbd0: bitmap WRITE of 0 > pages took 0 jiffies > Dec 20 14:38:52 h02 kernel: [ 83.070239] block drbd0: 192 MB (49139 bits) > marked out-of-sync by on disk bit-map. > Dec 20 14:38:52 h02 kernel: [ 83.070245] block drbd0: disk( Attaching -> > UpToDate ) > Dec 20 14:38:52 h02 kernel: [ 83.070249] block drbd0: attached to UUIDs > 3778E40F06BD4779:8344B9D0C389D2DC:902F198E803AB8E2:902E198E803AB8E3 > Dec 20 14:38:52 h02 kernel: [ 83.088979] block drbd0: conn( StandAlone -> > Unconnected ) > Dec 20 14:38:52 h02 kernel: [ 83.088995] block drbd0: Starting receiver > thread (from drbd0_worker [13878]) > Dec 20 14:38:52 h02 kernel: [ 83.089044] block drbd0: receiver (re)started > Dec 20 14:38:52 h02 kernel: [ 83.089053] block drbd0: conn( Unconnected -> > WFConnection ) > Dec 20 14:38:53 h02 kernel: [ 83.588830] block drbd0: Handshake successful: > Agreed network protocol version 96 > Dec 20 14:38:53 h02 kernel: [ 83.588842] block drbd0: conn( WFConnection -> > WFReportParams ) > Dec 20 14:38:53 h02 kernel: [ 83.588972] block drbd0: Starting asender > thread (from drbd0_receiver [13901]) > Dec 20 14:38:53 h02 kernel: [ 83.589219] block drbd0: data-integrity-alg: > <not-used> > Dec 20 14:38:53 h02 kernel: [ 83.589447] block drbd0: drbd_sync_handshake: > Dec 20 14:38:53 h02 kernel: [ 83.589452] block drbd0: self > 3778E40F06BD4778:8344B9D0C389D2DC:902F198E803AB8E2:902E198E803AB8E3 > bits:49139 flags:0 > Dec 20 14:38:53 h02 kernel: [ 83.589456] block drbd0: peer > B95131C56A7C2934:8344B9D0C389D2DC:902F198E803AB8E3:902E198E803AB8E3 bits:0 > flags:2 > Dec 20 14:38:53 h02 kernel: [ 83.589459] block drbd0: uuid_compare()=100 by > rule 90 > Dec 20 14:38:53 h02 kernel: [ 83.589465] block drbd0: helper command: > /sbin/drbdadm initial-split-brain minor-0 > Dec 20 14:38:53 h02 kernel: [ 83.591700] block drbd0: helper command: > /sbin/drbdadm initial-split-brain minor-0 exit code 0 (0x0) > Dec 20 14:38:53 h02 kernel: [ 83.591706] block drbd0: Split-Brain detected, > 0 primaries, automatically solved. Sync from this node > Dec 20 14:38:53 h02 kernel: [ 83.591714] block drbd0: peer( Unknown -> > Secondary ) conn( WFReportParams -> WFBitMapS ) pdsk( DUnknown -> Consistent ) > Dec 20 14:38:53 h02 kernel: [ 83.593414] block drbd0: helper command: > /sbin/drbdadm before-resync-source minor-0 > Dec 20 14:38:53 h02 kernel: [ 83.595099] block drbd0: helper command: > /sbin/drbdadm before-resync-source minor-0 exit code 0 (0x0) > Dec 20 14:38:53 h02 kernel: [ 83.595106] block drbd0: conn( WFBitMapS -> > SyncSource ) pdsk( Consistent -> Inconsistent ) > Dec 20 14:38:53 h02 kernel: [ 83.595113] block drbd0: Began resync as > SyncSource (will sync 196556 KB [49139 bits set]). > Dec 20 14:38:53 h02 kernel: [ 83.595222] block drbd0: updated sync UUID > 3778E40F06BD4778:8345B9D0C389D2DC:8344B9D0C389D2DC:902F198E803AB8E2 > Dec 20 14:39:14 h02 kernel: [ 104.848088] block drbd0: role( Secondary -> > Primary ) > Dec 20 14:39:14 h02 multipathd: drbd0: update path write_protect to '0' > (uevent) > Dec 20 14:39:18 h02 kernel: [ 109.167582] block drbd0: Resync done (total 25 > sec; paused 0 sec; 7860 K/sec) > Dec 20 14:39:18 h02 kernel: [ 109.167588] block drbd0: updated UUIDs > 3778E40F06BD4779:0000000000000000:8345B9D0C389D2DC:8344B9D0C389D2DC > Dec 20 14:39:18 h02 kernel: [ 109.167596] block drbd0: conn( SyncSource -> > Connected ) pdsk( Inconsistent -> UpToDate ) > Dec 20 14:39:18 h02 kernel: [ 109.167722] block drbd0: bitmap WRITE of 0 > pages took 0 jiffies > Dec 20 14:39:18 h02 kernel: [ 109.167738] block drbd0: 0 KB (0 bits) marked > out-of-sync by on disk bit-map. > Dec 20 14:39:44 h02 kernel: [ 135.213231] block drbd0: peer( Secondary -> > Primary ) > > I doubt that those "split brains were caused by a network connection problem > at that times. What might be wrong? I always thought DRBD had production > quality. Ist it a problem with the Cluster RAs? > > The version used is this (SLES11 SP1): > # rpm -qf /usr/lib/ocf/resource.d/linbit/drbd > drbd-pacemaker-8.3.11-0.3.1 > > > Regards, > Ulrich > > _______________________________________________ > Linux-HA mailing list > [email protected] > http://lists.linux-ha.org/mailman/listinfo/linux-ha > See also: http://linux-ha.org/ReportingProblems
signature.asc
Description: OpenPGP digital signature
_______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
