sorry, accidentally posted to wrong list, please ignore.  (unless you know
the answers :P )

On 17 September 2015 at 13:56, Lee Musgrave <l...@sclinternet.co.uk> wrote:

>
> Hi,
>
> i'm a little confused by this, and i don't want to do anything with these
> systems until i've got some clarity.
>
> i'm using drbd 8.3.13 on ubuntu 12.04.3,  ( i know i should upgrade, but
> that isn't an option right now).
>
> the metadata partition is on /dev/sda2, the os is installed on /dev/sda1
>
>
> on node1  cat /proc/drbd shows:
>
>
>  0: cs:Connected ro:Primary/Secondary ds:UpToDate/UpToDate C r-----
>     ns:40 nr:0 dw:12 dr:1104 al:2 bm:3 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0
>  1: cs:Connected ro:Primary/Secondary ds:Diskless/UpToDate C r-----
>     ns:2015462262 nr:516042356 dw:1353605609 dr:1812380379 al:67433064
> bm:3 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0
>  2: cs:Connected ro:Primary/Secondary ds:Diskless/UpToDate C r-----
>     ns:536493384 nr:80 dw:502334308 dr:1646088 al:28554 bm:0 lo:0 pe:0
> ua:0 ap:0 ep:1 wo:f oos:0
>  3: cs:Connected ro:Primary/Secondary ds:Diskless/UpToDate C r-----
>     ns:201451152 nr:0 dw:131716884 dr:84892 al:211 bm:0 lo:0 pe:0 ua:0
> ap:0 ep:1 wo:f oos:0
>  4: cs:Connected ro:Primary/Secondary ds:UpToDate/UpToDate C r-----
>     ns:3202744 nr:0 dw:3202744 dr:349700 al:57 bm:0 lo:0 pe:0 ua:0 ap:0
> ep:1 wo:f oos:0
>
>
> on node2 :
>
>  0: cs:Connected ro:Secondary/Primary ds:UpToDate/UpToDate C r-----
>     ns:0 nr:40 dw:40 dr:0 al:0 bm:3 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0
>  1: cs:Connected ro:Secondary/Primary ds:UpToDate/Diskless C r-----
>     ns:516043328 nr:2015478830 dw:2015478830 dr:516043328 al:33272078
> bm:3 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:289286616
>  2: cs:Connected ro:Secondary/Primary ds:UpToDate/Diskless C r-----
>     ns:80 nr:536493908 dw:536493908 dr:80 al:536 bm:0 lo:0 pe:0 ua:0 ap:0
> ep:1 wo:f oos:1825092
>  3: cs:Connected ro:Secondary/Primary ds:UpToDate/Diskless C r-----
>     ns:0 nr:201451968 dw:201451968 dr:0 al:298 bm:0 lo:0 pe:0 ua:0 ap:0
> ep:1 wo:f oos:91228
>  4: cs:Connected ro:Secondary/Primary ds:UpToDate/UpToDate C r-----
>     ns:0 nr:3202744 dw:3202744 dr:0 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1
> wo:f oos:0
>
>
> 0 is on /dev/sdb1, 1 is on /dev/sdb2, 2 is on /dev/sdc1, 3 is on
> /dev/sdc2, 4 is on /dev/sdb3
>
> /dev/sdb and /dev/sdc are 2 separate raid 10 arrays, each of 4 disks.
>
>
> i don't believe there is any problem with the raid arrays, or their
> composite disks. on node 1, the filesystem is currently mounted readonly,
> so i believe the problem is the os disk, which also has the metadata
> partition on it.
>
> does this seem the most likely to you?
>
> what's confusing me, is it's the node 2 partitions showing as out-of-sync,
> how? surely it's the diskless partitions that are oos? is it keeping
> everything in memory? are changes actually getting written to the disks on
> node2?
>
> as i said, i believe the problem to be access to the metadata, since all 5
> drbd partitions share the same metadata partition, what would be the
> recommended recovery method, i don't even want to try remounting / as rw
> until i've got a bit more information, right now things are still working,
> although in a degraded state, and it's not live, but i want to treat it as
> live so i know i can recover from the same situation when it is in
> production, so downtime or data loss needs to be avoided if at all possible.
>
>
> thanks
> lee.
>

Reply via email to