Hi folks,

We've upgraded 2 of our filesystems recently to 2.10.5 (from IEEL 3.1-
lustre 2.7.x) and I've noticed that the one with DNE doesn't seem to
be cleaning / reading changelogs in robinhood since then

(i'd only been doing manual scans since the upgrade - today I enabled
changelogs via changelog_register on each MDT)

 rbh-report -a
Changelog stats for MDT0000:
    last_read record: rec_id=25431, rec_time=2018/12/14
08:40:49.587248, step_time=2018/12/14 08:41:39.443366
    last_pushed record: rec_id=25407, rec_time=2018/12/14
08:40:44.699018, step_time=2018/12/14 08:41:39.442628
    last_committed record: rec_id=28414, rec_time=2018/12/14
08:50:31.019852, step_time=2018/12/14 08:51:26.367623
    last_cleared record: rec_id=25407, rec_time=2018/12/14
08:40:44.699018, step_time=2018/12/14 08:41:39.450362
    Changelog stats per type (MDT0000):
         type            total (diff) (rate)
        CREAT:             771
        MKDIR:             157
        UNLNK:             768
        RMDIR:             157
        CLOSE:           12414 (+2309) (2.57/sec)
        TRUNC:           11173 (+2308) (2.56/sec)

Changelog stats for MDT0001:
    last_read record: rec_id=15480, rec_time=2018/12/14
08:23:16.923499, step_time=2018/12/14 08:24:07.599741

and thats all it has for astrofs-MDT0001
Both are set up with the same mask:

root@hpc-xcat1 ~]# xdsh astrofs-mds[1,2],pgfs-mds2 lctl get_param
mdd.*.changelog_mask | xdshbak -c
HOSTS -------------------------------------------------------------------------
astrofs-mds1
-------------------------------------------------------------------------------
mdd.astrofs-MDT0000.changelog_mask=
MARK CREAT MKDIR HLINK SLINK MKNOD UNLNK RMDIR RENME RNMTO OPEN CLOSE
LYOUT TRUNC SATTR XATTR HSM MTIME CTIME MIGRT

HOSTS -------------------------------------------------------------------------
astrofs-mds2
-------------------------------------------------------------------------------
mdd.astrofs-MDT0001.changelog_mask=
MARK CREAT MKDIR HLINK SLINK MKNOD UNLNK RMDIR RENME RNMTO OPEN CLOSE
LYOUT TRUNC SATTR XATTR HSM MTIME CTIME MIGRT

HOSTS -------------------------------------------------------------------------
pgfs-mds2
-------------------------------------------------------------------------------
mdd.pgfs-MDT0000.changelog_mask=
MARK CREAT MKDIR HLINK SLINK MKNOD UNLNK RMDIR RENME RNMTO OPEN CLOSE
LYOUT TRUNC SATTR XATTR HSM MTIME CTIME MIGRT

and the changelogs seem to be racking up:

[root@hpc-xcat1 ~]# xdsh astrofs-mds[1,2] cat
/proc/fs/lustre/mdd/*/changelog_users
astrofs-mds2: current index: 23220
astrofs-mds2: ID    index
astrofs-mds2: cl1   0
astrofs-mds1: current index: 31839
astrofs-mds1: ID    index
astrofs-mds1: cl1   31839


and a quick read of the astrofs-MDT0001 changelog looks like it's full
of CREAT,CLOSE,UNLINK as expected

magnus-1:~ # lfs changelog astrofs-MDT0001 | head
1 01CREAT 23:49:09.486538354 2018.12.13 0x0 t=[0xa4000dd40:0xb144:0x0]
p=[0xa4000d488:0x6a8b:0x0]
AAVS1_SKALA2_embedded_element_02_rev0_100_100.bof.lock
2 11CLOSE 23:49:09.487007753 2018.12.13 0xc3 t=[0xa4000dd40:0xb144:0x0]
3 06UNLNK 23:49:09.487335599 2018.12.13 0x1 t=[0xa4000dd40:0xb144:0x0]
p=[0xa4000d488:0x6a8b:0x0]
AAVS1_SKALA2_embedded_element_02_rev0_100_100.bof.lock
4 01CREAT 23:49:09.487814447 2018.12.13 0x0 t=[0xa4000dd40:0xb145:0x0]
p=[0xa4000d488:0x6a8b:0x0]
AAVS1_SKALA2_embedded_element_02_rev0_100_100.bof.lock
5 11CLOSE 23:49:09.487986846 2018.12.13 0xc3 t=[0xa4000dd40:0xb145:0x0]
6 06UNLNK 23:49:09.488221218 2018.12.13 0x1 t=[0xa4000dd40:0xb145:0x0]
p=[0xa4000d488:0x6a8b:0x0]
AAVS1_SKALA2_embedded_element_02_rev0_100_100.bof.lock
7 01CREAT 23:49:09.488591157 2018.12.13 0x0 t=[0xa4000dd40:0xb146:0x0]
p=[0xa4000d488:0x6a8b:0x0]
AAVS1_SKALA2_embedded_element_02_rev0_100_100.bof.lock
8 11CLOSE 23:49:09.488745392 2018.12.13 0xc3 t=[0xa4000dd40:0xb146:0x0]
9 06UNLNK 23:49:09.488970060 2018.12.13 0x1 t=[0xa4000dd40:0xb146:0x0]
p=[0xa4000d488:0x6a8b:0x0]
AAVS1_SKALA2_embedded_element_02_rev0_100_100.bof.lock
10 01CREAT 23:49:09.489334721 2018.12.13 0x0
t=[0xa4000dd40:0xb147:0x0] p=[0xa4000d488:0x6a8b:0x0]
AAVS1_SKALA2_embedded_element_02_rev0_100_100.bof.lock
magnus-1:~ # lfs changelog astrofs-MDT0001 | tail
23211 06UNLNK 00:51:54.340464800 2018.12.14 0x1
t=[0xa4000dd40:0xcf7c:0x0] p=[0xa4000d488:0x6a8b:0x0]
AAVS1_SKALA2_embedded_element_02_rev0_100_100.bof.lock
23212 01CREAT 00:51:54.341178127 2018.12.14 0x0
t=[0xa4000dd40:0xcf7d:0x0] p=[0xa4000d488:0x6a8b:0x0]
AAVS1_SKALA2_embedded_element_02_rev0_100_100.bof.lock
23213 11CLOSE 00:51:54.341328738 2018.12.14 0xc3 t=[0xa4000dd40:0xcf7d:0x0]
23214 06UNLNK 00:51:54.341530488 2018.12.14 0x1
t=[0xa4000dd40:0xcf7d:0x0] p=[0xa4000d488:0x6a8b:0x0]
AAVS1_SKALA2_embedded_element_02_rev0_100_100.bof.lock
23215 01CREAT 00:51:54.341881777 2018.12.14 0x0
t=[0xa4000dd40:0xcf7e:0x0] p=[0xa4000d488:0x6a8b:0x0]
AAVS1_SKALA2_embedded_element_02_rev0_100_100.bof.lock
23216 11CLOSE 00:51:54.342020836 2018.12.14 0xc3 t=[0xa4000dd40:0xcf7e:0x0]
23217 06UNLNK 00:51:54.342211583 2018.12.14 0x1
t=[0xa4000dd40:0xcf7e:0x0] p=[0xa4000d488:0x6a8b:0x0]
AAVS1_SKALA2_embedded_element_02_rev0_100_100.bof.lock
23218 01CREAT 00:51:54.342805330 2018.12.14 0x0
t=[0xa4000dd40:0xcf7f:0x0] p=[0xa4000d488:0x6a8b:0x0]
AAVS1_SKALA2_embedded_element_02_rev0_100_100.bof.lock
23219 11CLOSE 00:51:54.342941539 2018.12.14 0xc3 t=[0xa4000dd40:0xcf7f:0x0]
23220 06UNLNK 00:51:54.343132968 2018.12.14 0x1
t=[0xa4000dd40:0xcf7f:0x0] p=[0xa4000d488:0x6a8b:0x0]
AAVS1_SKALA2_embedded_element_02_rev0_100_100.bof.lock


My configuration is as follows:
ChangeLog {
    MDT {
        mdt_name = "MDT0000";
        reader_id = "cl1";
    }
    MDT {
        mdt_name = "MDT0001";
        reader_id = "cl1";
    }
}


Anyone have any ideas?

Andrew


_______________________________________________
robinhood-support mailing list
robinhood-support@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/robinhood-support

Reply via email to