aries wrote:
The lustre topology is simple , each ost has the same size of 1G
/- ost1(sdb1)
client - mds(mgs,mdt) - ost2(sdb1)
\- ost3(sdb1)
Output of "lctl dl" on mds
[EMAIL PROTECTED] ~]# lctl dl
1 UP mgc [EMAIL PROTECTED] c603fed3-3f71-89a3-ab80-84563a5190f5 5
2 UP mdt MDS MDS_uuid 3
3 UP lov pogo-mdtlov pogo-mdtlov_UUID 4
4 UP mds pogo-MDT0000 pogo-MDT0000_UUID 5
5 UP osc pogo-OST0000-osc pogo-mdtlov_UUID 5
6 UP osc pogo-OST0001-osc pogo-mdtlov_UUID 5
7 UP osc pogo-OST0002-osc pogo-mdtlov_UUID 5
Output of "df -h /test" on client
[EMAIL PROTECTED] ~]# df -h /test
Filesystem Size Used Avail Use% Mounted on
[EMAIL PROTECTED]:/pogo 2.8G 153M 2.5G 6% /test
Now I add a failover pair of sdb2(ost1) and sdb1(ost2).
ost1: mkfs.lustre --fsname=pogo --ost --failnode=ost2
[EMAIL PROTECTED] /dev/sdb2
after i add the failover pair
Output of "lctl dl" on mds:
[EMAIL PROTECTED] ~]# lctl dl
1 UP mgc [EMAIL PROTECTED] c603fed3-3f71-89a3-ab80-84563a5190f5 5
2 UP mdt MDS MDS_uuid 3
3 UP lov pogo-mdtlov pogo-mdtlov_UUID 4
4 UP mds pogo-MDT0000 pogo-MDT0000_UUID 5
5 UP osc pogo-OST0000-osc pogo-mdtlov_UUID 5
6 UP osc pogo-OST0001-osc pogo-mdtlov_UUID 5
7 UP osc pogo-OST0002-osc pogo-mdtlov_UUID 5
8 UP osc pogo-OST0003-osc pogo-mdtlov_UUID 5
Output of "df -h /test" on client:
[EMAIL PROTECTED] ~]# df -h /test
Filesystem Size Used Avail Use% Mounted on
[EMAIL PROTECTED]:/pogo 3.7G 170M 3.3G 5% /test
So the question: Does it count 1 ost size of a failover pair into lov
or 2 ost size?
A failover pair is a pair of separate nodes that both have access to a
shared disk. In your case, sdb2 on ost1 and sdb1 on ost2 need to map
to the same physical device. So a failover pair counts as 1 ost in all
cases; it just happens to have a failover address to be able to access
the data.
Then I umount sdb1 on ost2 ,the /test direcotry on client kust hanged.
You must mount it then on ost1. Note that you should not have it
mounted on both ost1 and ost2 at the same time.
The following message appear on ost2
Lustre: 3417:0:(lib-move.c:1644:lnet_parse_put()) Dropping PUT from
[EMAIL PROTECTED] portal 7 match 667 offset 0 length 128: 2
Lustre: 3416:0:(lib-move.c:1644:lnet_parse_put()) Dropping PUT from
[EMAIL PROTECTED] portal 28 match 670 offset 0 length 128: 2
This just says that 1.210 and 1.200 are still looking for something on
ost2 that is not handled by anything (i.e. OST0003, which you unmounted.)
I just follow the doc of mountconf. Plz let me know which step is
wrong , thanks in advance.
------------------------------------------------------------------------
_______________________________________________
Lustre-discuss mailing list
[email protected]
https://mail.clusterfs.com/mailman/listinfo/lustre-discuss
_______________________________________________
Lustre-discuss mailing list
[email protected]
https://mail.clusterfs.com/mailman/listinfo/lustre-discuss