hmm.. This definitely is not expected. As long as ost 1 is down, it should
be returned immediately from OSC layer and tries to read the 2nd mirror
that is located on ost 7. For the following blocks, it should not even try
ost1 but go to 7 directly.

Would you please collect Lustre log and send it to me? You can collect logs
on client side as follows:
0. create mirrored file
1. lctl set_param debug=-1 && lctl clear
2. lctl mark "======= start ========"
3. read the file
4. lctl dk > log.txt

and send me the log.txt file. If you can reproduce this problem
consistently, please use a small file so that it would be easier to check
the log.

Jinshan

On Mon, May 20, 2019 at 6:20 AM John Doe <[email protected]> wrote:

> It turns out that the read eventually finished and was 1/10th of the
> performance that I was expecting.
>
> As ost idx 1 is unavailable, the client read has to timeout on ost idx 1
> and then will read from ost idx 7. This happens for each 1MB block, as I am
> using that as the block size.
>
> Is there a tunable to avoid this issue?
>
> lfs check osts also takes about 30 seconds as it times out on the
> unavailable OST.
>
> Due to this issue, I am virtually unable to use the mirroring feature.
>
> I
>
> On Sun, May 19, 2019 at 4:27 PM John Doe <[email protected]> wrote:
>
>> After mirroring a file , when one mirror is down, any reads from a client
>> just hangs. Both server and client are running latest 2.12.1-1. Client
>> waits for ost idx 1 to come back online.  I am only unmounting ost idx1 not
>> ost idx 7.
>>
>> Has anyone tried this feature?
>>
>> Thanks,
>> John.
>>
>> lfs getstripe mirror10
>> mirror10
>>   lcm_layout_gen:    5
>>   lcm_mirror_count:  2
>>   lcm_entry_count:   2
>>     lcme_id:             65537
>>     lcme_mirror_id:      1
>>     lcme_flags:          init
>>     lcme_extent.e_start: 0
>>     lcme_extent.e_end:   EOF
>>       lmm_stripe_count:  1
>>       lmm_stripe_size:   1048576
>>       lmm_pattern:       raid0
>>       lmm_layout_gen:    0
>>       lmm_stripe_offset: 1
>>       lmm_pool:          01
>>       lmm_objects:
>>       - 0: { l_ost_idx: 1, l_fid: [0x100010000:0x280a8:0x0] }
>>
>>     lcme_id:             131074
>>     lcme_mirror_id:      2
>>     lcme_flags:          init
>>     lcme_extent.e_start: 0
>>     lcme_extent.e_end:   EOF
>>       lmm_stripe_count:  1
>>       lmm_stripe_size:   1048576
>>       lmm_pattern:       raid0
>>       lmm_layout_gen:    0
>>       lmm_stripe_offset: 7
>>       lmm_pool:          02
>>       lmm_objects:
>>       - 0: { l_ost_idx: 7, l_fid: [0x100070000:0x28066:0x0] }
>>
> _______________________________________________
> lustre-discuss mailing list
> [email protected]
> http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org
>
_______________________________________________
lustre-discuss mailing list
[email protected]
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org

Reply via email to