On 1/26/19 9:18 AM, jianchao.wang wrote:
>
>
> On 1/26/19 6:10 AM, Florian Stecker wrote:
>>
>>
>> On 1/25/19 10:05 AM, jianchao.wang wrote:
>>>
>>>
>>> On 1/25/19 4:56 PM, Florian Stecker wrote:
>>>> On 1/25/19 4:49 AM, jianchao.wang wrote:
>>>>>
>>>>> It sounds like not so easy to trigger.
>>>>>
>>>>> blk_mq_dispatch_rq_list
>>>>> scsi_queue_rq
>>>>> if (atomic_read(&sdev->device_busy) ||
>>>>> scsi_device_blocked(sdev))
>>>>> ret = BLK_STS_DEV_RESOURCE; scsi_end_request
>>>>> __blk_mq_end_request
>>>>> blk_mq_sched_restart
>>>>> // clear RESTART
>>>>> blk_mq_run_hw_queue
>>>>> blk_mq_run_hw_queues
>>>>> list_splice_init(list, &hctx->dispatch)
>>>>> needs_restart = blk_mq_sched_needs_restart(hctx)
>>>>>
>>>>> The 'needs_restart' will be false, so the queue would be rerun.
>>>>>
>>>>> Thanks
>>>>> Jianchao
>>>>
>>>> Good point. So the RESTART flag is supposed to protect against this? Now I
>>>> see, this is also sort of what the lengthy comment in
>>>> blk_mq_dispatch_rq_list is saying.
>>>>
>>>> May I complain that this is very unintuitive (the queue gets rerun when
>>>> the RESTART flag is _not_ set) and also unreliable, as not every caller of
>>>> blk_mq_dispatch_rq_list seems to set the flag, and also it does not always
>>>> get cleared in __blk_mq_end_request?
>>>>
>>>> __blk_mq_end_request does the following:
>>>>
>>>> if (rq->end_io) {
>>>> rq_qos_done(rq->q, rq);
>>>> rq->end_io(rq, error);
>>>> } else {
>>>> if (unlikely(blk_bidi_rq(rq)))
>>>> blk_mq_free_request(rq->next_rq);
>>>> blk_mq_free_request(rq);
>>>> }
>>>>
>>>> and blk_mq_free_request then calls blk_mq_sched_restart, which clears the
>>>> flag. But in my case, rq->end_io != 0, so blk_mq_free_request is never
>>>> called.
>>>
>>> So what is your rq->end_io ?
>>> flush_end_io ? or mq_flush_data_end_io ? or other ?
>>> In normal case, the blk_mq_end_request should be finally invoked.
>>>
>>> Did you ever try the bfq io scheduler instead of mq-deadline ?
>>>
>>> Can you share your dmesg and config file here ?
>>
>> Sure.
>>
>> dmesg:
>> https://urldefense.proofpoint.com/v2/url?u=https-3A__florianstecker.de_dmesg-2D2019-2D01-2D25.txt&d=DwIDaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=7WdAxUBeiTUTCy8v-7zXyr4qk7sx26ATvfo6QSTvZyQ&m=t_udGGsH8ivtjXHin4oQ4LxELuPoXbl_ARP7oSxiQ6k&s=DC2fhRw2g4haGhhHxOYyX4zD6FWQQ5YoERdfptVRHM4&e=
>> Note that no I/O scheduler is in use here, I deactivated them in a udev rule
>> because I still want to be able to use my laptop. When I test this stuff I
>> just reactivate mq-deadline manually.
>>
>> Config is the default in Arch Linux:
>> https://urldefense.proofpoint.com/v2/url?u=https-3A__git.archlinux.org_svntogit_packages.git_tree_trunk_config-3Fh-3Dpackages_linux&d=DwIDaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=7WdAxUBeiTUTCy8v-7zXyr4qk7sx26ATvfo6QSTvZyQ&m=t_udGGsH8ivtjXHin4oQ4LxELuPoXbl_ARP7oSxiQ6k&s=GjV_fyXaKZ8za8zdB_gqmKW7yBlbla2OHRaAT0iGZkc&e=
>>
>> The problem also appears with BFQ.
>>
>> And rq->end_io is set to mq_flush_data_end_io when this happens. The only
>> point I see where this function could invoke blk_mq_end_request is via
>> blk_flush_complete_seq, but it does so only if seq == REQ_FSEQ_DONE.
>> However, seq is REQ_FSEQ_POSTFLUSH for me (i.e. it just transitioned from
>> REQ_FSEQ_DATA to REQ_FSEQ_POSTFLUSH).
>
>
> Could it be this scenario ?
>
> data + post flush
>
> blk_flush_complete_seq a flush
> case REQ_FSEQ_DATA
> blk_flush_queue_rq
>
> finally issued to driver blk_mq_dispatch_rq_list
> try to issue a flush req
> failed due to NON-NCQ command
> due to RESTART is set, do nothing
> req complete
> req->end_io() // doesn't check RESTART
> mq_flush_data_end_io
> blk_flush_complete_seq
> case REQ_FSEQ_POSTFLUSH
> blk_kick_flush
> do nothing because previous flush
> has not been completed, so
> flush_pending_idx != flush_running_idx
>
If it is right, maybe we could try following
diff --git a/block/blk-mq.c b/block/blk-mq.c
index 3ba37b9..3411b6a 100644
--- a/block/blk-mq.c
+++ b/block/blk-mq.c
@@ -549,6 +549,7 @@ inline void __blk_mq_end_request(struct request *rq,
blk_status_t error)
if (rq->end_io) {
rq_qos_done(rq->q, rq);
rq->end_io(rq, error);
+ blk_mq_sched_restart(hctx);
} else {
if (unlikely(blk_bidi_rq(rq)))
blk_mq_free_request(rq->next_rq);
>
>
>
>>>
>>