On 6/16/2026 10:53 AM, Peter Oberparleiter wrote:

> On 16.06.2026 11:16, Cornelia Huck wrote:
>> On Tue, Jun 16 2026, Peter Oberparleiter <[email protected]> wrote:
>>
>>> On 15.06.2026 23:42, Halil Pasic wrote:
>>>> On Mon, 15 Jun 2026 16:01:55 -0400
>>>> William Bezenah <[email protected]> wrote:
>>>>
>>>>> On 6/15/2026 10:58 AM, Cornelia Huck wrote:
>>>>>> On Mon, Jun 15 2026, Halil Pasic <[email protected]> wrote:
>>>>>>  
>>>>>>> On Fri, 12 Jun 2026 17:54:07 +0200
>>>>>>> William Bezenah <[email protected]> wrote:
>>>>>>>  
>>>>>>>> Since commit 8c58a229688c ("s390/cio: Do not unregister the
>>>>>>>> subchannel based on DNV"), subchannel behavior following a device
>>>>>>>> detach has been updated and results in -EINVAL being propagated
>>>>>>>> rather than -ENODEV, originating from ccw_device_start_timeout_key()
>>>>>>>> in cio/device_ops. In the end, the virtio driver has no ability to
>>>>>>>> react to the difference between device and subchannel states here,
>>>>>>>> and during detach, both -ENODEV and -EINVAL indicate the device
>>>>>>>> cannot be used and should not be treated as errors requiring
>>>>>>>> attention. Update error handling in virtio_ccw_del_vq() and
>>>>>>>> virtio_ccw_drop_indicator() to suppress -EINVAL in addition to
>>>>>>>> -ENODEV.  
>>>>>>> Hi William!
>>>>>>>
>>>>>>> Are you saying that ccw_device_start() started returning -EINVAL
>>>>>>> since 8c58a229688c ("s390/cio: Do not unregister the subchannel based on
>>>>>>> DNV")? Or did I somehow read the paragraph wrong?
>>>>>>>
>>>>>>> The funcition ccw_device_start is documented to return:
>>>>>>>  * Returns:                                                             
>>>>>>>         
>>>>>>>  *  %0, if the operation was successful;                                
>>>>>>>         
>>>>>>>  *  -%EBUSY, if the device is busy, or status pending;                  
>>>>>>>         
>>>>>>>  *  -%EACCES, if no path specified in @lpm is operational;              
>>>>>>>         
>>>>>>>  *  -%ENODEV, if the device is not operational. 
>>>>>>> and the commit message does not say a thing about introducing -EINVAL to
>>>>>>> the mix.  
>>>>>> The function may return -EINVAL for non-enabled subchannels
>>>>>> (i.e. pmcw.ena == 0), maybe we get an all-zeroes schib with dnv == 0?
>>>>>> I'd expect it not to be enabled in that case anyway.  
>>>>> Yep, that's at least how I've come to understand what changed. The
>>>>> function ccw_device_start_timeout_key() has always returned -EINVAL
>>>>> for non-enabled subchannels (pmcw.ena == 0), though it's not
>>>>> documented in the header.
>>>> Wasn't his -EINVAL actually introduced by commit:
>>>> 823d494ac111 ("[S390] pm: ccw bus power management callbacks")?
>>> In the context of virtio-ccw added in 2012, an EINVAL return code
>>> introduced in 2009 might be considered "always" :)
>> :)
>>
>> I'm wondering whether we should still expect to hit the "ssch with
>> ena==0" situation, given that pm support has been removed again in the
>> meanwhile. (Well, other than in situations like this, where it is a
>> follow-up to other problems.) IOW, can callers expect not to see
>> -EINVAL, unless they are doing something really stupid?
> As Halil also pointed out, this would be a programming error, either on
> the side of the driver that starts I/O without setting the device
> properly online, or in the common I/O layer (hopefully not, but you
> never know). Having a dedicated return code to identify this situation
> is definitely useful, and we'll also consider documenting it accordingly
> in the function comment.
>
>>>>> What changed with commit 8c58a229688c is that cio_update_schib() now
>>>>> updates the schib even when DNV=0, rather than returning early as it
>>>>> did previously. Somehow this update results in pmcw.ena == 0 in
>>>>> ccw_device_start_timeout_key(). Previously, it saw pmcw.ena == 1 and
>>>>> moved to the condition (cdev->private->state == DEV_STATE_NOT_OPER)
>>>>> where it returned -ENODEV.
>>>> Sounds fishy to me. As far as I understand the DNV takes precedence over
>>>> all other pieces of PMCW.
>>> And you're right about that! The Principles of Operation states (p. 15-4
>>> in SA22-7832-14 [1]) that the contents of all other fields in the PMCW
>>> are unpredictable when DNV is 0, therefore 8c58a229688c is in error.
>>>
>>> I'll work with Vineeth to determine how to fix this issue, potentially
>>> via manually clearing some relevant SCHIB fields instead of copying the
>>> unpredictable results of the STSCH instruction.
>> Can't you zero the whole SCHIB, or do you still need some of the
>> measurement block things for cleanup?
> I faintly remember that there WAS a reason to use the remainder of the
> SCHIB contents because of some unwanted effect that occurred if we
> didn't, but I don't recall the details. We'll need to dig up the
> associated bug report to understand it and determine if we can simply
> clear all of the SCHIB, or need to keep some of the information intact.
>
>>>>> So the commit didn't introduce -EINVAL as a new return value, rather,
>>>>> it changed the subchannel lifecycle such that existing paths now
>>>>> propagate -EINVAL rather than -ENODEV during the device detach
>>>>> scenario.
>>>> I'm not convinced returning -EINVAL in the given situation is the
>>>> right thing to do. Peter, would you mind to chime in?
>>> I tend to agree that an attempt to start I/O for a subchannel that has
>>> DNV 0 should result in ENODEV rather than EINVAL, though the latter is
>>> still valid when a driver tries to start I/O on a subchannel that is not
>>> enabled for I/O.
>>>
>>> We'll make sure to design the fix for 8c58a229688c in away that ENODEV
>>> will be returned when DNV is 0. Assuming that this is the only situation
>>> where virtio-ccw's ccw_io_helper() receives -EINVAL from
>>> ccw_device_start__timeout_key() during detach, the subject patch should
>>> no longer be necessary.
>> I agree, I'd not expect to get -EINVAL in ccw_io_helper().
> Yeah, this was definitely an unexpected side effect of the DNV commit.
>
FWIW, during my testing, that was indeed the only situation in which
I saw -EINVAL during the detach process. So assuming the right fix
would correctly return ENODEV, I agree this patch is likely
unnecessary.

Thanks all.


Reply via email to