Re: [PATCH v4 00/25] block layer: split block APIs in global state and I/O

Hanna Reitz Mon, 15 Nov 2021 08:04:32 -0800

On 25.10.21 12:17, Emanuele Giuseppe Esposito wrote:

Currently, block layer APIs like block-backend.h contain a mix of
functions that are either running in the main loop and under the
BQL, or are thread-safe functions and run in iothreads performing I/O.
The functions running under BQL also take care of modifying the
block graph, by using drain and/or aio_context_acquire/release.
This makes it very confusing to understand where each function
runs, and what assumptions it provided with regards to thread
safety.


We call the functions running under BQL "global state (GS) API", and
distinguish them from the thread-safe "I/O API".

The aim of this series is to split the relevant block headers in
global state and I/O sub-headers.

Despite leaving quite some comments, the series and the split seemreasonable to me overall. (This is a pretty big series, after all, sothose “some comments” stack up against a majority of changes that seemOK to me. :))

One thing I noticed while reviewing is that it’s really hard to verifythat no I/O function calls a GS function. What would be wonderful issome function marker like coroutine_fn that marks GS functions (or I/Ofunctions) and that we could then verify the call paths. But AFAIUwe’ve always wanted precisely that for coroutine_fn and still don’t haveit, so this seems like extremely wishful thinking... :(

I think most of the issues I found can be fixed (or are evenirrelevant), the only thing that really worries me are the two placesthat are clearly I/O paths that call permission functions: Namely firstblock_crypto_amend_options_generic_luks() (part of the luks blockdriver’s .bdrv_co_amend implementation), which callsbdrv_child_refresh_perms(); and second fuse_do_truncate(), which callsblk_set_perm().

In the first case, we need this call so that we don’t permanently hogthe WRITE permission for the luks file, which used to be a problem, Ibelieve. We want to unshare the WRITE permission (and apparently alsoCONSISTENT_READ) during the key update, so we need some way totemporarily update the permissions.


I only really see four solutions for this:

(1) We somehow make the amend job run in the main context under the BQLand have it prevent all concurrent I/O access (seems bad)(2) We can make the permission functions part of the I/O path (seemswrong and probably impossible?)(3) We can drop the permissions update and permanently require thepermissions that we need when updating keys (I think this might breakexisting use cases)(4) We can acquire the BQL around the permission update call and perhapsthat works?

I don’t know how (4) would work but it’s basically the only reasonablesolution I can come up with. Would this be a way to call a BQL functionfrom an I/O function?

As for the second case, the same applies as above, with the differencesthat we have no jobs, so this code must always run in the block device’sAioContext (I think), which rules out (1); but (3) would become easier(i.e. require the RESIZE permission all the time), although that toomight have an impact on existing users (don’t think so, though). In anycase, if we could do (4), that would solve the problem here, too.

And finally, another notable thing I noticed is that the way howcreate-related functions are handled is inconsistent. I believe theyshould all be GS functions; qmp_blockdev_create() seems to agree with meon this, but we currently seem to have some bugs there. It’s possibleto invoke blockdev-create on a block device that’s in an I/O thread, andthen qemu crashes. Oops. (The comment in qmp_blockdev_create() saysthat the block drivers’ implementations should prevent this, butapparently they don’t...?) In any case, that’s a pre-existing bug, ofcourse, that doesn’t concern this series (other than that it suggeststhat “create” functions should be classified as GS).


Hanna

Re: [PATCH v4 00/25] block layer: split block APIs in global state and I/O

Reply via email to