Re: [networking-discuss] GLDv3 locking concerns...

Garrett D'Amore Thu, 03 May 2007 11:43:18 -0700

Eric Cheng wrote:

Garrett D'Amore wrote:
So, given this, it is it safe for me to just callmac_{link,tx_notify() while holding driver locks?
I would recommend that you not do this even though the code appears tobe safe for now. the reason is that you do not know what the notifycallback is going to do. it may somehow loopback down to the driver toacquire the same lock you're holding.

I realize that is a concern, which is why I'm asking here. If all thenotify does (and ever will do) is fire off a taskq, then its going to besafe. But if it loops back (or ever will), then its not.

Or must I go to contortions (such as adding my own driver taskq, or asoft interrupt) to ensure that I never call this while holding adriver specific lock?
what's the problem with releasing the lock, then call mac_link_notify()?

You cant just "release" the lock in the middle of your code, always.There are often considerations that have to be made about whether it isactually _safe_ to drop the lock. Assumptions that you made and wereprotected by the lock, are no longer true after the lock is dropped.(Of course, if you're doing the notify at nearly the same place as youdrop the lock anyway, its easy. But if the notify is deep inside nestedcode, its much harder to do safely.)

For example, on the tx hot path, I might find that I'm out ofdescriptors. If that happens, I need to attempt to reclaim them. If Ido reclaim some, and if the mac layer was previously suspended due toflow control, then I need to notify the mac layer.

All this happens in the hot path, and dropping and reacquiring the lockhere would be a significant pain in the butt.

I can try to track whether I've reclaimed resources or not, and do thenotify later after the lock is reclaimed, but that significantlycomplicates the code, separates the action of notification far from thepoint when I've discovered that I should do the notification, and isthus likely to be a source of errors, particularly when the code ismaintained by someone else who may not see the bigger picture.

Its much simpler if I have just one call to mac_tx_update, at the pointat which I've discovered that I need to do it, rather than deferring itto some other point after I've dropped the lock.

Will this guarantee hold true going forward? Obviously, since itdoes a lot to simplify driver design, and eliminate extra contextswitches, I'd really prefer to be able to rely on this behavior.
we're redesigning nemo's locking as part of the crossbow project. youwill hear more about this in several weeks time. for now, I'd suggestthat you try not to hold locks across any mac_* interface. if thereare issues with doing this, let us know and we'll take your needs intoaccount in our new design.

See the above. I'd really, really like to have mac_xxx_update()continue to use the asynchronous taskq notification and therefore besafe to call with locks held, if at all possible. Obviously the othermac functions (e.g. mac_rx, etc.) really do need to be called withoutany locks held.



   -- Garrett

_______________________________________________
networking-discuss mailing list
[email protected]

Re: [networking-discuss] GLDv3 locking concerns...

Reply via email to