On 11/12/2010 01:29 PM, Mathieu Desnoyers wrote:
* Paolo Bonzini ([email protected]) wrote:
On 11/12/2010 01:19 PM, Mathieu Desnoyers wrote:
* Paolo Bonzini ([email protected]) wrote:
Return nothing can be useful because it can be optimized on x86
as "lock orl (mem), reg/imm".  However, there are no other
return-nothing atomic ops in uatomic_*.h so I decided not to
provide this.


How about we start by implementing uatomic_and/uatomic_or that
return void, and if we ever need
uatomic_return_and/uatomic_return_or (see my other mail), we add
them ?

I am using uatomic_or's return value in my call_rcu-with-futex, but
I don't need atomic-or-and-exchange really, I can even change it to
a load followed by an atomic or (it introduces a race but it is
benign).

If the race does not matter, why do we need the new atomic op in the
first place ? ;)

Say the "correct" code would be

   atomic x = *p, *p |= FLAG1;   # uatomic_xchg_or
   if (x & ~FLAG2)
       futex_wake (...)

I may try to rewrite it like this:

   x = *p;                       # 1
   atomic *p |= FLAG1;           # 2, uatomic_or
   if (x & ~FLAG2)
       futex_wake (...)

This is racy because something can set or reset FLAG2 between 1 and 2. Setting it may cause spurious wakeups, which are okay. If I know that nothing can reset FLAG2 between 1 and 2, correctness is preserved.

Instead if I remove atomicity, I get races on read-modify-write operations, and that is much more obviously incorrect.

But I really do not like being tricky (futexes are tricky enough on their own), and rather than spending a lot of time proving this correct I'd probably just use cmpxchg here.

Paolo

_______________________________________________
ltt-dev mailing list
[email protected]
http://lists.casi.polymtl.ca/cgi-bin/mailman/listinfo/ltt-dev

Reply via email to