Re: [Cocci] Out of tree instrumentation with coccinelle demo

Luis R. Rodriguez Mon, 09 May 2016 14:11:13 -0700

On Sun, May 08, 2016 at 09:51:43PM +0200, Julia Lawall wrote:
> On Sun, 8 May 2016, Luis R. Rodriguez wrote:
> > @ find_drv_name @
> > identifier drv;
> > identifier mutex;
> > @@
> > 
> > pthread_mutex_protects_3(&drv->mutex, ...);
> > 
> > @ find_hint @
> > type T;
> > T *drv;
> > identifier mutex, item_1, item_2, item_3;
> > @@
> > 
> > pthread_mutex_protects_3(&drv->mutex, drv->item_1, drv->item_2, 
> > drv->item_3);
> 
> Why do you need both of the above rules?  I think that the second one 
> should be fine.


The second one infers the type but does not care for the location of
drv, the first one is used to get the name of the global variable.
We cannot share *drv from the second rule on further rules.

> > @ find_pthread_fn depends on find_hint @
> > identifier fn, ret;
> > expression thread, attr, val;
> > @@
> > 
> > ret = pthread_create(thread, attr, fn, val);
> 
> The connection to find_hint is rather weak here.  Is there any way that 
> they could be better tied together?

The hint would be very application and domain specific, in the pthread case we
could for instance extend the hint support to be pthread related and also
require the hint to require the callback.

To make emphasis of this as a desired change I'll go ahead and do that
but also document how this is application specific.

> > @ check_fn_access @
> > identifier find_pthread_fn.fn;
> > type find_hint.T;
> > T *drv;
> > identifier find_hint.item_1;
> > identifier find_hint.item_2;
> > identifier find_hint.item_3;
> > @@
> > 
> > fn (...)
> > {
> >     <+...
> > (
> >     drv->item_1
> > |
> >     drv->item_2
> > |
> >     drv->item_3
> > )
> >         ...+>
> > }
> >
> > @ check_helpers depends on find_pthread_fn @
> > identifier helper;
> > identifier find_pthread_fn.fn;
> > identifier find_hint.mutex;
> > type find_hint.T;
> > T *drv;
> > @@
> > 
> > fn (...)
> > {
> >     <+... when != pthread_mutex_lock(&drv->mutex);
> >     helper(...);
> >     ...+>
> > }
> 
> The depends on should not be needed in the above two rules, because they 
> can only match if fn has a value, which can only happen if find_pthread_fn 
> matched.

Amended, thanks!

> In each case, fn is the callback function of the thread_create.  I guess 
> it is the function to be run by the thread.

Right, each thread.

> The first rule looks for a 
> direct reference to one of the protected fields.  The second rule looks 
> for a call to another function, in the case where the protecting lock is 
> not taken anywhere in the function.

Right. Its the same routine though, and now I understand why you mentioned
that the hint was rather loose with respect to fn.


> I'm surprised that there is no consideration of the lock in the first 
> rule.  The rules seem to be doing sort of the same thing, but don't have 
> the same constraints.

Right, so since fn is the same (but could have been any as you noted, but
constrained in the iteration I posted to pthread_create()) I split this up to
enable to query if fn made reference to any of the protected fields, separately
from querying if fn had the lock but also had helpers.  Its certainly possible
to combine them but I cared more for knowing the lock was not held during each
helper's use. Since fn is the same for both (this is made stronger with the
suggestion you made of making a stringer link, which I'm now implementing
by requiring the hint to be explicit about the call that we'd start
caring about the critical areas for), I could later ask simply if both
cases were true, and likewise a mismatch of helpers with / without locks
existed, then the right thing to do would be move to move the lock to
the initial fn call.

Technically this would miss the case where fn() accesses the critical sections
with a lock but does not hold the lock for the helpers. Will fix this then
by joining the rules.

> But in general, I think that there are two cases that are wanted:
> 
> ... when != lock()
> XXX
> ... when any
> 
> and
> 
> ... when any
> unlock()
> ... when != lock()
> XXX
> ... when any
> 
> That is, in the first case there has been no lock yet and in the second 
> case there was an unlock and there was no subsequent lock.

Yes, for simplicity, given this is just an example, can we just
consense this as follows:

fn (...)
{
        <+... when != 
\(pthread_mutex_lock(&drv->mutex)\|pthread_mutex_lock(&drv->mutex)\)
(
        drv->item_1
|
        drv->item_2
|
        drv->item_3
|
        helper(...)
)
        ...+>
}

There may be more complex fn() where things get intertwined, for instance a
lock here and there, but some code without a lock, but since this is an example,
I think simply ensuring no lock is used at all is fair. Thoughts ?

> > @ helper_accessing_with_lock exists @
> > identifier check_helpers.helper;
> > type find_hint.T;
> > T *drv;
> > identifier find_hint.mutex;
> > identifier find_hint.item_1;
> > identifier find_hint.item_2;
> > identifier find_hint.item_3;
> > position p1, p2;
> > @@
> > 
> > helper(...)
> > {
> >     ...
> >     pthread_mutex_lock@p1(&drv->mutex);
> >     <+...
> > (
> >     drv->item_1
> > |
> >     drv->item_2
> > |
> >     drv->item_3
> > )
> >         ...+>
> >     pthread_mutex_unlock@p2(&drv->mutex);
> > }
> 
> This requires the unlock to be at the end of the function.  The whole 
> function body should be surrounded in <+... ...+>

Indeed, thanks.

> This is finding that everything is OK for the helper, because the helper 
> has a lock of its own.

Correct.

> > @ helper_accessing_without_lock exists @
> > identifier check_helpers.helper;
> > type find_hint.T;
> > T *drv;
> > identifier find_hint.mutex;
> > identifier find_hint.item_1;
> > identifier find_hint.item_2;
> > identifier find_hint.item_3;
> > @@
> > 
> > helper(...)
> > {
> >     <+... when != pthread_mutex_lock(&drv->mutex);
> > (
> >     drv->item_1
> > |
> >     drv->item_2
> > |
> >     drv->item_3
> > )
> >         ...+>
> > }
> 
> This one is finding that the helper is not correct, because it doesn't 
> have a lock either, and it does have references.  See the above two 
> patterns for a better way to match the case where the references are not 
> in a lock.

Thanks.

> > @ remove_helper_lock @
> > identifier check_helpers.helper;
> > type find_hint.T;
> > T *drv;
> > identifier find_hint.mutex;
> > identifier find_hint.item_1;
> > identifier find_hint.item_2;
> > identifier find_hint.item_3;
> > position helper_accessing_with_lock.p1, helper_accessing_with_lock.p2;
> > @@
> > 
> > helper(...)
> > {
> >     ...
> > -   pthread_mutex_lock@p1(&drv->mutex);
> >     ...
> > -   pthread_mutex_unlock@p2(&drv->mutex);
> > }
> > 
> > 
> > @ fix_top_level @
> > identifier find_pthread_fn.fn;
> > identifier find_hint.mutex;
> > identifier find_drv_name.drv;
> > @@
> > 
> > fn (...)
> > {
> > +   pthread_mutex_lock(&drv->mutex);
> >     ...
> > +   pthread_mutex_unlock(&drv->mutex);
> > }
> 
> Hmm, not sure to understand the goal of the above two rules.  Why do you 
> want to put the lock around the entire body of the callback function?

Great question.

That's because the state machine used is a bit fragile, it requires
that the thread state (party->tid_status[tid]) must be in sync with
an action, in this case eat() or cleanup(). So there are two fixes
to the locking of this program with the current sample implementation:

0. Although the program has fed us hints about the critical section a
   mutex protects it has not provided any hints about the state machine
   used. When that's true, one can only assume all variables that are
   critical could be part of the state machine as it is difficult
   to infer what defines the state machine. If a state machine is used
   and relied as part of the critical sections protected by a lock
   its access must be held serially over full access to all other
   critical variables. To see this one can try a simple fix by locking
   only whenever any critical section is accessed, and see how it has
   issues:

diff --git a/main.c b/main.c
index 2fe9a0f9919f..3685d27425d3 100644
--- a/main.c
+++ b/main.c
@@ -65,6 +65,7 @@ void cleanup(long tid)
        bool clean_follow = true;
        long t;
 
+       pthread_mutex_lock(&party->mutex);
        for (t=0; t < NUM_THREADS; t++) {
                if (t == tid)
                        continue;
@@ -75,13 +76,16 @@ void cleanup(long tid)
 
        if (clean_follow)
                party->trash[tid] = 1;
+       pthread_mutex_unlock(&party->mutex);
 }
 
 void *thread_party(void *t)
 {
        long tid = (long)t;
 
+       pthread_mutex_lock(&party->mutex);
        party->tid_status[tid] = THREAD_AT_PARTY;
+       pthread_mutex_unlock(&party->mutex);
 
        eat(tid);
 
@@ -90,7 +94,9 @@ void *thread_party(void *t)
 
        cleanup(tid);
 
+       pthread_mutex_lock(&party->mutex);
        party->tid_status[tid] = THREAD_DONE;
+       pthread_mutex_unlock(&party->mutex);
 
        pthread_exit(NULL);
 }

   With this in place all critical sections are now locked, however
   there are still issues seen at run time. In lack of any information
   about the state machine implementation one can only assume the worst,
   and lock serially, as follows:

diff --git a/main.c b/main.c
index 2fe9a0f9919f..1bd89b9186c0 100644
--- a/main.c
+++ b/main.c
@@ -42,7 +42,6 @@ void eat(long tid)
        bool eat_in_order = true;
        long t;
 
-       pthread_mutex_lock(&party->mutex);
        /* If anyone ate out of place lets take notice of that... */
        for (t=0; t < NUM_THREADS; t++) {
                if (t == tid)
@@ -57,7 +56,6 @@ void eat(long tid)
        if (party->food[tid])
                if (eat_in_order)
                        party->food[tid] = 0;
-       pthread_mutex_unlock(&party->mutex);
 }
 
 void cleanup(long tid)
@@ -81,6 +79,7 @@ void *thread_party(void *t)
 {
        long tid = (long)t;
 
+       pthread_mutex_lock(&party->mutex);
        party->tid_status[tid] = THREAD_AT_PARTY;
 
        eat(tid);
@@ -91,6 +90,7 @@ void *thread_party(void *t)
        cleanup(tid);
 
        party->tid_status[tid] = THREAD_DONE;
+       pthread_mutex_unlock(&party->mutex);
 
        pthread_exit(NULL);
 }


   This is the strategy the current SmPL patch tries to address.

1. With access to state machine hints Coccinelle may be able
   to do more, that would be step 2 in optimization for a solution
   here, the outlook of the hint and SmPL patch should be considered
   as a secondary step in evolving this instrumentation, but the
   results are as follows:

diff --git a/main.c b/main.c
index 2fe9a0f9919f..518b8c5c843b 100644
--- a/main.c
+++ b/main.c
@@ -42,7 +42,6 @@ void eat(long tid)
        bool eat_in_order = true;
        long t;
 
-       pthread_mutex_lock(&party->mutex);
        /* If anyone ate out of place lets take notice of that... */
        for (t=0; t < NUM_THREADS; t++) {
                if (t == tid)
@@ -57,7 +56,6 @@ void eat(long tid)
        if (party->food[tid])
                if (eat_in_order)
                        party->food[tid] = 0;
-       pthread_mutex_unlock(&party->mutex);
 }
 
 void cleanup(long tid)
@@ -81,16 +79,20 @@ void *thread_party(void *t)
 {
        long tid = (long)t;
 
+       pthread_mutex_lock(&party->mutex);
        party->tid_status[tid] = THREAD_AT_PARTY;
 
        eat(tid);
+       pthread_mutex_unlock(&party->mutex);
 
        if (tid % 2 == 1)
                sleep_random();
 
+       pthread_mutex_lock(&party->mutex);
        cleanup(tid);
 
        party->tid_status[tid] = THREAD_DONE;
+       pthread_mutex_unlock(&party->mutex);
 
        pthread_exit(NULL);
 }

   In this case the state machine, party->tid_status is only
   locked serially against access helpers to the critical
   sections.

> The weakness seems to be that helper has to be called directly from the 
> callback function.

In this case yes, that is true.

> One would have to use iteration to trace the calls  down more deeply.

I see, thanks!

> Also, the whole set of rules would be needed for 1, 2, 3, ...  With 
> iteration, one could schedule the work on the protected fields one by one, 
> if the protection declaration could take any number of arguments.

Interesting, thanks. Its not clear how we could devise a strategy in C
to enable overloading, if that's what you mean, otherwise I was not able
to follow what you hinted at here.

  Luis
_______________________________________________
Cocci mailing list
[email protected]
https://systeme.lip6.fr/mailman/listinfo/cocci

Re: [Cocci] Out of tree instrumentation with coccinelle demo

Reply via email to