On 9/28/20 4:17 PM, Andrew Stubbs wrote:
> On 28/09/2020 15:02, Tom de Vries wrote:
>>> This patch simply skips barriers when they would "wait" for only one
>>> thread (the current thread). This means that teams nested inside other
>>> teams now run independently, instead of strictly in lock-step, and is
>>> only valid as long as inner teams are limited to one thread each
>>> (currently the case).
>>
>> Is this inner-team-one-thread-limit coded or documented somewhere?
> 
> In libgomp/parallel.c, gomp_resolve_num_threads we have:
> 
>   else if (thr->ts.active_level >= 1 && !icv->nest_var)
>     return 1;
> 
>> If so, it might be good to add a comment there referring to the code
>> this patch adds.
> 
>   /* Accelerators with fixed thread counts require this to return 1 for
>      nested parallel regions.  */
> 
> WDYT?

Yep, looks good, thanks.
- Tom

Reply via email to