Re: [PATCH v6 1/2] sched/deadline: Add support for SD_PREFER_SIBLING on find_later_rq()

2017-08-16 Thread Steven Rostedt
On Wed, 16 Aug 2017 23:04:14 +0900
Byungchul Park  wrote:

> On Wed, Aug 16, 2017 at 09:32:44AM -0400, Steven Rostedt wrote:
> > On Wed, 16 Aug 2017 11:17:36 +0900
> > Byungchul Park  wrote:
> > 
> >   
> > > Yes, that's what I intended. IOW:
> > > 
> > > If (we found a proper sd, not having SD_PREFER_SIBLING?)
> > >   use the sd;
> > > else if (we found a proper sd, having SD_PREFER_SIBLING?)
> > >   use the smallest sd among SD_PREFER_SIBLING sds;  
> > 
> > BTW, what do you mean by "smallest sd"?  
> 
> There might be more than one SD_PREFER_SIBLING domain in its hierachy.
> In that case, we have to choose one of them. Imagine the following
> example, in case that the source cpu is cpu 0:
> 
> [Domain hierachy for cpu 0]
> 
> cpu 0 -+ domain 1 -+
>| SD_PREFER_SIBLING flaged  |
> cpu 1 -+   +- domain 2
>|  SD_PREFER_SIBLING flaged
> cpu 2 -+---+
>|
> cpu 3 -+
> 
> In this case, we have to choose domain 1 than 2, because cpus in domain 1
> are closer to the source cpu, cpu 0. That's what I meant.

Then you mean "closest sd", at least that makes more sense in the
context.

-- Steve


Re: [PATCH v6 1/2] sched/deadline: Add support for SD_PREFER_SIBLING on find_later_rq()

2017-08-16 Thread Byungchul Park
On Wed, Aug 16, 2017 at 09:32:44AM -0400, Steven Rostedt wrote:
> On Wed, 16 Aug 2017 11:17:36 +0900
> Byungchul Park  wrote:
> 
> 
> > Yes, that's what I intended. IOW:
> > 
> > If (we found a proper sd, not having SD_PREFER_SIBLING?)
> > use the sd;
> > else if (we found a proper sd, having SD_PREFER_SIBLING?)
> > use the smallest sd among SD_PREFER_SIBLING sds;
> 
> BTW, what do you mean by "smallest sd"?

There might be more than one SD_PREFER_SIBLING domain in its hierachy.
In that case, we have to choose one of them. Imagine the following
example, in case that the source cpu is cpu 0:

[Domain hierachy for cpu 0]

cpu 0 -+ domain 1 -+
   | SD_PREFER_SIBLING flaged  |
cpu 1 -+   +- domain 2
   |  SD_PREFER_SIBLING flaged
cpu 2 -+---+
   |
cpu 3 -+

In this case, we have to choose domain 1 than 2, because cpus in domain 1
are closer to the source cpu, cpu 0. That's what I meant.


Re: [PATCH v6 1/2] sched/deadline: Add support for SD_PREFER_SIBLING on find_later_rq()

2017-08-16 Thread Steven Rostedt
On Wed, 16 Aug 2017 11:17:36 +0900
Byungchul Park  wrote:


> Yes, that's what I intended. IOW:
> 
> If (we found a proper sd, not having SD_PREFER_SIBLING?)
>   use the sd;
> else if (we found a proper sd, having SD_PREFER_SIBLING?)
>   use the smallest sd among SD_PREFER_SIBLING sds;

BTW, what do you mean by "smallest sd"?

-- Steve


Re: [PATCH v6 1/2] sched/deadline: Add support for SD_PREFER_SIBLING on find_later_rq()

2017-08-15 Thread Byungchul Park
On Tue, Aug 15, 2017 at 09:42:01PM -0400, Steven Rostedt wrote:
> > > > @@ -1385,6 +1407,17 @@ static int find_later_rq(struct task_struct 
> > > > *task)
> > > >  * already under consideration through 
> > > > later_mask.
> > > >  */
> > > > if (best_cpu < nr_cpu_ids) {
> > > > +   /*
> > > > +* If current domain is 
> > > > SD_PREFER_SIBLING
> > > > +* flaged, we have to get more chances 
> > > > to
> > > > +* check other siblings.
> 
> BTW, "we have to get more chances" doesn't really make sense. Do you
> mean "we need to try other domains"?

Yes, we need to try other domains first if current domain is
SD_PREFER_SIBLING flaged.

> > > > +*/
> > > > +   if (sd->flags & SD_PREFER_SIBLING) {
> > > > +   prefer = sd;  
> > > 
> > > Is this how the SD_PREFER_SIBLING works? According to this, the
> > > preferred sd is the next sd in for_each_domain(). Not to mention, the
> > > prefer variable stays set if the next domain has no available CPUs. Is
> > > that what we want?  
> > 
> > Maybe I don't understand what you want to say. The variable, prefer, is
> > used to pick up the smallest sched domain among SD_PREFER_SIBLING
> > domains, if more than one SD_PREFER_SIBLING domain exist in the visit.
> > 
> > The prefer variable alway points to the previous SD_PREFER_SIBLING domain.
> > And that must stay set to be used as a fallback choise if the next domain
> > has no available CPUs.
> > 
> > Could you explain what I mis-understand?
> >
> 
> I may be the one confused here ;-)
> 
> I think I misread the patch. So, the SD_PREFER_SIBLING means to try to
> find a CPU in another sd instead? Thus, we try to find a CPU in a sd
> that does not have SD_PREFER_SIBLING set. And if there is none, we use
> the preferred sd as a fallback. Is that correct?

Yes, that's what I intended. IOW:

If (we found a proper sd, not having SD_PREFER_SIBLING?)
use the sd;
else if (we found a proper sd, having SD_PREFER_SIBLING?)
use the smallest sd among SD_PREFER_SIBLING sds;

Thanks,
Byungchul


Re: [PATCH v6 1/2] sched/deadline: Add support for SD_PREFER_SIBLING on find_later_rq()

2017-08-15 Thread Steven Rostedt
On Wed, 16 Aug 2017 09:38:11 +0900
Byungchul Park  wrote:

> On Tue, Aug 15, 2017 at 11:19:40AM -0400, Steven Rostedt wrote:
> > > @@ -1385,6 +1407,17 @@ static int find_later_rq(struct task_struct *task)
> > >* already under consideration through later_mask.
> > >*/
> > >   if (best_cpu < nr_cpu_ids) {
> > > + /*
> > > +  * If current domain is SD_PREFER_SIBLING
> > > +  * flaged, we have to get more chances to
> > > +  * check other siblings.

BTW, "we have to get more chances" doesn't really make sense. Do you
mean "we need to try other domains"?

> > > +  */
> > > + if (sd->flags & SD_PREFER_SIBLING) {
> > > + prefer = sd;  
> > 
> > Is this how the SD_PREFER_SIBLING works? According to this, the
> > preferred sd is the next sd in for_each_domain(). Not to mention, the
> > prefer variable stays set if the next domain has no available CPUs. Is
> > that what we want?  
> 
> Maybe I don't understand what you want to say. The variable, prefer, is
> used to pick up the smallest sched domain among SD_PREFER_SIBLING
> domains, if more than one SD_PREFER_SIBLING domain exist in the visit.
> 
> The prefer variable alway points to the previous SD_PREFER_SIBLING domain.
> And that must stay set to be used as a fallback choise if the next domain
> has no available CPUs.
> 
> Could you explain what I mis-understand?
>

I may be the one confused here ;-)

I think I misread the patch. So, the SD_PREFER_SIBLING means to try to
find a CPU in another sd instead? Thus, we try to find a CPU in a sd
that does not have SD_PREFER_SIBLING set. And if there is none, we use
the preferred sd as a fallback. Is that correct?

I'm not familiar with the SD_PREFER_SIBLING flag, the only
documentation I can find about it is the comment that states:

  /* Prefer to place tasks in a sibling domain */

And the very informative git commit change log:

commit b5d978e0c7e79a7ff842e895c85a86b38c71f1cd
Date:   Tue Sep 1 10:34:33 2009 +0200

sched: Add SD_PREFER_SIBLING

Do the placement thing using SD flags.

 ;-)

-- Steve


Re: [PATCH v6 1/2] sched/deadline: Add support for SD_PREFER_SIBLING on find_later_rq()

2017-08-15 Thread Byungchul Park
On Tue, Aug 15, 2017 at 11:19:40AM -0400, Steven Rostedt wrote:
> > @@ -1385,6 +1407,17 @@ static int find_later_rq(struct task_struct *task)
> >  * already under consideration through later_mask.
> >  */
> > if (best_cpu < nr_cpu_ids) {
> > +   /*
> > +* If current domain is SD_PREFER_SIBLING
> > +* flaged, we have to get more chances to
> > +* check other siblings.
> > +*/
> > +   if (sd->flags & SD_PREFER_SIBLING) {
> > +   prefer = sd;
> 
> Is this how the SD_PREFER_SIBLING works? According to this, the
> preferred sd is the next sd in for_each_domain(). Not to mention, the
> prefer variable stays set if the next domain has no available CPUs. Is
> that what we want?

Maybe I don't understand what you want to say. The variable, prefer, is
used to pick up the smallest sched domain among SD_PREFER_SIBLING
domains, if more than one SD_PREFER_SIBLING domain exist in the visit.

The prefer variable alway points to the previous SD_PREFER_SIBLING domain.
And that must stay set to be used as a fallback choise if the next domain
has no available CPUs.

Could you explain what I mis-understand?

Thanks,
Byungchul

> -- Steve
> 
> 
> > +   if (fallback_cpu == -1)
> > +   fallback_cpu = best_cpu;
> > +   continue;
> > +   }
> > rcu_read_unlock();
> > return best_cpu;
> > }
> > @@ -1393,6 +1426,13 @@ static int find_later_rq(struct task_struct *task)
> > rcu_read_unlock();
> >  
> > /*
> > +* If fallback_cpu is valid, all our guesses failed *except* for
> > +* SD_PREFER_SIBLING domain. Now, we can return the fallback cpu.
> > +*/
> > +   if (fallback_cpu != -1)
> > +   return fallback_cpu;
> > +
> > +   /*
> >  * At this point, all our guesses failed, we just return
> >  * 'something', and let the caller sort the things out.
> >  */


Re: [PATCH v6 1/2] sched/deadline: Add support for SD_PREFER_SIBLING on find_later_rq()

2017-08-15 Thread Steven Rostedt
On Mon,  7 Aug 2017 12:50:33 +0900
Byungchul Park  wrote:

> It would be better to avoid pushing tasks to other cpu within
> a SD_PREFER_SIBLING domain, instead, get more chances to check other
> siblings.
> 
> Signed-off-by: Byungchul Park 
> ---
>  kernel/sched/deadline.c | 46 +++---
>  1 file changed, 43 insertions(+), 3 deletions(-)
> 
> diff --git a/kernel/sched/deadline.c b/kernel/sched/deadline.c
> index 0223694..2fd1591 100644
> --- a/kernel/sched/deadline.c
> +++ b/kernel/sched/deadline.c
> @@ -1319,12 +1319,35 @@ static struct task_struct 
> *pick_earliest_pushable_dl_task(struct rq *rq, int cpu
>  
>  static DEFINE_PER_CPU(cpumask_var_t, local_cpu_mask_dl);
>  
> +/*
> + * Find the first cpu in: mask & sd & ~prefer
> + */
> +static int find_cpu(const struct cpumask *mask,
> + const struct sched_domain *sd,
> + const struct sched_domain *prefer)
> +{
> + const struct cpumask *sds = sched_domain_span(sd);
> + const struct cpumask *ps  = prefer ? sched_domain_span(prefer) : NULL;
> + int cpu = -1;
> +
> + while ((cpu = cpumask_next(cpu, mask)) < nr_cpu_ids) {
> + if (!cpumask_test_cpu(cpu, sds))
> + continue;
> + if (ps && cpumask_test_cpu(cpu, ps))
> + continue;
> + break;
> + }
> +
> + return cpu;
> +}
> +
>  static int find_later_rq(struct task_struct *task)
>  {
> - struct sched_domain *sd;
> + struct sched_domain *sd, *prefer = NULL;
>   struct cpumask *later_mask = 
> this_cpu_cpumask_var_ptr(local_cpu_mask_dl);
>   int this_cpu = smp_processor_id();
>   int cpu = task_cpu(task);
> + int fallback_cpu = -1;
>  
>   /* Make sure the mask is initialized first */
>   if (unlikely(!later_mask))
> @@ -1376,8 +1399,7 @@ static int find_later_rq(struct task_struct *task)
>   return this_cpu;
>   }
>  
> - best_cpu = cpumask_first_and(later_mask,
> - sched_domain_span(sd));
> + best_cpu = find_cpu(later_mask, sd, prefer);
>   /*
>* Last chance: if a cpu being in both later_mask
>* and current sd span is valid, that becomes our
> @@ -1385,6 +1407,17 @@ static int find_later_rq(struct task_struct *task)
>* already under consideration through later_mask.
>*/
>   if (best_cpu < nr_cpu_ids) {
> + /*
> +  * If current domain is SD_PREFER_SIBLING
> +  * flaged, we have to get more chances to
> +  * check other siblings.
> +  */
> + if (sd->flags & SD_PREFER_SIBLING) {
> + prefer = sd;

Is this how the SD_PREFER_SIBLING works? According to this, the
preferred sd is the next sd in for_each_domain(). Not to mention, the
prefer variable stays set if the next domain has no available CPUs. Is
that what we want?

-- Steve


> + if (fallback_cpu == -1)
> + fallback_cpu = best_cpu;
> + continue;
> + }
>   rcu_read_unlock();
>   return best_cpu;
>   }
> @@ -1393,6 +1426,13 @@ static int find_later_rq(struct task_struct *task)
>   rcu_read_unlock();
>  
>   /*
> +  * If fallback_cpu is valid, all our guesses failed *except* for
> +  * SD_PREFER_SIBLING domain. Now, we can return the fallback cpu.
> +  */
> + if (fallback_cpu != -1)
> + return fallback_cpu;
> +
> + /*
>* At this point, all our guesses failed, we just return
>* 'something', and let the caller sort the things out.
>*/



[PATCH v6 1/2] sched/deadline: Add support for SD_PREFER_SIBLING on find_later_rq()

2017-08-06 Thread Byungchul Park
It would be better to avoid pushing tasks to other cpu within
a SD_PREFER_SIBLING domain, instead, get more chances to check other
siblings.

Signed-off-by: Byungchul Park 
---
 kernel/sched/deadline.c | 46 +++---
 1 file changed, 43 insertions(+), 3 deletions(-)

diff --git a/kernel/sched/deadline.c b/kernel/sched/deadline.c
index 0223694..2fd1591 100644
--- a/kernel/sched/deadline.c
+++ b/kernel/sched/deadline.c
@@ -1319,12 +1319,35 @@ static struct task_struct 
*pick_earliest_pushable_dl_task(struct rq *rq, int cpu
 
 static DEFINE_PER_CPU(cpumask_var_t, local_cpu_mask_dl);
 
+/*
+ * Find the first cpu in: mask & sd & ~prefer
+ */
+static int find_cpu(const struct cpumask *mask,
+   const struct sched_domain *sd,
+   const struct sched_domain *prefer)
+{
+   const struct cpumask *sds = sched_domain_span(sd);
+   const struct cpumask *ps  = prefer ? sched_domain_span(prefer) : NULL;
+   int cpu = -1;
+
+   while ((cpu = cpumask_next(cpu, mask)) < nr_cpu_ids) {
+   if (!cpumask_test_cpu(cpu, sds))
+   continue;
+   if (ps && cpumask_test_cpu(cpu, ps))
+   continue;
+   break;
+   }
+
+   return cpu;
+}
+
 static int find_later_rq(struct task_struct *task)
 {
-   struct sched_domain *sd;
+   struct sched_domain *sd, *prefer = NULL;
struct cpumask *later_mask = 
this_cpu_cpumask_var_ptr(local_cpu_mask_dl);
int this_cpu = smp_processor_id();
int cpu = task_cpu(task);
+   int fallback_cpu = -1;
 
/* Make sure the mask is initialized first */
if (unlikely(!later_mask))
@@ -1376,8 +1399,7 @@ static int find_later_rq(struct task_struct *task)
return this_cpu;
}
 
-   best_cpu = cpumask_first_and(later_mask,
-   sched_domain_span(sd));
+   best_cpu = find_cpu(later_mask, sd, prefer);
/*
 * Last chance: if a cpu being in both later_mask
 * and current sd span is valid, that becomes our
@@ -1385,6 +1407,17 @@ static int find_later_rq(struct task_struct *task)
 * already under consideration through later_mask.
 */
if (best_cpu < nr_cpu_ids) {
+   /*
+* If current domain is SD_PREFER_SIBLING
+* flaged, we have to get more chances to
+* check other siblings.
+*/
+   if (sd->flags & SD_PREFER_SIBLING) {
+   prefer = sd;
+   if (fallback_cpu == -1)
+   fallback_cpu = best_cpu;
+   continue;
+   }
rcu_read_unlock();
return best_cpu;
}
@@ -1393,6 +1426,13 @@ static int find_later_rq(struct task_struct *task)
rcu_read_unlock();
 
/*
+* If fallback_cpu is valid, all our guesses failed *except* for
+* SD_PREFER_SIBLING domain. Now, we can return the fallback cpu.
+*/
+   if (fallback_cpu != -1)
+   return fallback_cpu;
+
+   /*
 * At this point, all our guesses failed, we just return
 * 'something', and let the caller sort the things out.
 */
-- 
1.9.1