[tip: sched/core] sched/fair: Reduce long-tail newly idle balance cost

2021-03-23 Thread tip-bot2 for Aubrey Li
The following commit has been merged into the sched/core branch of tip: Commit-ID: acb4decc1e900468d51b33c5f1ee445278e716a7 Gitweb: https://git.kernel.org/tip/acb4decc1e900468d51b33c5f1ee445278e716a7 Author:Aubrey Li AuthorDate:Wed, 24 Feb 2021 16:15:49 +08:00 Committer

[PATCH v10] sched/fair: select idle cpu from idle cpumask for task wakeup

2021-03-15 Thread Aubrey Li
From: Aubrey Li Add idle cpumask to track idle cpus in sched domain. Every time a CPU enters idle, the CPU is set in idle cpumask to be a wakeup target. And if the CPU is not in idle, the CPU is cleared in idle cpumask during scheduler tick to ratelimit idle cpumask update. When a task wakes up

[PATCH v9 1/2] sched/fair: select idle cpu from idle cpumask for task wakeup

2021-03-09 Thread Aubrey Li
From: Aubrey Li Add idle cpumask to track idle cpus in sched domain. Every time a CPU enters idle, the CPU is set in idle cpumask to be a wakeup target. And if the CPU is not in idle, the CPU is cleared in idle cpumask during scheduler tick to ratelimit idle cpumask update. When a task wakes up

[PATCH v9 2/2] sched/fair: Remove SIS_PROP

2021-03-09 Thread Aubrey Li
From: Aubrey Li Scanning idle cpu from the idle cpumask avoid superfluous scans of the LLC domain, as the first bit in the idle cpumask is the target. Considering the selected target could become busy, the idle check is reserved, but SIS_PROP feature becomes meaningless, so remove avg_scan_cost

[PATCH v2] sched/fair: reduce long-tail newly idle balance cost

2021-02-24 Thread Aubrey Li
: Srinivas Pandruvada Cc: Rafael J. Wysocki Signed-off-by: Aubrey Li --- kernel/sched/fair.c | 9 + 1 file changed, 9 insertions(+) diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c index 04a3ce2..5c67804 100644 --- a/kernel/sched/fair.c +++ b/kernel/sched/fair.c @@ -7675,6

[RFC PATCH v1] sched/fair: limit load balance redo times at the same sched_domain level

2021-01-24 Thread Aubrey Li
. Cc: Andi Kleen Cc: Tim Chen Cc: Srinivas Pandruvada Cc: Rafael J. Wysocki Signed-off-by: Aubrey Li --- kernel/sched/fair.c | 7 ++- 1 file changed, 6 insertions(+), 1 deletion(-) diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c index ae7ceba..b59f371 100644 --- a/kernel/sched

[PATCH] cpuset: fix typos in comments

2021-01-12 Thread Aubrey Li
Change hierachy to hierarchy and congifured to configured, no functionality changed. Signed-off-by: Aubrey Li --- kernel/cgroup/cpuset.c | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/kernel/cgroup/cpuset.c b/kernel/cgroup/cpuset.c index 57b5b5d..15f4300 100644

[RFC PATCH v8] sched/fair: select idle cpu from idle cpumask for task wakeup

2020-12-09 Thread Aubrey Li
ter Zijlstra Cc: Mel Gorman Cc: Vincent Guittot Cc: Qais Yousef Cc: Valentin Schneider Cc: Jiang Biao Cc: Tim Chen Signed-off-by: Aubrey Li --- include/linux/sched/topology.h | 13 ++ kernel/sched/core.c| 2 ++ kernel/sched/fair.c| 45 +

[RFC PATCH v7] sched/fair: select idle cpu from idle cpumask for task wakeup

2020-12-08 Thread Aubrey Li
v1->v2: - idle cpumask is updated in the nohz routines, by initializing idle cpumask with sched_domain_span(sd), nohz=off case remains the original behavior. Cc: Peter Zijlstra Cc: Mel Gorman Cc: Vincent Guittot Cc: Qais Yousef Cc: Valentin Schneider Cc: Jiang Biao Cc: Tim Chen Signed

[RFC PATCH v6] sched/fair: select idle cpu from idle cpumask for task wakeup

2020-12-07 Thread Aubrey Li
e nohz routines, by initializing idle cpumask with sched_domain_span(sd), nohz=off case remains the original behavior. Cc: Mel Gorman Cc: Vincent Guittot Cc: Qais Yousef Cc: Valentin Schneider Cc: Jiang Biao Cc: Tim Chen Signed-off-by: Aubrey Li --- include/linux/sched/topology.h | 13 + kern

[RFC PATCH v5] sched/fair: select idle cpu from idle cpumask for task wakeup

2020-11-19 Thread Aubrey Li
d), nohz=off case remains the original behavior. Cc: Mel Gorman Cc: Vincent Guittot Cc: Qais Yousef Cc: Valentin Schneider Cc: Jiang Biao Cc: Tim Chen Signed-off-by: Aubrey Li --- include/linux/sched/topology.h | 13 + kernel/sched/core.c| 2 ++ kernel/sched/fair.c

[RFC PATCH v4] sched/fair: select idle cpu from idle cpumask for task wakeup

2020-11-17 Thread Aubrey Li
From: Aubrey Li Add idle cpumask to track idle cpus in sched domain. When a CPU enters idle, if the idle driver indicates to stop tick, this CPU is set in the idle cpumask to be a wakeup target. And if the CPU is not in idle, the CPU is cleared in idle cpumask during scheduler tick to ratelimit

[PATCH v1] coresched/proc: add forceidle report with coresched enabled

2020-10-29 Thread Aubrey Li
% idle% cpu53 25.24 0.000.00% 3.88% 70.87% Signed-off-by: Aubrey Li --- fs/proc/stat.c | 48 + include/linux/kernel_stat.h | 1 + include/linux/tick.h| 2 ++ kernel/time/tick-sched.c| 48

[RFC PATCH v3] sched/fair: select idle cpu from idle cpumask for task wakeup

2020-10-21 Thread Aubrey Li
From: Aubrey Li Added idle cpumask to track idle cpus in sched domain. When a CPU enters idle, its corresponding bit in the idle cpumask will be set, and when the CPU exits idle, its bit will be cleared. When a task wakes up to select an idle cpu, scanning idle cpumask has low cost than

[RFC PATCH v2] sched/fair: select idle cpu from idle cpumask in sched domain

2020-09-15 Thread Aubrey Li
ask is updated in the nohz routines, by initializing idle cpumask with sched_domain_span(sd), nohz=off case remains the original behavior. Cc: Qais Yousef Cc: Valentin Schneider Cc: Jiang Biao Cc: Tim Chen Signed-off-by: Aubrey Li --- include/linux/sched/topology.h | 13 + ker

[RFC PATCH v1 0/1] select idle cpu from idle cpumask in sched domain

2020-09-11 Thread Aubrey Li
1 Any suggestions are highly appreciated! Thanks, -Aubrey Aubrey Li (1): sched/fair: select idle cpu from idle cpumask in sched domain include/linux/sched/topology.h | 13 + kernel/sched/fair.c| 4 +++- kernel/sched/topology.c| 2 +- 3 files changed, 17

[RFC PATCH v1 1/1] sched/fair: select idle cpu from idle cpumask in sched domain

2020-09-11 Thread Aubrey Li
in last level cache domain, especially when the system is heavily loaded. Signed-off-by: Aubrey Li --- include/linux/sched/topology.h | 13 + kernel/sched/fair.c| 4 +++- kernel/sched/topology.c| 2 +- 3 files changed, 17 insertions(+), 2 deletions(-) diff --git

Re: [RFC PATCH 11/16] sched: migration changes for core scheduling(Internet mail)

2020-07-23 Thread Aubrey Li
;>>>>> Hi, > >>>>>>>> > >>>>>>>>> On Jul 22, 2020, at 8:13 PM, Li, Aubrey > >>>>>>>>> wrote: > >>>>>>>>> > >>>>>>>>> On 2020/7/22 16:

Re: [RFC PATCH 14/16] irq: Add support for core-wide protection of IRQ and softirq

2020-07-10 Thread Aubrey Li
On Fri, Jul 10, 2020 at 9:36 PM Vineeth Remanan Pillai wrote: > > Hi Aubrey, > > On Fri, Jul 10, 2020 at 8:19 AM Li, Aubrey wrote: > > > > Hi Joel/Vineeth, > > [...] > > The problem is gone when we reverted this patch. We are running multiple > > uperf threads(equal to cpu number) in a cgroup

Re: [RFC PATCH v3 00/16] Core scheduling v3

2019-09-25 Thread Aubrey Li
On Thu, Sep 26, 2019 at 1:24 AM Tim Chen wrote: > > On 9/24/19 7:40 PM, Aubrey Li wrote: > > On Sat, Sep 7, 2019 at 2:30 AM Tim Chen wrote: > >> +static inline s64 core_sched_imbalance_delta(int src_cpu, int dst_cpu, > >> + in

Re: [RFC PATCH v3 00/16] Core scheduling v3

2019-09-24 Thread Aubrey Li
On Sat, Sep 7, 2019 at 2:30 AM Tim Chen wrote: > +static inline s64 core_sched_imbalance_delta(int src_cpu, int dst_cpu, > + int src_sibling, int dst_sibling, > + struct task_group *tg, u64 task_load) > +{ > + struct sched_entity *se, *se_sibling,

Re: [RFC PATCH v3 00/16] Core scheduling v3

2019-09-18 Thread Aubrey Li
On Thu, Sep 19, 2019 at 4:41 AM Tim Chen wrote: > > On 9/17/19 6:33 PM, Aubrey Li wrote: > > On Sun, Sep 15, 2019 at 10:14 PM Aaron Lu > > wrote: > > >> > >> And I have pushed Tim's branch to: > >> https://github.com/aaronlu/linux coresch

Re: [RFC PATCH v3 00/16] Core scheduling v3

2019-09-17 Thread Aubrey Li
On Sun, Sep 15, 2019 at 10:14 PM Aaron Lu wrote: > > On Fri, Sep 13, 2019 at 07:12:52AM +0800, Aubrey Li wrote: > > On Thu, Sep 12, 2019 at 8:04 PM Aaron Lu wrote: > > > > > > On Wed, Sep 11, 2019 at 09:19:02AM -0700, Tim Chen wrote: > > > > On 9/11/1

Re: [RFC PATCH v3 00/16] Core scheduling v3

2019-09-12 Thread Aubrey Li
On Thu, Sep 12, 2019 at 8:04 PM Aaron Lu wrote: > > On Wed, Sep 11, 2019 at 09:19:02AM -0700, Tim Chen wrote: > > On 9/11/19 7:02 AM, Aaron Lu wrote: > > I think Julien's result show that my patches did not do as well as > > your patches for fairness. Aubrey did some other testing with the same >

Re: [RFC PATCH v3 00/16] Core scheduling v3

2019-08-27 Thread Aubrey Li
On Wed, Aug 28, 2019 at 5:14 AM Matthew Garrett wrote: > > Apple have provided a sysctl that allows applications to indicate that > specific threads should make use of core isolation while allowing > the rest of the system to make use of SMT, and browsers (Safari, Firefox > and Chrome, at least)

Re: [PATCH] x86/apic: Handle missing global clockevent gracefully

2019-08-12 Thread Aubrey Li
On Mon, Aug 12, 2019 at 8:25 PM Thomas Gleixner wrote: > > On Mon, 12 Aug 2019, Li, Aubrey wrote: > > On 2019/8/9 20:54, Thomas Gleixner wrote: > > > + local_irq_disable(); > > > /* > > > * Setup the APIC counter to maximum. There is no way the lapic > > > * can underflow in the

Re: [RFC PATCH v3 00/16] Core scheduling v3

2019-08-06 Thread Aubrey Li
On Tue, Aug 6, 2019 at 11:24 AM Aaron Lu wrote: > > On Mon, Aug 05, 2019 at 08:55:28AM -0700, Tim Chen wrote: > > On 8/2/19 8:37 AM, Julien Desfossez wrote: > > > We tested both Aaron's and Tim's patches and here are our results. > > > > > > Test setup: > > > - 2 1-thread sysbench, one running

Re: setup_boot_APIC_clock() NULL dereference during early boot on reduced hardware platforms

2019-08-01 Thread Aubrey Li
On Thu, Aug 1, 2019 at 3:35 PM Thomas Gleixner wrote: > > On Thu, 1 Aug 2019, Aubrey Li wrote: > > On Thu, Aug 1, 2019 at 2:26 PM Daniel Drake wrote: > > > global_clock_event is NULL here. This is a "reduced hardware" ACPI > > > platform so acpi_gene

Re: setup_boot_APIC_clock() NULL dereference during early boot on reduced hardware platforms

2019-08-01 Thread Aubrey Li
On Thu, Aug 1, 2019 at 2:26 PM Daniel Drake wrote: > > Hi, > > Working with a new consumer laptop based on AMD R7-3700U, we are > seeing a kernel panic during early boot (before the display > initializes). It's a new product and there is no previous known > working kernel version (tested 5.0, 5.2

Re: [RFC PATCH v3 00/16] Core scheduling v3

2019-07-22 Thread Aubrey Li
On Mon, Jul 22, 2019 at 6:43 PM Aaron Lu wrote: > > On 2019/7/22 18:26, Aubrey Li wrote: > > The granularity period of util_avg seems too large to decide task priority > > during pick_task(), at least it is in my case, cfs_prio_less() always picked > > core max task, s

Re: [RFC PATCH v3 00/16] Core scheduling v3

2019-07-22 Thread Aubrey Li
On Thu, Jul 18, 2019 at 6:07 PM Aaron Lu wrote: > > On Wed, Jun 19, 2019 at 02:33:02PM -0400, Julien Desfossez wrote: > > On 17-Jun-2019 10:51:27 AM, Aubrey Li wrote: > > > The result looks still unfair, and particularly, the variance is too high, > > > > I ju

Re: [RFC PATCH v3 00/16] Core scheduling v3

2019-07-19 Thread Aubrey Li
On Fri, Jul 19, 2019 at 1:53 PM Aaron Lu wrote: > > On Thu, Jul 18, 2019 at 04:27:19PM -0700, Tim Chen wrote: > > > > > > On 7/18/19 3:07 AM, Aaron Lu wrote: > > > On Wed, Jun 19, 2019 at 02:33:02PM -0400, Julien Desfossez wrote: > > > > > > > > With the below patch on top of v3 that makes use of

Re: [RFC PATCH v3 00/16] Core scheduling v3

2019-06-16 Thread Aubrey Li
On Thu, Jun 13, 2019 at 11:22 AM Julien Desfossez wrote: > > On 12-Jun-2019 05:03:08 PM, Subhra Mazumdar wrote: > > > > On 6/12/19 9:33 AM, Julien Desfossez wrote: > > >After reading more traces and trying to understand why only untagged > > >tasks are starving when there are cpu-intensive tasks

[tip:x86/core] Documentation/filesystems/proc.txt: Add arch_status file

2019-06-12 Thread tip-bot for Aubrey Li
Commit-ID: 711486fd18596315d42cebaac3dba8c408f60a3d Gitweb: https://git.kernel.org/tip/711486fd18596315d42cebaac3dba8c408f60a3d Author: Aubrey Li AuthorDate: Thu, 6 Jun 2019 09:22:36 +0800 Committer: Thomas Gleixner CommitDate: Wed, 12 Jun 2019 11:42:13 +0200 Documentation/filesystems

[tip:x86/core] x86/process: Add AVX-512 usage elapsed time to /proc/pid/arch_status

2019-06-12 Thread tip-bot for Aubrey Li
Commit-ID: 0c608dad2a771c0a11b6d12148d1a8b975e015d4 Gitweb: https://git.kernel.org/tip/0c608dad2a771c0a11b6d12148d1a8b975e015d4 Author: Aubrey Li AuthorDate: Thu, 6 Jun 2019 09:22:35 +0800 Committer: Thomas Gleixner CommitDate: Wed, 12 Jun 2019 11:42:13 +0200 x86/process: Add AVX-512

[tip:x86/core] proc: Add /proc//arch_status

2019-06-12 Thread tip-bot for Aubrey Li
Commit-ID: 68bc30bb9f33fc8d11e3d110d29e06490896a999 Gitweb: https://git.kernel.org/tip/68bc30bb9f33fc8d11e3d110d29e06490896a999 Author: Aubrey Li AuthorDate: Thu, 6 Jun 2019 09:22:34 +0800 Committer: Thomas Gleixner CommitDate: Wed, 12 Jun 2019 11:42:13 +0200 proc: Add /proc

[PATCH v19 1/3] proc: add /proc//arch_status

2019-06-05 Thread Aubrey Li
into its own file /proc//arch_status and make sure that everything in it is namespaced. Signed-off-by: Aubrey Li Acked-by: Andrew Morton Cc: Thomas Gleixner Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven Cc: Alexey Dobriyan Cc: Andrew Morton Cc: Andy

[PATCH v19 3/3] Documentation/filesystems/proc.txt: add arch_status file

2019-06-05 Thread Aubrey Li
Added /proc//arch_status file, and added AVX512_elapsed_ms in /proc//arch_status. Report it in Documentation/filesystems/proc.txt Signed-off-by: Aubrey Li Cc: Thomas Gleixner Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven Cc: Alexey Dobriyan Cc: Andrew

[PATCH v19 2/3] x86,/proc/pid/arch_status: Add AVX-512 usage elapsed time

2019-06-05 Thread Aubrey Li
core_power.lvl2_turbo_license 1.004031387 seconds time elapsed Non-zero counter value confirms that the task causes frequency drop. Signed-off-by: Aubrey Li Cc: Thomas Gleixner Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven Cc: Alexey Dobriyan Cc

Re: [RFC PATCH v3 00/16] Core scheduling v3

2019-05-31 Thread Aubrey Li
On Fri, May 31, 2019 at 3:45 PM Aaron Lu wrote: > > On Fri, May 31, 2019 at 02:53:21PM +0800, Aubrey Li wrote: > > On Fri, May 31, 2019 at 2:09 PM Aaron Lu wrote: > > > > > > On 2019/5/31 13:12, Aubrey Li wrote: > > > > On Fri, May 31,

Re: [RFC PATCH v3 00/16] Core scheduling v3

2019-05-31 Thread Aubrey Li
On Fri, May 31, 2019 at 2:09 PM Aaron Lu wrote: > > On 2019/5/31 13:12, Aubrey Li wrote: > > On Fri, May 31, 2019 at 11:01 AM Aaron Lu > > wrote: > >> > >> This feels like "date" failed to schedule on some CPU > >> on time. > &g

Re: [RFC PATCH v3 00/16] Core scheduling v3

2019-05-30 Thread Aubrey Li
On Fri, May 31, 2019 at 11:01 AM Aaron Lu wrote: > > This feels like "date" failed to schedule on some CPU > on time. > > My first reaction is: when shell wakes up from sleep, it will > fork date. If the script is untagged and those workloads are > tagged and all available cores are already

Re: [RFC PATCH v3 00/16] Core scheduling v3

2019-05-30 Thread Aubrey Li
On Thu, May 30, 2019 at 10:17 PM Julien Desfossez wrote: > > Interesting, could you detail a bit more your test setup (commands used, > type of machine, any cgroup/pinning configuration, etc) ? I would like > to reproduce it and investigate. Let me see if I can simply my test to reproduce it.

Re: [RFC PATCH v3 00/16] Core scheduling v3

2019-05-30 Thread Aubrey Li
On Thu, May 30, 2019 at 4:36 AM Vineeth Remanan Pillai wrote: > > Third iteration of the Core-Scheduling feature. > > This version fixes mostly correctness related issues in v2 and > addresses performance issues. Also, addressed some crashes related > to cgroups and cpu hotplugging. > > We have

Re: [RFC PATCH v2 13/17] sched: Add core wide task selection and scheduling.

2019-05-21 Thread Aubrey Li
On Mon, May 20, 2019 at 10:04 PM Vineeth Pillai wrote: > > > > The following patch improved my test cases. > > > Welcome any comments. > > > > > > > This is certainly better than violating the point of the core scheduler :) > > > > If I'm understanding this right what will happen in this case is

Re: [RFC PATCH v2 13/17] sched: Add core wide task selection and scheduling.

2019-05-18 Thread Aubrey Li
On Wed, Apr 24, 2019 at 12:18 AM Vineeth Remanan Pillai wrote: > > From: Peter Zijlstra (Intel) > > Instead of only selecting a local task, select a task for all SMT > siblings for every reschedule on the core (irrespective which logical > CPU does the reschedule). > > NOTE: there is still

Re: [RFC PATCH v2 17/17] sched: Debug bits...

2019-05-17 Thread Aubrey Li
On Wed, Apr 24, 2019 at 12:18 AM Vineeth Remanan Pillai wrote: > > From: Peter Zijlstra (Intel) > > Not-Signed-off-by: Peter Zijlstra (Intel) > --- > kernel/sched/core.c | 38 +- > 1 file changed, 37 insertions(+), 1 deletion(-) > > diff --git

Re: [RFC PATCH v2 11/17] sched: Basic tracking of matching tasks

2019-05-09 Thread Aubrey Li
On Thu, May 9, 2019 at 10:14 AM Subhra Mazumdar wrote: > > > On 5/8/19 6:38 PM, Aubrey Li wrote: > > On Thu, May 9, 2019 at 8:29 AM Subhra Mazumdar > > wrote: > >> > >> On 5/8/19 5:01 PM, Aubrey Li wrote: > >>> On Thu, May 9, 2019 at 2:41 AM S

Re: [RFC PATCH v2 11/17] sched: Basic tracking of matching tasks

2019-05-08 Thread Aubrey Li
On Thu, May 9, 2019 at 8:29 AM Subhra Mazumdar wrote: > > > On 5/8/19 5:01 PM, Aubrey Li wrote: > > On Thu, May 9, 2019 at 2:41 AM Subhra Mazumdar > > wrote: > >> > >> On 5/8/19 11:19 AM, Subhra Mazumdar wrote: > >>> On 5/8/19 8:49 AM, Aubrey L

Re: [RFC PATCH v2 11/17] sched: Basic tracking of matching tasks

2019-05-08 Thread Aubrey Li
On Thu, May 9, 2019 at 2:41 AM Subhra Mazumdar wrote: > > > On 5/8/19 11:19 AM, Subhra Mazumdar wrote: > > > > On 5/8/19 8:49 AM, Aubrey Li wrote: > >>> Pawan ran an experiment setting up 2 VMs, with one VM doing a > >>> parallel kernel build and o

Re: [RFC PATCH v2 11/17] sched: Basic tracking of matching tasks

2019-05-08 Thread Aubrey Li
On Fri, May 3, 2019 at 8:06 AM Tim Chen wrote: > > On 5/1/19 4:27 PM, Tim Chen wrote: > > On 4/28/19 11:15 PM, Aaron Lu wrote: > >> On Tue, Apr 23, 2019 at 04:18:16PM +, Vineeth Remanan Pillai wrote: > >>> +/* > >>> + * Find left-most (aka, highest priority) task matching @cookie. > >>> + */

Re: [RFC PATCH v2 00/17] Core scheduling v2

2019-04-29 Thread Aubrey Li
On Tue, Apr 30, 2019 at 12:01 AM Ingo Molnar wrote: > * Li, Aubrey wrote: > > > > I.e. showing the approximate CPU thread-load figure column would be > > > very useful too, where '50%' shows half-loaded, '100%' fully-loaded, > > > '200%' over-saturated, etc. - for each row? > > > > See below,

Re: [RFC PATCH v2 00/17] Core scheduling v2

2019-04-29 Thread Aubrey Li
On Mon, Apr 29, 2019 at 11:39 PM Phil Auld wrote: > > On Mon, Apr 29, 2019 at 09:25:35PM +0800 Li, Aubrey wrote: > > .--. > > |NA/AVX vanilla-SMT [std% / sem%]

Re: [RFC PATCH v2 00/17] Core scheduling v2

2019-04-28 Thread Aubrey Li
On Sun, Apr 28, 2019 at 5:33 PM Ingo Molnar wrote: > So because I'm a big fan of presenting data in a readable fashion, here > are your results, tabulated: I thought I tried my best to make it readable, but this one looks much better, thanks, ;-) > > # > # Sysbench throughput comparison of 3

Re: [RFC PATCH v2 00/17] Core scheduling v2

2019-04-27 Thread Aubrey Li
On Sat, Apr 27, 2019 at 10:21 PM Ingo Molnar wrote: > > * Aubrey Li wrote: > > > On Sat, Apr 27, 2019 at 5:17 PM Ingo Molnar wrote: > > > > > > > > > * Aubrey Li wrote: > > > > > > > I have the same environment setup above, for nos

Re: [RFC PATCH v2 00/17] Core scheduling v2

2019-04-27 Thread Aubrey Li
On Sat, Apr 27, 2019 at 5:17 PM Ingo Molnar wrote: > > > * Aubrey Li wrote: > > > I have the same environment setup above, for nosmt cases, I used > > /sys interface Thomas mentioned, below is the result: > > > > NA/AVX baseline(std%) coresched(std%) +/-

Re: [RFC PATCH v2 00/17] Core scheduling v2

2019-04-26 Thread Aubrey Li
On Thu, Apr 25, 2019 at 5:55 PM Ingo Molnar wrote: > * Aubrey Li wrote: > > On Wed, Apr 24, 2019 at 10:00 PM Julien Desfossez > > wrote: > > > > > > On 24-Apr-2019 09:13:10 PM, Aubrey Li wrote: > > > > On Wed, Apr 24, 2019 at 1

Re: [RFC PATCH v2 00/17] Core scheduling v2

2019-04-25 Thread Aubrey Li
On Thu, Apr 25, 2019 at 5:55 PM Ingo Molnar wrote: > > > * Aubrey Li wrote: > > > On Wed, Apr 24, 2019 at 10:00 PM Julien Desfossez > > wrote: > > > > > > On 24-Apr-2019 09:13:10 PM, Aubrey Li wrote: > > > > On Wed, Apr 24,

[PATCH v18 2/3] x86,/proc/pid/arch_status: Add AVX-512 usage elapsed time

2019-04-25 Thread Aubrey Li
core_power.lvl2_turbo_license 1.004031387 seconds time elapsed Non-zero counter value confirms that the task causes frequency drop. Signed-off-by: Aubrey Li Cc: Thomas Gleixner Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven Cc: Alexey Dobriyan Cc

[PATCH v18 1/3] proc: add /proc//arch_status

2019-04-25 Thread Aubrey Li
The architecture specific information of the running processes could be useful to the userland. Add /proc//arch_status interface support to examine process architecture specific information externally. Signed-off-by: Aubrey Li Cc: Thomas Gleixner Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen

[PATCH v18 3/3] Documentation/filesystems/proc.txt: add arch_status file

2019-04-25 Thread Aubrey Li
Added /proc//arch_status file, and added AVX512_elapsed_ms in /proc//arch_status. Report it in Documentation/filesystems/proc.txt Signed-off-by: Aubrey Li Cc: Thomas Gleixner Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven Cc: Alexey Dobriyan Cc: Andrew

Re: [RFC PATCH v2 00/17] Core scheduling v2

2019-04-24 Thread Aubrey Li
On Wed, Apr 24, 2019 at 10:00 PM Julien Desfossez wrote: > > On 24-Apr-2019 09:13:10 PM, Aubrey Li wrote: > > On Wed, Apr 24, 2019 at 12:18 AM Vineeth Remanan Pillai > > wrote: > > > > > > Second iteration of the core-scheduling feature. > >

Re: [RFC PATCH v2 00/17] Core scheduling v2

2019-04-24 Thread Aubrey Li
On Wed, Apr 24, 2019 at 12:18 AM Vineeth Remanan Pillai wrote: > > Second iteration of the core-scheduling feature. > > This version fixes apparent bugs and performance issues in v1. This > doesn't fully address the issue of core sharing between processes > with different tags. Core sharing still

Re: [RFC PATCH v2 15/17] sched: Trivial forced-newidle balancer

2019-04-23 Thread Aubrey Li
On Wed, Apr 24, 2019 at 12:18 AM Vineeth Remanan Pillai wrote: > > From: Peter Zijlstra (Intel) > > When a sibling is forced-idle to match the core-cookie; search for > matching tasks to fill the core. > > Signed-off-by: Peter Zijlstra (Intel) > --- > include/linux/sched.h | 1 + >

Re: [RFC PATCH v2 00/17] Core scheduling v2

2019-04-23 Thread Aubrey Li
stra > - Aaron Lu > - Fixes a hard lockup in rq locking > - Vineeth Pillai > - Julien Desfossez > - Fixes a performance issue seen on IO heavy workloads > - Vineeth Pillai > - Julien Desfossez > - Fix for 32bit build > - Aubrey Li > > Issues >

[PATCH v17 1/3] proc: add /proc//arch_status

2019-04-21 Thread Aubrey Li
The architecture specific information of the running processes could be useful to the userland. Add /proc//arch_status interface support to examine process architecture specific information externally. Signed-off-by: Aubrey Li Cc: Thomas Gleixner Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen

[PATCH v17 2/3] /proc/pid/arch_status: Add AVX-512 usage elapsed time

2019-04-21 Thread Aubrey Li
core_power.lvl2_turbo_license 1.004031387 seconds time elapsed Non-zero counter value confirms that the task causes frequency drop. Signed-off-by: Aubrey Li Cc: Thomas Gleixner Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven Cc: Alexey Dobriyan Cc

[PATCH v17 3/3] Documentation/filesystems/proc.txt: add arch_status file

2019-04-21 Thread Aubrey Li
Added /proc//arch_status file, and added AVX512_elapsed_ms in /proc//arch_status. Report it in Documentation/filesystems/proc.txt Signed-off-by: Aubrey Li Cc: Thomas Gleixner Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven Cc: Alexey Dobriyan Cc: Andrew

[PATCH v16 3/3] Documentation/filesystems/proc.txt: add AVX512_elapsed_ms

2019-04-17 Thread Aubrey Li
Added AVX512_elapsed_ms in /proc//status. Report it in Documentation/filesystems/proc.txt Signed-off-by: Aubrey Li Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven Cc: Linux API Cc: Alexey Dobriyan Cc: Andrew Morton --- Documentation/filesystems

[PATCH v16 2/3] x86,/proc/pid/status: Add AVX-512 usage elapsed time

2019-04-17 Thread Aubrey Li
core_power.lvl2_turbo_license 1.004031387 seconds time elapsed Non-zero counter value confirms that the task causes frequency drop. Signed-off-by: Aubrey Li Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven Cc: Linux API Cc: Alexey Dobriyan Cc: Andrew

[PATCH v16 1/3] /proc/pid/status: Add support for architecture specific output

2019-04-17 Thread Aubrey Li
The architecture specific information of the running processes could be useful to the userland. Add support to examine process architecture specific information externally. Signed-off-by: Aubrey Li Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven Cc: Linux

[PATCH v15 3/3] Documentation/filesystems/proc.txt: add AVX512_elapsed_ms

2019-04-16 Thread Aubrey Li
Added AVX512_elapsed_ms in /proc//status. Report it in Documentation/filesystems/proc.txt Signed-off-by: Aubrey Li Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven Cc: Linux API Cc: Alexey Dobriyan Cc: Andrew Morton --- Documentation/filesystems

[PATCH v15 2/3] x86,/proc/pid/status: Add AVX-512 usage elapsed time

2019-04-16 Thread Aubrey Li
core_power.lvl2_turbo_license 1.004031387 seconds time elapsed Non-zero counter value confirms that the task causes frequency drop. Signed-off-by: Aubrey Li Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven Cc: Linux API Cc: Alexey Dobriyan Cc: Andrew

[PATCH v15 1/3] /proc/pid/status: Add support for architecture specific output

2019-04-16 Thread Aubrey Li
The architecture specific information of the running processes could be useful to the userland. Add support to examine process architecture specific information externally. Signed-off-by: Aubrey Li Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven Cc: Linux

Re: [RFC][PATCH 13/16] sched: Add core wide task selection and scheduling.

2019-04-10 Thread Aubrey Li
On Wed, Apr 10, 2019 at 12:36 PM Aaron Lu wrote: > > On Tue, Apr 09, 2019 at 11:09:45AM -0700, Tim Chen wrote: > > Now that we have accumulated quite a number of different fixes to your > > orginal > > posted patches. Would you like to post a v2 of the core scheduler with the > > fixes? > >

[PATCH v14 1/3] /proc/pid/status: Add support for architecture specific output

2019-04-09 Thread Aubrey Li
The architecture specific information of the running processes could be useful to the userland. Add support to examine process architecture specific information externally. Signed-off-by: Aubrey Li Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven Cc: Linux

[PATCH v14 3/3] Documentation/filesystems/proc.txt: add AVX512_elapsed_ms

2019-04-09 Thread Aubrey Li
Added AVX512_elapsed_ms in /proc//status. Report it in Documentation/filesystems/proc.txt Signed-off-by: Aubrey Li Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven Cc: Linux API Cc: Alexey Dobriyan Cc: Andrew Morton --- Documentation/filesystems

[PATCH v14 2/3] x86,/proc/pid/status: Add AVX-512 usage elapsed time

2019-04-09 Thread Aubrey Li
core_power.lvl2_turbo_license 1.004031387 seconds time elapsed Non-zero counter value confirms that the task causes frequency drop. Signed-off-by: Aubrey Li Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven Cc: Linux API Cc: Alexey Dobriyan Cc: Andrew

Re: [RFC][PATCH 15/16] sched: Trivial forced-newidle balancer

2019-04-05 Thread Aubrey Li
On Thu, Apr 4, 2019 at 4:31 PM Aubrey Li wrote: > > On Fri, Feb 22, 2019 at 12:42 AM Peter Zijlstra wrote: > > > > On Thu, Feb 21, 2019 at 04:19:46PM +, Valentin Schneider wrote: > > > Hi, > > > > > > On 18/02/2019 16:56, Peter Zij

Re: [RFC][PATCH 15/16] sched: Trivial forced-newidle balancer

2019-04-04 Thread Aubrey Li
On Fri, Feb 22, 2019 at 12:42 AM Peter Zijlstra wrote: > > On Thu, Feb 21, 2019 at 04:19:46PM +, Valentin Schneider wrote: > > Hi, > > > > On 18/02/2019 16:56, Peter Zijlstra wrote: > > [...] > > > +static bool try_steal_cookie(int this, int that) > > > +{ > > > + struct rq *dst =

Re: [RFC][PATCH 00/16] sched: Core scheduling

2019-03-18 Thread Aubrey Li
On Tue, Mar 12, 2019 at 7:36 AM Subhra Mazumdar wrote: > > > On 3/11/19 11:34 AM, Subhra Mazumdar wrote: > > > > On 3/10/19 9:23 PM, Aubrey Li wrote: > >> On Sat, Mar 9, 2019 at 3:50 AM Subhra Mazumdar > >> wrote: > >>> expected. Most of

Re: [RFC][PATCH 00/16] sched: Core scheduling

2019-03-13 Thread Aubrey Li
On Thu, Mar 14, 2019 at 8:35 AM Tim Chen wrote: > >> > >> One more NULL pointer dereference: > >> > >> Mar 12 02:24:46 aubrey-ivb kernel: [ 201.916741] core sched enabled > >> [ 201.950203] BUG: unable to handle kernel NULL pointer dereference > >> at 0008 > >> [ 201.950254]

Re: [RFC][PATCH 00/16] sched: Core scheduling

2019-03-12 Thread Aubrey Li
On Tue, Mar 12, 2019 at 3:45 PM Aubrey Li wrote: > > On Tue, Mar 12, 2019 at 7:36 AM Subhra Mazumdar > wrote: > > > > > > On 3/11/19 11:34 AM, Subhra Mazumdar wrote: > > > > > > On 3/10/19 9:23 PM, Aubrey Li wrote: > > >> On

Re: [RFC][PATCH 00/16] sched: Core scheduling

2019-03-12 Thread Aubrey Li
On Tue, Mar 12, 2019 at 7:36 AM Subhra Mazumdar wrote: > > > On 3/11/19 11:34 AM, Subhra Mazumdar wrote: > > > > On 3/10/19 9:23 PM, Aubrey Li wrote: > >> On Sat, Mar 9, 2019 at 3:50 AM Subhra Mazumdar > >> wrote: > >>> expected. Most of

Re: [RFC][PATCH 00/16] sched: Core scheduling

2019-03-10 Thread Aubrey Li
On Sat, Mar 9, 2019 at 3:50 AM Subhra Mazumdar wrote: > > expected. Most of the performance recovery happens in patch 15 which, > unfortunately, is also the one that introduces the hard lockup. > After applied Subhra's patch, the following is triggered by enabling core sched when a cgroup is

Re: [RFC][PATCH 00/16] sched: Core scheduling

2019-02-26 Thread Aubrey Li
On Tue, Feb 26, 2019 at 4:26 PM Aubrey Li wrote: > > On Sat, Feb 23, 2019 at 3:27 AM Tim Chen wrote: > > > > On 2/22/19 6:20 AM, Peter Zijlstra wrote: > > > On Fri, Feb 22, 2019 at 01:17:01PM +0100, Paolo Bonzini wrote: > > >> On 18/02/19 21:40, Peter Zijls

Re: [RFC][PATCH 00/16] sched: Core scheduling

2019-02-26 Thread Aubrey Li
On Sat, Feb 23, 2019 at 3:27 AM Tim Chen wrote: > > On 2/22/19 6:20 AM, Peter Zijlstra wrote: > > On Fri, Feb 22, 2019 at 01:17:01PM +0100, Paolo Bonzini wrote: > >> On 18/02/19 21:40, Peter Zijlstra wrote: > >>> On Mon, Feb 18, 2019 at 09:49:10AM -0800, Linus Torvalds wrote: > On Mon, Feb

[PATCH v13 1/3] /proc/pid/status: Add support for architecture specific output

2019-02-23 Thread Aubrey Li
The architecture specific information of the running processes could be useful to the userland. Add support to examine process architecture specific information externally. Signed-off-by: Aubrey Li Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven --- fs

[PATCH v13 3/3] Documentation/filesystems/proc.txt: add AVX512_elapsed_ms

2019-02-23 Thread Aubrey Li
Added AVX512_elapsed_ms in /proc//status. Report it in Documentation/filesystems/proc.txt Signed-off-by: Aubrey Li Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven --- Documentation/filesystems/proc.txt | 29 - 1 file changed

[PATCH v13 2/3] x86,/proc/pid/status: Add AVX-512 usage elapsed time

2019-02-23 Thread Aubrey Li
core_power.lvl2_turbo_license 1.004031387 seconds time elapsed Non-zero counter value confirms that the task causes frequency drop. Signed-off-by: Aubrey Li Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven --- arch/x86/kernel/fpu/xstate.c | 42

[PATCH v12 1/3] /proc/pid/status: Add support for architecture specific output

2019-02-20 Thread Aubrey Li
The architecture specific information of the running processes could be useful to the userland. Add support to examine process architecture specific information externally. Signed-off-by: Aubrey Li Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven --- fs

[PATCH v12 3/3] Documentation/filesystems/proc.txt: add AVX512_elapsed_ms

2019-02-20 Thread Aubrey Li
Added AVX512_elapsed_ms in /proc//status. Report it in Documentation/filesystems/proc.txt Signed-off-by: Aubrey Li Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven --- Documentation/filesystems/proc.txt | 28 +++- 1 file changed

[PATCH v12 2/3] x86,/proc/pid/status: Add AVX-512 usage elapsed time

2019-02-20 Thread Aubrey Li
core_power.lvl2_turbo_license 1.004031387 seconds time elapsed Non-zero counter value confirms that the task causes frequency drop. Signed-off-by: Aubrey Li Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven --- arch/x86/kernel/fpu/xstate.c | 42

[PATCH v11 1/3] /proc/pid/status: Add support for architecture specific output

2019-02-12 Thread Aubrey Li
The architecture specific information of the running processes could be useful to the userland. Add support to examine process architecture specific information externally. Signed-off-by: Aubrey Li Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven --- fs

[PATCH v11 3/3] Documentation/filesystems/proc.txt: add AVX512_elapsed_ms

2019-02-12 Thread Aubrey Li
Added AVX512_elapsed_ms in /proc//status. Report it in Documentation/filesystems/proc.txt Signed-off-by: Aubrey Li Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven --- Documentation/filesystems/proc.txt | 4 +++- 1 file changed, 3 insertions(+), 1

[PATCH v11 2/3] x86,/proc/pid/status: Add AVX-512 usage elapsed time

2019-02-12 Thread Aubrey Li
1.004031387 seconds time elapsed Non-zero counter value confirms that the task causes frequency drop. Signed-off-by: Aubrey Li Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven --- arch/x86/kernel/fpu/xstate.c | 42 1 file

[PATCH v10 2/3] x86,/proc/pid/status: Add AVX-512 usage elapsed time

2019-02-12 Thread Aubrey Li
1.004031387 seconds time elapsed Non-zero counter value confirms that the task causes frequency drop. Signed-off-by: Aubrey Li Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven --- arch/x86/kernel/fpu/xstate.c | 41 1 file

[PATCH v10 3/3] Documentation/filesystems/proc.txt: add AVX512_elapsed_ms

2019-02-12 Thread Aubrey Li
Added AVX512_elapsed_ms in /proc//status. Report it in Documentation/filesystems/proc.txt Signed-off-by: Aubrey Li Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven --- Documentation/filesystems/proc.txt | 4 +++- 1 file changed, 3 insertions(+), 1

[PATCH v10 1/3] /proc/pid/status: Add support for architecture specific output

2019-02-12 Thread Aubrey Li
The architecture specific information of the running processes could be useful to the userland. Add support to examine process architecture specific information externally. Signed-off-by: Aubrey Li Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven --- fs

[PATCH v9 3/3] Documentation/filesystems/proc.txt: add AVX512_elapsed_ms

2019-02-11 Thread Aubrey Li
Added AVX512_elapsed_ms in /proc//status. Report it in Documentation/filesystems/proc.txt Signed-off-by: Aubrey Li Cc: Peter Zijlstra Cc: Andi Kleen Cc: Tim Chen Cc: Dave Hansen Cc: Arjan van de Ven --- Documentation/filesystems/proc.txt | 4 +++- 1 file changed, 3 insertions(+), 1

  1   2   3   >