Re: 4.3 group scheduling regression

2015-10-13 Thread Yuyang Du
On Tue, Oct 13, 2015 at 10:10:23AM +0200, Peter Zijlstra wrote: > On Tue, Oct 13, 2015 at 10:06:48AM +0200, Peter Zijlstra wrote: > > On Tue, Oct 13, 2015 at 03:55:17AM +0800, Yuyang Du wrote: > > > > > I think maybe the real disease is the tg->load_avg is not updated in time. > > > I.e., it is

Re: 4.3 group scheduling regression

2015-10-13 Thread Yuyang Du
On Tue, Oct 13, 2015 at 10:06:48AM +0200, Peter Zijlstra wrote: > On Tue, Oct 13, 2015 at 03:55:17AM +0800, Yuyang Du wrote: > > > I think maybe the real disease is the tg->load_avg is not updated in time. > > I.e., it is after migrate, the source cfs_rq does not decrease its > > contribution >

Re: 4.3 group scheduling regression

2015-10-13 Thread Peter Zijlstra
On Tue, Oct 13, 2015 at 10:06:48AM +0200, Peter Zijlstra wrote: > On Tue, Oct 13, 2015 at 03:55:17AM +0800, Yuyang Du wrote: > > > I think maybe the real disease is the tg->load_avg is not updated in time. > > I.e., it is after migrate, the source cfs_rq does not decrease its > > contribution >

Re: 4.3 group scheduling regression

2015-10-13 Thread Peter Zijlstra
On Tue, Oct 13, 2015 at 03:32:47AM +0800, Yuyang Du wrote: > On Mon, Oct 12, 2015 at 01:47:23PM +0200, Peter Zijlstra wrote: > > > > Also, should we do the below? At this point se->on_rq is still 0 so > > reweight_entity() will not update (dequeue/enqueue) the accounting, but > > we'll have just

Re: 4.3 group scheduling regression

2015-10-13 Thread Peter Zijlstra
On Tue, Oct 13, 2015 at 03:55:17AM +0800, Yuyang Du wrote: > I think maybe the real disease is the tg->load_avg is not updated in time. > I.e., it is after migrate, the source cfs_rq does not decrease its > contribution > to the parent's tg->load_avg fast enough. No, using the load_avg for

Re: 4.3 group scheduling regression

2015-10-13 Thread Peter Zijlstra
On Tue, Oct 13, 2015 at 03:55:17AM +0800, Yuyang Du wrote: > I think maybe the real disease is the tg->load_avg is not updated in time. > I.e., it is after migrate, the source cfs_rq does not decrease its > contribution > to the parent's tg->load_avg fast enough. No, using the load_avg for

Re: 4.3 group scheduling regression

2015-10-13 Thread Yuyang Du
On Tue, Oct 13, 2015 at 10:06:48AM +0200, Peter Zijlstra wrote: > On Tue, Oct 13, 2015 at 03:55:17AM +0800, Yuyang Du wrote: > > > I think maybe the real disease is the tg->load_avg is not updated in time. > > I.e., it is after migrate, the source cfs_rq does not decrease its > > contribution >

Re: 4.3 group scheduling regression

2015-10-13 Thread Yuyang Du
On Tue, Oct 13, 2015 at 10:10:23AM +0200, Peter Zijlstra wrote: > On Tue, Oct 13, 2015 at 10:06:48AM +0200, Peter Zijlstra wrote: > > On Tue, Oct 13, 2015 at 03:55:17AM +0800, Yuyang Du wrote: > > > > > I think maybe the real disease is the tg->load_avg is not updated in time. > > > I.e., it is

Re: 4.3 group scheduling regression

2015-10-13 Thread Peter Zijlstra
On Tue, Oct 13, 2015 at 10:06:48AM +0200, Peter Zijlstra wrote: > On Tue, Oct 13, 2015 at 03:55:17AM +0800, Yuyang Du wrote: > > > I think maybe the real disease is the tg->load_avg is not updated in time. > > I.e., it is after migrate, the source cfs_rq does not decrease its > > contribution >

Re: 4.3 group scheduling regression

2015-10-13 Thread Peter Zijlstra
On Tue, Oct 13, 2015 at 03:32:47AM +0800, Yuyang Du wrote: > On Mon, Oct 12, 2015 at 01:47:23PM +0200, Peter Zijlstra wrote: > > > > Also, should we do the below? At this point se->on_rq is still 0 so > > reweight_entity() will not update (dequeue/enqueue) the accounting, but > > we'll have just

Re: 4.3 group scheduling regression

2015-10-12 Thread Yuyang Du
On Tue, Oct 13, 2015 at 06:08:34AM +0200, Mike Galbraith wrote: > It sounded like you wanted me to run the below alone. If so, it's a nogo. Yes, thanks. Then it is the sad fact that after migrate and removed_load_avg is added in migrate_task_rq_fair(), we don't get a chance to update the tg

Re: 4.3 group scheduling regression

2015-10-12 Thread Mike Galbraith
On Tue, 2015-10-13 at 03:55 +0800, Yuyang Du wrote: > On Mon, Oct 12, 2015 at 12:23:31PM +0200, Mike Galbraith wrote: > > On Mon, 2015-10-12 at 10:12 +0800, Yuyang Du wrote: > > > > > I am guessing it is in calc_tg_weight(), and naughty boys do make them > > > more > > > favored, what a

Re: 4.3 group scheduling regression

2015-10-12 Thread Yuyang Du
On Mon, Oct 12, 2015 at 12:23:31PM +0200, Mike Galbraith wrote: > On Mon, 2015-10-12 at 10:12 +0800, Yuyang Du wrote: > > > I am guessing it is in calc_tg_weight(), and naughty boys do make them more > > favored, what a reality... > > > > Mike, beg you test the following? > > Wow, that was

Re: 4.3 group scheduling regression

2015-10-12 Thread Yuyang Du
On Mon, Oct 12, 2015 at 01:47:23PM +0200, Peter Zijlstra wrote: > > Also, should we do the below? At this point se->on_rq is still 0 so > reweight_entity() will not update (dequeue/enqueue) the accounting, but > we'll have just accounted the 'old' load.weight. > > Doing it this way around we'll

Re: 4.3 group scheduling regression

2015-10-12 Thread Mike Galbraith
On Mon, 2015-10-12 at 13:47 +0200, Peter Zijlstra wrote: > Also, should we do the below? Ew. Box said "Either you quilt pop/burn, or I boot windows." ;-) -Mike -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to

Re: 4.3 group scheduling regression

2015-10-12 Thread Peter Zijlstra
On Mon, Oct 12, 2015 at 10:12:31AM +0800, Yuyang Du wrote: > On Mon, Oct 12, 2015 at 11:12:06AM +0200, Peter Zijlstra wrote: > > So in the old code we had 'magic' to deal with the case where a cgroup > > was consuming less than 1 cpu's worth of runtime. For example, a single > > task running in

Re: 4.3 group scheduling regression

2015-10-12 Thread Mike Galbraith
On Mon, 2015-10-12 at 10:12 +0800, Yuyang Du wrote: > I am guessing it is in calc_tg_weight(), and naughty boys do make them more > favored, what a reality... > > Mike, beg you test the following? Wow, that was quick. Dinky patch made it all better.

Re: 4.3 group scheduling regression

2015-10-12 Thread Yuyang Du
On Mon, Oct 12, 2015 at 11:12:06AM +0200, Peter Zijlstra wrote: > On Mon, Oct 12, 2015 at 08:53:51AM +0800, Yuyang Du wrote: > > Good morning, Peter. > > > > On Mon, Oct 12, 2015 at 10:04:07AM +0200, Peter Zijlstra wrote: > > > On Mon, Oct 12, 2015 at 09:44:57AM +0200, Mike Galbraith wrote: > > >

Re: 4.3 group scheduling regression

2015-10-12 Thread Peter Zijlstra
On Mon, Oct 12, 2015 at 08:53:51AM +0800, Yuyang Du wrote: > Good morning, Peter. > > On Mon, Oct 12, 2015 at 10:04:07AM +0200, Peter Zijlstra wrote: > > On Mon, Oct 12, 2015 at 09:44:57AM +0200, Mike Galbraith wrote: > > > > > It's odd to me that things look pretty much the same good/bad tree

Re: 4.3 group scheduling regression

2015-10-12 Thread Mike Galbraith
On Mon, 2015-10-12 at 10:04 +0200, Peter Zijlstra wrote: > On Mon, Oct 12, 2015 at 09:44:57AM +0200, Mike Galbraith wrote: > > > It's odd to me that things look pretty much the same good/bad tree with > > hogs vs hogs or hogs vs tbench (with top anyway, just adding up times). > > Seems

Re: 4.3 group scheduling regression

2015-10-12 Thread Yuyang Du
Good morning, Peter. On Mon, Oct 12, 2015 at 10:04:07AM +0200, Peter Zijlstra wrote: > On Mon, Oct 12, 2015 at 09:44:57AM +0200, Mike Galbraith wrote: > > > It's odd to me that things look pretty much the same good/bad tree with > > hogs vs hogs or hogs vs tbench (with top anyway, just adding up

Re: 4.3 group scheduling regression

2015-10-12 Thread Peter Zijlstra
On Mon, Oct 12, 2015 at 09:44:57AM +0200, Mike Galbraith wrote: > It's odd to me that things look pretty much the same good/bad tree with > hogs vs hogs or hogs vs tbench (with top anyway, just adding up times). > Seems Xorg+mplayer more or less playing cross group ping-pong must be > the

Re: 4.3 group scheduling regression

2015-10-12 Thread Mike Galbraith
On Mon, 2015-10-12 at 09:23 +0200, Peter Zijlstra wrote: > On Sun, Oct 11, 2015 at 07:42:01PM +0200, Mike Galbraith wrote: > > (change subject, CCs) > > > > On Sun, 2015-10-11 at 04:25 +0200, Mike Galbraith wrote: > > > > > > Is the interactivity the same (horrible) at fe32d3cd5e8e (ie, before

Re: 4.3 group scheduling regression

2015-10-12 Thread Peter Zijlstra
On Sun, Oct 11, 2015 at 07:42:01PM +0200, Mike Galbraith wrote: > (change subject, CCs) > > On Sun, 2015-10-11 at 04:25 +0200, Mike Galbraith wrote: > > > > Is the interactivity the same (horrible) at fe32d3cd5e8e (ie, before the > > > load tracking rewrite from Yuyang)? > > It is the rewrite,

Re: 4.3 group scheduling regression

2015-10-12 Thread Peter Zijlstra
On Sun, Oct 11, 2015 at 07:42:01PM +0200, Mike Galbraith wrote: > (change subject, CCs) > > On Sun, 2015-10-11 at 04:25 +0200, Mike Galbraith wrote: > > > > Is the interactivity the same (horrible) at fe32d3cd5e8e (ie, before the > > > load tracking rewrite from Yuyang)? > > It is the rewrite,

Re: 4.3 group scheduling regression

2015-10-12 Thread Peter Zijlstra
On Mon, Oct 12, 2015 at 09:44:57AM +0200, Mike Galbraith wrote: > It's odd to me that things look pretty much the same good/bad tree with > hogs vs hogs or hogs vs tbench (with top anyway, just adding up times). > Seems Xorg+mplayer more or less playing cross group ping-pong must be > the

Re: 4.3 group scheduling regression

2015-10-12 Thread Mike Galbraith
On Mon, 2015-10-12 at 09:23 +0200, Peter Zijlstra wrote: > On Sun, Oct 11, 2015 at 07:42:01PM +0200, Mike Galbraith wrote: > > (change subject, CCs) > > > > On Sun, 2015-10-11 at 04:25 +0200, Mike Galbraith wrote: > > > > > > Is the interactivity the same (horrible) at fe32d3cd5e8e (ie, before

Re: 4.3 group scheduling regression

2015-10-12 Thread Mike Galbraith
On Mon, 2015-10-12 at 10:12 +0800, Yuyang Du wrote: > I am guessing it is in calc_tg_weight(), and naughty boys do make them more > favored, what a reality... > > Mike, beg you test the following? Wow, that was quick. Dinky patch made it all better.

Re: 4.3 group scheduling regression

2015-10-12 Thread Yuyang Du
Good morning, Peter. On Mon, Oct 12, 2015 at 10:04:07AM +0200, Peter Zijlstra wrote: > On Mon, Oct 12, 2015 at 09:44:57AM +0200, Mike Galbraith wrote: > > > It's odd to me that things look pretty much the same good/bad tree with > > hogs vs hogs or hogs vs tbench (with top anyway, just adding up

Re: 4.3 group scheduling regression

2015-10-12 Thread Mike Galbraith
On Mon, 2015-10-12 at 10:04 +0200, Peter Zijlstra wrote: > On Mon, Oct 12, 2015 at 09:44:57AM +0200, Mike Galbraith wrote: > > > It's odd to me that things look pretty much the same good/bad tree with > > hogs vs hogs or hogs vs tbench (with top anyway, just adding up times). > > Seems

Re: 4.3 group scheduling regression

2015-10-12 Thread Yuyang Du
On Mon, Oct 12, 2015 at 11:12:06AM +0200, Peter Zijlstra wrote: > On Mon, Oct 12, 2015 at 08:53:51AM +0800, Yuyang Du wrote: > > Good morning, Peter. > > > > On Mon, Oct 12, 2015 at 10:04:07AM +0200, Peter Zijlstra wrote: > > > On Mon, Oct 12, 2015 at 09:44:57AM +0200, Mike Galbraith wrote: > > >

Re: 4.3 group scheduling regression

2015-10-12 Thread Peter Zijlstra
On Mon, Oct 12, 2015 at 08:53:51AM +0800, Yuyang Du wrote: > Good morning, Peter. > > On Mon, Oct 12, 2015 at 10:04:07AM +0200, Peter Zijlstra wrote: > > On Mon, Oct 12, 2015 at 09:44:57AM +0200, Mike Galbraith wrote: > > > > > It's odd to me that things look pretty much the same good/bad tree

Re: 4.3 group scheduling regression

2015-10-12 Thread Peter Zijlstra
On Mon, Oct 12, 2015 at 10:12:31AM +0800, Yuyang Du wrote: > On Mon, Oct 12, 2015 at 11:12:06AM +0200, Peter Zijlstra wrote: > > So in the old code we had 'magic' to deal with the case where a cgroup > > was consuming less than 1 cpu's worth of runtime. For example, a single > > task running in

Re: 4.3 group scheduling regression

2015-10-12 Thread Yuyang Du
On Mon, Oct 12, 2015 at 01:47:23PM +0200, Peter Zijlstra wrote: > > Also, should we do the below? At this point se->on_rq is still 0 so > reweight_entity() will not update (dequeue/enqueue) the accounting, but > we'll have just accounted the 'old' load.weight. > > Doing it this way around we'll

Re: 4.3 group scheduling regression

2015-10-12 Thread Yuyang Du
On Mon, Oct 12, 2015 at 12:23:31PM +0200, Mike Galbraith wrote: > On Mon, 2015-10-12 at 10:12 +0800, Yuyang Du wrote: > > > I am guessing it is in calc_tg_weight(), and naughty boys do make them more > > favored, what a reality... > > > > Mike, beg you test the following? > > Wow, that was

Re: 4.3 group scheduling regression

2015-10-12 Thread Yuyang Du
On Tue, Oct 13, 2015 at 06:08:34AM +0200, Mike Galbraith wrote: > It sounded like you wanted me to run the below alone. If so, it's a nogo. Yes, thanks. Then it is the sad fact that after migrate and removed_load_avg is added in migrate_task_rq_fair(), we don't get a chance to update the tg

Re: 4.3 group scheduling regression

2015-10-12 Thread Mike Galbraith
On Tue, 2015-10-13 at 03:55 +0800, Yuyang Du wrote: > On Mon, Oct 12, 2015 at 12:23:31PM +0200, Mike Galbraith wrote: > > On Mon, 2015-10-12 at 10:12 +0800, Yuyang Du wrote: > > > > > I am guessing it is in calc_tg_weight(), and naughty boys do make them > > > more > > > favored, what a

Re: 4.3 group scheduling regression

2015-10-12 Thread Mike Galbraith
On Mon, 2015-10-12 at 13:47 +0200, Peter Zijlstra wrote: > Also, should we do the below? Ew. Box said "Either you quilt pop/burn, or I boot windows." ;-) -Mike -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to

4.3 group scheduling regression

2015-10-11 Thread Mike Galbraith
(change subject, CCs) On Sun, 2015-10-11 at 04:25 +0200, Mike Galbraith wrote: > > Is the interactivity the same (horrible) at fe32d3cd5e8e (ie, before the > > load tracking rewrite from Yuyang)? It is the rewrite, 9d89c257dfb9c51a532d69397f6eed75e5168c35. Watching 8 single hog groups vs 1

4.3 group scheduling regression

2015-10-11 Thread Mike Galbraith
(change subject, CCs) On Sun, 2015-10-11 at 04:25 +0200, Mike Galbraith wrote: > > Is the interactivity the same (horrible) at fe32d3cd5e8e (ie, before the > > load tracking rewrite from Yuyang)? It is the rewrite, 9d89c257dfb9c51a532d69397f6eed75e5168c35. Watching 8 single hog groups vs 1