Re: [RFC 2/2] x86_64: expand kernel stack to 16K

2014-05-30 Thread Linus Torvalds
On Fri, May 30, 2014 at 2:48 AM, Richard Weinberger wrote: > > If we raise the stack size on x86_64 to 16k, what about i386? > Beside of the fact that most of you consider 32bits as dead and must die... ;) x86-32 doesn't have nearly the same issue, since a large portion of stack content tends to

Re: [PATCH v3 2/2] ARM : change fixmap mapping region to support 32 CPUs

2014-05-30 Thread Rob Herring
On Tue, Apr 15, 2014 at 6:06 AM, Liu Hua wrote: > In 32-bit ARM systems, the fixmap mapping region can support > no more than 14 CPUs(total: 896k; one CPU: 64K). And we can > configure NR_CPUS up to 32. So there is a mismatch. > > This patch moves fixmapping region downwards to region >

Re: [PATCH RFC RESEND 00/14] New version of the BFQ I/O Scheduler

2014-05-30 Thread Vivek Goyal
On Tue, May 27, 2014 at 02:42:24PM +0200, paolo wrote: [..] > Strong fairness guarantees (already provided by BFQ-v0) > > As for long-term guarantees, BFQ distributes the device throughput > (and not just the device time) as desired to I/O-bound applications, > with any workload and regardless

Re: fs/dcache.c - BUG: soft lockup - CPU#5 stuck for 22s! [systemd-udevd:1667]

2014-05-30 Thread Linus Torvalds
On Fri, May 30, 2014 at 8:21 AM, Al Viro wrote: > > Linus, how would you prefer it to be handled? I'll just have to do an rc8. I really hoped to avoid it, because we're going on our family vacation when school is out in two weeks, and it causes problems for the merge window, but it's not like

Re: [PATCH 6/7 V2] perf: Add support to dynamically get cacheline size

2014-05-30 Thread Arnaldo Carvalho de Melo
Em Fri, May 30, 2014 at 10:50:25AM -0400, Don Zickus escreveu: > Different arches may have different cacheline sizes. Look it up and set > a global variable for reference. [acme@zoo linux]$ strings `which getconf` | grep LINESIZE LEVEL1_ICACHE_LINESIZE LEVEL1_DCACHE_LINESIZE

Re: [RFC 2/2] x86_64: expand kernel stack to 16K

2014-05-30 Thread H. Peter Anvin
On 05/29/2014 06:34 PM, Dave Chinner wrote: >> ... >> "kworker/u24:1 (94) used greatest stack depth: 8K bytes left, it means >> there is some horrible stack hogger in your kernel. Please report it >> the LKML and enable stacktrace to investigate who is culprit" > > That, however, presumes that a

Re: [PATCH] block: virtio_blk: don't hold spin lock during world switch

2014-05-30 Thread Michael S. Tsirkin
On Fri, May 30, 2014 at 10:49:29AM +0800, Ming Lei wrote: > Firstly, it isn't necessary to hold lock of vblk->vq_lock > when notifying hypervisor about queued I/O. > > Secondly, virtqueue_notify() will cause world switch and > it may take long time on some hypervisors(such as, qemu-arm), > so it

Re: [PATCH 3/4] perf tools: add hardware breakpoint bp_len test cases

2014-05-30 Thread Frederic Weisbecker
On Fri, May 30, 2014 at 09:49:25AM -0500, Suravee Suthikulanit wrote: > On 5/29/2014 10:26 AM, Frederic Weisbecker wrote: > >@@ -1389,6 +1432,21 @@ static struct evlist_test test__events[] = { > > .check = test__pinned_group, > > .id= 41, > > }, > >+{ > >+

Re: [RFC v2] Bluetooth: Keep master role when SCO or eSCO is active

2014-05-30 Thread Sergei Shtylyov
Hello. On 05/30/2014 10:24 AM, Kiran Kumar Raparthy wrote: From: "hyungseoung.yoo" Preserve the master role when SCO or eSCO is active as this improves compatability with lots of headset and chipset combinations. This is one of the number of patches from the Android AOSP common.git

Re: fs/dcache.c - BUG: soft lockup - CPU#5 stuck for 22s! [systemd-udevd:1667]

2014-05-30 Thread Al Viro
On Fri, May 30, 2014 at 11:12:38AM +0300, Mika Westerberg wrote: > Tested your latest #for-linus from here: > > https://git.kernel.org/cgit/linux/kernel/git/viro/vfs.git/log/?h=for-linus > > and the livelock is gone, > > Tested-by: Mika Westerberg > > Thanks again! OK... I've just pushed

[tip:x86/xsave] x86/cpufeature.h: Reformat x86 feature macros

2014-05-30 Thread tip-bot for Fenghua Yu
Commit-ID: 446fd806f5408b623fa51f3aa084e56844563779 Gitweb: http://git.kernel.org/tip/446fd806f5408b623fa51f3aa084e56844563779 Author: Fenghua Yu AuthorDate: Thu, 29 May 2014 11:12:29 -0700 Committer: H. Peter Anvin CommitDate: Thu, 29 May 2014 12:37:10 -0700 x86/cpufeature.h:

[tip:x86/xsave] Define kernel API to get address of each state in xsave area

2014-05-30 Thread tip-bot for Fenghua Yu
Commit-ID: 7496d6458fe3219d63848ce4a9afbd86245cab22 Gitweb: http://git.kernel.org/tip/7496d6458fe3219d63848ce4a9afbd86245cab22 Author: Fenghua Yu AuthorDate: Thu, 29 May 2014 11:12:44 -0700 Committer: H. Peter Anvin CommitDate: Thu, 29 May 2014 14:33:09 -0700 Define kernel API to get

[tip:x86/xsave] x86/xsaves: Enable xsaves/xrstors

2014-05-30 Thread tip-bot for Fenghua Yu
Commit-ID: 7e7ce87f6ad4e1730364e5e76628b43c5759b700 Gitweb: http://git.kernel.org/tip/7e7ce87f6ad4e1730364e5e76628b43c5759b700 Author: Fenghua Yu AuthorDate: Thu, 29 May 2014 11:12:43 -0700 Committer: H. Peter Anvin CommitDate: Thu, 29 May 2014 14:33:07 -0700 x86/xsaves: Enable

[tip:x86/xsave] x86/xsaves: Clear reserved bits in xsave header

2014-05-30 Thread tip-bot for Fenghua Yu
Commit-ID: 21e726c4a3625a1038e97795b7aad97109ba7e19 Gitweb: http://git.kernel.org/tip/21e726c4a3625a1038e97795b7aad97109ba7e19 Author: Fenghua Yu AuthorDate: Thu, 29 May 2014 11:12:39 -0700 Committer: H. Peter Anvin CommitDate: Thu, 29 May 2014 14:33:00 -0700 x86/xsaves: Clear

[tip:x86/xsave] x86/xsaves: Call booting time xsaves and xrstors in setup_init_fpu_buf

2014-05-30 Thread tip-bot for Fenghua Yu
Commit-ID: 47c2f292cc8669f70644a949cadd5fa5ee0e0e07 Gitweb: http://git.kernel.org/tip/47c2f292cc8669f70644a949cadd5fa5ee0e0e07 Author: Fenghua Yu AuthorDate: Thu, 29 May 2014 11:12:42 -0700 Committer: H. Peter Anvin CommitDate: Thu, 29 May 2014 14:33:06 -0700 x86/xsaves: Call booting

[tip:x86/xsave] x86/xsaves: Save xstate to task' s xsave area in __save_fpu during booting time

2014-05-30 Thread tip-bot for Fenghua Yu
Commit-ID: f41d830fa890044cb60f6bb39fc8f6493ffebb47 Gitweb: http://git.kernel.org/tip/f41d830fa890044cb60f6bb39fc8f6493ffebb47 Author: Fenghua Yu AuthorDate: Thu, 29 May 2014 11:12:41 -0700 Committer: H. Peter Anvin CommitDate: Thu, 29 May 2014 14:33:04 -0700 x86/xsaves: Save xstate

[tip:x86/xsave] x86/xsaves: Add xsaves and xrstors support for booting time

2014-05-30 Thread tip-bot for Fenghua Yu
Commit-ID: adb9d526e98268b647a74726346e1c40e6a37d2e Gitweb: http://git.kernel.org/tip/adb9d526e98268b647a74726346e1c40e6a37d2e Author: Fenghua Yu AuthorDate: Thu, 29 May 2014 11:12:40 -0700 Committer: H. Peter Anvin CommitDate: Thu, 29 May 2014 14:33:02 -0700 x86/xsaves: Add xsaves

[tip:x86/xsave] x86/xsaves: Use xsave/ xrstor for saving and restoring user space context

2014-05-30 Thread tip-bot for Fenghua Yu
Commit-ID: facbf4d91ae64f84ef93a00e4037135cd9f4b2ab Gitweb: http://git.kernel.org/tip/facbf4d91ae64f84ef93a00e4037135cd9f4b2ab Author: Fenghua Yu AuthorDate: Thu, 29 May 2014 11:12:38 -0700 Committer: H. Peter Anvin CommitDate: Thu, 29 May 2014 14:32:57 -0700 x86/xsaves: Use

Re: Bad DMA from Marvell 9230

2014-05-30 Thread Jérôme Carretero
On Fri, 30 May 2014 09:13:43 -0500 Roger Heflin wrote: > I had a 9230... > [...] > Supplier support "claimed" it to be a Linux AHCI bug as the "claim" > that their board correctly supports AHCI, even though all other AHCI > boards work right in this exact same use case in the exact same >

[tip:x86/xsave] x86/xsaves: Use xsaves/ xrstors to save and restore xsave area

2014-05-30 Thread tip-bot for Fenghua Yu
Commit-ID: f31a9f7c71691569359fa7fb8b0acaa44bce0324 Gitweb: http://git.kernel.org/tip/f31a9f7c71691569359fa7fb8b0acaa44bce0324 Author: Fenghua Yu AuthorDate: Thu, 29 May 2014 11:12:36 -0700 Committer: H. Peter Anvin CommitDate: Thu, 29 May 2014 14:31:21 -0700 x86/xsaves: Use

[tip:x86/xsave] x86/xsaves: Use xsaves/xrstors for context switch

2014-05-30 Thread tip-bot for Fenghua Yu
Commit-ID: f9de314b340f4816671f037e79ed01f685ac9787 Gitweb: http://git.kernel.org/tip/f9de314b340f4816671f037e79ed01f685ac9787 Author: Fenghua Yu AuthorDate: Thu, 29 May 2014 11:12:37 -0700 Committer: H. Peter Anvin CommitDate: Thu, 29 May 2014 14:31:25 -0700 x86/xsaves: Use

[tip:x86/xsave] x86/xsaves: Define a macro for handling xsave/ xrstor instruction fault

2014-05-30 Thread tip-bot for Fenghua Yu
Commit-ID: b84e70552e5aad71a1c14536e6ffcfe7934b73e4 Gitweb: http://git.kernel.org/tip/b84e70552e5aad71a1c14536e6ffcfe7934b73e4 Author: Fenghua Yu AuthorDate: Thu, 29 May 2014 11:12:35 -0700 Committer: H. Peter Anvin CommitDate: Thu, 29 May 2014 14:31:18 -0700 x86/xsaves: Define a

[tip:x86/xsave] x86/xsaves: Change compacted format xsave area header

2014-05-30 Thread tip-bot for Fenghua Yu
Commit-ID: 0b29643a58439dc9a8b0c0cacad0e7cb608c8199 Gitweb: http://git.kernel.org/tip/0b29643a58439dc9a8b0c0cacad0e7cb608c8199 Author: Fenghua Yu AuthorDate: Thu, 29 May 2014 11:12:33 -0700 Committer: H. Peter Anvin CommitDate: Thu, 29 May 2014 14:31:10 -0700 x86/xsaves: Change

[tip:x86/xsave] x86/xsaves: Define macros for xsave instructions

2014-05-30 Thread tip-bot for Fenghua Yu
Commit-ID: 200b08a970b2ae764b670a326088ab8bc0a989cc Gitweb: http://git.kernel.org/tip/200b08a970b2ae764b670a326088ab8bc0a989cc Author: Fenghua Yu AuthorDate: Thu, 29 May 2014 11:12:34 -0700 Committer: H. Peter Anvin CommitDate: Thu, 29 May 2014 14:31:16 -0700 x86/xsaves: Define macros

[tip:x86/xsave] x86/xsaves: Detect xsaves/xrstors feature

2014-05-30 Thread tip-bot for Fenghua Yu
Commit-ID: 6229ad278ca74acdbc8bd3a3d469322a3de91039 Gitweb: http://git.kernel.org/tip/6229ad278ca74acdbc8bd3a3d469322a3de91039 Author: Fenghua Yu AuthorDate: Thu, 29 May 2014 11:12:30 -0700 Committer: H. Peter Anvin CommitDate: Thu, 29 May 2014 14:24:28 -0700 x86/xsaves: Detect

[tip:x86/xsave] x86/xsaves: Add a kernel parameter noxsaves to disable xsaves/xrstors

2014-05-30 Thread tip-bot for Fenghua Yu
Commit-ID: b6f42a4a3c886bd18baf319d433a841ac9942c02 Gitweb: http://git.kernel.org/tip/b6f42a4a3c886bd18baf319d433a841ac9942c02 Author: Fenghua Yu AuthorDate: Thu, 29 May 2014 11:12:31 -0700 Committer: H. Peter Anvin CommitDate: Thu, 29 May 2014 14:24:52 -0700 x86/xsaves: Add a kernel

[tip:x86/xsave] x86/alternative: Add alternative_input_2 to support alternative with two features and input

2014-05-30 Thread tip-bot for Fenghua Yu
Commit-ID: 5b3e83f46a2a7e8625258dbf84a26e7f4032bfa8 Gitweb: http://git.kernel.org/tip/5b3e83f46a2a7e8625258dbf84a26e7f4032bfa8 Author: Fenghua Yu AuthorDate: Thu, 29 May 2014 11:12:32 -0700 Committer: H. Peter Anvin CommitDate: Thu, 29 May 2014 14:24:53 -0700 x86/alternative: Add

[PATCH RESEND] clk: flatten clk tree in debugfs

2014-05-30 Thread Peter De Schrijver
This patch flattens the clk tree in CCF debugfs. Instead of representing the clocks and their hierarchy as a directory structure under /sys/kernel/debug/clk, each clock gets a single directory directly under /sys/kernel/debug/clk. The orphans directory is replaced by a file called

[PATCH] clk: flatten clk tree in debugfs

2014-05-30 Thread Peter De Schrijver
This patch flattens the clk tree in CCF debugfs. Instead of representing the clocks and their hierarchy as a directory structure under /sys/kernel/debug/clk, each clock gets a single directory directly under /sys/kernel/debug/clk. The orphans directory is replaced by a file called

Re: [PATCH -mm 8/8] slab: reap dead memcg caches aggressively

2014-05-30 Thread Christoph Lameter
On Fri, 30 May 2014, Vladimir Davydov wrote: > There is no use in keeping free objects/slabs on dead memcg caches, > because they will never be allocated. So let's make cache_reap() shrink > as many free objects from such caches as possible. > > Note the difference between SLAB and SLUB handling

Re: [PATCH v4] NVMe: basic conversion to blk-mq

2014-05-30 Thread Matthew Wilcox
On Thu, May 29, 2014 at 11:51:25PM +0200, Matias Bjørling wrote: > -static int nvme_map_bio(struct nvme_queue *nvmeq, struct nvme_iod *iod, > - struct bio *bio, enum dma_data_direction dma_dir, int psegs) > +static int nvme_map_rq(struct nvme_queue *nvmeq, struct nvme_iod *iod, > +

Re: MIGRATE_RESERVE pages in show_mem function problems

2014-05-30 Thread Vlastimil Babka
On 05/28/2014 04:24 AM, Wang, Yalin wrote: Hi I find the show_mem function show page MIGRATE types result is not correct for MIGRATE_RESERVE pages : Normal: 1582*4kB (UEMC) 1317*8kB (UEMC) 1020*16kB (UEMC) 450*32kB (UEMC) 206*64kB (UEMC) 40*128kB (UM) 10*256kB (UM) 10*512kB (UM) 1*1024kB (M)

Re: [PATCH] console: Add persistent scrollback buffers for all VGA consoles

2014-05-30 Thread Manuel Schoelling
Hi Geert, On Fr, 2014-05-30 at 16:28 +0200, Geert Uytterhoeven wrote: Hi Manuel, > > On Fri, May 30, 2014 at 3:27 PM, Manuel Schölling > wrote: > > Add a scrollback buffers for each VGA console. The benefit is that > > the scrollback history is not flushed when switching between consoles > >

Re: [PATCH -mm 7/8] slub: make dead caches discard free slabs immediately

2014-05-30 Thread Christoph Lameter
On Fri, 30 May 2014, Vladimir Davydov wrote: > (3) is a bit more difficult, because slabs are added to per-cpu partial > lists lock-less. Fortunately, we only have to handle the __slab_free > case, because, as there shouldn't be any allocation requests dispatched > to a dead memcg cache,

Re: [PATCH v4] NVMe: basic conversion to blk-mq

2014-05-30 Thread Matthew Wilcox
On Thu, May 29, 2014 at 11:51:25PM +0200, Matias Bjørling wrote: > @@ -816,9 +671,7 @@ static irqreturn_t nvme_irq(int irq, void *data) > irqreturn_t result; > struct nvme_queue *nvmeq = data; > spin_lock(>q_lock); > - nvme_process_cq(nvmeq); > - result = nvmeq->cqe_seen

[PATCH 6/7 V2] perf: Add support to dynamically get cacheline size

2014-05-30 Thread Don Zickus
Different arches may have different cacheline sizes. Look it up and set a global variable for reference. Signed-off-by: Don Zickus --- V3: remove unneeded cpumap.h (Namhyung Kim) V2: change to be global and setup in perf.c use filename__read_int for setup --- tools/perf/perf.c| 5

[PATCH 2/7 V2] Revert "perf: Disable PERF_RECORD_MMAP2 support"

2014-05-30 Thread Don Zickus
This reverts commit 3090ffb5a2515990182f3f55b0688a7817325488. Re-enable the mmap2 interface as we will have a user soon. Since things have changed since perf disabled mmap2, small tweaks to the revert had to be done: o commit 9d4ecc88 forced (n!=8) to become (n<7) o a new libunwind test needed

[RFC PATCH 02/16 v3] Define and initialize CPU ConCurrency in struct rq

2014-05-30 Thread Yuyang Du
This struct is in CPU's rq and updated with rq->lock held. Signed-off-by: Yuyang Du --- kernel/sched/core.c |1 + kernel/sched/fair.c | 22 ++ kernel/sched/sched.h | 18 ++ 3 files changed, 41 insertions(+) diff --git a/kernel/sched/core.c

Re: [PATCH 3/4] perf tools: add hardware breakpoint bp_len test cases

2014-05-30 Thread Suravee Suthikulanit
On 5/29/2014 10:26 AM, Frederic Weisbecker wrote: @@ -1389,6 +1432,21 @@ static struct evlist_test test__events[] = { .check = test__pinned_group, .id= 41, }, + { + .name = "mem:0/1", + .check =

Re: [PATCH -mm 5/8] slab: remove kmem_cache_shrink retval

2014-05-30 Thread Christoph Lameter
On Fri, 30 May 2014, Vladimir Davydov wrote: > First, nobody uses it. Second, it differs across the implementations: > for SLUB it always returns 0, for SLAB it returns 0 if the cache appears > to be empty. So let's get rid of it. Well slub returns an error code if it fails. I am all in favor of

[RFC PATCH 07/16 v3] Init Workload Consolidation flags in sched_domain

2014-05-30 Thread Yuyang Du
Workload Consolidation can be enabled/disabled on the fly. This patchset enables MC and CPU domain WC by default. To enable CPU WC (SD_WORKLOAD_CONSOLIDATION=0x8000): sysctl -w kernel.sched_domain.cpuX.domainY.flags += 0x8000 To disable CPU WC: sysctl -w kernel.sched_domain.cpuX.domainY.flags

Re: [PATCH v2 3/3] CMA: always treat free cma pages as non-free on watermark checking

2014-05-30 Thread Joonsoo Kim
2014-05-30 19:40 GMT+09:00 Ritesh Harjani : > Hi Joonsoo, > > I think you will be loosing the benefit of below patch with your changes. > I am no expert here so please bear with me. I tried explaining in the > inline comments, let me know if I am wrong. > > commit

Re: [PATCH -mm 4/8] slub: never fail kmem_cache_shrink

2014-05-30 Thread Christoph Lameter
On Fri, 30 May 2014, Vladimir Davydov wrote: > SLUB's kmem_cache_shrink not only removes empty slabs from the cache, > but also sorts slabs by the number of objects in-use to cope with > fragmentation. To achieve that, it tries to allocate a temporary array. > If it fails, it will abort the whole

Re: [PATCH v2 04/11] sched: Allow all archs to set the power_orig

2014-05-30 Thread Peter Zijlstra
On Fri, May 30, 2014 at 03:04:32PM +0100, Dietmar Eggemann wrote: > On 23/05/14 16:52, Vincent Guittot wrote: > > power_orig is only changed for system with a SMT sched_domain level in > > order to > > reflect the lower capacity of CPUs. Heterogenous system also have to > > reflect an > >

[RFC PATCH 05/16 v3] Update CPU CC in fair

2014-05-30 Thread Yuyang Du
CC can only be modified when enqueue and dequeue the CPU rq. We also update it in scheduler tick, load balancing, and idle enter/exit in case we may not have enqueue and dequeue for a long time. Therefore, we update/track CC in and only in these points: we update cpu concurrency at: 1) enqueue

[RFC PATCH 11/16 v3] Make wakeup bias threshold changeable via sysctl

2014-05-30 Thread Yuyang Du
In wakeup balance, we bias wakee and waker (in this order) if it is capable of handling the wakee task. sysctl_sched_cc_wakeup_threshold is the threshold to see whether the CPU is capable, and can be changed by sysctl tool Signed-off-by: Yuyang Du --- include/linux/sched/sysctl.h |1 +

Re: [PATCH] ARM: exynos4212: hotplug: Fix CPU idle clock down after CPU1 off

2014-05-30 Thread Krzysztof Kozlowski
On pią, 2014-05-30 at 14:56 +0200, Tomasz Figa wrote: > On 30.05.2014 14:46, Krzysztof Kozlowski wrote: > > On pią, 2014-05-30 at 13:50 +0200, Tomasz Figa wrote: > >> Hi Krzysztof, > >> > >> On 13.05.2014 16:12, Krzysztof Kozlowski wrote: > >>> On Exynos4212 USE_DELAYED_RESET_ASSERTION must be set

[RFC PATCH 08/16 v3] Write CPU topology info for Workload Consolidation fields in sched_domain

2014-05-30 Thread Yuyang Du
Write additional CPU topology info in sched_domain for our use in cpu_attach_domain() Signed-off-by: Yuyang Du --- kernel/sched/core.c | 27 +++ 1 file changed, 27 insertions(+) diff --git a/kernel/sched/core.c b/kernel/sched/core.c index 1cb7402..9df01d5 100644 ---

[RFC PATCH 12/16 v3] Bias select wakee than waker in WAKE_AFFINE

2014-05-30 Thread Yuyang Du
In WAKE_AFFINE, we do not simply select idle, but bias wakee than waker if the cc of the wakee and waker (in this order) is capable of handling the wakee task Signed-off-by: Yuyang Du --- kernel/sched/fair.c | 13 - 1 file changed, 12 insertions(+), 1 deletion(-) diff --git

[RFC PATCH 06/16 v3] Add Workload Consolidation fields in struct sched_domain

2014-05-30 Thread Yuyang Du
Workload Consolidation is completely CPU topology and policy driven. To do so, we define SD_WORKLOAD_CONSOLIDATION, and add some fields in sched_domain struct: 1) total_groups is the group number in total in this domain 2) group_number is this CPU's group sequence number 3) consolidating_coeff is

[RFC PATCH 16/16 v3] Intercept periodic load balancing

2014-05-30 Thread Yuyang Du
We intercept load balancing to contain the load and load balancing in the consolidated CPUs according to our consolidating mechanism. In periodic load balancing, we do two things: 1) Skip pulling task to the non-consolidated CPUs. 2) In addition, for consolidated Idle CPU, we aggressively pull

[RFC PATCH 04/16 v3] CPU CC update period is changeable via sysctl

2014-05-30 Thread Yuyang Du
sysctl_sched_cc_sum_period is the CC update period. Make it changable via sysctl tool. In general, the longer this period, the stabler and slower to respond to task concurrency change on this CPU. Signed-off-by: Yuyang Du --- include/linux/sched/sysctl.h |4 kernel/sysctl.c

[RFC PATCH 14/16 v3] Intercept idle balancing

2014-05-30 Thread Yuyang Du
We intercept load balancing to contain the load and load balancing in the consolidated CPUs according to our consolidating mechanism. In idle balancing, we do two things: 1) Skip pulling task to the idle non-consolidated CPUs. 2) In addition, for consolidated Idle CPU, we aggressively pull

[RFC PATCH 15/16 v3] Intercept periodic nohz idle balancing

2014-05-30 Thread Yuyang Du
We intercept load balancing to contain the load and load balancing in the consolidated CPUs according to our consolidating mechanism. In periodic nohz idle balance, we skip the idle but non-consolidated CPUs from load balancing. Signed-off-by: Yuyang Du --- kernel/sched/fair.c | 57

[RFC PATCH 09/16 v3] Define and allocate a per CPU local cpumask for Workload Consolidation

2014-05-30 Thread Yuyang Du
We need these cpumasks to aid in cosolidated load balancing Signed-off-by: Yuyang Du --- kernel/sched/fair.c |8 1 file changed, 8 insertions(+) diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c index 96d6f64..5755746 100644 --- a/kernel/sched/fair.c +++ b/kernel/sched/fair.c

[RFC PATCH 13/16 v3] Intercept wakeup/fork/exec load balancing

2014-05-30 Thread Yuyang Du
We intercept load balancing to contain the load and load balancing in the consolidated CPUs according to our consolidating mechanism. In wakeup/fork/exec load balaning, when to find the idlest sched_group, we first try to find the consolidated group Signed-off-by: Yuyang Du ---

[RFC PATCH 10/16 v3] Workload Consolidation APIs

2014-05-30 Thread Yuyang Du
Currently, CPU CC is per CPU. To consolidate, the formula is based on a heuristic. Suppose we have 2 CPUs, their task concurrency over time is ('-' means no task, 'x' having tasks): 1) CPU0: ----- (CC[0]) CPU1: - (CC[1]) 2) CPU0: ----- (CC[0]) CPU1:

[RFC PATCH 00/16 v3] A new CPU load metric for power-efficient scheduler: CPU ConCurrency

2014-05-30 Thread Yuyang Du
Hi Ingo, PeterZ, Rafael, and others, The current scheduler’s load balancing is completely work-conserving. In some workload, generally low CPU utilization but immersed with CPU bursts of transient tasks, migrating task to engage all available CPUs for work-conserving can lead to significant

[RFC PATCH 01/16 v3] Remove update_rq_runnable_avg

2014-05-30 Thread Yuyang Du
Since rq->avg is not made use of anywhere (I really can't find it), and the code is in fair scheduler's critical path, remove it. Sorry if anybody wants to use it, just at least temporarily remove it as of now. Signed-off-by: Yuyang Du --- kernel/sched/debug.c |8

[RFC PATCH 03/16 v3] How CC accrues with run queue change and time

2014-05-30 Thread Yuyang Du
It is natural to use task concurrency (running tasks in the rq) as load indicator. We calculate CC for task concurrency by two steps: 1) Divide continuous time into periods of time, and average task concurrency in period, for tolerating the transient bursts: a = sum(concurrency * time) / period

[PATCH 1/1] md: Do only necessary operations when adding device to RO array

2014-05-30 Thread Baldysiak, Pawel
Commit 8313b8e57f55b15e5b7f7fc5d1630bbf686a9a97 changed way of adding device to read-only array. Used routine md_reap_sync_thread() which also trigger finish_reshape(), can break reshape process, if it was restarted. This part of function should do only necessary operations, that is: 1) call

Re: [PATCHv2 net-next 3/4] bridge: add export of multicast database adjacent to net_dev

2014-05-30 Thread Stephen Hemminger
On Sun, 25 May 2014 07:03:24 +0200 Linus Lüssing wrote: > With this new, exported function br_multicast_list_adjacent(net_dev) a > list of IPv4/6 addresses is returned. This list contains all multicast > addresses sensed by the bridge multicast snooping feature on all bridge > ports of the

Re: [PATCHv2 net-next 0/4] bridge: multicast snooping patches / exports

2014-05-30 Thread Stephen Hemminger
On Thu, 29 May 2014 15:36:22 -0700 (PDT) David Miller wrote: > From: Linus Lüssing > Date: Sun, 25 May 2014 07:03:21 +0200 > > > Changes in v2: > > I really need a bridging expert to review this series, thank you. I am fine with these, but would like original author of IGMP snooping to weigh

Re: [PATCH] watchdog: imx2_wdt: adds big endianness support.

2014-05-30 Thread Guenter Roeck
On 05/29/2014 11:10 PM, Xiubo Li wrote: This watchdog driver will be working on IMX2+, Vybrid, LS1, LS2+ platforms, and will be in different endianness mode in those SoCs: SoCs CPU endian mode WDT endian mode IMX2+LE

Re: [PATCH v3 12/15] pci: Add pci_fixup_suspend_late quirk pass.

2014-05-30 Thread Andreas Noever
On Thu, May 29, 2014 at 12:36 AM, Bjorn Helgaas wrote: > On Mon, May 26, 2014 at 05:18:09PM +0200, Andreas Noever wrote: > > Please adjust the subject to match the drivers/pci convention, i.e., change > > pci: Add pci_fixup_suspend_late quirk pass. > > to > > PCI: Add pci_fixup_suspend_late

Current ACPI/PM commit queue for 3.16

2014-05-30 Thread Rafael J. Wysocki
Hello Everyone, Below is the current ACPI/PM queue for 3.16, including some things still being run through the build robot and a couple of things still under discussion. I'm still considering a couple of additional patchsets already sent that require some fixups, but apart from those, I'm not

Re: [PATCH -mm 3/8] memcg: mark caches that belong to offline memcgs as dead

2014-05-30 Thread Christoph Lameter
On Fri, 30 May 2014, Vladimir Davydov wrote: > This will be used by the next patches. Acked-by: Christoph Lameter -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majord...@vger.kernel.org More majordomo info at

Re: [PATCH -mm 2/8] memcg: destroy kmem caches when last slab is freed

2014-05-30 Thread Christoph Lameter
On Fri, 30 May 2014, Vladimir Davydov wrote: > When the memcg_cache_params->refcnt goes to 0, schedule the worker that > will unregister the cache. To prevent this from happening when the owner > memcg is alive, keep the refcnt incremented during memcg lifetime. Acked-by: Christoph Lameter --

Re: [PATCH v2 2/3] CMA: aggressively allocate the pages on cma reserved memory when not used

2014-05-30 Thread Joonsoo Kim
2014-05-30 16:53 GMT+09:00 Gioh Kim : > Joonsoo, > > I'm attaching a patch for combination of __rmqueue and __rmqueue_cma. > I didn't test fully but my board is turned on and working well if no frequent > memory allocations. > > I'm sorry to send not-tested code. > I just want to report this

Re: [PATCH net] bridge: notify user space after fdb update

2014-05-30 Thread Stephen Hemminger
On Thu, 29 May 2014 17:27:16 +1000 Jon Maxwell wrote: > There has been a number incidents recently where customers running KVM have > reported that VM hosts on different Hypervisors are unreachable. Based on > pcap traces we found that the bridge was broadcasting the ARP request out > onto the

Re: [PATCH -mm 1/8] memcg: cleanup memcg_cache_params refcnt usage

2014-05-30 Thread Christoph Lameter
On Fri, 30 May 2014, Vladimir Davydov wrote: > Currently, we count the number of pages allocated to a per memcg cache > in memcg_cache_params->nr_pages. We only use this counter to find out if > the cache is empty and can be destroyed. So let's rename it to refcnt > and make it count not pages,

Re: [PATCH v2] pinctrl: Enable "power-source" to be extracted from DT files

2014-05-30 Thread Heiko Stübner
Am Freitag, 30. Mai 2014, 10:41:31 schrieb Ivan T. Ivanov: > On Wed, 2014-05-28 at 10:48 +0200, Linus Walleij wrote: > > On Wed, May 28, 2014 at 10:36 AM, Heiko Stübner wrote: > > > Am Mittwoch, 28. Mai 2014, 10:15:43 schrieb Linus Walleij: > > >> On Tue, May 27, 2014 at 3:09 PM, Heiko Stübner

Re: [PATCH] console: Add persistent scrollback buffers for all VGA consoles

2014-05-30 Thread Geert Uytterhoeven
Hi Manuel, On Fri, May 30, 2014 at 3:27 PM, Manuel Schölling wrote: > Add a scrollback buffers for each VGA console. The benefit is that > the scrollback history is not flushed when switching between consoles > but is persistent. > The buffers are allocated on demand when a new console is

RE: [PATCH v2 03/18] PCI: designware: Configuration space should be specified in 'reg'

2014-05-30 Thread Karicheri, Muralidharan
>-Original Message- >From: Murali Karicheri [mailto:m-kariche...@ti.com] >Sent: Thursday, May 29, 2014 12:32 PM >To: ABRAHAM, KISHON VIJAY >Cc: devicet...@vger.kernel.org; linux-...@vger.kernel.org; linux-arm- >ker...@lists.infradead.org; linux-o...@vger.kernel.org;

Re: Bad DMA from Marvell 9230

2014-05-30 Thread Roger Heflin
I had a 9230...on older kernels it worked "ok" so long as you did not do any smart commands, I removed it and went to something that works. Marvell appears to be hit and miss with some cards/chips working right and some not... Do enough smartcmds and the entire board (all 4 ports) locked up

Re: [patch]blk-mq: blk_mq_tag_to_rq should handle flush request

2014-05-30 Thread Jens Axboe
On 2014-05-09 22:00, Shaohua Li wrote: On Fri, May 09, 2014 at 08:00:18AM -0700, Christoph Hellwig wrote: On Fri, May 09, 2014 at 08:07:33PM +0800, Shaohua Li wrote: flush request is special, which borrows tag from other request. Need a special handling to get it from tag. Thanks, we

Re: console: lockup on boot

2014-05-30 Thread Jan Kara
On Fri 30-05-14 09:58:14, Peter Hurley wrote: > On 05/30/2014 09:11 AM, Sasha Levin wrote: > >Hi all, > > > >I sometime see lockups when booting my KVM guest with the latest -next > >kernel, > >it basically hangs right when it should start 'init', and after a while I get > >the following spew: >

Re: [PATCH] vmstat: on demand updates from differentials V7

2014-05-30 Thread Christoph Lameter
On Fri, 30 May 2014, Frederic Weisbecker wrote: > On Thu, May 29, 2014 at 02:56:15PM -0500, Christoph Lameter wrote: > > -static void start_cpu_timer(int cpu) > > +static void __init start_shepherd_timer(void) > > { > > - struct delayed_work *work = _cpu(vmstat_work, cpu); > > + int cpu; > >

Re: [PATCH v2 2/5] clk: bcm281xx: implement prerequisite clocks

2014-05-30 Thread Alex Elder
On 05/29/2014 10:20 PM, Alex Elder wrote: > On 05/23/2014 07:53 PM, Mike Turquette wrote: >> Quoting Alex Elder (2014-05-20 05:52:39) >>> @@ -743,11 +746,16 @@ struct clk *kona_clk_setup(struct kona_clk *bcm_clk) >>> clk = clk_register(NULL, _clk->hw); >>> if (IS_ERR(clk)) { >>>

Re: [PATCH v2 04/11] sched: Allow all archs to set the power_orig

2014-05-30 Thread Dietmar Eggemann
On 23/05/14 16:52, Vincent Guittot wrote: > power_orig is only changed for system with a SMT sched_domain level in order > to > reflect the lower capacity of CPUs. Heterogenous system also have to reflect > an > original capacity that is different from the default value. > > Create a more

Re: [PATCH V2 2/2] ftrace: Introduce nr_saved_cmdlines I/F

2014-05-30 Thread Steven Rostedt
On Thu, 20 Feb 2014 17:44:33 +0900 Yoshihiro YUNOMAE wrote: > Introduce nr_saved_cmdlines I/F for changing the number of pid-comm list. What's an I/F? > saved_cmdlines can store 128 command names using SAVED_CMDLINES now, but > 'no-existing processes' names are often lost in saved_cmdlines

Re: [PATCH] vmstat: on demand updates from differentials V7

2014-05-30 Thread Christoph Lameter
On Fri, 30 May 2014, Frederic Weisbecker wrote: > > + cpu_stat_off = kmalloc(cpumask_size(), GFP_KERNEL); > > + cpumask_copy(cpu_stat_off, cpu_online_mask); > > Actually looks like you can as well remove that cpumask and use > cpu_online_mask directly. That would mean I would offline cpus

Re: [PATCH] clk: exynos5420: Keep aclk66_peric enabled during boot

2014-05-30 Thread Javier Martinez Canillas
Hello Doug, On 05/29/2014 11:21 PM, Doug Anderson wrote: > Right now if you've got earlyprintk enabled on exynos5420-peach-pit > then you'll get a hang on boot. Here's why: > > 1. The i2c-s3c2410 driver will probe at subsys_initcall. It will >enable its clock and disable it. This is the

Re: console: lockup on boot

2014-05-30 Thread Peter Hurley
On 05/30/2014 09:11 AM, Sasha Levin wrote: Hi all, I sometime see lockups when booting my KVM guest with the latest -next kernel, it basically hangs right when it should start 'init', and after a while I get the following spew: [ 30.790833] BUG: spinlock lockup suspected on CPU#1,

Re: Bad DMA from Marvell 9230

2014-05-30 Thread Jérôme Carretero
On Fri, 30 May 2014 20:37:58 +1000 Benjamin Herrenschmidt wrote: > We've switched to a 9235 instead which seems to work fine. Weird (I hadn't seen that you reported the 9235 working...), I have IOMMU problems with a 9235... What system are you running it on (when you say "power box", is it a

Re: [RFC PATCH 04/11] coresight: add CoreSight ETB driver

2014-05-30 Thread Russell King - ARM Linux
NAK for all the reasons I mentioned in the previous submission in 2012. You partially did the right thing - you read through the previous submission, and you said in your cover message that you had addressed some of the comments from that submission. What I find extremely distasteful is that you

[PATCH -mm 2/8] memcg: destroy kmem caches when last slab is freed

2014-05-30 Thread Vladimir Davydov
When the memcg_cache_params->refcnt goes to 0, schedule the worker that will unregister the cache. To prevent this from happening when the owner memcg is alive, keep the refcnt incremented during memcg lifetime. Note, this doesn't guarantee that the cache that belongs to a dead memcg will go away

[PATCH -mm 4/8] slub: never fail kmem_cache_shrink

2014-05-30 Thread Vladimir Davydov
SLUB's kmem_cache_shrink not only removes empty slabs from the cache, but also sorts slabs by the number of objects in-use to cope with fragmentation. To achieve that, it tries to allocate a temporary array. If it fails, it will abort the whole procedure. This is unacceptable for kmemcg, where we

[PATCH -mm 8/8] slab: reap dead memcg caches aggressively

2014-05-30 Thread Vladimir Davydov
There is no use in keeping free objects/slabs on dead memcg caches, because they will never be allocated. So let's make cache_reap() shrink as many free objects from such caches as possible. Note the difference between SLAB and SLUB handling of dead memcg caches. For SLUB, dead cache destruction

[PATCH -mm 7/8] slub: make dead caches discard free slabs immediately

2014-05-30 Thread Vladimir Davydov
To speed up further allocations, SLUB may keep some empty slabs on per cpu/node partial lists. If the cache is dead, i.e. belongs to a memcg that was turned offline, there is no need in that, because dead caches are never allocated from. What is worse, keeping empty slabs on the list will prevent

[PATCH -mm 3/8] memcg: mark caches that belong to offline memcgs as dead

2014-05-30 Thread Vladimir Davydov
This will be used by the next patches. Signed-off-by: Vladimir Davydov --- include/linux/slab.h |2 ++ mm/memcontrol.c |1 + mm/slab.h| 10 ++ 3 files changed, 13 insertions(+) diff --git a/include/linux/slab.h b/include/linux/slab.h index

Re: [PATCH] block: virtio_blk: don't hold spin lock during world switch

2014-05-30 Thread Jens Axboe
On 2014-05-30 00:10, Rusty Russell wrote: Jens Axboe writes: If Rusty agrees, I'd like to add it for 3.16 with a stable marker. Really stable? It improves performance, which is nice. But every patch which goes into the kernel fixes a bug, improves clarity, improves performance or adds a

[PATCH -mm 6/8] slub: do not use cmpxchg for adding cpu partials when irqs disabled

2014-05-30 Thread Vladimir Davydov
We add slabs to per cpu partial lists on both objects allocation (see get_partial_node) and free (see __slab_free). We use the same function, put_cpu_partial, in both cases. Since __slab_free can be executed with preempt/irqs enabled, we have to use cmpxchg for adding a new element to a partial

[PATCH -mm 5/8] slab: remove kmem_cache_shrink retval

2014-05-30 Thread Vladimir Davydov
First, nobody uses it. Second, it differs across the implementations: for SLUB it always returns 0, for SLAB it returns 0 if the cache appears to be empty. So let's get rid of it. Signed-off-by: Vladimir Davydov --- include/linux/slab.h |2 +- mm/slab.c| 11 ---

[PATCH -mm 1/8] memcg: cleanup memcg_cache_params refcnt usage

2014-05-30 Thread Vladimir Davydov
Currently, we count the number of pages allocated to a per memcg cache in memcg_cache_params->nr_pages. We only use this counter to find out if the cache is empty and can be destroyed. So let's rename it to refcnt and make it count not pages, but slabs so that we can use atomic_inc/dec instead of

[PATCH -mm 0/8] memcg/slab: reintroduce dead cache self-destruction

2014-05-30 Thread Vladimir Davydov
Hi, When a memcg is turned offline, some of its kmem caches can still have active objects and therefore cannot be destroyed immediately. Currently, we simply leak such caches along with the owner memcg, which is bad and should be resolved. It would be perfect if we could move all slab pages of

Re: [RFC PATCH 11/11] ARM: moving support for etb/etm to the "drivers" directory

2014-05-30 Thread Russell King - ARM Linux
On Fri, May 30, 2014 at 07:43:11AM -0600, mathieu.poir...@linaro.org wrote: > From: Mathieu Poirier > > Removing minimal support for etb/etm to favour an implentation > that is more flexible, extensible and capable of handling more > platforms. > > Also removing the only client of the old

Re: [PATCH] page_alloc: skip cpuset enforcement for lower zone allocations (v5)

2014-05-30 Thread Christoph Lameter
On Thu, 29 May 2014, David Rientjes wrote: > When I said that my point about mempolicies needs more thought, I wasn't > expecting that there would be no discussion -- at least _something_ that > would say why we don't care about the mempolicy case. Lets get Andi involved here too. > The

Re: [PATCH] block: per-cpu counters for in-flight IO accounting

2014-05-30 Thread Jens Axboe
On 2014-05-30 06:11, Shaohua Li wrote: On Fri, May 09, 2014 at 10:41:27AM -0600, Jens Axboe wrote: On 05/09/2014 08:12 AM, Jens Axboe wrote: On 05/09/2014 03:17 AM, Matias Bjørling wrote: With multi-million IOPS and multi-node workloads, the atomic_t in_flight tracking becomes a bottleneck.

Re: [PATCH] page_alloc: skip cpuset enforcement for lower zone allocations (v4)

2014-05-30 Thread Christoph Lameter
On Thu, 29 May 2014, Andrew Morton wrote: > > > > if (!nodemask && gfp_zone(gfp_mask) < policy_zone) > > nodemask = _states[N_ONLINE]; > > OK, thanks, I made the patch go away for now. > And another issue is that the policy_zone may be highmem on 32 bit platforms which will

<    1   2   3   4   5   6   7   8   9   10   >