Re: [PATCH v2] make dm and dm-crypt forward cgroup context

2013-04-11 Thread Milan Broz
On 12.4.2013 2:22, Tejun Heo wrote: > On Thu, Apr 11, 2013 at 08:06:10PM -0400, Mikulas Patocka wrote: >> All that I can tell you is that adding an empty atomic operation >> "cmpxchg(>bi_css->refcnt, bio->bi_css->refcnt, bio->bi_css->refcnt);" >> to bio_clone_context and bio_disassociate_task

[RESEND][PATCH 3/3] PM / devfreq: exynos4_bus: Fix table entry size for exynos4x12_clkdiv_dmc1

2013-04-11 Thread Axel Lin
exynos4x12_clkdiv_dmc1 contains { G2DACP, DIVC2C, DIVC2C_ACLK }, thus set the size to 3 rather than 6. Signed-off-by: Axel Lin --- drivers/devfreq/exynos4_bus.c |2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/drivers/devfreq/exynos4_bus.c b/drivers/devfreq/exynos4_bus.c

[RESEND][PATCH 2/3] PM / devfreq: exynos4_bus: Constify clock divider table

2013-04-11 Thread Axel Lin
These tables are never modified, make them const. Signed-off-by: Axel Lin --- drivers/devfreq/exynos4_bus.c | 16 1 file changed, 8 insertions(+), 8 deletions(-) diff --git a/drivers/devfreq/exynos4_bus.c b/drivers/devfreq/exynos4_bus.c index 3f37f3b..45d00d1 100644 ---

[RESEND][PATCH 1/3] PM / devfreq: exynos4_bus: Fix missing mutex_unlock if opp_find_freq_floor fails

2013-04-11 Thread Axel Lin
We need to call mutex_unlock() in the error path. Signed-off-by: Axel Lin --- drivers/devfreq/exynos4_bus.c |3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/drivers/devfreq/exynos4_bus.c b/drivers/devfreq/exynos4_bus.c index 1deee09..54b9615 100644 ---

Re: [PATCH v4 3/4] x86, kdump: Change crashkernel_high/low= to crashkernel=,high/low

2013-04-11 Thread HATAYAMA Daisuke
(2013/04/11 4:39), Yinghai Lu wrote: > Index: linux-2.6/kernel/kexec.c > === > --- linux-2.6.orig/kernel/kexec.c > +++ linux-2.6/kernel/kexec.c > @@ -1368,35 +1368,108 @@ static int __init parse_crashkernel_simp > return 0; >

Re: [PATCH 01/10] mm: vmscan: Limit the number of pages kswapd reclaims at each priority

2013-04-11 Thread Ric Mason
Ping Rik, I also want to know the answer. ;-) On 04/11/2013 01:58 PM, Will Huck wrote: Hi Rik, On 03/22/2013 11:52 AM, Rik van Riel wrote: On 03/21/2013 08:05 PM, Will Huck wrote: One offline question, how to understand this in function balance_pgdat: /* * Do some background aging of the

Re: [PATCHSET v2] arch: unify task dump debug info

2013-04-11 Thread Ingo Molnar
* Tejun Heo wrote: > On Mon, Apr 08, 2013 at 08:31:07AM -0700, Tejun Heo wrote: > > Andrew, ping? > > Ping #2. Workqueue conversion of writeback in the block tree needs > these patches to avoid losing debug information over the conversion, > so it'd be great if this can be scheduled for 3.10.

Re: Bulk CPU Hotplug (Was Re: [PATCH] Do not force shutdown/reboot to boot cpu.)

2013-04-11 Thread Ingo Molnar
* Robin Holt wrote: > For the v3.9 release, can we consider my awful patch? How about trying what I suggested, to make reboot affine to the boot CPU explicitly, not by shutting down all the other CPUs, but by set_cpus_allowed() or so? That should solve the regression, without the ugly

Re: [PATCH] x86, FPU: Fix FPU initialization

2013-04-11 Thread H. Peter Anvin
I used to have one of these but have it away when cleaning out my study... no space. Ingo Molnar wrote: > >* Borislav Petkov wrote: > >> On Thu, Apr 11, 2013 at 12:26:09PM -0700, H. Peter Anvin wrote: >> > What host is this? >> >> Judging by the DMI string in the oops: >> >> > [

sw_perf_event_destroy() oops while fuzzing

2013-04-11 Thread Tommi Rantala
Hello, Saw these oopses while fuzzing with trinity. I have some local modifications to trinity that might explain why Dave and others have not hit this before. Tommi [91911.171328] warning: process `trinity-child7' used the deprecated sysctl system call with 1029078728.32609.1029078728.32609.

Re: [PATCH] x86, FPU: Fix FPU initialization

2013-04-11 Thread Ingo Molnar
* Borislav Petkov wrote: > On Thu, Apr 11, 2013 at 12:26:09PM -0700, H. Peter Anvin wrote: > > What host is this? > > Judging by the DMI string in the oops: > > > [ 15.921486] Pid: 73, comm: hwclock Tainted: GW3.9.0-rc6+ > > #222032 System manufacturer System Product Name/A8N-E

Re: [tip:x86/urgent] x86/mm/cpa: Convert noop to functional fix

2013-04-11 Thread Ingo Molnar
* Andrea Arcangeli wrote: > Hi, > > On Thu, Apr 11, 2013 at 02:29:18PM +0200, Ingo Molnar wrote: > > > > > > * tip-bot for Andrea Arcangeli wrote: > > > > > Commit-ID: f76cfa3c2496c462b5bc01bd0c9340c2715b73ca > > > Gitweb: > > >

[PATCH] kernel: audit_watch: resource management: better reset to NULL.

2013-04-11 Thread Chen Gang
better to set krule->watch = NULL. maybe it is not a real issue, but can make code clearer, so can help the readers to analyse another issues. Signed-off-by: Chen Gang --- kernel/audit_watch.c |1 + 1 files changed, 1 insertions(+), 0 deletions(-) diff --git a/kernel/audit_watch.c

Re: Excessive stall times on ext4 in 3.9-rc2

2013-04-11 Thread Dave Chinner
On Thu, Apr 11, 2013 at 10:57:08PM -0400, Theodore Ts'o wrote: > On Thu, Apr 11, 2013 at 11:33:35PM +0200, Jan Kara wrote: > > I think it might be more enlightening if Mel traced which process in > > which funclion is holding the buffer lock. I suspect we'll find out that > > the flusher thread

[PATCH] mips: Fix typo in cavium-octeon

2013-04-11 Thread EUNBONG SONG
I think "CUI2" should be changed to "CIU2", because CIU means Central Intrrupt Unit. Singed-off-by: EunBong Song --- arch/mips/cavium-octeon/octeon-irq.c |2 +- arch/mips/include/asm/mach-cavium-octeon/irq.h |2 +- 2 files changed, 2 insertions(+), 2 deletions(-) diff --git

[PATCH] kernel: audit_tree: resource management: need put_tree and goto Err when failure occures

2013-04-11 Thread Chen Gang
since "normally audit_add_tree_rule() will free it on failure", need free it completely, when failure occures. need additional put_tree before return, since get_tree was called. always need goto error processing area for list_del_init. Signed-off-by: Chen Gang ---

RE: [PATCH -next] audit: remove duplicated include from audit.c

2013-04-11 Thread yongjun_...@trendmicro.com.cn
Hi Eric, Sorry reply on top. From the source code from linex-next.git tree, line 55~64: #include #include *** #include #ifdef CONFIG_SECURITY #include #endif #include *** #include #include #include net/netlink.h is included twice, and linux/netlink.h is

Re: [PATCH -next] f2fs: fix error return code in f2fs_fill_super()

2013-04-11 Thread Namjae Jeon
2013/4/12, Wei Yongjun : > From: Wei Yongjun > > Fix to return a negative error code from the error handling > case instead of 0, as returned elsewhere in this function. > Introduce by commit c0d39e(f2fs: fix return values from validate > superblock) > > Signed-off-by: Wei Yongjun Acked-by:

Re: [PATCH] sched: wake-affine throttle

2013-04-11 Thread Michael Wang
On 04/10/2013 04:51 PM, Peter Zijlstra wrote: > On Wed, 2013-04-10 at 11:30 +0800, Michael Wang wrote: >> | 15 GB | 32 | 35918 | | 37632 | +4.77% | 47923 | +33.42% | >> 52241 | +45.45% > > So I don't get this... is wake_affine() once every milisecond _that_ > expensive? > > Seeing we

[PATCH -next] perf: fix error return code

2013-04-11 Thread Wei Yongjun
From: Wei Yongjun Fix to return a negative error code from the error handling case instead of 0, as returned elsewhere in this function. Signed-off-by: Wei Yongjun --- kernel/events/core.c | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) diff --git a/kernel/events/core.c

Re: Excessive stall times on ext4 in 3.9-rc2

2013-04-11 Thread Theodore Ts'o
On Thu, Apr 11, 2013 at 11:33:35PM +0200, Jan Kara wrote: > I think it might be more enlightening if Mel traced which process in > which funclion is holding the buffer lock. I suspect we'll find out that > the flusher thread has submitted the buffer for IO as an async write and > thus it takes a

Re: [PATCH 10/10] mm: vmscan: Move logic from balance_pgdat() to kswapd_shrink_zone()

2013-04-11 Thread Rik van Riel
On 04/09/2013 07:07 AM, Mel Gorman wrote: balance_pgdat() is very long and some of the logic can and should be internal to kswapd_shrink_zone(). Move it so the flow of balance_pgdat() is marginally easier to follow. Signed-off-by: Mel Gorman Acked-by: Rik van Riel -- All rights reversed

Re: [PATCH 07/10] mm: vmscan: Block kswapd if it is encountering pages under writeback

2013-04-11 Thread Rik van Riel
On 04/09/2013 07:07 AM, Mel Gorman wrote: Historically, kswapd used to congestion_wait() at higher priorities if it was not making forward progress. This made no sense as the failure to make progress could be completely independent of IO. It was later replaced by wait_iff_congested() and removed

Re: [PATCH 2/3] kernel/SRCU: provide a static initializer

2013-04-11 Thread Lai Jiangshan
On 04/12/2013 01:04 AM, Sebastian Andrzej Siewior wrote: > * Lai Jiangshan | 2013-04-09 09:09:56 [+0800]: > >> If the percpu array can be defined in __SRCU_STRUCT_INIT(), >> I'm happy to expose it. but it is not currently. > > I have no idea how to achieve this. > >> Why crypto can't use boot

Re: [PATCH 06/10] mm: vmscan: Have kswapd writeback pages based on dirty pages encountered, not priority

2013-04-11 Thread Rik van Riel
On 04/09/2013 07:07 AM, Mel Gorman wrote: Currently kswapd queues dirty pages for writeback if scanning at an elevated priority but the priority kswapd scans at is not related to the number of unqueued dirty encountered. Since commit "mm: vmscan: Flatten kswapd priority loop", the priority is

Re: [PATCH 04/10] mm: vmscan: Decide whether to compact the pgdat based on reclaim progress

2013-04-11 Thread Rik van Riel
On 04/09/2013 07:06 AM, Mel Gorman wrote: In the past, kswapd makes a decision on whether to compact memory after the pgdat was considered balanced. This more or less worked but it is late to make such a decision and does not fit well now that kswapd makes a decision whether to exit the zone

Re: [PATCH 03/10] mm: vmscan: Flatten kswapd priority loop

2013-04-11 Thread Rik van Riel
On 04/09/2013 07:06 AM, Mel Gorman wrote: kswapd stops raising the scanning priority when at least SWAP_CLUSTER_MAX pages have been reclaimed or the pgdat is considered balanced. It then rechecks if it needs to restart at DEF_PRIORITY and whether high-order reclaim needs to be reset. This is not

[PATCH v4 update 21/22] x86, mm: Make init_mem_mapping be able to be called several times

2013-04-11 Thread Yinghai Lu
Prepare to put page table on local nodes. Move calling of init_mem_mapping to early_initmem_init. Rework alloc_low_pages to alloc page table in following order: BRK, local node, low range update: remove two lines in changelog about xen. Signed-off-by: Yinghai Lu Cc: Pekka Enberg Cc:

[GIT PULL] slave-dmaengine fixes

2013-04-11 Thread Vinod Koul
Hi Linus, Please pull the following two patches to recive the fixes for slave-dmaengine The first one fixes issue in pl330 to check for DT compatible and second one fixes omap-dma to start without delay The following changes since commit 07961ac7c0ee8b546658717034fe692fd12eefa9: are available

[PATCH -next] f2fs: fix error return code in f2fs_fill_super()

2013-04-11 Thread Wei Yongjun
From: Wei Yongjun Fix to return a negative error code from the error handling case instead of 0, as returned elsewhere in this function. Introduce by commit c0d39e(f2fs: fix return values from validate superblock) Signed-off-by: Wei Yongjun --- fs/f2fs/super.c | 3 ++- 1 file changed, 2

Re: [PATCH v4 21/22] x86, mm: Make init_mem_mapping be able to be called several times

2013-04-11 Thread H. Peter Anvin
Please send a replacement patch. Yinghai Lu wrote: >On Thu, Apr 11, 2013 at 6:05 PM, Konrad Wilk >wrote: >> >> - ying...@kernel.org wrote: >> >>> Prepare to put page table on local nodes. >>> >>> Move calling of init_mem_mapping to early_initmem_init. >>> >>> Rework alloc_low_pages to

Re: linux-next: Tree for Apr 11 [ crypto: aesni-intel ]

2013-04-11 Thread Herbert Xu
On Thu, Apr 11, 2013 at 07:14:31PM +0200, Sedat Dilek wrote: > On Thu, Apr 11, 2013 at 10:30 AM, Stephen Rothwell > wrote: > > Hi all, > > > > Changes since 20130410: > > > > The tip tree gained conflicts against the net-next and pm trees. > > > > The driver-core tree gained a conflict against

Re: [PATCH v4 21/22] x86, mm: Make init_mem_mapping be able to be called several times

2013-04-11 Thread Yinghai Lu
On Thu, Apr 11, 2013 at 6:05 PM, Konrad Wilk wrote: > > - ying...@kernel.org wrote: > >> Prepare to put page table on local nodes. >> >> Move calling of init_mem_mapping to early_initmem_init. >> >> Rework alloc_low_pages to alloc page table in following order: >> BRK, local node, low

Re: [PATCH] Thermal: Don't resolve THERMAL_NO_LIMIT to max_state.

2013-04-11 Thread Zhang Rui
On Tue, 2013-03-26 at 21:29 +0800, Yuxuan Shui wrote: > max_state may change at runtime, for example, when loading/unloading > cpufreq policy. > this seems to be a problem that we have not covered yet. when loading/unloading the cpufreq policy, the cpufreq_frequency_table will be changed as

[PATCH PART2 v2 4/7] staging: ramster: Add incremental accessory counters

2013-04-11 Thread Wanpeng Li
Add incremental accessory counters that are going to be used for debug fs entries. Acked-by: Dan Magenheimer Signed-off-by: Wanpeng Li --- drivers/staging/zcache/ramster/debug.h | 67 ++ drivers/staging/zcache/ramster/ramster.c | 32 +++--- 2 files

[PATCH PART2 v2 5/7] staging: ramster/debug: Add RAMSTER_DEBUG Kconfig entry

2013-04-11 Thread Wanpeng Li
Add RAMSTER_DEBUG Kconfig entry. Acked-by: Dan Magenheimer Signed-off-by: Wanpeng Li --- drivers/staging/zcache/Kconfig |8 drivers/staging/zcache/Makefile|2 +- drivers/staging/zcache/ramster/debug.h |2 +- 3 files changed, 10 insertions(+), 2 deletions(-)

[PATCH PART2 v2 2/7] staging: ramster: Move debugfs code out of ramster.c file

2013-04-11 Thread Wanpeng Li
Note that at this point there is no CONFIG_RAMSTER_DEBUG option in the Kconfig. So in effect all of the counters are nop until that option gets re-introduced in: zcache/ramster/debug: Add RAMSTE_DEBUG Kconfig entry Acked-by: Dan Magenheimer Signed-off-by: Wanpeng Li ---

[PATCH PART2 v2 3/7] staging: ramster/debug: Use an array to initialize/use debugfs attributes

2013-04-11 Thread Wanpeng Li
Use an array to initialize/use debugfs attributes, it makes them neater as zcache/debug.c does. Acked-by: Dan Magenheimer Signed-off-by: Wanpeng Li --- drivers/staging/zcache/ramster/debug.c | 68 +++- 1 file changed, 32 insertions(+), 36 deletions(-) diff --git

[PATCH PART2 v2 1/7] staging: ramster: decrease foregin pers pages when count < 0

2013-04-11 Thread Wanpeng Li
commit 9a5c59687ad ("staging: ramster: Provide accessory functions for counter decrease") forget decrease foregin pers pages, this patch fix it. Acked-by: Dan Magenheimer Signed-off-by: Wanpeng Li --- drivers/staging/zcache/ramster/ramster.c |1 + 1 file changed, 1 insertion(+) diff

[PATCH PART2 v2 7/7] staging: ramster: add how-to for ramster

2013-04-11 Thread Wanpeng Li
Add how-to for ramster. Acked-by: Dan Magenheimer Singed-off-by: Dan Magenheimer Signed-off-by: Wanpeng Li --- drivers/staging/zcache/ramster/HOWTO.txt | 257 ++ 1 file changed, 257 insertions(+) create mode 100644 drivers/staging/zcache/ramster/HOWTO.txt diff

[PATCH PART2 v2 0/7] staging: zcache/ramster: fix and ramster/debugfs improvement

2013-04-11 Thread Wanpeng Li
Changelog: v1 -> v2: * fix bisect issue * fix issue in patch staging: ramster: Provide accessory functions for counter decrease * drop patch staging: zcache: remove zcache_freeze * Add Dan Acked-by Fix bugs in zcache and rips out the debug counters out of ramster.c and sticks them in

[PATCH PART2 v2 6/7] staging: zcache/debug: fix coding style

2013-04-11 Thread Wanpeng Li
Fix coding style issue: ERROR: space prohibited before that '++' (ctx:WxO) and line beyond 8 characters. Acked-by: Dan Magenheimer Signed-off-by: Wanpeng Li --- drivers/staging/zcache/debug.h | 95 1 file changed, 76 insertions(+), 19 deletions(-)

[RFC PATCH v2 00/25] Dynamic NUMA: Runtime NUMA memory layout reconfiguration

2013-04-11 Thread Cody P Schafer
These patches allow the NUMA memory layout (meaning which node each physical page belongs to, the mapping from physical pages to NUMA nodes) to be changed at runtime in place (without hotplugging). Depends on "mm: avoid duplication of setup_nr_node_ids()",

[RFC PATCH v2 02/25] rbtree: add rbtree_postorder_for_each_entry_safe() helper.

2013-04-11 Thread Cody P Schafer
Signed-off-by: Cody P Schafer --- include/linux/rbtree.h | 8 1 file changed, 8 insertions(+) diff --git a/include/linux/rbtree.h b/include/linux/rbtree.h index 2879e96..1b239ca 100644 --- a/include/linux/rbtree.h +++ b/include/linux/rbtree.h @@ -85,4 +85,12 @@ static inline void

[RFC PATCH v2 05/25] mm/memory_hotplug: use {pgdat,zone}_is_empty() when resizing zones & pgdats

2013-04-11 Thread Cody P Schafer
Use the *_is_empty() helpers to be more clear about what we're actually checking for. Signed-off-by: Cody P Schafer --- mm/memory_hotplug.c | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/mm/memory_hotplug.c b/mm/memory_hotplug.c index df04c36..deea8c2 100644 ---

[RFC PATCH v2 03/25] mm/memory_hotplug: factor out zone+pgdat growth.

2013-04-11 Thread Cody P Schafer
Create a new function grow_pgdat_and_zone() which handles locking + growth of a zone & the pgdat which it is associated with. Signed-off-by: Cody P Schafer --- include/linux/memory_hotplug.h | 3 +++ mm/memory_hotplug.c| 17 +++-- 2 files changed, 14 insertions(+), 6

[RFC PATCH v2 04/25] memory_hotplug: export ensure_zone_is_initialized() in mm/internal.h

2013-04-11 Thread Cody P Schafer
Export ensure_zone_is_initialized() so that it can be used to initialize new zones within the dynamic numa code. Signed-off-by: Cody P Schafer --- mm/internal.h | 8 mm/memory_hotplug.c | 2 +- 2 files changed, 9 insertions(+), 1 deletion(-) diff --git a/mm/internal.h

[RFC PATCH v2 01/25] rbtree: add postorder iteration functions.

2013-04-11 Thread Cody P Schafer
Add postorder iteration functions for rbtree. These are useful for safely freeing an entire rbtree without modifying the tree at all. Signed-off-by: Cody P Schafer --- include/linux/rbtree.h | 4 lib/rbtree.c | 40 2 files changed, 44

[RFC PATCH v2 07/25] page_alloc: add return_pages_to_zone() when DYNAMIC_NUMA is enabled.

2013-04-11 Thread Cody P Schafer
Add return_pages_to_zone(), which uses return_page_to_zone(). It is a minimized version of __free_pages_ok() which handles adding pages which have been removed from another zone into a new zone. Signed-off-by: Cody P Schafer --- mm/internal.h | 5 - mm/page_alloc.c | 17 +

[RFC PATCH v2 10/25] page-flags dnuma: reserve a pageflag for determining if a page needs a node lookup.

2013-04-11 Thread Cody P Schafer
Add a pageflag called "lookup_node"/ PG_lookup_node / Page*LookupNode(). Used by dynamic numa to indicate when a page has a new node assignment waiting for it. FIXME: This also exempts PG_lookup_node from PAGE_FLAGS_CHECK_AT_PREP due to the asynchronous usage of PG_lookup_node, which needs to be

[RFC PATCH v2 06/25] mm: add nid_zone() helper

2013-04-11 Thread Cody P Schafer
Add nid_zone(), which returns the zone corresponding to a given nid & zonenum. Signed-off-by: Cody P Schafer --- include/linux/mm.h | 7 ++- 1 file changed, 6 insertions(+), 1 deletion(-) diff --git a/include/linux/mm.h b/include/linux/mm.h index 9ddae00..1b6abae 100644 ---

[RFC PATCH v2 09/25] page_alloc: when dynamic numa is enabled, don't check that all pages in a block belong to the same zone

2013-04-11 Thread Cody P Schafer
When dynamic numa is enabled, the last or first page in a pageblock may have been transplanted to a new zone (or may not yet be transplanted to a new zone). Disable a BUG_ON() which checks that the start_page and end_page are in the same zone, if they are not in the proper zone they will simply

[RFC PATCH v2 15/25] page_alloc: use dnuma to transplant newly freed pages in free_hot_cold_page()

2013-04-11 Thread Cody P Schafer
free_hot_cold_page() is used for order == 0 pages, and is where the page's zone is decided. In the normal case, these pages are freed to the per-cpu lists. When a page needs transplanting (ie: the actual node it belongs to has changed, and it needs to be moved to another zone), the pcp lists are

[RFC PATCH v2 13/25] mm: memlayout+dnuma: add debugfs interface

2013-04-11 Thread Cody P Schafer
Add a debugfs interface to dnuma/memlayout. It keeps track of a variable backlog of memory layouts, provides some statistics on dnuma moved pages & cache performance, and allows the setting of a new global memlayout. TODO: split out statistics, backlog, & write interfaces from eachother.

[RFC PATCH v2 17/25] x86: memlayout: add a arch specific inital memlayout setter.

2013-04-11 Thread Cody P Schafer
On x86, we have numa_info specifically to track the numa layout, which is precisely the data memlayout needs, so use it to create an initial memlayout. Signed-off-by: Cody P Schafer --- arch/x86/mm/numa.c | 28 1 file changed, 28 insertions(+) diff --git

[RFC PATCH v2 19/25] dnuma: memlayout: add memory_add_physaddr_to_nid() for memory_hotplug

2013-04-11 Thread Cody P Schafer
Signed-off-by: Cody P Schafer --- mm/memlayout.c | 16 1 file changed, 16 insertions(+) diff --git a/mm/memlayout.c b/mm/memlayout.c index 45e7df6..4dc6706 100644 --- a/mm/memlayout.c +++ b/mm/memlayout.c @@ -247,3 +247,19 @@ void memlayout_global_init(void)

[RFC PATCH v2 16/25] page_alloc: transplant pages that are being flushed from the per-cpu lists

2013-04-11 Thread Cody P Schafer
In free_pcppages_bulk(), check if a page needs to be moved to a new node/zone & then perform the transplant (in a slightly defered manner). Signed-off-by: Cody P Schafer --- mm/page_alloc.c | 36 +++- 1 file changed, 35 insertions(+), 1 deletion(-) diff --git

[RFC PATCH v2 14/25] page_alloc: use dnuma to transplant newly freed pages in __free_pages_ok()

2013-04-11 Thread Cody P Schafer
__free_pages_ok() handles higher order (order != 0) pages. Transplant hook is added here as this is where the struct zone to free to is decided. Signed-off-by: Cody P Schafer --- mm/page_alloc.c | 14 +- 1 file changed, 13 insertions(+), 1 deletion(-) diff --git a/mm/page_alloc.c

[RFC PATCH v2 20/25] x86/mm/numa: when dnuma is enabled, use memlayout to handle memory hotplug's physaddr_to_nid.

2013-04-11 Thread Cody P Schafer
When a memlayout is tracked (ie: CONFIG_DYNAMIC_NUMA is enabled), rather than iterate over numa_meminfo, a lookup can be done using memlayout. Signed-off-by: Cody P Schafer --- arch/x86/mm/numa.c | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/arch/x86/mm/numa.c

[RFC PATCH v2 23/25] mm/page_alloc: make pr_err() in page_outside_zone_boundaries() more useful

2013-04-11 Thread Cody P Schafer
Signed-off-by: Cody P Schafer --- mm/page_alloc.c | 7 +-- 1 file changed, 5 insertions(+), 2 deletions(-) diff --git a/mm/page_alloc.c b/mm/page_alloc.c index a54baa9..20304cb 100644 --- a/mm/page_alloc.c +++ b/mm/page_alloc.c @@ -253,8 +253,11 @@ static int

[RFC PATCH v2 21/25] mm/memory_hotplug: VM_BUG if nid is too large.

2013-04-11 Thread Cody P Schafer
Signed-off-by: Cody P Schafer --- mm/memory_hotplug.c | 2 ++ 1 file changed, 2 insertions(+) diff --git a/mm/memory_hotplug.c b/mm/memory_hotplug.c index f5ea9b7..5fcd29e 100644 --- a/mm/memory_hotplug.c +++ b/mm/memory_hotplug.c @@ -1063,6 +1063,8 @@ int __mem_online_node(int nid)

[RFC PATCH v2 24/25] mm/page_alloc: use manage_pages instead of present pages when calculating default_zonelist_order()

2013-04-11 Thread Cody P Schafer
Signed-off-by: Cody P Schafer --- mm/page_alloc.c | 8 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/mm/page_alloc.c b/mm/page_alloc.c index 20304cb..686d8f8 100644 --- a/mm/page_alloc.c +++ b/mm/page_alloc.c @@ -3488,8 +3488,8 @@ static int default_zonelist_order(void)

[RFC PATCH v2 18/25] init/main: call memlayout_global_init() in start_kernel().

2013-04-11 Thread Cody P Schafer
memlayout_global_init() initializes the first memlayout, which is assumed to match the initial page-flag nid settings. This is done in start_kernel() as the initdata used to populate the memlayout is purged from memory early in the boot process (XXX: When?). Signed-off-by: Cody P Schafer ---

[RFC PATCH v2 25/25] mm: add a early_param "extra_nr_node_ids" to increase nr_node_ids above the minimum by a percentage.

2013-04-11 Thread Cody P Schafer
For dynamic numa, sometimes the hypervisor we're running under will want to split a single NUMA node into multiple NUMA nodes. If the number of numa nodes is limited to the number avaliable when the system booted (as it is on x86), we may not be able to fully adopt the new memory layout provided

[RFC PATCH v2 12/25] mm: add memlayout & dnuma to track pfn->nid & transplant pages between nodes

2013-04-11 Thread Cody P Schafer
On some systems, the hypervisor can (and will) relocate physical addresses as seen in a VM between real NUMA nodes. For example, IBM Power systems which are using particular revisions of PHYP (IBM's proprietary hypervisor) This change set introduces the infrastructure for tracking & dynamically

[RFC PATCH v2 22/25] mm/page_alloc: in page_outside_zone_boundaries(), avoid premature decisions.

2013-04-11 Thread Cody P Schafer
With some code that expands the zone boundaries, VM_BUG_ON(bad_range()) was being triggered. Previously, page_outside_zone_boundaries() decided that once it detected a page outside the boundaries, it was certainly outside even if the seqlock indicated the data was invalid & needed to be reread.

[RFC PATCH v2 11/25] memory_hotplug: factor out locks in mem_online_cpu()

2013-04-11 Thread Cody P Schafer
In dynamic numa, when onlining nodes, lock_memory_hotplug() is already held when mem_online_node()'s functionality is needed. Factor out the locking and create a new function __mem_online_node() to allow reuse. Signed-off-by: Cody P Schafer --- include/linux/memory_hotplug.h | 1 +

[RFC PATCH v2 08/25] page_alloc: in move_freepages(), skip pages instead of VM_BUG on node differences.

2013-04-11 Thread Cody P Schafer
With dynamic numa, pages are going to be gradully moved from one node to another, causing the page ranges that move_freepages() examines to contain pages that actually belong to another node. When dynamic numa is enabled, we skip these pages instead of VM_BUGing out on them. This additionally

Re: [PATCH V2 1/3] Thermal: rename thermal_sys.c to thermal_core.c

2013-04-11 Thread Zhang Rui
On Thu, 2013-04-04 at 16:24 -0400, Eduardo Valentin wrote: > On 29-03-2013 10:26, Zhang Rui wrote: > > this is the preparation work to build all the thermal core framework > > source file, like governors, cpu cooling, etc, into one module. > > > > No functional change in this patch. > > > >

[PATCH] mm/vmstat: add note on safety of drain_zonestat

2013-04-11 Thread Cody P Schafer
--- mm/vmstat.c | 4 1 file changed, 4 insertions(+) diff --git a/mm/vmstat.c b/mm/vmstat.c index e1d8ed1..2b93877 100644 --- a/mm/vmstat.c +++ b/mm/vmstat.c @@ -495,6 +495,10 @@ void refresh_cpu_vm_stats(int cpu) atomic_long_add(global_diff[i], _stat[i]); } +/* +

Re: [PATCH 0/2] mm: vmemmap: add vmemmap_verify check for hot-add node/memory case

2013-04-11 Thread Tang Chen
On 04/11/2013 11:10 PM, Yinghai Lu wrote: On Thu, Apr 11, 2013 at 12:41 AM, Tang Chen wrote: 3. If we add flag to memblock, we can mark different memory. And I remember you mentioned before that we can use memblock to reserve local node data for node-life-cycle data, like vmemmap,

Re: [PATCH v4 21/22] x86, mm: Make init_mem_mapping be able to be called several times

2013-04-11 Thread Konrad Wilk
- ying...@kernel.org wrote: > Prepare to put page table on local nodes. > > Move calling of init_mem_mapping to early_initmem_init. > > Rework alloc_low_pages to alloc page table in following order: > BRK, local node, low range > > Still only load_cr3 one time, otherwise we would

Re: [PATCH v8 10/13] PCI/acpiphp: do not use ACPI PCI subdriver mechanism

2013-04-11 Thread Yijing Wang
On 2013/4/12 1:29, Bjorn Helgaas wrote: > On Wed, Apr 10, 2013 at 7:50 PM, Yijing Wang wrote: Hi Bjorn, Thanks for review. > My goal is that a user should never have to specify a kernel boot > parameter or edit a modules.conf file, but the user did previously >

Re: [PATCH v2 0/5] ioatdma: Intel S1200 support patches

2013-04-11 Thread Dan Williams
On Wed, Apr 10, 2013 at 4:44 PM, Dave Jiang wrote: > These are the updated patches from first submission series and rebased against > vinod's slave-dma git tree for-linus branch. > > Patches 1 & 4 have been updated after discussion with Dan. Patches 1-4 acked. > Patch 5 was acked by > Dan but

[PATCH v4 11/22] x86, mm, numa: Call numa_meminfo_cover_memory() checking early

2013-04-11 Thread Yinghai Lu
For the separation, we need to set memblock nid later, as it could change memblock array, and possible doube memblock.memory array that will need to allocate buffer. We do not need to use nid in memblock to find out absent pages. So we can move that numa_meminfo_cover_memory() early. Also could

[PATCH v4 12/22] x86, mm, numa: Move node_map_pfn alignment() to x86

2013-04-11 Thread Yinghai Lu
Move node_map_pfn_alignment() to arch/x86/mm as no other user for it. Will update it to use numa_meminfo instead of memblock. Signed-off-by: Yinghai Lu --- arch/x86/mm/numa.c | 50 ++ include/linux/mm.h | 1 - mm/page_alloc.c| 50

[PATCH v4 00/22] x86, ACPI, numa: Parse numa info early

2013-04-11 Thread Yinghai Lu
One commit that tried to parse SRAT early get reverted before v3.9-rc1. | commit e8d1955258091e4c92d5a975ebd7fd8a98f5d30f | Author: Tang Chen | Date: Fri Feb 22 16:33:44 2013 -0800 | |acpi, memory-hotplug: parse SRAT before memblock is ready It broke several things, like acpi override and

[PATCH v4 01/22] x86: Change get_ramdisk_image() to global

2013-04-11 Thread Yinghai Lu
Need to use get_ramdisk_image() with early microcode_updating in other file. Change it to global. Also make it to take boot_params pointer, as head_32.S need to access it via phys address during 32bit flat mode. Signed-off-by: Yinghai Lu Acked-by: Tejun Heo Tested-by: Thomas Renninger ---

[PATCH v4 02/22] x86, microcode: Use common get_ramdisk_image()

2013-04-11 Thread Yinghai Lu
Use common get_ramdisk_image() to get ramdisk start phys address. We need this to get correct ramdisk adress for 64bit bzImage that initrd can be loaded above 4G by kexec-tools. -v2: fix one typo that is found by Tang Chen Signed-off-by: Yinghai Lu Cc: Fenghua Yu Acked-by: Tejun Heo

[PATCH v4 10/22] x86, mm, numa: Move two functions calling on successful path later

2013-04-11 Thread Yinghai Lu
We need to have numa info ready before init_mem_mapping, so we can call init_mem_mapping per nodes also can trim node mem range to big alignment. Current numa parsing need to allocate some buffer and need to be called after init_mem_mapping. So try to split parsing numa info to two stages, and

[PATCH v4 22/22] x86, mm, numa: Put pagetable on local node ram for 64bit

2013-04-11 Thread Yinghai Lu
If node with ram is hotplugable, local node mem for page table and vmemmap should be on that node ram. This patch is some kind of refreshment of | commit 1411e0ec3123ae4c4ead6bfc9fe3ee5a3ae5c327 | Date: Mon Dec 27 16:48:17 2010 -0800 | |x86-64, numa: Put pgtable to local node memory That

[PATCH v4 03/22] x86, ACPI, mm: Kill max_low_pfn_mapped

2013-04-11 Thread Yinghai Lu
Now we have arch_pfn_mapped array, and max_low_pfn_mapped should not be used anymore. User should use arch_pfn_mapped or just 1UL<<(32-PAGE_SHIFT) instead. Only user is ACPI_INITRD_TABLE_OVERRIDE, and it should not use that, as later accessing is using early_ioremap(). We could change to use

[PATCH v4 18/22] x86, mm, numa: Add early_initmem_init() stub

2013-04-11 Thread Yinghai Lu
early_initmem_init() call early_x86_numa_init() to parse numa info early. Later will call init_mem_mapping for nodes in it. Signed-off-by: Yinghai Lu Cc: Pekka Enberg Cc: Jacob Shin --- arch/x86/include/asm/page_types.h | 1 + arch/x86/kernel/setup.c | 1 + arch/x86/mm/init.c

[PATCH v4 21/22] x86, mm: Make init_mem_mapping be able to be called several times

2013-04-11 Thread Yinghai Lu
Prepare to put page table on local nodes. Move calling of init_mem_mapping to early_initmem_init. Rework alloc_low_pages to alloc page table in following order: BRK, local node, low range Still only load_cr3 one time, otherwise we would break xen 64bit again. Signed-off-by: Yinghai Lu

[PATCH v4 17/22] x86, ACPI, numa, ia64: split SLIT handling out

2013-04-11 Thread Yinghai Lu
We need to handle slit later, as it need to allocate buffer for distance matrix. Also we do not need SLIT info before init_mem_mapping. So move SLIT parsing later. x86_acpi_numa_init become x86_acpi_numa_init_srat/x86_acpi_numa_init_slit. It should not break ia64 by replacing acpi_numa_init

[PATCH v4 13/22] x86, mm, numa: Use numa_meminfo to check node_map_pfn alignment

2013-04-11 Thread Yinghai Lu
We could use numa_meminfo directly instead of memblock nid. So we could move down set memblock nid and only do it one time for successful path. -v2: according to tj, separate moving to another patch. Signed-off-by: Yinghai Lu --- arch/x86/mm/numa.c | 30 +++--- 1 file

[PATCH v4 20/22] x86, mm: Add comments for step_size shift

2013-04-11 Thread Yinghai Lu
As request by hpa, add comments for why we choose 5 for step size shift. Signed-off-by: Yinghai Lu --- arch/x86/mm/init.c | 21 ++--- 1 file changed, 18 insertions(+), 3 deletions(-) diff --git a/arch/x86/mm/init.c b/arch/x86/mm/init.c index 28b294f..2754e45 100644 ---

[PATCH v4 14/22] x86, mm, numa: Set memblock nid later

2013-04-11 Thread Yinghai Lu
For the separation, we need to set memblock nid later, as it could change memblock array, and possible doube memblock.memory array that will need to allocate buffer. Only set memblock nid one time for successful path. Also rename numa_register_memblks to numa_check_memblks() after move out code

[PATCH v4 09/22] x86, ACPI: Find acpi tables in initrd early from head_32.S/head64.c

2013-04-11 Thread Yinghai Lu
head64.c could use #PF handler set page table to access initrd before init mem mapping and initrd relocating. head_32.S could use 32bit flat mode to access initrd before init mem mapping initrd relocating. That make 32bit and 64 bit more consistent. -v2: use inline function in header file

[PATCH v4 19/22] x86, mm: Parse numa info early

2013-04-11 Thread Yinghai Lu
Parsing numa info has been separated to two functions now. early_initmem_info() only parse info in numa_meminfo and nodes_parsed. still keep numaq, acpi_numa, amd_numa, dummy fall back sequence working. SLIT and numa emulation handling are still left in initmem_init(). Call early_initmem_init

[PATCH v4 08/22] x86, ACPI: Make acpi_initrd_override_find work with 32bit flat mode

2013-04-11 Thread Yinghai Lu
For finding with 32bit, it would be easy to access initrd in 32bit flat mode, as we don't need to set page table. That is from head_32.S, and microcode updating already use this trick. Need to change acpi_initrd_override_find to use phys to access global variables. Pass is_phys in the function,

[PATCH v4 05/22] x86, ACPI: Increase override tables number limit

2013-04-11 Thread Yinghai Lu
Current acpi tables in initrd is limited to 10, that is too small. 64 should be good enough as we have 35 sigs and could have several SSDT. Two problems in current code prevent us from increasing limit: 1. that cpio file info array is put in stack, as every element is 32 bytes, could run out

[PATCH v4 07/22] x86, ACPI: Store override acpi tables phys addr in cpio files info array

2013-04-11 Thread Yinghai Lu
In 32bit we will find table with phys address during 32bit flat mode in head_32.S, because at that time we don't need set page table to access initrd. For copying we could use early_ioremap() with phys directly before mem mapping is set. To keep 32bit and 64bit consistent, use phys_addr for all.

[PATCH v4 06/22] x86, ACPI: Split acpi_initrd_override to find/copy two functions

2013-04-11 Thread Yinghai Lu
To parse srat early, we need to move acpi table probing early. acpi_initrd_table_override is before acpi table probing. So we need to move it early too. Current code acpi_initrd_table_override is after init_mem_mapping and relocate_initrd(), so it can scan initrd and copy acpi tables with kernel

[PATCH v4 16/22] x86, mm, numa: Move emulation handling down.

2013-04-11 Thread Yinghai Lu
It needs to allocate buffer for new numa_meminfo and distance matrix, so move it down. Also we change the behavoir: before this patch, if user input wrong data in command line, it will fall back to next numa probing or disabling numa. after this patch, if user input wrong data in command line, it

[PATCH v4 15/22] x86, mm, numa: Move node_possible_map setting later

2013-04-11 Thread Yinghai Lu
Move node_possible_map handling out of numa_check_memblks to avoid side changing in numa_check_memblks(). Only set once for successful path instead of resetting in numa_init() every time. Suggested-by: Tejun Heo Signed-off-by: Yinghai Lu --- arch/x86/mm/numa.c | 11 +++ 1 file

[PATCH v4 04/22] x86, ACPI: Search buffer above 4G in second try for acpi override tables

2013-04-11 Thread Yinghai Lu
Now we only search buffer for override acpi table under 4G. In some case, like user use memmap to exclude all low ram, we may not find range for it under 4G. Do second try to search above 4G. Signed-off-by: Yinghai Lu Cc: "Rafael J. Wysocki" Cc: linux-a...@vger.kernel.org Tested-by: Thomas

Re: [PATCH v2] make dm and dm-crypt forward cgroup context (was: dm-crypt parallelization patches)

2013-04-11 Thread Tejun Heo
On Thu, Apr 11, 2013 at 08:06:10PM -0400, Mikulas Patocka wrote: > All that I can tell you is that adding an empty atomic operation > "cmpxchg(>bi_css->refcnt, bio->bi_css->refcnt, bio->bi_css->refcnt);" > to bio_clone_context and bio_disassociate_task increases the time to run a > benchmark

Re: [PATCH] USB: ehci-omap: Select USB_PHY

2013-04-11 Thread Alexander Holler
Am 11.04.2013 20:29, schrieb Felipe Balbi: > and who said OMAP USB depends on CONFIG_USB_PHY ? Some platforms need to > control a PHY and some don't. I've read that so. > Go check out kernel 2.6.39 (maybe even 3.1 and 3.2) and you'll see that > we're much better off today where we can actually

Re: [PATCH v3 5/9] ARM: dts: Add a 64 bits version of the skeleton device tree

2013-04-11 Thread Jason Cooper
On Thu, Apr 11, 2013 at 10:48:27PM +0200, Gregory CLEMENT wrote: > Hi Jason, > > On 04/11/2013 08:12 PM, Jason Cooper wrote: > > On Tue, Apr 09, 2013 at 12:52:13AM +0200, Gregory CLEMENT wrote: > >> From: Lior Amsalem > >> > >> In order to be able to use more than 4GB address-cells and

  1   2   3   4   5   6   7   8   9   10   >