Quoting Serge E. Hallyn ([EMAIL PROTECTED]): > Quoting Lee Schermerhorn ([EMAIL PROTECTED]): > > On Tue, 2007-08-14 at 13:03 -0500, Serge E. Hallyn wrote: > > > Quoting Lee Schermerhorn ([EMAIL PROTECTED]): > > > > On Mon, 2007-08-13 at 15:12 -0500, Serge E. Hallyn wrote: > > > > > Quoting Dhaval Giani ([EMAIL PROTECTED]): > > > > > > Hi, > > > > > > > > > > > > On mounting cpusets using containers, I have been hitting the > > > > > > following > > > > > > bug. > > > > > > > > > > > > > > > > > > -----------[ cut here ]------------ > > > > > > kernel BUG at kernel/cpuset.c:331! > > > > <snip> > > > > > > CONFIG_HIGHMEM=y > > > > > > CONFIG_X86_PAE=y > > > > > > # CONFIG_NUMA is not set > > > > > > CONFIG_ARCH_POPULATES_NODE_MAP=y > > > > > > CONFIG_SELECT_MEMORY_MODEL=y > > > > > > CONFIG_FLATMEM_MANUAL=y > > > > > > # CONFIG_DISCONTIGMEM_MANUAL is not set > > > > > > # CONFIG_SPARSEMEM_MANUAL is not set > > > > > > CONFIG_FLATMEM=y > > > > > > CONFIG_FLAT_NODE_MEM_MAP=y > > > > > > # CONFIG_SPARSEMEM_STATIC is not set > > > > > > CONFIG_SPLIT_PTLOCK_CPUS=4 > > > > > > CONFIG_RESOURCES_64BIT=y > > > > > > CONFIG_ZONE_DMA_FLAG=1 > > > > <snip> > > > > > > > > > > Yeah, I'm seeing the same thing. Oddly, my > > > > > node_states[N_NORMAL_MEMORY] > > > > > and node_states[N_HIGH_MEMORY] are empty, while node_states[N_ONLINE] > > > > > contains my single cpu (on i386 kvm image). > > > > > > > > > > -serge > > > > > > > > Yes, you'll definitely hit that BUG if the N_HIGH_MEMORY mask is empty. > > > > So far, I can't see how this could be, tho'. __build_all_zonelists() > > > > should be called for non-NUMA as well as NUMA. It iterates over "all > > > > > > Yup, and it is, and some debug statements insist that > > > node_set_state(nid, N_HIGH_MEMORY) is being called for cpu 0, > > > and the state is in fact being correctly set. > > > > > > So it must get cleared later... > > > > OK. That helps. I'll see what I can find... > > Well apparently I lied? I don't think it's getting cleared in node_states. > I'm doing: > > static void guarantee_online_mems(const struct cpuset *cs, nodemask_t *pmask) > { > int nid=0; > printk(KERN_NOTICE "%s: at top, node %d is %d\n", > __FUNCTION__, nid, node_state(nid, N_HIGH_MEMORY)); > while (cs && !nodes_intersects(cs->mems_allowed, > node_states[N_HIGH_MEMORY])) { > printk(KERN_NOTICE "%s: in loop\n", __FUNCTION__); > cs = cs->parent; > } > if (cs) { > printk(KERN_NOTICE "%s: cs was true\n", __FUNCTION__); > nodes_and(*pmask, cs->mems_allowed, > node_states[N_HIGH_MEMORY]); > } > else > { > printk(KERN_NOTICE "%s: cs was false\n", __FUNCTION__); > *pmask = node_states[N_HIGH_MEMORY]; > } > printk(KERN_NOTICE "%s: at bottom, node %d is %d\n", > __FUNCTION__, nid, node_state(nid, N_HIGH_MEMORY)); > printk(KERN_NOTICE "%s: at bottom, in pmask node %d is %d\n", > __FUNCTION__, nid, node_isset(nid, *pmask)); > BUG_ON(!nodes_intersects(*pmask, node_states[N_HIGH_MEMORY])); > } > > and getting: > > guarantee_online_mems: in loop > guarantee_online_mems: cs was false > guarantee_online_mems: at bottom, node 0 is 1 > guarantee_online_mems: at bottom, in pmask node 0 is 0 > > > So I'm guess that > *pmask = node_states[N_HIGH_MEMORY]; > isn't doing what it was intended to do? That or I've got node_state() > wrong...
Argh. CONFIG_NODES_SHIFT was unset, so MAX_NUMNODES=1, so node_state() and node_set_state() are dummies. So __build_all_zonelists is using the dummy node_state() and not actually setting node_states[N_HIGH_MEMORY], and my debug statements were using the dumy node_state() which returns 1 if n==0 :) Here is a patch which fixes the bug on my system. As noted in the patch description, there are other calls to node_set_state() in mm/page_alloc.c which might also need to be switched to node_set(), if this is deemed the right solution. thanks, -serge >From 8cffdf9bc60af3894858bcc9412aa80938064435 Mon Sep 17 00:00:00 2001 From: [EMAIL PROTECTED] <[EMAIL PROTECTED](none)> Date: Tue, 14 Aug 2007 13:47:51 -0700 Subject: [PATCH 1/1] memoryless: actually set node_states[N_HIGH_MEMORY] If CONFIG_NODES_SHIFT is unset, then MAX_NUMNODES=1, and node_set_state() does nothing. This means that __build_all_zonelists() does not actually set node_states[N_HIGH_MEMORY]. cpusets needs node_states[N_HIGH_MEMORY] to be correctly set, or it will raise a bug when a new cpuset is created. Switch to using node_set() in __build_all_zonelists() to make sure it is actually set. Note there are other calls to node_set_state() in mm/page_alloc.c, which perhaps should also be modified. Since this area is completely foreign to me I thought I'd first make sure this is the right approach. Signed-off-by: Serge E. Hallyn <[EMAIL PROTECTED]> Signed-off-by: Dhaval Giani <[EMAIL PROTECTED]> --- mm/page_alloc.c | 2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/mm/page_alloc.c b/mm/page_alloc.c index 1a40099..111e311 100644 --- a/mm/page_alloc.c +++ b/mm/page_alloc.c @@ -2443,7 +2443,7 @@ static int __build_all_zonelists(void *dummy) /* Any memory on that node */ if (pgdat->node_present_pages) - node_set_state(nid, N_HIGH_MEMORY); + node_set(nid, node_states[N_HIGH_MEMORY]); check_for_regular_memory(pgdat); } return 0; -- 1.5.1 - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/