Re: [PATCH] SLUB The unqueued slab allocator V3
From: Christoph Lameter <[EMAIL PROTECTED]> Date: Wed, 28 Feb 2007 17:06:19 -0800 (PST) > On Wed, 28 Feb 2007, David Miller wrote: > > > Arguably SLAB_HWCACHE_ALIGN and SLAB_MUST_HWCACHE_ALIGN should > > not be set here, but SLUBs change in semantics in this area > > could cause similar grief in other areas, an audit is probably > > in order. > > > > The above example was from sparc64, but x86 does the same thing > > as probably do other platforms which use SLAB for pagetables. > > Maybe this will address these concerns? > > Index: linux-2.6.21-rc2/mm/slub.c > === > --- linux-2.6.21-rc2.orig/mm/slub.c 2007-02-28 16:54:23.0 -0800 > +++ linux-2.6.21-rc2/mm/slub.c2007-02-28 17:03:54.0 -0800 > @@ -1229,8 +1229,10 @@ static int calculate_order(int size) > static unsigned long calculate_alignment(unsigned long flags, > unsigned long align) > { > - if (flags & (SLAB_MUST_HWCACHE_ALIGN|SLAB_HWCACHE_ALIGN)) > + if (flags & SLAB_HWCACHE_ALIGN) > return L1_CACHE_BYTES; > + if (flags & SLAB_MUST_HWCACHE_ALIGN) > + return max(align, (unsigned long)L1_CACHE_BYTES); > > if (align < ARCH_SLAB_MINALIGN) > return ARCH_SLAB_MINALIGN; It would achiever parity with existing SLAB behavior, sure. - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: [PATCH] SLUB The unqueued slab allocator V3
On Wed, 28 Feb 2007, David Miller wrote: > Maybe if you managed your individual changes in GIT or similar > this could be debugged very quickly. :-) I think once things calm down and the changes become smaller its going to be easier. Likely the case with after V4. > Meanwhile I noticed that your alignment algorithm is different > than SLAB's. And I think this is important for the page table > SLABs that some platforms use. Ok. > No matter what flags are specified, SLAB gives at least the > passed in alignment specified in kmem_cache_create(). That > logic in slab is here: > > /* 3) caller mandated alignment */ > if (ralign < align) { > ralign = align; > } Hmmm... Right. > Whereas SLUB uses the CPU cacheline size when the MUSTALIGN > flag is set. Architectures do things like: > > pgtable_cache = kmem_cache_create("pgtable_cache", > PAGE_SIZE, PAGE_SIZE, > SLAB_HWCACHE_ALIGN | > SLAB_MUST_HWCACHE_ALIGN, > zero_ctor, > NULL); > > to get a PAGE_SIZE aligned slab, SLUB doesn't give the same > behavior SLAB does in this case. SLUB only supports this by passing through allocations to the page allocator since it does not maintain queues. So the above will cause the pgtable_cache to use the caches of the page allocator. The queueing effect that you get from SLAB is not present in SLUB since it does not provide them. If SLUB is to be used this way then we need to have higher order page sizes and allocate chunks from the higher order page for the pgtable_cache. There are other ways of doing it. IA64 f.e. uses a linked list to accomplish the same avoiding SLAB overhead. > Arguably SLAB_HWCACHE_ALIGN and SLAB_MUST_HWCACHE_ALIGN should > not be set here, but SLUBs change in semantics in this area > could cause similar grief in other areas, an audit is probably > in order. > > The above example was from sparc64, but x86 does the same thing > as probably do other platforms which use SLAB for pagetables. Maybe this will address these concerns? Index: linux-2.6.21-rc2/mm/slub.c === --- linux-2.6.21-rc2.orig/mm/slub.c 2007-02-28 16:54:23.0 -0800 +++ linux-2.6.21-rc2/mm/slub.c 2007-02-28 17:03:54.0 -0800 @@ -1229,8 +1229,10 @@ static int calculate_order(int size) static unsigned long calculate_alignment(unsigned long flags, unsigned long align) { - if (flags & (SLAB_MUST_HWCACHE_ALIGN|SLAB_HWCACHE_ALIGN)) + if (flags & SLAB_HWCACHE_ALIGN) return L1_CACHE_BYTES; + if (flags & SLAB_MUST_HWCACHE_ALIGN) + return max(align, (unsigned long)L1_CACHE_BYTES); if (align < ARCH_SLAB_MINALIGN) return ARCH_SLAB_MINALIGN; - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: [PATCH] SLUB The unqueued slab allocator V3
From: David Miller <[EMAIL PROTECTED]> Date: Wed, 28 Feb 2007 14:00:22 -0800 (PST) > V3 doesn't boot successfully on sparc64 False alarm! This crash was actually due to an unrelated problem in the parport_pc driver on my machine. Slub v3 boots up and seems to work fine so far on sparc64. - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: [PATCH] SLUB The unqueued slab allocator V3
From: Christoph Lameter <[EMAIL PROTECTED]> Date: Wed, 28 Feb 2007 11:20:44 -0800 (PST) > V2->V3 > - Debugging and diagnostic support. This is runtime enabled and not compile > time enabled. Runtime debugging can be controlled via kernel boot options > on an individual slab cache basis or globally. > - Slab Trace support (For individual slab caches). > - Resiliency support: If basic sanity checks are enabled (via F f.e.) > (boot option) then SLUB will do the best to perform diagnostics and > then continue (i.e. mark corrupted objects as used). > - Fix up numerous issues including clash of SLUBs use of page > flags with i386 arch use for pmd and pgds (which are managed > as slab caches, sigh). > - Dynamic per CPU array sizing. > - Explain SLUB slabcache flags V3 doesn't boot successfully on sparc64, sorry I don't have the ability to track this down at the moment since it resets the machine right as the video device is initialized and after diffing V2 to V3 there is way too much stuff changing for me to try and "bisect" between V2 to V3 to find the guilty sub-change. Maybe if you managed your individual changes in GIT or similar this could be debugged very quickly. :-) Meanwhile I noticed that your alignment algorithm is different than SLAB's. And I think this is important for the page table SLABs that some platforms use. No matter what flags are specified, SLAB gives at least the passed in alignment specified in kmem_cache_create(). That logic in slab is here: /* 3) caller mandated alignment */ if (ralign < align) { ralign = align; } Whereas SLUB uses the CPU cacheline size when the MUSTALIGN flag is set. Architectures do things like: pgtable_cache = kmem_cache_create("pgtable_cache", PAGE_SIZE, PAGE_SIZE, SLAB_HWCACHE_ALIGN | SLAB_MUST_HWCACHE_ALIGN, zero_ctor, NULL); to get a PAGE_SIZE aligned slab, SLUB doesn't give the same behavior SLAB does in this case. Arguably SLAB_HWCACHE_ALIGN and SLAB_MUST_HWCACHE_ALIGN should not be set here, but SLUBs change in semantics in this area could cause similar grief in other areas, an audit is probably in order. The above example was from sparc64, but x86 does the same thing as probably do other platforms which use SLAB for pagetables. - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: [PATCH] SLUB The unqueued slab allocator V3
From: Christoph Lameter [EMAIL PROTECTED] Date: Wed, 28 Feb 2007 11:20:44 -0800 (PST) V2-V3 - Debugging and diagnostic support. This is runtime enabled and not compile time enabled. Runtime debugging can be controlled via kernel boot options on an individual slab cache basis or globally. - Slab Trace support (For individual slab caches). - Resiliency support: If basic sanity checks are enabled (via F f.e.) (boot option) then SLUB will do the best to perform diagnostics and then continue (i.e. mark corrupted objects as used). - Fix up numerous issues including clash of SLUBs use of page flags with i386 arch use for pmd and pgds (which are managed as slab caches, sigh). - Dynamic per CPU array sizing. - Explain SLUB slabcache flags V3 doesn't boot successfully on sparc64, sorry I don't have the ability to track this down at the moment since it resets the machine right as the video device is initialized and after diffing V2 to V3 there is way too much stuff changing for me to try and bisect between V2 to V3 to find the guilty sub-change. Maybe if you managed your individual changes in GIT or similar this could be debugged very quickly. :-) Meanwhile I noticed that your alignment algorithm is different than SLAB's. And I think this is important for the page table SLABs that some platforms use. No matter what flags are specified, SLAB gives at least the passed in alignment specified in kmem_cache_create(). That logic in slab is here: /* 3) caller mandated alignment */ if (ralign align) { ralign = align; } Whereas SLUB uses the CPU cacheline size when the MUSTALIGN flag is set. Architectures do things like: pgtable_cache = kmem_cache_create(pgtable_cache, PAGE_SIZE, PAGE_SIZE, SLAB_HWCACHE_ALIGN | SLAB_MUST_HWCACHE_ALIGN, zero_ctor, NULL); to get a PAGE_SIZE aligned slab, SLUB doesn't give the same behavior SLAB does in this case. Arguably SLAB_HWCACHE_ALIGN and SLAB_MUST_HWCACHE_ALIGN should not be set here, but SLUBs change in semantics in this area could cause similar grief in other areas, an audit is probably in order. The above example was from sparc64, but x86 does the same thing as probably do other platforms which use SLAB for pagetables. - To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: [PATCH] SLUB The unqueued slab allocator V3
From: David Miller [EMAIL PROTECTED] Date: Wed, 28 Feb 2007 14:00:22 -0800 (PST) V3 doesn't boot successfully on sparc64 False alarm! This crash was actually due to an unrelated problem in the parport_pc driver on my machine. Slub v3 boots up and seems to work fine so far on sparc64. - To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: [PATCH] SLUB The unqueued slab allocator V3
On Wed, 28 Feb 2007, David Miller wrote: Maybe if you managed your individual changes in GIT or similar this could be debugged very quickly. :-) I think once things calm down and the changes become smaller its going to be easier. Likely the case with after V4. Meanwhile I noticed that your alignment algorithm is different than SLAB's. And I think this is important for the page table SLABs that some platforms use. Ok. No matter what flags are specified, SLAB gives at least the passed in alignment specified in kmem_cache_create(). That logic in slab is here: /* 3) caller mandated alignment */ if (ralign align) { ralign = align; } Hmmm... Right. Whereas SLUB uses the CPU cacheline size when the MUSTALIGN flag is set. Architectures do things like: pgtable_cache = kmem_cache_create(pgtable_cache, PAGE_SIZE, PAGE_SIZE, SLAB_HWCACHE_ALIGN | SLAB_MUST_HWCACHE_ALIGN, zero_ctor, NULL); to get a PAGE_SIZE aligned slab, SLUB doesn't give the same behavior SLAB does in this case. SLUB only supports this by passing through allocations to the page allocator since it does not maintain queues. So the above will cause the pgtable_cache to use the caches of the page allocator. The queueing effect that you get from SLAB is not present in SLUB since it does not provide them. If SLUB is to be used this way then we need to have higher order page sizes and allocate chunks from the higher order page for the pgtable_cache. There are other ways of doing it. IA64 f.e. uses a linked list to accomplish the same avoiding SLAB overhead. Arguably SLAB_HWCACHE_ALIGN and SLAB_MUST_HWCACHE_ALIGN should not be set here, but SLUBs change in semantics in this area could cause similar grief in other areas, an audit is probably in order. The above example was from sparc64, but x86 does the same thing as probably do other platforms which use SLAB for pagetables. Maybe this will address these concerns? Index: linux-2.6.21-rc2/mm/slub.c === --- linux-2.6.21-rc2.orig/mm/slub.c 2007-02-28 16:54:23.0 -0800 +++ linux-2.6.21-rc2/mm/slub.c 2007-02-28 17:03:54.0 -0800 @@ -1229,8 +1229,10 @@ static int calculate_order(int size) static unsigned long calculate_alignment(unsigned long flags, unsigned long align) { - if (flags (SLAB_MUST_HWCACHE_ALIGN|SLAB_HWCACHE_ALIGN)) + if (flags SLAB_HWCACHE_ALIGN) return L1_CACHE_BYTES; + if (flags SLAB_MUST_HWCACHE_ALIGN) + return max(align, (unsigned long)L1_CACHE_BYTES); if (align ARCH_SLAB_MINALIGN) return ARCH_SLAB_MINALIGN; - To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: [PATCH] SLUB The unqueued slab allocator V3
From: Christoph Lameter [EMAIL PROTECTED] Date: Wed, 28 Feb 2007 17:06:19 -0800 (PST) On Wed, 28 Feb 2007, David Miller wrote: Arguably SLAB_HWCACHE_ALIGN and SLAB_MUST_HWCACHE_ALIGN should not be set here, but SLUBs change in semantics in this area could cause similar grief in other areas, an audit is probably in order. The above example was from sparc64, but x86 does the same thing as probably do other platforms which use SLAB for pagetables. Maybe this will address these concerns? Index: linux-2.6.21-rc2/mm/slub.c === --- linux-2.6.21-rc2.orig/mm/slub.c 2007-02-28 16:54:23.0 -0800 +++ linux-2.6.21-rc2/mm/slub.c2007-02-28 17:03:54.0 -0800 @@ -1229,8 +1229,10 @@ static int calculate_order(int size) static unsigned long calculate_alignment(unsigned long flags, unsigned long align) { - if (flags (SLAB_MUST_HWCACHE_ALIGN|SLAB_HWCACHE_ALIGN)) + if (flags SLAB_HWCACHE_ALIGN) return L1_CACHE_BYTES; + if (flags SLAB_MUST_HWCACHE_ALIGN) + return max(align, (unsigned long)L1_CACHE_BYTES); if (align ARCH_SLAB_MINALIGN) return ARCH_SLAB_MINALIGN; It would achiever parity with existing SLAB behavior, sure. - To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/