Re: Optimum size for malloc() allocation?

Jeffrey Kegler Sun, 05 Jan 2014 07:13:14 -0800

Thanks for gathering these references, which I read or re-read. Readingthem and thinking, I was reminded to have another look at LinusTorvald's git, which has its own very interesting memory allocator,which Linus calls "slabs". Linus's allocator gets its memory frommalloc() in big slabs, which he sizes as follows:

/* allocate ~512kB at once, allowing for malloc overhead */
#ifndef COMMIT_SLAB_SIZE
#define COMMIT_SLAB_SIZE (512*1024-32)
#endif

So Linus's answer seems to be allocate a good-sized power of two, less32 btyes to allow for overhead. It sounds reasonable, and Linus tendsto know on these matters.

So, in the Marpa case, 4096-32 is probably the best choice. (I wastempted to make it 4096-42, but I suspect Linus picked a power of twobecause it is most efficient in a buddy-system allocator.)


-- jeffrey

On 01/05/2014 02:59 AM, Ruslan Shvedov wrote:

From what the obstacks authors say it looks like if _range checking_is _off_ /[1]/ you can still use 4K as the best size (16 bytes extralooks like a safe bet otherwise /[2]/) and "less sensitive to the sizeof the request" as applies to "the new" GNU C malloc seems to meanthat it does not align block sizes to powers of 2 /[4]/ (as obstackdoc says /[3]/) and just uses 4K fixed block size /[5]/, well, most ofthe time /[6]/.
Overall, the absolutely best way for an application to help malloc()seems to be determine the pagesize /[7] /and align to it, e.g. bysetting obstack default chunk size to it. Otherwise, 4K would well bethe optimal size.
Not exactly what's asked, but hopefully helpful.

References:
/
/
/[1] /"12 is sizeof (mhead) and 4 is EXTRA from GNU malloc.
Use the values for range checking, because if range checking is off,
the extra bytes won't be missed terribly, but _if range checking is on_
and we used a larger request, a whole extra 4096 bytes would be
allocated."
"These number are irrelevant to the new GNU malloc. I suspect it is
less sensitive to the size of the request."
—https://github.com/jeffreykegler/Marpa--R2/blob/master/cpan/libmarpa/obs/marpa_obs.c
/[2]/ Allocated memory contains an 8 or 16 byte overhead for the sizeof the chunk and usage flags.
— http://en.wikipedia.org/wiki/C_dynamic_memory_allocation#dlmalloc
/[3]/ If you allocate chunks with malloc, the chunk size should be apower of 2.
— http://gcc.gnu.org/onlinedocs/libiberty/Obstack-Chunks.html
/[4] /the malloc in the GNU C Library _does not round up block sizesto powers of two_, neither for large nor for small sizes—http://www.gnu.org/software/libc/manual/html_node/Efficiency-and-Malloc.html#Efficiency-and-Malloc
/[5] /The /new GNU malloc/ improves on things by allocating largeobjects in chunks of 4096 bytes _rather than in ever larger powers oftwo_, which results in ever larger wastage.
— http://www.sxemacs.org/docs/internals/Low_002dlevel-allocation.html
/[6]/ To the extent possible, _this malloc manages memory from thesystem in page-size units_.
#    define malloc_getpagesize (4096) /* just guess */

malloc_getpagesize   (default: derived from system #includes)
—http://web.mit.edu/course/13/13.715/build/lam-7.1.3/share/memory/ptmalloc/ptmalloc.c
/[7] /
Most operating systems allow programs to discover the page size atruntime. This allows programs to use memory _more efficiently byaligning allocations to this size_ and reducing overall internalfragmentation of pages.—http://en.wikipedia.org/wiki/Page_size#Determining_the_page_size_in_a_program
On Sun, Jan 5, 2014 at 7:49 AM, Jeffrey Kegler<[email protected]<mailto:[email protected]>> wrote:
    This is a bit of a long story, but it's a question which I've
    tried to research without must progress, and perhaps someone on
    the list can help.  I've finished up my rework of Marpa's memory
    allocator.  It's based on GNU's obstacks, but very heavily hacked
    at this point.  It's extremely efficient, but efficiency is *not*
    its most important advantage.  Marpa has lots of complex data
    structures which can be very hard to destroy, particularly under
    error conditions -- they may still be under construction.  If you
    look at Ben Pfaff's AVL code, his most complex logic is not for
    the AVL data structures themselves, but the code that makes sure
    his AVL's don't leak memory.

    Obstacks eliminate my having to worry about this.  Each object
    (recognizer, grammar, etc.) has an obstack and almost everything
    is allocated from that obstack.  When the object is destroyed,
    there's no need to follow all the links, make sure everything is
    destroyed in the right order, etc., etc. -- I just delete the
    obstack in which everything was allocated.

    Marpa's obstacks have the disadvantage that I cannot resize
    allocations.  I also cannot free any of the memory without
    destroying the entire obstack.  In Marpa, I rarely need to do
    either of these things.  For the few cases where obstacks fall
    short of what I need, I just fall back on malloc().
Marpa's obstacks get their memory in big blocks from malloc().For obstacks, I have a good deal of freedom in deciding how much
    memory to request.  My question is, what is the best size?

    I am currently followig the strategy used by GNU's obstacks.  They
    allocated, by default, 4K less a complex calculation based on the
    size of various malloc() headers, etc.  Their idea was that if
    they ask for exactly 4K, GNU's malloc(), because of the extra few
bytes needed for headers, would need to allocate an extra block.That suggests that doing many allocations, all of exactly 4K, is
    an almost pessimal way to use malloc().  But the GNU obstack
    authors append a comment noting that they think their analysis, as
    of the latest GNU malloc, may be obsolete -- that lots of
    allocations of exactly 4K are no longer worse than any other strategy.

    Most helpful for me would be pointers to the writeups where this
    issue is addressed.  I've tried to research this, but the papers
    that I find focus on how well they deal with difficult mixes of
    allocation sizes, and not on how an application with a lot of
    flexibility can help malloc() out.

    Thanks, jeffrey
--You received this message because you are subscribed to the Google
    Groups "marpa parser" group.
    To unsubscribe from this group and stop receiving emails from it,
    send an email to [email protected]
    <mailto:marpa-parser%[email protected]>.
    For more options, visit https://groups.google.com/groups/opt_out.


--
You received this message because you are subscribed to the GoogleGroups "marpa parser" group.To unsubscribe from this group and stop receiving emails from it, sendan email to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.


--
You received this message because you are subscribed to the Google Groups "marpa 
parser" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.

Re: Optimum size for malloc() allocation?

Reply via email to