Re: [Python-Dev] The untuned tunable parameter ARENA_SIZE

Larry Hastings Fri, 02 Jun 2017 13:08:13 -0700


On 06/02/2017 02:38 AM, Antoine Pitrou wrote:

I hope those are not the actual numbers you're intending to use ;-)
I still think that allocating more than 1 or 2MB at once would be
foolish.  Remember this is data that's going to be carved up into
(tens of) thousands of small objects.  Large objects eschew the small
object allocator (not to mention that third-party libraries like Numpy
may be using different allocation routines when they allocate very
large data).

Honest, I'm well aware of what obmalloc does and how it works. I betI've spent more time crawling around in it in the last year than anybodyelse on the planet. Mainly because it works so well for CPython, nobodyelse needed to bother!

I'm also aware, for example, that if your process grows to consumegigabytes of memory, you're going to have tens of thousands of allocatedarenas. The idea that on systems with gigabytes of memory--90%+? ofcurrent systems running CPython--we should allocate memory forever in256kb chunks is faintly ridiculous. I agree that we should start small,and ramp up slowly, so Python continues to run well on small computersand not allocate tons of memory for small programs. But I also think weshould ramp up *ever*, for programs that use tens or hundreds of megabytes.

Also note that if we don't touch the allocated memory, smart modern OSeswon't actually commit any resources to it. All that happens when yourprocess allocates 1GB is that the OS changes some integers around. Itdoesn't actually commit any memory to your process until you attempt towrite to that memory, at which point it gets mapped in inlocal-page-size chunks (4k? 8k? something in that neighborhood andpower-of-2 sized). So if we allocate 32mb, and only touch the first1mb, the other 31mb doesn't consume any real resources. I was planningon making the multi-arena code only touch memory when it actually needsto, similarly to the way obmalloc lazily consumes memory inside anallocated pool (see the nextoffset field in pool_header), to takeadvantage of this ubiquitous behavior.

If I write this multi-arena code, which I might, I was thinking I'd trythis approach:


 * leave arenas themselves at 256k
 * start with a 1MB multi-arena size
 * every time I allocate a new multi-arena, multiply the size of the
   next multi-arena by 1.5 (rounding up to 256k each time)
 * every time I free a multi-arena, divide the size of the next
   multi-arena by 2 (rounding up to 256k each time)
 * if allocation of a multi-arena fails, use a binary search algorithm
   to allocate the largest multi-arena possible (rounding up to 256k at
   each step)
 * cap the size of multi arenas at, let's say, 32mb

So multi-arenas would be 1mb, 1.5mb, 2.25mb, 3.5mb (round up!), etc.

Fun fact: Python allocates 16 arenas at the start of the program, justto initialize obmalloc. That consumes 4mb of memory. With the abovemulti-arena approach, that'd allocate the first three multi-arenas,pre-allocating 19 arenas, leaving 3 unused. It's *mildly* tempting tomake the first multi-arena be 4mb, just so this is exactly right-sized,but... naah.



//arry/

_______________________________________________
Python-Dev mailing list
[email protected]
https://mail.python.org/mailman/listinfo/python-dev
Unsubscribe: 
https://mail.python.org/mailman/options/python-dev/archive%40mail-archive.com

Re: [Python-Dev] The untuned tunable parameter ARENA_SIZE

Reply via email to