Re: [sqlite] SQLITE_MAX_MMAP_SIZE 2GB default

Carl Edquist Tue, 23 Apr 2019 09:05:02 -0700

Thanks Jens!

On Mon, 22 Apr 2019, Jens Alfke wrote:

But yeah, I agree with you that it seems odd to have a compiled-inrestriction on the maximum memory-map size.

I looked a bit into the history, and it appears the just-under-2GB limitwas specifically put there (by drh) so that the value would fit into asigned 32-bit integer:


        https://www.sqlite.org/src/info/460752b857532016

Maybe this was in order to fit into a ssize_t on 32-bit platforms? (Justa guess as the size_t type is used for mmap.)

If that's the reason, would drh & the sqlite team consider changing thedefault SQLITE_MAX_MMAP_SIZE definition from 0x7fff0000 to SSIZE_MAX(defined in limits.h) ?



...


Somewhat of an aside:

Most current OSs have a universal buffer cache, wherein filesystemcaches and VM pages use the same RAM buffers. A page-fault and a fileread will incur similar amounts of work. The big benefit is that thememory-mapped pages can be evicted from RAM when needed for other stuff,whereas a malloc-ed page cache is considered dirty and has to be swappedout before the RAM page can be reused.

Right, i think we're agreeing here. I just mean that if a db file is muchlarger than the available ram, and all of it is used for a query, thenwhether bytes come in via mmap+memcpy() or regular file read(), the kernelwill page-in missing pages into its page cache, and (at its discretion) itwill eventually have to evict those pages to make room for others. (Ididn't mean to make reference to disk swap or sqlite's user-spacepagecache.) The point is that the kernel page loading/evicting part isthe same, but the mmap also saves the overhead for the seek/read systemcalls.

And i say this from my personal experience with sqlite -- when doing along running join on a db that does not fit into ram, with the timer on, iobserved that in the regular non-mmap mode, the system time for the querywould be about equal to the user time. But when i mmap'ed the whole file,the system time for the query was < 1% of the user time, and the user timewas also less than it had been in non-mmap mode. I can only guess thehuge difference in the system time was mainly the kernel handling all theseek/read system calls, vs just memcpy when the whole db file is mmap'ed.

The bottom line in any case is i saw a substantial speedup for longqueries when mmap'ing a large db, even when it did not fit into ram.



Thanks..!

Carl
_______________________________________________
sqlite-users mailing list
[email protected]
http://mailinglists.sqlite.org/cgi-bin/mailman/listinfo/sqlite-users

Re: [sqlite] SQLITE_MAX_MMAP_SIZE 2GB default

Reply via email to