Re: [HACKERS] Postgres vs. intel ccNUMA on Linux

Greg Smith Thu, 30 Sep 2010 18:33:08 -0700

James Robinson wrote:

Any tips / conventional wisdom regarding running postgres onlarge-ish memory ccNUMA intel machines, such as a 32G dual-quad-core,showing two NUMA nodes of 16G each? I expect each postgres backend'snon-shared memory usage to remain nice and reasonably sized, hopefullystaying within the confines of its processor's local memory region,but how will accesses to shared memory and / or buffer cache play out?Do people tune their backends via 'numactl' ?

My gut feel here is that the odds this particular area will turn intoyour bottleneck is so slim that worrying about it in advance ispremature optimization. If you somehow end up in the unexpectedsituation where processor time that might be altered via such finecontrol is your bottleneck, as opposed to disks, buffer cachecontention, the ProcArray contention Robert mentioned, WAL contention,or something else like that--all things you can't segment usefullyhere--well maybe at that point I'd start chasing after numactl. As forhow likely that is, all I can say is I've never gotten there beforefinding a much more obvious bottleneck first.

However, I recently wrote a little utility to test memory speeds asincreasing numbers of clients do things on a system, and it may provideyou some insight into how your system responds as different numbers ofthem do things: http://github.com/gregs1104/stream-scaling

I've gotten results submitted to me where you can see memory speedsfluctuate on servers where threads bounce between processors and theirassociated memory, stuff that goes away if you then lock the testprogram to specific cores. If you want to discuss results from tryingthat on your system and how that might impact real-world serverbehavior, I'd recommend posting about that to the pgsql-performance listrather than this one. pgsql-hackers is more focused on code-levelissues with PostgreSQL. There really aren't any of those in the areayou're asking about, as the database is blind to what the OS is doingunderneath of it here.

Furthermore, if one had more than one database being served by themachine, would it be advisable to do this via multiple clustersinstead of a single cluster, tweaking the processor affinity of eachpostmaster accordingly, trying to ensure each cluster's shared memorysegments and buffer cache pools remain local for the resulting backends?

If you have a database that basically fits in memory, that mightactually work. Note however that the typical useful tuning forPostgreSQL puts more cache into the operating system side of things thanwhat's dedicated to the database, and that may end up mixed across"banks" as it were. I'd still place my money on running into anotherlimitation first, but the idea is much more sound . What I would trydoing here is running the SELECT-only version of pgbench against bothclusters at once, and see if you really can get more total oomph out ofthe system than a single cluster of twice the size. The minute disksstart entering the picture though, you're likely to end up back to whereprocessor/memory affinity is the least of your concerns.


--
Greg Smith, 2ndQuadrant US [email protected] Baltimore, MD
PostgreSQL Training, Services and Support  www.2ndQuadrant.us
Author, "PostgreSQL 9.0 High Performance"    Pre-ordering at:
https://www.packtpub.com/postgresql-9-0-high-performance/book


--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Postgres vs. intel ccNUMA on Linux

Reply via email to