[HACKERS] Faster methods for getting SPI results

Jim Nasby Tue, 20 Dec 2016 20:15:32 -0800

I've been looking at the performance of SPI calls within plpython.There's a roughly 1.5x difference from equivalent python code just inpulling data out of the SPI tuplestore. Some of that is due to aninefficiency in how plpython is creating result dictionaries, but fixingthat is ultimately a dead-end: if you're dealing with a lot of resultsin python, you want a tuple of arrays, not an array of tuples.

While we could just brute-force a tuple of arrays by plowing through theSPI tuplestore (this is what pl/r does), there's still a lot of extrawork involved in doing that. AFAICT there's at least 2 copies thathappen between the executor producing a tuple and it making it into thetuplestore, plus the tuplestore is going to consume a potentially verylarge amount of memory for a very short period of time, before all thedata gets duplicated (again) into python objects.

It would be a lot more efficient if we could just grab datums from theexecutor and make a single copy into plpython (or R), letting the PLdeal with all the memory management overhead.

I briefly looked at using SPI cursors to do just that, but that lookseven worse: every fetch is executed in a subtransaction, and every fetchcreates an entire tuplestore even if it's just going to return a singlevalue. (But hey, we never claimed cursors were fast...)

Is there any way to avoid all of this? I'm guessing one issue might bethat we don't want to call an external interpreter while potentiallyholding page pins, but even then couldn't we just copy a single tuple ata time and save a huge amount of palloc overhead?

--
Jim Nasby, Data Architect, Blue Treble Consulting, Austin TX
Experts in Analytics, Data Architecture and PostgreSQL
Data in Trouble? Get it in Treble! http://BlueTreble.com
855-TREBLE2 (855-873-2532)


--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

[HACKERS] Faster methods for getting SPI results

Reply via email to