Re: [HACKERS] GIN improvements part2: fast scan

Heikki Linnakangas Wed, 12 Mar 2014 09:30:00 -0700

On 03/12/2014 12:09 AM, Tomas Vondra wrote:

Hi all,


a quick question that just occured to me - do you plan to tweak the cost
estimation fot GIN indexes, in this patch?

IMHO it would be appropriate, given the improvements and gains, but it
seems to me gincostestimate() was not touched by this patch.

Good point. We have done two major changes to GIN in this release cycle:changed the data page format and made it possible to skip items withoutfetching all the keys ("fast scan"). gincostestimate doesn't know abouteither change.

Adjusting gincostestimate for the more compact data page format seemseasy. When I hacked on that, I assumed all along that gincostestimatedoesn't need to be changed as the index will just be smaller, which willbe taken into account automatically. But now that I look atgincostestimate, it assumes that the size of one item on a posting treepage is a constant 6 bytes (SizeOfIptrData), which is no longer true.I'll go fix that.

Adjusting for the effects of skipping is harder. gincostestimate needsto do the same preparation steps as startScanKey: sort the query keys byfrequency, and call consistent function to split the keys intao"required" and "additional" sets. And then model that the "additional"entries only need to be fetched when the other keys match. That's doablein principle, but requires a bunch of extra code.

Alexander, any thoughts on that? It's getting awfully late to add newcode for that, but it sure would be nice somehow take fast scan intoaccount.


- Heikki


--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] GIN improvements part2: fast scan

Reply via email to