Re: POC: GROUP BY optimization

Andrei Lepikhov Thu, 11 Apr 2024 22:05:37 -0700

On 4/12/24 06:44, Tom Lane wrote:

If this patch were producing better results I'd be more excited
about putting more work into it.  But on the basis of what I'm
seeing right now, I think maybe we ought to give up on it.

First, thanks for the deep review - sometimes, only a commit gives us achance to get such observation :))).On a broader note, introducing automatic group-by-order choosing is astep towards training the optimiser to handle poorly tuned incomingqueries. While it's true that this may initially impact performance,it's crucial to weigh the potential benefits. So, beforehand, we shouldagree: Is it worth it?If yes, I would say I see how often hashing doesn't work in grouping.Sometimes because of estimation errors, sometimes because groupingalready has sorted input, sometimes in analytical queries when plannerdoesn't have enough memory for hashing. In analytical cases, the onlyway to speed up queries sometimes is to be smart with features likeIncrementalSort and this one.About low efficiency. Remember the previous version of the GROUP-BYoptimisation - we disagreed on operator costs and the cost model ingeneral. In the current version, we went the opposite - adding smallfeatures step-by-step. The current commit contains an integral part ofthe feature and is designed for safely testing the approach and addingmore profitable parts like choosing group-by-order according to distinctvalues or unique indexes on grouping columns.I have passed through the code being steered by the issues explained indetail. I see seven issues. Two of them definitely should be scrutinisedright now, and I'm ready to do that.


--
regards,
Andrei Lepikhov
Postgres Professional

Re: POC: GROUP BY optimization

Reply via email to