Re: Propose a new hook for mutating the query bounds

Tomas Vondra Wed, 17 Nov 2021 10:47:48 -0800

On 11/17/21 16:39, Xiaozhe Yao wrote:

Hi Tom,
Thanks for your feedback. I completely agree with you that ahigher-level hook is better suited for this case. I have adjusted thePoC patch to this email.
Now it is located in the clauselist_selectivity_ext function, where wefirst check if the hook is defined. If so, we let the hook estimate theselectivity and return the result. With this one, I can also developextensions to better estimate the selectivity.

I think clauselist_selectivity is the right level, because this ispretty similar to what extended statistics are doing. I'm not sure ifthe hook should be called in clauselist_selectivity_ext or in the plainclauselist_selectivity. But it should be in clauselist_selectivity_ortoo, probably.

The way the hook is used seems pretty inconvenient, though. I mean, ifyou do this


    if (clauselist_selectivity_hook)
        return clauselist_selectivity_hook(...);

then what will happen when the ML model has no information applicable toa query? This is called for all relations, all conditions, etc. andyou've short-circuited all the regular code, so the hook will have tocopy all of that. Seems pretty silly and fragile.

IMO the right approach is what statext_clauselist_selectivity is doing,i.e. estimate clauses, mark them as estimated in a bitmap, and let therest of the existing code take care of the remaining clauses. So moresomething like


    if (clauselist_selectivity_hook)
        s1 *= clauselist_selectivity_hook(..., &estimatedclauses);


regards

--
Tomas Vondra
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

Re: Propose a new hook for mutating the query bounds

Reply via email to