Re: Indexes on expressions with multiple columns and operators

Frédéric Yhuel Mon, 22 Sep 2025 09:09:25 -0700



On 9/22/25 15:57, Andrei Lepikhov wrote:

On 22/9/2025 15:37, Frédéric Yhuel wrote:
I wonder if this is an argument in favour of decoupling the samplesize and the precision of the statistics. Here, we basically want thesample size to be as big as the table in order to include the few(NULL, WARNING) values.
I also have seen how repeating ANALYZE on the same database drasticallychanges query plans ;(.It seems to me that with massive samples, many of the ANALYZE algorithmsshould be rewritten. In principle, statistical hooks exist. So, it ispossible to invent an independent table analyser which will scan thewhole table to get precise statistics.


Interesting! I wonder how difficult it would be.

However, in this specific case, I realised that it wouldn't solve theissue of ANALYZE being triggered when there are zero rows with (ackid,crit) = (NULL, WARNING).

Partitioning would still work in this case, though, because ackid'snull_frac would be zero for the partition containing the 'WARNING' value.

I wonder if we could devise another kind of extended statistic thatwould provide these "partitioned statistics" without actually partitioning.

Re: Indexes on expressions with multiple columns and operators

Reply via email to