Re: Bitmap scan is undercosted?

Vitaliy Garnashevich Sun, 03 Dec 2017 13:15:42 -0800

On 02/12/2017 23:17, Jeff Janes wrote:

Right, so there is a cpu costing problem (which could only be fixed byhacking postgresql and recompiling it), but it is much smaller of aproblem than the IO cost not being accurate due to the high hit rate.Fixing the CPU costing problem is unlikely to make a difference toyour real query. If you set the page costs to zero, what happens toyour real query?

I can't reproduce the exact issue on the real database any more. Thequery started to use the slow bitmap scan recently, and had been doingso for some time lately, but now it's switched back to use the indexscan. The table involved in the query gets modified a lot. It hashundreds of millions of rows. Lots of new rows are appended to it everyday, the oldest rows are sometimes removed. The table is analyzed atleast daily. It's possible that statistics was updated and that causedthe query to run differently. But I still would like to understand whythat issue happened, and how to properly fix it, in case the issue returns.

    But I doubt that the settings seq_page_cost = random_page_cost =
    0.0 should actually be used.
Why not? If your production server really has everything in memoryduring normal operation, that is the correct course of action. If youever restart the server, then you could have some unpleasant timegetting it back up to speed again, but pg_prewarm could help with that.

In the real database, not everything is in memory. There are 200GB+ ofRAM, but DB is 500GB+. The table involved in the query itself is 60GB+of data and 100GB+ of indexes. I'm running the test case in a way whereall reads are done from RAM, only to make it easier to reproduce and toavoid unrelated effects.

As far as know, costs in Postgres were designed to be relative toseq_page_cost, which for that reason is usually defined as 1.0. Even ifeverything would be in RAM, accesses to the pages would still not havezero cost. Setting 0.0 just seems too extreme, as all other non-zerocosts would become infinitely bigger.

If you really want to target the plan with the BitmapAnd, you shouldincrease cpu_index_tuple_cost and/or cpu_operator_cost but notincrease cpu_tuple_cost. That is because the unselective bitmapindex scan does not incur any cpu_tuple_cost, but does incurindex_tuple and operator costs. Unfortunately all other index scansin the system will also be skewed by such a change if you make thechange system-wide.

Exactly. I'd like to understand why the worse plan is being chosen, and1) if it's fixable by tuning costs, to figure out the right settingswhich could be used in production, 2) if there is a bug in Postgresoptimizer, then to bring some attention to it, so that it's eventuallyfixed in one of future releases, 3) if Postgres is supposed to work thisway, then at least I (and people who ever read this thread) wouldunderstand it better.


Regards,
Vitaliy

Re: Bitmap scan is undercosted?

Reply via email to