[PERFORM] OLAP/reporting queries fall into nested loops over seq scans or other horrible planner choices

Gunther Wed, 01 Nov 2017 17:29:59 -0700

Hi, this is Gunther, have been with PgSQL for decades, on an off thislist. Haven't been on for a long time making my way just fine. But thereis one thing that keeps bothering me both with Oracle and PgSQL. Andthat is the preference for Nested Loops.

Over the years the archives have questions about Nested Loops beingchosen over Hash Joins. But the responses seem too specific to thepeople's queries, ask many questions, make them post the query plans,and often end up frustrating with suggestions to change the data modelor to add an index and stuff like that.


One should not have to go into that personal detail.

There are some clear boundaries that a smart database should just nevercross.

Especially with OLAP queries. Think a database that is fine for OLTP,has indexes and the index based accesses for a few records joined with adozen other tables all with indexes is no problem. If you fall into aSeq Scan scenario or unwanted Hash Join, you usually forgot to add anindex or forgot to put index columns into your join or otherconstraints. Such are novice questions and we should be beyond that.

But the issue is bulk searches, reports, and any analytic queriesscenarios. In those queries Nested Loops are almost always a bad choice,even if there is an index. In over 20 years of working with RDBMs thishas been my unfailing heuristics. A report runs slow? Look at plan, isthere a Nested Loop? Yes? Squash it! And the report runs 10x fasterinstantaneously.

So, all the more troublesome is if any database system (here PgSQL)would ever fall into a Nested Loop trap with CPU spinning at 100% forseveral minutes, with a Nested Loop body of anything from a Seq Scan orworse with a cardinality of anything over 10 or 100. Nested Loops ofNested Loops or Nested Loops of other complex query plan fragmentsshould be a no-no and chosen only as an absolute last resort when thesystem cannot find enough memory, even then disk based merge sort shouldbe better, i.e., Nested Loops should never be chosen. Period.

If you can set enable_nestloop off and the Hash Join is chosen and theperformance goes from 1 hour of 100% CPU to 10 seconds completion time,then something is deadly wrong. And it doesn't matter to me if I shouldhave re-written my query in some funny ways or tweaked my data model,these are all unacceptable options when you have a complex system withhybrid OLTP/OLAP uses. Don't tell me to de-normalize. I know I canmaterialize joins in tables which I can then use again in joins to savetime. But that is not the point here.

And I don't think tweaking optimizer statistics is the solution either.Because optimizer statistics quickly become worthless when your criteriaget more complex.

The point is that Nested Loops should never be chosen except in indexlookup situations or may be memory constraints.

How can I prevent it on a query by query scope? I cannot setenable_nestloop = off because one query will be for a full report, wileanother one might have indexed constraints running in the same session,and I don't want to manage side effects and remember to setenable_nestloop parameter on and off.

There must be a way to tell the optimizer to penalize nested loops tomake them the last resort. In Oracle there are those infamous hints, butthey don't always work either (or it is easy to make mistakes that youget no feedback about).

Is there any chance PgSQL can get something like a hint feature? Or isthere a way to use postgresql.conf to penalize nested loops so that theywould only ever be chosen in the most straight-forward situations aswith query parameters that are indexed? I know I need to have sufficientwork_mem, but if you can set enable_nestloop = off and you get thedesired Hash Join, there is obviously sufficient work_mem, so that isn'tthe answer either.


Thanks for listening to my rant.

regards,
-Gunther



--
Sent via pgsql-performance mailing list (pgsql-performance@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance

[PERFORM] OLAP/reporting queries fall into nested loops over seq scans or other horrible planner choices

Reply via email to