Re: Improving estimates for TPC-H Q2

2020-05-08 Thread Tomas Vondra
On Fri, May 08, 2020 at 07:58:39AM -0400, Matt Daw wrote: Hi Tomas, there’s an interesting related paper in the April 2020 PVLDB, “Quantifying TPC-H Choke Points and Their Optimizations”: http://www.vldb.org/pvldb/vol13/p1206-dreseler.pdf. Thanks. Seems like an interesting and new paper, alth

Re: Improving estimates for TPC-H Q2

2020-05-08 Thread Matt Daw
Hi Tomas, there’s an interesting related paper in the April 2020 PVLDB, “Quantifying TPC-H Choke Points and Their Optimizations”: http://www.vldb.org/pvldb/vol13/p1206-dreseler.pdf. Matt

Improving estimates for TPC-H Q2

2020-05-07 Thread Tomas Vondra
Hi, I've been re-running the TPC-H benchmark, to remind myself the common issues with OLAP workloads, and one of the most annoying problems seems to be the misestimates in Q2. The query is not particularly complex, although it does have a correlated subquery with an aggregate, but it's one of the