[HACKERS] DISTINCT vs. GROUP BY

Hans-Jürgen Schönig Mon, 19 Sep 2005 07:33:11 -0700

I was wondering whether it is possible to teach the planner to handleDISTINCT in a more efficient way:


em=# explain select distinct lastname from import.testtest;
                                   QUERY PLAN
--------------------------------------------------------------------------------
 Unique  (cost=2647377.45..2709467.70 rows=1 width=7)
   ->  Sort  (cost=2647377.45..2678422.58 rows=12418051 width=7)
         Sort Key: lastname

-> Seq Scan on testtest (cost=0.00..370082.51 rows=12418051width=7)

(4 Zeilen)



Isn't it possible to perform the same operation using a HashAggregate?

We have seen that a GROUP BY workaround is usually a lot faster thansort->unique - at least when work_mem is large enough.


        best regards,

                hans


--
Cybertec Geschwinde & Schönig GmbH
Schöngrabern 134; A-2020 Hollabrunn
Tel: +43/1/205 10 35 / 340
www.postgresql.at, www.cybertec.at

---------------------------(end of broadcast)---------------------------
TIP 3: Have you checked our extensive FAQ?

              http://www.postgresql.org/docs/faq

[HACKERS] DISTINCT vs. GROUP BY

Reply via email to