I really thought this would have caused some interest, since

- this item is in the TODO list
- the improvement for CLUSTER in some scenarios is 800%,
and maybe more (if I didn't do anything wrong, of course...)

Could at least the message:
http://archives.postgresql.org/pgsql-hackers/2010-02/msg00766.php
be added to the TODO page, under 
"Improve CLUSTER performance by sorting to reduce
random I/O" ?
It would be sad if the patch got lost... 


Leonardo


> Attached the updated patch (should solve a bug) and a script.
> The sql scripts generates a 2M rows table ("orig"); then the
> table is copied and the copy clustered using seq + sort (since 
> "set enable_seqscan=false;").
> Then the table "orig" is copied again, and the copy clustered
> using regular index scan (set enable_indexscan=true; set 
> enable_seqscan=false).
> Then the same thing is done on a 5M rows table, and on a 10M
> rows table.
> 
> On my system (Sol10 on a dual Opteron 2.8) single disc:
> 
> 
> 2M:  seq+sort 11secs; regular index scan: 33secs
> 5M:  seq+sort 39secs; regular index scan: 105secs
> 10M:seq+sort 83secs; regular index scan: 646secs
> 
> 
> Maybe someone could suggest a better/different test?
> 
> 
> Leonardo





-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Reply via email to