Re: [GENERAL] which Update quicker

Steve Crawford Tue, 23 Sep 2014 14:06:03 -0700

On 09/23/2014 12:35 PM, Emi Lu wrote:

Hello list,

For a big table with more than 1,000,000 records, may I know whichupdate is quicker please?


(1) update t1
      set c1 = a.c1
      from a
      where pk and
                 t1.c1       <> a.c1;
 ......
      update t1
      set c_N = a.c_N
      from a
      where pk and
                 t1.c_N       <> a.c_N;


(2)  update t1
      set c1 = a.c1 ,
            c2  = a.c2,
            ...
            c_N = a.c_N
     from a
     where pk AND
               (  t1.c1 <> a.c1 OR t1.c2 <> a.c2..... t1.c_N <> a.c_N)


....

We don't have any info about table structures, index availability andusage for query optimization, whether or not the updated columns arepart of an index, amount of memory available, disk speed, portion of t1that will be updated, PostgreSQL settings, etc. so it's really anyone'sguess. A million rows is pretty modest so I was able to try a couplevariants of "update...from..." on million row tables on my aging desktopwithout coming close to the 60-second mark.

*Usually* putting statements into a single transaction is better (aswould happen automatically in case 2). Also, to the extent that a giventuple would have multiple columns updated you will have less bloat andI/O using the query that updates the tuple once rather than multipletimes. But a lot will depend on the efficiency of looking up theappropriate data in "a."


Cheers,
Steve






--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

Re: [GENERAL] which Update quicker

Reply via email to