Re: [PERFORM] Insert performance (OT?)

Richard Huxton Tue, 19 Jul 2005 04:11:15 -0700

Yves Vindevogel wrote:
 >>> So, I must use a function that will check against u1 and u2, and then

insert if it is ok.
I know that such a function is way slower that my insert query.
So - you have a table, called something like "upload" with 20,000 rowsand you'd like to know whether it is safe to insert them. Well, it'seasy enough to identify which ones are duplicates.
SELECT * FROM upload JOIN main_table ON u1=f1 AND u2=f2 AND u3=f3;
SELECT * FROM upload JOIN main_table ON u1=f1 AND u2=f2 AND u3=f4;
That is a good idea. I can delete the ones that would fail my firstunique index this way, and then delete the ones that would fail mysecond unique index and then upload them.
Hmm, why did I not think of that myself.

I've spent a lot of time moving data from one system to another, usuallyhaving to clean it in the process. At 9pm on a Friday, you decide thaton the next job you'll find an efficient way to do it :-)

Are you saying that deleting these rows and then inserting takes toolong?
This goes very fast, but not with a function that checks each record oneby one.


You could get away with one query if you converted them to left-joins:
INSERT INTO ...
SELECT * FROM upload LEFT JOIN ... WHERE f3 IS NULL
UNION
SELECT * FROM upload LEFT JOIN ... WHERE f4 IS NULL

The UNION will remove duplicates for you, but this might turn out to beslower than two separate queries.


--
  Richard Huxton
  Archonet Ltd

---------------------------(end of broadcast)---------------------------
TIP 9: In versions below 8.0, the planner will ignore your desire to
      choose an index scan if your joining column's datatypes do not
      match

Re: [PERFORM] Insert performance (OT?)

Reply via email to