Re: [PATCHES] A little COPY speedup

Andrew Dunstan Thu, 01 Mar 2007 09:29:56 -0800

Heikki Linnakangas wrote:

One complaint we've heard from clients trying out EDB or PostgreSQL isthat loading data is slower than on other DBMSs.
I ran oprofile on a COPY FROM to get an overview of where the CPU timeis spent. To my amazement, the function at the top of the list wasPageAddItem with 16% of samples.
On every row, PageAddItem will scan all the line pointers on thetarget page, just to see that they're all in use, and create a newline pointer. That adds up, especially with narrow tuples like what Iused in the test.
Attached is a fix for that. It adds a flag to each heap page thatindicates that "there isn't any free line pointers on this page, sodon't bother trying". Heap pages haven't had any heap-specificper-page data before, so this patch adds a HeapPageOpaqueData-structthat's stored in the special space.
My simple test case of a COPY FROM of 10000000 tuples took 19.6 swithout the patch, and 17.7 s with the patch applied. Your mileage mayvary.

What is the speedup with less narrow tuples? 10% improvement is good butnot stellar.


cheers

andrew

---------------------------(end of broadcast)---------------------------
TIP 1: if posting/reading through Usenet, please send an appropriate
      subscribe-nomail command to [EMAIL PROTECTED] so that your
      message can get through to the mailing list cleanly

Re: [PATCHES] A little COPY speedup

Reply via email to