subject:"Re\: \[PATCHES\] WIP\: avoiding tuple construction\/deconstruction overhead"

Re: [PATCHES] WIP: avoiding tuple construction/deconstruction overhead

2005-03-18 Thread a_ogawa


(BTom Lane wrote:
(B a_ogawa [EMAIL PROTECTED] writes:
(B  (1)We can improve compare_heap() by using TableTupleSlot instead of
(B  HeapTuple. Please see attached patch.
(B
(B Did you measure any performance improvement from that?  I considered it
(B but thought it would likely be a wash or a loss, because in most cases
(B only one attribute will be pulled from a tuple during comparetup_heap.
(B slot_getattr cannot improve on heap_getattr in that case, and is quite
(B likely to be slower.
(B
(BI measured performance of heap_getattr and slot_getattr in
(Bcomparetup_heap.
(B
(BI made the table which had ten varchar attributes, and registered
(Bdata for tests.  (Attached file includes SQL doing this.)
(BI carried out the following tests.
(B
(B(case 1)
(B test1: select * from sort_test order by v1 limit 100;
(B test2: select * from sort_test order by v1, v2 limit 100;
(B test3: select * from sort_test order by v1, v2, v3 limit 100;
(B test4: select * from sort_test order by v1, v2, v3, v4 limit 100;
(B test5: select * from sort_test order by v1, v2, v3, v4, v5 limit 100;
(B
(B result:test1test2test3test4test5
(B---
(B heap_getattr  2.149s   2.602s   3.204s   3.830s   4.159s
(B slot_getattr  2.523s   3.422s   3.977s   4.453s   4.721s
(B
(B(case 2)
(B test1: select * from sort_test order by v10 limit 100;
(B test2: select * from sort_test order by v10, v9 limit 100;
(B test3: select * from sort_test order by v10, v9, v8 limit 100;
(B test4: select * from sort_test order by v10, v9, v8, v7 limit 100;
(B test5: select * from sort_test order by v10, v9, v8, v7, v6 limit 100;
(B
(B result:test1test2test3test4test5
(B---
(B heap_getattr  3.654s   5.549s   6.575s   7.367s   7.870s
(B slot_getattr  4.027s   4.930s   5.249s   5.555s   5.756s
(B
(B(case 3)
(B test1: select * from sort_test order by v5 limit 100;
(B test2: select * from sort_test order by v5, v6 limit 100;
(B test3: select * from sort_test order by v5, v6, v7 limit 100;
(B test4: select * from sort_test order by v5, v6, v7, v8 limit 100;
(B test5: select * from sort_test order by v5, v6, v7, v8, v9 limit 100;
(B
(B result:test1test2test3test4test5
(B---
(B heap_getattr  2.657s   4.207s   5.194s   6.179s  6.662s
(B slot_getattr  3.126s   4.233s   4.806s   5.271s  5.557s
(B
(BIn most cases, heap_getattr is fast.
(BWhen the following conditions occurred, slot_getattr is fast.
(B (1)Tuple have varlen attributes.
(B (2)Sort key have more than two attributes.
(B (3)A position of a sort key is far from the head of tuple.
(B (4)As for the data of a sort key, there be many repetition.
(BActually it will be rare that these conditions are occurred.
(B
(BThinking from a result, I think that we had better continue using
(Bheap_getattr in comparetup_heap.
(B
(Bregards,
(B
(B--- Atsushi Ogawa

make_test_data.sql
Description: Binary data

---(end of broadcast)---
TIP 4: Don't 'kill -9' the postmaster

Re: [PATCHES] WIP: avoiding tuple construction/deconstruction overhead

2005-03-18 Thread Bruce Momjian


I am very excited there has been so much reduction in tuple processing
overhead in the past few weeks.  This is always and area I thought
needed improvement, and its great to see it.

We will certainly have some big performance improvements in 8.1 because
we already have several (e.g. SMP) and we have many more months to go.

---

a_ogawa wrote:
 
 Tom Lane wrote:
  a_ogawa [EMAIL PROTECTED] writes:
   (1)We can improve compare_heap() by using TableTupleSlot instead of
   HeapTuple. Please see attached patch.
 
  Did you measure any performance improvement from that?  I considered it
  but thought it would likely be a wash or a loss, because in most cases
  only one attribute will be pulled from a tuple during comparetup_heap.
  slot_getattr cannot improve on heap_getattr in that case, and is quite
  likely to be slower.
 
 I measured performance of heap_getattr and slot_getattr in
 comparetup_heap.
 
 I made the table which had ten varchar attributes, and registered
 data for tests.  (Attached file includes SQL doing this.)
 I carried out the following tests.
 
 (case 1)
  test1: select * from sort_test order by v1 limit 100;
  test2: select * from sort_test order by v1, v2 limit 100;
  test3: select * from sort_test order by v1, v2, v3 limit 100;
  test4: select * from sort_test order by v1, v2, v3, v4 limit 100;
  test5: select * from sort_test order by v1, v2, v3, v4, v5 limit 100;
 
  result:test1test2test3test4test5
 ---
  heap_getattr  2.149s   2.602s   3.204s   3.830s   4.159s
  slot_getattr  2.523s   3.422s   3.977s   4.453s   4.721s
 
 (case 2)
  test1: select * from sort_test order by v10 limit 100;
  test2: select * from sort_test order by v10, v9 limit 100;
  test3: select * from sort_test order by v10, v9, v8 limit 100;
  test4: select * from sort_test order by v10, v9, v8, v7 limit 100;
  test5: select * from sort_test order by v10, v9, v8, v7, v6 limit 100;
 
  result:test1test2test3test4test5
 ---
  heap_getattr  3.654s   5.549s   6.575s   7.367s   7.870s
  slot_getattr  4.027s   4.930s   5.249s   5.555s   5.756s
 
 (case 3)
  test1: select * from sort_test order by v5 limit 100;
  test2: select * from sort_test order by v5, v6 limit 100;
  test3: select * from sort_test order by v5, v6, v7 limit 100;
  test4: select * from sort_test order by v5, v6, v7, v8 limit 100;
  test5: select * from sort_test order by v5, v6, v7, v8, v9 limit 100;
 
  result:test1test2test3test4test5
 ---
  heap_getattr  2.657s   4.207s   5.194s   6.179s  6.662s
  slot_getattr  3.126s   4.233s   4.806s   5.271s  5.557s
 
 In most cases, heap_getattr is fast.
 When the following conditions occurred, slot_getattr is fast.
  (1)Tuple have varlen attributes.
  (2)Sort key have more than two attributes.
  (3)A position of a sort key is far from the head of tuple.
  (4)As for the data of a sort key, there be many repetition.
 Actually it will be rare that these conditions are occurred.
 
 Thinking from a result, I think that we had better continue using
 heap_getattr in comparetup_heap.
 
 regards,
 
 --- Atsushi Ogawa

[ Attachment, skipping... ]

 
 ---(end of broadcast)---
 TIP 4: Don't 'kill -9' the postmaster

-- 
  Bruce Momjian|  http://candle.pha.pa.us
  pgman@candle.pha.pa.us   |  (610) 359-1001
  +  If your life is a hard drive, |  13 Roberts Road
  +  Christ can be your backup.|  Newtown Square, Pennsylvania 19073

---(end of broadcast)---
TIP 6: Have you searched our list archives?

   http://archives.postgresql.org

Re: [PATCHES] WIP: avoiding tuple construction/deconstruction overhead

2005-03-17 Thread a_ogawa


(BTom Lane wrote:
(B Attached is the current state of a patch to reduce the overhead of
(B passing tuple data up through many levels of plan nodes.
(B
(BIt is a good idea. 
(BI think that this patch improves performance of the whole executor.
(B
(BI have three comments.
(B
(B(1)We can improve compare_heap() by using TableTupleSlot instead of
(BHeapTuple. Please see attached patch.
(B
(B(2)In ExecStoreTuple(), we can omit initialization of slot-tts_nvalid.
(BIf slot-tts_isempty is false, tts_nvalid is initialized by
(BExecClearTuple(). If it is true, tts_nvalid is always zero.
(B
(B(3)There is a description of slot-val in comment of execTuple.c.
(BThis had better revise it.
(B
(B Finally, I have made some progress towards making the tuple access
(B routines consistently use "bool isNull" arrays as null markers, instead
(B of the char 'n' or ' ' convention that was previously used in some but
(B not all contexts.
(B
(BI agree. I think that this progress improves readability.
(B
(Bregards,
(B
(B--- Atsushi Ogawa

compare_heap.patch
Description: Binary data

---(end of broadcast)---
TIP 4: Don't 'kill -9' the postmaster

Re: [PATCHES] WIP: avoiding tuple construction/deconstruction overhead

2005-03-17 Thread Tom Lane

a_ogawa [EMAIL PROTECTED] writes:
 (1)We can improve compare_heap() by using TableTupleSlot instead of
 HeapTuple. Please see attached patch.

Did you measure any performance improvement from that?  I considered it
but thought it would likely be a wash or a loss, because in most cases
only one attribute will be pulled from a tuple during comparetup_heap.
slot_getattr cannot improve on heap_getattr in that case, and is quite
likely to be slower.

 (3)There is a description of slot-val in comment of execTuple.c.
 This had better revise it.

Drat, how'd I miss that?  Thanks.

regards, tom lane

---(end of broadcast)---
TIP 4: Don't 'kill -9' the postmaster

Re: [PATCHES] WIP: avoiding tuple construction/deconstruction overhead

Re: [PATCHES] WIP: avoiding tuple construction/deconstruction overhead

Re: [PATCHES] WIP: avoiding tuple construction/deconstruction overhead

Re: [PATCHES] WIP: avoiding tuple construction/deconstruction overhead

4 matches

Site Navigation

Mail list logo

Footer information