Csaba Ringhofer created IMPALA-13209:
----------------------------------------

             Summary: ExchangeNode's ConvertRowBatchTime can be high
                 Key: IMPALA-13209
                 URL: https://issues.apache.org/jira/browse/IMPALA-13209
             Project: IMPALA
          Issue Type: Improvement
          Components: Backend
            Reporter: Csaba Ringhofer


ConvertRowBatchTime can be surprisingly high - the only thing done during this 
timer is copying tuple pointers from one RowBatch to another.
https://github.com/apache/impala/blob/c53987480726b114e0c3537c71297df2834a4962/be/src/exec/exchange-node.cc#L217

{code}
set mt_dop=8;
select straight_join count(*) from tpcds_parquet.store_sales s1 join 
/*+broadcast*/ tpcds_parquet.store_sales16 s2 on s1.ss_customer_sk = 
s2.ss_customer_sk;

ConvertRowBatchTime dominates the busy exchange node's exec time in the profile:
           - ConvertRowBatchTime: 640.072ms
           - InactiveTotalTime: 243.783ms
           - PeakMemoryUsage: 12.53 MB (13142368)
           - RowsReturned: 46.09M (46086464)
           - RowsReturnedRate: 46.93 M/sec
           - TotalTime: 981.968ms
{code}




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to