Csaba Ringhofer created IMPALA-13209:
----------------------------------------
Summary: ExchangeNode's ConvertRowBatchTime can be high
Key: IMPALA-13209
URL: https://issues.apache.org/jira/browse/IMPALA-13209
Project: IMPALA
Issue Type: Improvement
Components: Backend
Reporter: Csaba Ringhofer
ConvertRowBatchTime can be surprisingly high - the only thing done during this
timer is copying tuple pointers from one RowBatch to another.
https://github.com/apache/impala/blob/c53987480726b114e0c3537c71297df2834a4962/be/src/exec/exchange-node.cc#L217
{code}
set mt_dop=8;
select straight_join count(*) from tpcds_parquet.store_sales s1 join
/*+broadcast*/ tpcds_parquet.store_sales16 s2 on s1.ss_customer_sk =
s2.ss_customer_sk;
ConvertRowBatchTime dominates the busy exchange node's exec time in the profile:
- ConvertRowBatchTime: 640.072ms
- InactiveTotalTime: 243.783ms
- PeakMemoryUsage: 12.53 MB (13142368)
- RowsReturned: 46.09M (46086464)
- RowsReturnedRate: 46.93 M/sec
- TotalTime: 981.968ms
{code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]