Quanlong Huang created IMPALA-11185:
---------------------------------------
Summary: Reuse orc::ColumnVectorBatch in the scanner life-cycle
Key: IMPALA-11185
URL: https://issues.apache.org/jira/browse/IMPALA-11185
Project: IMPALA
Issue Type: Improvement
Components: Backend
Reporter: Quanlong Huang
Assignee: Quanlong Huang
In HdfsOrcScanner::AssembleRows(), we always re-create a
orc::ColumnVectorBatch. The ideal pattern is reusing the batch and only
destroyed it when the scanner is closed.
In the flame graph of TPC-H Q1 collected by [~drorke] , the createRowBatch and
destructors occupies almost half of the scanner time.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)