Quanlong Huang created IMPALA-11185:
---------------------------------------

             Summary: Reuse orc::ColumnVectorBatch in the scanner life-cycle
                 Key: IMPALA-11185
                 URL: https://issues.apache.org/jira/browse/IMPALA-11185
             Project: IMPALA
          Issue Type: Improvement
          Components: Backend
            Reporter: Quanlong Huang
            Assignee: Quanlong Huang


In HdfsOrcScanner::AssembleRows(), we always re-create a 
orc::ColumnVectorBatch. The ideal pattern is reusing the batch and only 
destroyed it when the scanner is closed.

In the flame graph of TPC-H Q1 collected by [~drorke] , the createRowBatch and 
destructors occupies almost half of the scanner time.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to