Todd Lipcon created KUDU-2866:
---------------------------------
Summary: CFileSet::Iterator::FinishBatch takes a lot of CPU for
selective wide table scans
Key: KUDU-2866
URL: https://issues.apache.org/jira/browse/KUDU-2866
Project: Kudu
Issue Type: Improvement
Components: perf, tablet
Reporter: Todd Lipcon
Assignee: Todd Lipcon
Scanning a wide table with a predicate that doesn't ever match showed ~10% CPU
usage in CFileSet::Iterator::FinishBatch. Looking at the assembly it seems that
the cost was in iterating over the vector<bool> indicating which columns had
been prepared. In the case of a selective predicate, only one of the 200+
columns was prepared, and the iteration was quite slow. Instead of using the
bitmap, we can instead just keep a list of the prepared column iterators.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)