[ https://issues.apache.org/jira/browse/HBASE-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13801529#comment-13801529 ]
Lars Hofhansl commented on HBASE-9769: -------------------------------------- We should test end-to-end, not a microbenchmark of StoreScanner. Note that you cannot exercise the seeking code in checkVersion without returning data to the client, in which case network IO will dominate. If you filter KVs out with a filter before that checkVersion is never called, if the filter returns INCLUDE it'll call checkVersion and incur a seek. Only with a coprocessor would it be possible to exercise checkVersion and avoid the network IO. Also note that in your filter case you'd still get the SEEK_NEXT_ROW/SEEK_NEXT_COL in ScanWildcardColumnTracker.checkVersion for each column that you included. When you get a chance, could you check out the last patch on HBASE-9778? Maybe you could run it through your micro StoreScanner test, I'd be curious how it compares. Generally, if that the column tracker code is not efficient we should fix that rather than circumventing it completely with a filter. > Improve performance of a Scanner with explicit column list when rows are > small/medium size > ------------------------------------------------------------------------------------------ > > Key: HBASE-9769 > URL: https://issues.apache.org/jira/browse/HBASE-9769 > Project: HBase > Issue Type: Improvement > Components: Scanners > Affects Versions: 0.98.0, 0.94.12, 0.96.0 > Reporter: Vladimir Rodionov > Assignee: Vladimir Rodionov > Attachments: 9769-0.94-sample1.txt, 9769-0.94-sample2.txt, > 9769-0.94-sample.txt, 9769-94.txt, 9769-94-v2.txt, 9769-trunk-v1.txt, > 9769-trunk-v2.txt, 9769-trunk-v3.txt, 9769-trunk-v4.txt > > -- This message was sent by Atlassian JIRA (v6.1#6144)