Riza Suminto has uploaded this change for review. ( http://gerrit.cloudera.org:8080/18220
Change subject: IMPALA-11107: Allow specifying footer size in IssueFooterRanges ...................................................................... IMPALA-11107: Allow specifying footer size in IssueFooterRanges FOOTER_SIZE was a constant of 100KB in HdfsScanner::IssueFooterRanges, an estimate based on Parquet format. Other scanner subclasses such as HdfsOrcScanner expect that footer size is much lower at 16KB (orc::DIRECTORY_SIZE_GUESS in ORC lib). This patch adds footer_size_estimate as a parameter so different file formats can ask for different footer range sizes. Having a more precise initial range can also help reduce waste in the data cache. Testing: - Pass core tests. - Manually verify that 'IoReadSkippedBytes' reduced after this patch. Change-Id: Ib0a5aab48324bf9e78fb0bb0bf6f649d87e89dfc --- M be/src/exec/hdfs-orc-scanner.cc M be/src/exec/hdfs-orc-scanner.h M be/src/exec/hdfs-scanner.cc M be/src/exec/hdfs-scanner.h M be/src/exec/parquet/hdfs-parquet-scanner.cc M be/src/exec/parquet/hdfs-parquet-scanner.h M fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java 7 files changed, 39 insertions(+), 21 deletions(-) git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/20/18220/1 -- To view, visit http://gerrit.cloudera.org:8080/18220 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-Project: Impala-ASF Gerrit-Branch: master Gerrit-MessageType: newchange Gerrit-Change-Id: Ib0a5aab48324bf9e78fb0bb0bf6f649d87e89dfc Gerrit-Change-Number: 18220 Gerrit-PatchSet: 1 Gerrit-Owner: Riza Suminto <[email protected]>
