Pooja Nilangekar has posted comments on this change. ( http://gerrit.cloudera.org:8080/11517 )
Change subject: [WIP] IMPALA-6932: Speed up scans for sequence datasets with many files ...................................................................... Patch Set 3: This can't be tested on hdfs since there are no "remote" blocks in the minicluster. So all the scan ranges of a file are added to the appropriate local disk queue once the header is processed. > Patch Set 3: -Code-Review > > Adding a targeted test that uses some profile counters seems like a great > idea. Maybe it's possible to do this already on HDFS with the right query and > options. E.g. run a query with limit=1 with num_nodes=1 and either > num_scanner_threads=1 or mt_dop=1 and confirm that only one file is opened > from the profile. -- To view, visit http://gerrit.cloudera.org:8080/11517 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-Project: Impala-ASF Gerrit-Branch: master Gerrit-MessageType: comment Gerrit-Change-Id: I211e2511ea3bb5edea29f1bd63e6b1fa4c4b1965 Gerrit-Change-Number: 11517 Gerrit-PatchSet: 3 Gerrit-Owner: Pooja Nilangekar <pooja.nilange...@cloudera.com> Gerrit-Reviewer: Bikramjeet Vig <bikramjeet....@cloudera.com> Gerrit-Reviewer: Impala Public Jenkins <impala-public-jenk...@cloudera.com> Gerrit-Reviewer: Pooja Nilangekar <pooja.nilange...@cloudera.com> Gerrit-Reviewer: Tim Armstrong <tarmstr...@cloudera.com> Gerrit-Comment-Date: Tue, 30 Oct 2018 00:09:16 +0000 Gerrit-HasComments: No