Huaisi Xu has submitted this change and it was merged. Change subject: CDH-41243: Parquet scanner regression on wide tables ......................................................................
CDH-41243: Parquet scanner regression on wide tables IMPALA-2473 introduced a check that prevent row batches growing beyond 8MB, but it has a corner case that when an empty row batch is larger than 8MB, it returns this row batch immediately after it materialize one row, essentailly setting batch_size=1. Revert "IMPALA-2473: reduce scanner memory usage" This reverts commit 1635c0a8738daef1b283cb457fbd3bca227aa0b1. Change-Id: If6728ed8facd305682d7dfd58f1210fa294bb232 Reviewed-on: http://gerrit.cloudera.org:8080/3484 Reviewed-by: Huaisi Xu <[email protected]> Tested-by: Huaisi Xu <[email protected]> --- M be/src/exec/data-source-scan-node.cc M be/src/exec/hdfs-parquet-scanner.cc M be/src/exec/hdfs-scanner.cc M be/src/exec/hdfs-table-sink.cc M be/src/exec/hdfs-table-sink.h M be/src/runtime/row-batch.h M testdata/workloads/functional-query/queries/QueryTest/nested-types-tpch.test 7 files changed, 26 insertions(+), 81 deletions(-) Approvals: Huaisi Xu: Looks good to me, approved; Verified -- To view, visit http://gerrit.cloudera.org:8080/3484 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-MessageType: merged Gerrit-Change-Id: If6728ed8facd305682d7dfd58f1210fa294bb232 Gerrit-PatchSet: 2 Gerrit-Project: Impala Gerrit-Branch: cdh5-2.3.0_5.5.x Gerrit-Owner: Huaisi Xu <[email protected]> Gerrit-Reviewer: Huaisi Xu <[email protected]> Gerrit-Reviewer: Tim Armstrong <[email protected]>
