Tim Armstrong has uploaded a new patch set (#5). Change subject: IMPALA-3729: batch_size=1 coverage for avro scanner ......................................................................
IMPALA-3729: batch_size=1 coverage for avro scanner Also fix a stale comment in the avro scanner header. The main work here is to fix the handling of empty result sets in the test result verifier. This is a problem because we wanted to verify that the results in the test file were a superset of the rows returned, and this was thrown off by superflous '' rows in the expected and actual result sets. The basic problem is that the way test file sections was parsed conflated an empty result section with non-empty result section that had a single empty string. I.e.: ---- RESULTS ==== vs ---- RESULTS ==== both got resolved to ['']. Change-Id: Ia007e558d92c7e4ce30be90446fdbb1f50a0ebc4 --- M be/src/exec/hdfs-avro-scanner.h M bin/load-data.py M testdata/bin/generate-schema-statements.py M testdata/workloads/functional-query/queries/DataErrorsTest/avro-errors.test M testdata/workloads/functional-query/queries/QueryTest/load.test M testdata/workloads/functional-query/queries/QueryTest/test-unmatched-schema.test M tests/beeswax/impala_beeswax.py M tests/common/test_result_verifier.py M tests/data_errors/test_data_errors.py M tests/unittests/test_file_parser.py M tests/util/test_file_parser.py 11 files changed, 85 insertions(+), 42 deletions(-) git pull ssh://gerrit.cloudera.org:29418/Impala refs/changes/13/3413/5 -- To view, visit http://gerrit.cloudera.org:8080/3413 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-MessageType: newpatchset Gerrit-Change-Id: Ia007e558d92c7e4ce30be90446fdbb1f50a0ebc4 Gerrit-PatchSet: 5 Gerrit-Project: Impala Gerrit-Branch: cdh5-trunk Gerrit-Owner: Tim Armstrong <[email protected]> Gerrit-Reviewer: Taras Bobrovytsky <[email protected]> Gerrit-Reviewer: Tim Armstrong <[email protected]>
