Alex Behm has uploaded a new patch set (#2). Change subject: IMPALA-3845: Split up hdfs-parquet-scanner.cc into more files/components. ......................................................................
IMPALA-3845: Split up hdfs-parquet-scanner.cc into more files/components. This patch refactors hdfs-parquet-scanner.cc into several files. The new responsibilities of each file/component are roughly as follows: hdfs-parquet-scanner.cc - Creates column readers aud uses them to materializes row batches. - Evaluates runtime filters and conjuncts, populates row batch queue. parquet-schema-resolver.h/cc - Parses the schema of a Parquet file into our internal schema representation. - Resolves SchemaPaths (e.g. from a table descriptor) against the internal representation of the Parquet file schema. parquet-column-readers.h - Contains the per-column data reading, parsing and value materialization logic. Testing: A private core/hdfs run passed. Change-Id: I4c5fd46f9c1a0ff2a4c30ea5a712fbae17c68f92 --- M be/src/exec/CMakeLists.txt M be/src/exec/hdfs-parquet-scanner.cc M be/src/exec/hdfs-parquet-scanner.h M be/src/exec/hdfs-scanner.cc M be/src/exec/hdfs-scanner.h A be/src/exec/parquet-column-readers.h A be/src/exec/parquet-schema-resolver.cc A be/src/exec/parquet-schema-resolver.h M be/src/util/debug-util.cc M be/src/util/debug-util.h 10 files changed, 2,199 insertions(+), 2,067 deletions(-) git pull ssh://gerrit.cloudera.org:29418/Impala refs/changes/96/3596/2 -- To view, visit http://gerrit.cloudera.org:8080/3596 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-MessageType: newpatchset Gerrit-Change-Id: I4c5fd46f9c1a0ff2a4c30ea5a712fbae17c68f92 Gerrit-PatchSet: 2 Gerrit-Project: Impala Gerrit-Branch: cdh5-trunk Gerrit-Owner: Alex Behm <[email protected]>
