[
https://issues.apache.org/jira/browse/HIVE-13873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15561956#comment-15561956
]
Hive QA commented on HIVE-13873:
--------------------------------
Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12832433/HIVE-13873.1.patch
{color:green}SUCCESS:{color} +1 due to 5 test(s) being added or modified.
{color:red}ERROR:{color} -1 due to 68 failed/errored test(s), 10668 tests
executed
*Failed tests:*
{noformat}
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[constant_prop]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[groupby_nullvalues]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_array_map_emptynullvals]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_array_null_element]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_array_of_multi_field_struct]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_array_of_optional_elements]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_array_of_required_elements]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_array_of_single_field_struct]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_array_of_structs]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_array_of_unannotated_groups]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_avro_array_of_single_field_struct]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_columnar]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_create]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_ctas]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_decimal1]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_decimal]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_external_time]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_join2]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_join]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_map_null]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_map_of_arrays_of_ints]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_map_of_maps]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_mixed_case]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_nested_complex]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_nested_field_pruning]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_partitioned]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_ppd]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_ppd_boolean]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_ppd_char]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_ppd_date]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_ppd_decimal]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_ppd_partition]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_ppd_timestamp]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_ppd_varchar]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_predicate_pushdown]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_read_backward_compatible_files]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_schema_evolution]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_table_with_subschema]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_thrift_array_of_single_field_struct]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_type_promotion]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_types]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_write_correct_definition_levels]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[struct_in_view]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[udf_named_struct]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[udf_struct]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[vectorized_parquet]
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[vectorized_parquet_types]
org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[vectorized_parquet]
org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[vectorized_parquet_types]
org.apache.hadoop.hive.cli.TestMiniTezCliDriver.testCliDriver[explainanalyze_2]
org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[parquet_join]
org.apache.hadoop.hive.ql.io.parquet.TestArrayCompatibility.testAmbiguousSingleFieldGroupInList
org.apache.hadoop.hive.ql.io.parquet.TestArrayCompatibility.testAvroSingleFieldGroupInList
org.apache.hadoop.hive.ql.io.parquet.TestArrayCompatibility.testHiveRequiredGroupInList
org.apache.hadoop.hive.ql.io.parquet.TestArrayCompatibility.testMultiFieldGroupInList
org.apache.hadoop.hive.ql.io.parquet.TestArrayCompatibility.testNewOptionalGroupInList
org.apache.hadoop.hive.ql.io.parquet.TestArrayCompatibility.testNewRequiredGroupInList
org.apache.hadoop.hive.ql.io.parquet.TestArrayCompatibility.testThriftSingleFieldGroupInList
org.apache.hadoop.hive.ql.io.parquet.TestArrayCompatibility.testUnannotatedListOfGroups
org.apache.hadoop.hive.ql.io.parquet.TestMapStructures.testDoubleMapWithStructValue
org.apache.hadoop.hive.ql.io.parquet.TestMapStructures.testMapWithComplexKey
org.apache.hadoop.hive.ql.io.parquet.TestMapStructures.testNestedMap
org.apache.hadoop.hive.ql.io.parquet.TestMapStructures.testStringMapOfOptionalArray
org.apache.hadoop.hive.ql.io.parquet.TestMapStructures.testStringMapOfOptionalIntArray
org.apache.hadoop.hive.ql.io.parquet.TestMapStructures.testStringMapOptionalPrimitive
org.apache.hadoop.hive.ql.io.parquet.TestMapStructures.testStringMapRequiredPrimitive
org.apache.hadoop.hive.ql.io.parquet.TestParquetRowGroupFilter.testRowGroupFilterTakeEffect
org.apache.hive.jdbc.TestJdbcWithMiniHS2.testAddJarConstructorUnCaching
{noformat}
Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/1454/testReport
Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/1454/console
Test logs:
http://ec2-204-236-174-241.us-west-1.compute.amazonaws.com/logs/PreCommit-HIVE-Build-1454/
Messages:
{noformat}
Executing org.apache.hive.ptest.execution.TestCheckPhase
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 68 tests failed
{noformat}
This message is automatically generated.
ATTACHMENT ID: 12832433 - PreCommit-HIVE-Build
> Column pruning for nested fields
> --------------------------------
>
> Key: HIVE-13873
> URL: https://issues.apache.org/jira/browse/HIVE-13873
> Project: Hive
> Issue Type: New Feature
> Components: Logical Optimizer
> Reporter: Xuefu Zhang
> Assignee: Ferdinand Xu
> Attachments: HIVE-13873.1.patch, HIVE-13873.patch,
> HIVE-13873.wip.patch
>
>
> Some columnar file formats such as Parquet store fields in struct type also
> column by column using encoding described in Google Dramel pager. It's very
> common in big data where data are stored in structs while queries only needs
> a subset of the the fields in the structs. However, presently Hive still
> needs to read the whole struct regardless whether all fields are selected.
> Therefore, pruning unwanted sub-fields in struct or nested fields at file
> reading time would be a big performance boost for such scenarios.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)