[
https://issues.apache.org/jira/browse/HIVE-12878?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15139766#comment-15139766
]
Matt McCline commented on HIVE-12878:
-------------------------------------
I just went through the 1253 test failures to filter out the expected
"Execution mode: vectorized", statistics differences, etc.
Here are the query wrong results and test failures. A rather stunning amount.
{code}
TestCliDriver
o Wrong Results:
• add_part_multiple
• alter_partition_coltype
• alter_varchar2
• analyze_tbl_part
• auto_join18
• auto_join18_multi_distinct
• avro_schema_evolution_native
• avro_timestamp
• bucket_groupby
• cbo_const
• cbo_rp_lineage2
• cbo_rp_union
• cbo_rp_views
• cbo_rp_windowing
• cbo_union
• cbo_views
• cbo_windowing
• complex_alias
• constprog_type
• correlationoptimizer14
• correlationoptimizer2
• correlationoptimizer8
• ctas_colname
• custom_input_output_format
• date_1
• date_3
• date_udf
• decimal_1
• decimal_2
• empty_join
• filter_join_breaktask2
• groupby_duplicate_key
• groupby_grouping_window
• groupby_sort_10
• insert_into1
• interval_arithmetic
• join18
• join18_multi_distinct
• lineage2
• mapjoin_test_outer
• metadata_only_queries
• metadata_only_queries_with_filters
• non_ascii_literal
• orc_dictionary_threshold
• orc_diff_part_cols
• orc_empty_strings
• orc_file_dump
• orc_int_type_promotion
• orc_predicate_pushdown
• offset_limit_global_optimizer
• parquet_ppd_decimal
• parquet_predicate_pushdown
• partcols1
• partition_date
• partition_date2
• partition_multilevels
• partition_timestamp
• partition_timestamp2
• partition_varchar1
• partition_wise_fileformat2
• ppr_pushdown2
• rcfile_null_value
• selectDistinctStar
• special_characters_in_tabnames_1
• stats1
• str_to_map
• temp_table_windowing_expressions
• test_boolean_whereclause
• timestamp_3
• timestamp_lazy
• timestamp_udf
• truncate_column
• truncate_column_merge
• udf_context_aware
• udf_get_json_object
• udf_length
• udf_printf
• udf_round_2
• udtf_json_tuple
• union6
• union34
• unionDistinct_1
• vector_binary_join_groupby
• vector_data_types
• vector_decimal_1
• vector_decimal_2
• vector_orderby_5
• windowing_distinct
• windowing_expressions
• windowing_multipartitioning
• windowing_navfn
• windowing_rank
o Failures:
• auto_join_reordering_values
• auto_sortmerge_join_1
• auto_sortmerge_join_14
• auto_sortmerge_join_2
• auto_sortmerge_join_3
• auto_sortmerge_join_4
• auto_sortmerge_join_5
• auto_sortmerge_join_6
• auto_sortmerge_join_7
• auto_sortmerge_join_9
• bucketsortoptimize_insert_2
• bucketsortoptimize_insert_4
• bucketsortoptimize_insert_5
• join42
• join_filters
• mapjoin1
• orc_min_max
• partition_wise_fileformat16
• ppd_union_view
• skewjoin
• vector_elt
TestContribNegativeCliDriver
o Wrong Results:
o Failures:
• case_with_row_sequence
TestHBaseCliDriver
o Wrong Results:
• hbase_single_sourced_multi_insert
o Failures:
TestMiniLlapCliDriver
o Wrong Results:
• hybridgrace_hashjoin_1
• hybridgrace_hashjoin_2
• tez_join_tests
• tez_union_decimal
o Failures:
• bucket_map_join_tez1
• tez_bmj_schema_evolution
• tez_smb_main
• TestMiniSparkOnYarnCliDriver
o Wrong Results:
• schemaAuthority2
• vector_outer_join1
• vector_outer_join2
• vector_outer_join3
• vector_outer_join4
o Failures:
• bucketmapjoin7
TestMiniTezCliDriver
o Wrong Results:
• cbo_simple_select
• cbo_union
• cbo_views
• cbo_windowing
• custom_input_output_format
• empty_join
• filter_join_breaktask2
• hybridgrace_hashjoin_1
• hybridgrace_hashjoin_2
• insert_into1
• mergejoin
• metadata_queries_only
• metadata_queries_only_with_filters
• selectDistinctStar
• select_dummy_source
• tez_join_tests
• tez_union_decimal
• union6
• unionDistinct_1
• vector_binary_join_groupby
• vector_data_types
• vector_decimal_1
• vector_decimal_2
• vector_outer_join1
• vector_outer_join2
• vector_outer_join3
• vector_outer_join4
• vector_orderby_5
• vector_when_case_null
• vectorized_date_funcs
o Failures:
• (Various setup failures)
• bucket_map_join_tez1
• schema_evol_orc_acidvec_mapwork_part
• tez_bmj_schema_evolution
• tez_union
• vector_elt
TestMinimrCliDriver
o Wrong Results:
• parallel_orderby
o Failures:
• (Skipping this one)
• orc_merge_diff_fs
• SchemeAuthority
• scriptfile1_win
• Shutdown?
TestNegativeCliDriver
o udf_test_error
TestPerfCliDriver
o (None)
TestSparkCliDriver
o Wrong Results:
• autojoin_18
• bucket_map_join_1
• bucket_map_join_2
• custom_input_output_format
• filter_join_breaktask2
• join18
• join25
• join27
• join30
• join34
• join37
• join39
• join40
• mapjoin1
• metadata_only_queries_with_filters
• semijoin
• tables_access_keys_stats
• timestamp_3
• union6
o Failures:
• annotate_stats_join
{code}
> Support Vectorization for TEXTFILE and other formats
> ----------------------------------------------------
>
> Key: HIVE-12878
> URL: https://issues.apache.org/jira/browse/HIVE-12878
> Project: Hive
> Issue Type: New Feature
> Components: Hive
> Reporter: Matt McCline
> Assignee: Matt McCline
> Priority: Critical
> Attachments: HIVE-12878.01.patch
>
>
> Support vectorizing when the input format is TEXTFILE and other formats for
> better Map Vertex performance.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)