[ 
https://issues.apache.org/jira/browse/HIVE-12878?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15139766#comment-15139766
 ] 

Matt McCline commented on HIVE-12878:
-------------------------------------

I just went through the 1253 test failures to filter out the expected 
"Execution mode: vectorized", statistics differences, etc.

Here are the query wrong results and test failures.  A rather stunning amount.

{code}
TestCliDriver
o       Wrong Results:
•       add_part_multiple
•       alter_partition_coltype
•       alter_varchar2
•       analyze_tbl_part
•       auto_join18
•       auto_join18_multi_distinct
•       avro_schema_evolution_native
•       avro_timestamp
•       bucket_groupby
•       cbo_const
•       cbo_rp_lineage2
•       cbo_rp_union
•       cbo_rp_views
•       cbo_rp_windowing
•       cbo_union
•       cbo_views
•       cbo_windowing
•       complex_alias
•       constprog_type
•       correlationoptimizer14
•       correlationoptimizer2
•       correlationoptimizer8
•       ctas_colname
•       custom_input_output_format
•       date_1
•       date_3
•       date_udf
•       decimal_1
•       decimal_2
•       empty_join
•       filter_join_breaktask2
•       groupby_duplicate_key
•       groupby_grouping_window
•       groupby_sort_10
•       insert_into1
•       interval_arithmetic
•       join18
•       join18_multi_distinct
•       lineage2
•       mapjoin_test_outer
•       metadata_only_queries
•       metadata_only_queries_with_filters
•       non_ascii_literal
•       orc_dictionary_threshold
•       orc_diff_part_cols
•       orc_empty_strings
•       orc_file_dump
•       orc_int_type_promotion
•       orc_predicate_pushdown
•       offset_limit_global_optimizer
•       parquet_ppd_decimal
•       parquet_predicate_pushdown
•       partcols1
•       partition_date
•       partition_date2
•       partition_multilevels
•       partition_timestamp
•       partition_timestamp2
•       partition_varchar1
•       partition_wise_fileformat2
•       ppr_pushdown2
•       rcfile_null_value
•       selectDistinctStar
•       special_characters_in_tabnames_1
•       stats1
•       str_to_map
•       temp_table_windowing_expressions
•       test_boolean_whereclause
•       timestamp_3
•       timestamp_lazy
•       timestamp_udf
•       truncate_column
•       truncate_column_merge
•       udf_context_aware
•       udf_get_json_object
•       udf_length
•       udf_printf
•       udf_round_2
•       udtf_json_tuple
•       union6
•       union34
•       unionDistinct_1
•       vector_binary_join_groupby
•       vector_data_types
•       vector_decimal_1
•       vector_decimal_2
•       vector_orderby_5
•       windowing_distinct
•       windowing_expressions
•       windowing_multipartitioning
•       windowing_navfn
•       windowing_rank
o       Failures:
•       auto_join_reordering_values
•       auto_sortmerge_join_1
•       auto_sortmerge_join_14
•       auto_sortmerge_join_2
•       auto_sortmerge_join_3
•       auto_sortmerge_join_4
•       auto_sortmerge_join_5
•       auto_sortmerge_join_6
•       auto_sortmerge_join_7
•       auto_sortmerge_join_9
•       bucketsortoptimize_insert_2
•       bucketsortoptimize_insert_4
•       bucketsortoptimize_insert_5
•       join42
•       join_filters
•       mapjoin1
•       orc_min_max
•       partition_wise_fileformat16
•       ppd_union_view
•       skewjoin
•       vector_elt

TestContribNegativeCliDriver
o       Wrong Results:
o       Failures:
•       case_with_row_sequence

TestHBaseCliDriver
o       Wrong Results:
•       hbase_single_sourced_multi_insert
o       Failures:

TestMiniLlapCliDriver
o       Wrong Results:
•       hybridgrace_hashjoin_1
•       hybridgrace_hashjoin_2
•       tez_join_tests
•       tez_union_decimal
o       Failures:
•       bucket_map_join_tez1
•       tez_bmj_schema_evolution
•       tez_smb_main
•       TestMiniSparkOnYarnCliDriver
o       Wrong Results:
•       schemaAuthority2
•       vector_outer_join1
•       vector_outer_join2
•       vector_outer_join3
•       vector_outer_join4
o       Failures:
•       bucketmapjoin7

TestMiniTezCliDriver
o       Wrong Results:
•       cbo_simple_select
•       cbo_union
•       cbo_views
•       cbo_windowing
•       custom_input_output_format
•       empty_join
•       filter_join_breaktask2
•       hybridgrace_hashjoin_1
•       hybridgrace_hashjoin_2
•       insert_into1
•       mergejoin
•       metadata_queries_only
•       metadata_queries_only_with_filters
•       selectDistinctStar
•       select_dummy_source
•       tez_join_tests
•       tez_union_decimal
•       union6
•       unionDistinct_1
•       vector_binary_join_groupby
•       vector_data_types
•       vector_decimal_1
•       vector_decimal_2
•       vector_outer_join1
•       vector_outer_join2
•       vector_outer_join3
•       vector_outer_join4
•       vector_orderby_5
•       vector_when_case_null
•       vectorized_date_funcs
o       Failures:
•       (Various setup failures)
•       bucket_map_join_tez1
•       schema_evol_orc_acidvec_mapwork_part
•       tez_bmj_schema_evolution
•       tez_union
•       vector_elt

TestMinimrCliDriver
o       Wrong Results:
•       parallel_orderby
o       Failures:
•       (Skipping this one)
•       orc_merge_diff_fs
•       SchemeAuthority
•       scriptfile1_win
•       Shutdown?

TestNegativeCliDriver
o       udf_test_error

TestPerfCliDriver
o       (None)

TestSparkCliDriver
o       Wrong Results:
•       autojoin_18
•       bucket_map_join_1
•       bucket_map_join_2
•       custom_input_output_format
•       filter_join_breaktask2
•       join18
•       join25
•       join27
•       join30
•       join34
•       join37
•       join39
•       join40
•       mapjoin1
•       metadata_only_queries_with_filters
•       semijoin
•       tables_access_keys_stats
•       timestamp_3
•       union6
o       Failures:
•       annotate_stats_join
{code}

> Support Vectorization for TEXTFILE and other formats
> ----------------------------------------------------
>
>                 Key: HIVE-12878
>                 URL: https://issues.apache.org/jira/browse/HIVE-12878
>             Project: Hive
>          Issue Type: New Feature
>          Components: Hive
>            Reporter: Matt McCline
>            Assignee: Matt McCline
>            Priority: Critical
>         Attachments: HIVE-12878.01.patch
>
>
> Support vectorizing when the input format is TEXTFILE and other formats for 
> better Map Vertex performance.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to