[
https://issues.apache.org/jira/browse/HIVE-24709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vignesh Ilangovan updated HIVE-24709:
-------------------------------------
Summary: Hive Compute Statistics for columns failing after new columns
added to AVRO tables (was: Hive Stats compute on columns failing after new
columns added to AVRO tables)
> Hive Compute Statistics for columns failing after new columns added to AVRO
> tables
> ----------------------------------------------------------------------------------
>
> Key: HIVE-24709
> URL: https://issues.apache.org/jira/browse/HIVE-24709
> Project: Hive
> Issue Type: Bug
> Components: Hive
> Affects Versions: 1.2.1
> Reporter: Vignesh Ilangovan
> Priority: Major
>
> ANALYZE TABLE <<table>> COMPUTE STATISTICS FOR COLUMNS;
> FAILED: Execution Error, return code 1 from
> org.apache.hadoop.hive.ql.exec.ColumnStatsTask
>
> When AVRO non-partition table updated with new columns in AVSC files then
> compute statistics on table working fine but compute statistics on columns
> failing with above error. Temporarily dropped and recreated the hive avro
> table since it is external table there is no much impact. But each time
> recreating the DDL is not an right option
> Note:
> MR job succeeded on compute statistics for columns but at the final step it
> returns code 1.
> Hope it is some bug.
>
> Steps to reproduce:
> #1 Create table with avro table as external table pointing to avsc file
> #2 Update the new columns in avsc file
> #3 Run 'ANALYZE TABLE <<table>> COMPUTE STATISTICS FOR COLUMNS; '
> #4 MR will succeed but it will fail with return code 1
> from org.apache.hadoop.hive.ql.exec.ColumnStatsTask
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)