[ https://issues.apache.org/jira/browse/TRAFODION-3316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
David Wayne Birdsall resolved TRAFODION-3316. --------------------------------------------- Resolution: Fixed Fix Version/s: 2.4 > Fix some issues with incremental UPDATE STATISTICS > -------------------------------------------------- > > Key: TRAFODION-3316 > URL: https://issues.apache.org/jira/browse/TRAFODION-3316 > Project: Apache Trafodion > Issue Type: Improvement > Reporter: David Wayne Birdsall > Assignee: David Wayne Birdsall > Priority: Major > Fix For: 2.4 > > Time Spent: 1h 20m > Remaining Estimate: 0h > > Fix some issues with incremental UPDATE STATISTICS. > # The Hive TIMESTAMP datatype has precision 9. When processing a Hive > TIMESTAMP column, UPDATE STATISTICS abends. > # After fixing that, when testing a small example, the sampling ratio for > the sample table is made incorrectly small, resulting in 9207 errors (sample > is empty). > # Sometimes, when populating the sample table, the UPSERT fails because > garbage is used for the table name. This is because the table name is taken > from an NAString that has gone out of scope. > The following test script can be used to test these conditions. In Hive, do > the following: > {quote}create database if not exists h_increUpdatestats; > use h_increUpdatestats; > drop table if exists hive_sequencefile ; > create table hive_sequencefile(C_CUSTKEY int,C_NAME char(20),C_ADDRESS > varchar(20), C_NATIONKEY int, C_TIMESTAMP timestamp,C_PHONE char(20), > C_ACCTBAL string, > C_MKTSEGMENT char(20), C_DATE date) stored as sequencefile; > insert into hive_sequencefile values > (1,'x','xx',1,'2018-08-22 12:00:00.123','xxx','xxxx','xxxxx','2018-08-22'), > (2,'x','xx',2,'2018-08-23 12:00:00.123','xxx','xxxx','xxxxx','2018-08-23'), > (3,'x','xx',3,'2018-08-24 12:00:00.123','xxx','xxxx','xxxxx','2018-08-24'), > (4,'x','xx',4,'2018-08-25 12:00:00.123','xxx','xxxx','xxxxx','2018-08-25'), > (5,'x','xx',5,'2018-08-26 12:00:00.123','xxx','xxxx','xxxxx','2018-08-26'); > {quote} > Then in Trafodion do the following: > {quote}update statistics for table hive.h_increUpdatestats.hive_sequencefile > remove sample; > update statistics for table hive.h_increUpdatestats.hive_sequencefile create > sample random 100 percent; > update statistics for table hive.h_increUpdatestats.hive_sequencefile on > every column; > update statistics for table hive.h_increUpdatestats.hive_sequencefile on > existing columns incremental where C_CUSTKEY>=0; > {quote} > Without any fixes, the third UPDATE STATISTICS command abends. > With a fix for the first problem, the last UPDATE STATISTICS command > sometimes fails with error 2109, and always fails with error 9207. -- This message was sent by Atlassian JIRA (v7.6.3#76005)