David Wayne Birdsall created TRAFODION-3151:
-----------------------------------------------

             Summary: Improve CQDs in UPDATE STATISTICS for using a backing 
sample table
                 Key: TRAFODION-3151
                 URL: https://issues.apache.org/jira/browse/TRAFODION-3151
             Project: Apache Trafodion
          Issue Type: Improvement
          Components: sql-cmp
            Reporter: David Wayne Birdsall
            Assignee: David Wayne Birdsall


In UPDATE STATISTICS, it may sometimes be desirable to manually create a sample 
table and ask UPDATE STATISTICS to use that when generating statistics for a 
given base table. This desire might arise from the need to work around some bug 
in UPDATE STATISTICS in its own automatic sample table logic. Or the desire may 
come from the need to manually manipulate the sample data itself.

There are two CQDs presently for this purpose.

CQD USTAT_SAMPLE_TABLE_NAME if set provides a table name which will be assumed 
to be a user-created sample table.

CQD USTAT_USE_BACKING_SAMPLE if set to 'ON' indicates that the sample table is 
a Hive table.

We could simplify this by getting rid of the second CQD, and relying on catalog 
and schema qualifiers in the first CQD to indicate whether the table is Hive or 
not using the usual rules (that is, if the catalog name is 'HIVE', then it is a 
Hive table).

There is other logic associated with these CQDs that tries to infer sampling 
ratio. This logic is inconsistent and appears incorrect. So, this aspect should 
be re-engineered as needed and then appropriately documented.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to