Prevent automatic indexing from creating worse queries
------------------------------------------------------

                 Key: HIVE-2081
                 URL: https://issues.apache.org/jira/browse/HIVE-2081
             Project: Hive
          Issue Type: Improvement
          Components: Indexing
    Affects Versions: 0.8.0
            Reporter: Russell Melick


We want to make sure that automatically using indexes doesn't make the query 
worse.  For example, after scanning the index table, it might still need to 
scan the whole base table.  In this case, we would much rather just kill the 
index job and go back and scan the whole base table.

This can be done by adding a conditional task and a backup task. You can detect 
whether the index is good or not by monitoring the index job's number of input 
records and number of output records, and comparing them. As an initial 
example, if the ratio is >50, do not use the index, and go back to scanning the 
whole base table.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to