Prevent automatic indexing from creating worse queries
------------------------------------------------------
Key: HIVE-2081
URL: https://issues.apache.org/jira/browse/HIVE-2081
Project: Hive
Issue Type: Improvement
Components: Indexing
Affects Versions: 0.8.0
Reporter: Russell Melick
We want to make sure that automatically using indexes doesn't make the query
worse. For example, after scanning the index table, it might still need to
scan the whole base table. In this case, we would much rather just kill the
index job and go back and scan the whole base table.
This can be done by adding a conditional task and a backup task. You can detect
whether the index is good or not by monitoring the index job's number of input
records and number of output records, and comparing them. As an initial
example, if the ratio is >50, do not use the index, and go back to scanning the
whole base table.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira