[
https://issues.apache.org/jira/browse/PHOENIX-1560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277396#comment-14277396
]
James Taylor edited comment on PHOENIX-1560 at 1/14/15 6:48 PM:
----------------------------------------------------------------
I see your point for the no where clause case. With an order by and no where
clause, we'd be choosing between:
- re-ordering the data table versus
- a scan over the index table plus a skip scan per row for the data table
Not sure how to compare those, but I know that the re-order is expensive, as it
essentially causes the data table to be re-written. I guess I'd lean toward
using the optimization, as that's what the hint is directing us to do. If perf
is bad, the user can always remove the hint.
was (Author: jamestaylor):
I see your point for the no where clause case. With an order by and no where
clause, we'd be choosing between:
- re-ordering the data table versus
- a scan over the index table plus a skip scan per row for the data table
Not sure how to compare those, but I know that the re-order is expensive, as it
essentially causes the data table to be re-written. I guess I'd lean toward
using the optimization, as that's what the hint is directing us to do. If perf
is bad, the user can always remove the hint.
> Join between global index and data table if INDEX hint used
> -----------------------------------------------------------
>
> Key: PHOENIX-1560
> URL: https://issues.apache.org/jira/browse/PHOENIX-1560
> Project: Phoenix
> Issue Type: Bug
> Reporter: James Taylor
> Assignee: Maryann Xue
> Attachments: 1560.patch
>
>
> We already have an INDEX hint, and we already have a mechanism to collect
> referenced columns in the data table that are not in the index table (used
> only for local indexes currently). Instead of not using the global index when
> a referenced data column is not found in the index, we should rewrite the
> query to join back to the data table when the INDEX hint is present. This is
> always possible, as we always have the data PK columns in the index table,
> and our join optimization would kick in as well.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)