[
https://issues.apache.org/jira/browse/PHOENIX-2446?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15103028#comment-15103028
]
James Taylor commented on PHOENIX-2446:
---------------------------------------
Thanks, [~lhofhansl]. That's exactly what we were looking for.
We can't predict when an index will be added to a table, so we'd like it to
work reliably. Some potential scenarios:
- A user may have a smallish table (100m rows or less) and decide to add an
index at that point.
- A user may be adding an index to a view which isn't yet a multi-billion row
table (perhaps the physical table is, though).
- A user may have orthogonal smaller tables being managed by Phoenix because
their bigger ones are there too (it's not easy to use a RDBMS and a
Hadoop-based store seamlessly).
> Immutable index - Index vs base table row count does not match when index is
> created during data load
> -----------------------------------------------------------------------------------------------------
>
> Key: PHOENIX-2446
> URL: https://issues.apache.org/jira/browse/PHOENIX-2446
> Project: Phoenix
> Issue Type: Bug
> Affects Versions: 4.6.0
> Reporter: Mujtaba Chohan
> Assignee: Thomas D'Silva
> Fix For: 4.7.0
>
> Attachments: PHOENIX-2446-wip.patch, PHOENIX-2446.patch, server.log
>
>
> I'll add more details later but here's the scenario that consistently
> produces wrong row count for index table vs base table for immutable async
> index.
> 1. Start data upsert
> 2. Create async index
> 3. Trigger M/R index build
> 4. Keep data upsert going in background during step 2,3 and a while after M/R
> index finishes.
> 5. End data upsert.
> Now count with index enabled vs count with hint to not use index is off by a
> large factor. Will get a cleaner repro for this issue soon.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)