[
https://issues.apache.org/jira/browse/PHOENIX-2582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Thomas D'Silva updated PHOENIX-2582:
------------------------------------
Description:
If we create an index while we are upserting rows to the table its possible we
can miss writing corresponding rows to the index table.
If a region server is writing a batch of rows and we create an index just
before the batch is written we will miss writing that batch to the index table.
This is because we run the inital UPSERT SELECT to populate the index with an
SCN that we get from the server which will be before the timestamp the batch of
rows is written.
We need to figure out if there is a way to determine that are pending batches
have been written before running the UPSERT SELECT to do the initial index
population.
was:
If we create an index while we are upserting rows to the table its possible we
can miss writing corresponding rows to the index table.
If a region server is writing a batch of rows and we create an index just
before the batch is written we will miss writing that batch to the index table.
This is because we run the inital UPSERT SELECT to populate the index with an
SCN that we get from the server which will be before the timestamp the batch of
rows is written.
We need to figure out if there is a way to determine that are pending batches
have been written before running the UPSERT SELECT to do the initial index
population.
[~jamestaylor]
> Creating an index while a batch of rows is being written leads to missing
> rows in the index table
> -------------------------------------------------------------------------------------------------
>
> Key: PHOENIX-2582
> URL: https://issues.apache.org/jira/browse/PHOENIX-2582
> Project: Phoenix
> Issue Type: Bug
> Reporter: Thomas D'Silva
>
> If we create an index while we are upserting rows to the table its possible
> we can miss writing corresponding rows to the index table.
> If a region server is writing a batch of rows and we create an index just
> before the batch is written we will miss writing that batch to the index
> table. This is because we run the inital UPSERT SELECT to populate the index
> with an SCN that we get from the server which will be before the timestamp
> the batch of rows is written.
> We need to figure out if there is a way to determine that are pending batches
> have been written before running the UPSERT SELECT to do the initial index
> population.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)