[jira] [Commented] (DERBY-5367) Stale data retrieved when using new collation=TERRITORY_BASED:PRIMARY feature

Mike Matrigali (JIRA) Thu, 08 Sep 2011 12:42:33 -0700

    [ 
https://issues.apache.org/jira/browse/DERBY-5367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100632#comment-13100632
 ]


Mike Matrigali commented on DERBY-5367:
---------------------------------------

I think the problem is that updateAtSlot interface can not be used in btree's 
in general.   It only supports physical undo which means that the row must be 
in exactly the same spot on the page as it was originally.  It is used in a 
couple of places in the btree code on the control row (ie. the row that is 
always in slot 0).  But otherwise rows move around in btrees and that is the 
purpose of the logical undo logic.  Updates of rows in btrees are always done 
as deletes followed by inserts.  But that does not help in this special case.  
The updateFieldAtSlot interface does support logical undo.  

Off hand seems like we could:
1) write a updateAtSlot that does logical undo.  Before this it was not needed 
because an update of btree row would never "know" that it was going back to the 
same slot, so it was always cleaner to just do the delete and insert.  I would 
be willing to help with this, let me know.  

2) a little ugly but might be interesting as a quick fix to see if it works is 
that you could do updateFieldAtSlot for each field in the row.

I will think about this over night and see if I can come up with anything 
simpler.  

> Stale data retrieved when using new collation=TERRITORY_BASED:PRIMARY feature
> -----------------------------------------------------------------------------
>
>                 Key: DERBY-5367
>                 URL: https://issues.apache.org/jira/browse/DERBY-5367
>             Project: Derby
>          Issue Type: Bug
>          Components: Store
>    Affects Versions: 10.8.1.2, 10.9.0.0
>         Environment: Mac OS X, Windows
>            Reporter: Brett Wooldridge
>            Assignee: Kristian Waagan
>            Priority: Critical
>         Attachments: derby-5367-1a-update_row_fully.diff, 
> derby-5367-1b-update_row_fully.diff, derby-5367-1b-update_row_fully.stat, 
> derby-5367-2a-minimal_fix.diff
>
>
> Our product recently upgraded to version 10.8.1.2 in order to take advantage 
> of the new 'case-insensitive' mode offered by Derby in the form of the 
> "collation=TERRITORY_BASED:PRIMARY" connection parameter.
> Unfortunately, we have run into an issue whereby stale data appears to be 
> retrieved from an index, even though the data in the table itself has changed.
> You can see this issue in the IJ session below.  The database in question was 
> created using this Java parameter when invoking IJ:
> -Dij.database=jdbc:derby:test;create=true;collation=TERRITORY_BASED:PRIMARY
> Here is the IJ session:
> CONNECTION0* -        jdbc:derby:test
> * = current connection
> ij> CREATE TABLE tag (
>     tag_id INTEGER GENERATED BY DEFAULT AS IDENTITY NOT NULL,
>     tag VARCHAR(255) NOT NULL,
>     CONSTRAINT tag_pk PRIMARY KEY (tag_id),
>     CONSTRAINT tag_tag_unique UNIQUE (tag)
> );
> 0 rows inserted/updated/deleted
> ij> -- first insert a value 'Test', note the upper-case 'T' in 'Test'
> ij> INSERT INTO tag (tag) VALUES ('Test');
> 1 row inserted/updated/deleted
> ij> SELECT * FROM tag;
> TAG_ID     |TAG                                                               
>                                                               
> --------------------------------------------------------------------------------------------------------------------------------------------
> 1          |Test                                                              
>                                                               
> 1 row selected
> ij> -- Now delete the row
> ij> DELETE FROM tag WHERE tag='Test';
> 1 row inserted/updated/deleted
> ij> -- You could run another SELECT here to verify it is gone, but it is.
> ij> -- Now insert a new value 'test', note the lower-case 't' in 'test'
> ij> INSERT INTO tag (tag) VALUES ('test');
> 1 row inserted/updated/deleted
> ij> -- Now verify that the table contains only the lower-case version: 'test'
> ij> SELECT * FROM tag;
> TAG_ID     |TAG                                                               
>                                                               
> --------------------------------------------------------------------------------------------------------------------------------------------
> 2          |test                                                              
>                                                               
> 1 row selected
> ij> -- Now, here is the bug.
> ij> SELECT tag FROM tag;
> TAG                                                                           
>                                                   
> --------------------------------------------------------------------------------------------------------------------------------
> Test                                                                          
>                                                   
> 1 row selected
> ij> 
> Note in the last SELECT we specify the 'tag' column specifically.  When we 
> 'SELECT *', Derby performs a table-scan and the result is correct.  However, 
> when we 'SELECT tag', Derby appears to use the index created for the 
> 'tag_tag_unique' unique constraint.  As an optimization Derby, like many 
> databases, will use values directly from the index in the case where the 
> index covers all requested columns.
> The bigger question is, why doesn't the DELETE action cause the entry in the 
> tag_tag_unique index to be deleted?  Is this a further optimization?  If so, 
> it is imperative that the index at least be updated when the new value is 
> inserted.
> This is rather a severe bug for us that causes stale data to be returned.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (DERBY-5367) Stale data retrieved when using new collation=TERRITORY_BASED:PRIMARY feature

Reply via email to