[
https://issues.apache.org/jira/browse/DERBY-2991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Mike Matrigali updated DERBY-2991:
----------------------------------
After reviewing the current patch, this does not look like a change appropriate
for a backport. It is already 5000 lines of diffs and changes a basic
concurrency building block of the btree code. Getting rid of the scan lock if
it can be done with little or no performance penality does look like a good
feature for a major release. It will simplify a lot of code, but would like to
see a lot of testing before it gets into an official derby release.
I do agree this is a good direction. Some of the code that originally depended
on the scan protection lock no longer needs to as part of work that was later
done to support the read uncommitted isolation level. Off the top of my head
my areas of concern would be that the following operations work correctly in
all combinations without the concurrency protection provided by the scan
protection lock (I think they are ok as they should be protected by latches,
but just get worried removing locking from these operations):
split
merge
reclaim deleted rows
Another area that might be worth thinking about is to make sure the code that
get's previous key locks for serializable is still right without the scan
protection lock. Need to make sure an intervening split which previously was
not possible does not break this code.
This stuff is hard to test for, I will think about these operations and see if
there is any hidden dependency or if they only got the scan protection lock to
enable the scan optimization.
I definitely do worry about having to copy around the full key every time one
gives up the latch. Given the current store interface we can't keep references
so we have to allocate
objects. This in the worst case can lead to allocation/copies for every index
reference in a query and can quickly add up which was why the additional
complicaiton of the scan lock was added in the first place. It would be
interesting to understand the performance overhead of the copy vs. the extra
search. As I understand it the proposed optimization would in the "usual" case
as long as the page remained in memory eliminate the scan but would not
eliminate the copy. Probably the worst case is long keys (maybe multi-part)
and maybe datatypes that require object allocations every time they are copied
(like decimal).
> Index split deadlock
> --------------------
>
> Key: DERBY-2991
> URL: https://issues.apache.org/jira/browse/DERBY-2991
> Project: Derby
> Issue Type: Bug
> Components: Store
> Affects Versions: 10.2.2.0, 10.3.1.4
> Environment: Windows XP, Java 6
> Reporter: Bogdan Calmac
> Assignee: Knut Anders Hatlen
> Attachments: d2991-preview-1a.diff, d2991-preview-1a.stat,
> d2991-preview-1b.diff, d2991-preview-1b.stat, d2991-preview-1c.diff,
> d2991-preview-1c.stat, derby.log, InsertSelectDeadlock.java, Repro2991.java,
> stacktraces_during_deadlock.txt
>
>
> After doing dome research on the mailing list, it appears that the index
> split deadlock is a known behaviour, so I will start by describing the
> theoretical problem first and then follow with the details of my test case.
> If you have concurrent select and insert transactions on the same table, the
> observed locking behaviour is as follows:
> - the select transaction acquires an S lock on the root block of the index
> and then waits for an S lock on some uncommitted row of the insert transaction
> - the insert transaction acquires X locks on the inserted records and if it
> needs to do an index split creates a sub-transaction that tries to acquire an
> X lock on the root block of the index
> In summary: INDEX LOCK followed by ROW LOCK + ROW LOCK followed by INDEX LOCK
> = deadlock
> In the case of my project this is an important issue (lack of concurrency
> after being forced to use table level locking) and I would like to contribute
> to the project and fix this issue (if possible). I was wondering if someone
> that knows the code can give me a few pointers on the implications of this
> issue:
> - Is this a limitation of the top-down algorithm used?
> - Would fixing it require to use a bottom up algorithm for better
> concurrency (which is certainly non trivial)?
> - Trying to break the circular locking above, I would first question why
> does the select transaction need to acquire (and hold) a lock on the root
> block of the index. Would it be possible to ensure the consistency of the
> select without locking the index?
> -----
> The attached test (InsertSelectDeadlock.java) tries to simulate a typical
> data collection application, it consists of:
> - an insert thread that inserts records in batch
> - a select thread that 'processes' the records inserted by the other thread:
> 'select * from table where id > ?'
> The derby log provides detail about the deadlock trace and
> stacktraces_during_deadlock.txt shows that the inser thread is doing an index
> split.
> The test was run on 10.2.2.0 and 10.3.1.4 with identical behaviour.
> Thanks,
> Bogdan Calmac.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.