TDB Status

Andy Seaborne Sun, 10 Jun 2012 08:11:01 -0700

I have one oddity I noticed in TDB I want to investigate

There is an oddity - it's a design flaw. This message is "fulldisclosure" message.

It's very hard to have a test case to simulate these conditions in areproducible way - this was found by code analysis and a sense ofparanoia gained during time on a fault tolerant systems project in thetelecoms sector awhile ago.


== Analysis

When preparing a transaction commit, the transactional node table isupdating the master index of node hash to node id datastructure and thatrisks damage.

Entries are only added to this structure during prepare. On-diskstructures can only be corrupted when a B+Tree update causes a blocksplit and only part of the split is written back. Updating a singleblock only risks inaccessible junk in one block. Block splits occurabout 1 in 100 updates.

If a block split is only partially written back the on-disk tree isbroken. That can happen if writing caching flushes one block not theother (but they have the same write usage time so they are adjacent inthe LRU queue), then there is a system crash, or if a filesystem syncwrites one block not the other due to power failure.

The same problem could arise in the non-transactional system - B+Treestructural problems have not been reported.

This applies to direct and mapped modes although with differentprobabilities of damaging on-disk data.


== Plan

It's a design flaw and the fix is to reimplement the transactional nodetable.


== 2.7.1 Release

On balance, I don't think this should hold the release up, better torelease and get another release out relatively soon.


1/ There are fixes elsewhere we want to get out.

2/ There are fixes in TDB. 0.9.1 is at least better than 0.9.0 (thedesign flaw is in 0.9.0) so holding up the TDB part of the release isactually withholding other fixes from users.

3/ The reimplementation needs testing thoroughly - it is more risk todata by releasing lightly tested code than to go with what there is.


        Andy

TDB Status

Reply via email to