"Lizard" is a clustered SPARQL system - it's been my August project. The
target is providing fault-tolerant operation; if any one machine fails
(e.g. hardware or software problem), then Lizard is able to continue
providing the SPARQL service.
Lizard assumes machine fail in a "fail-stop" manner -- the machine fails
and stops on an error, and does not generate malicious information, nor
attempt to come alive again without operations intervention. No
Byzantine fault tolerance.
The goal of work in August has been to prototype the overall design. By
putting something together that is the potentially right design, I can
gather experience of using it to learn what does, and does not, work.
* The code is messy, insufficiently tested and inefficient
Some components are written for convenience of
debugging and fast implementation rather than
efficiency.
* Configuration is very hard (magic required)
It is based on TDB and the rest of Jena, but changes the index design to
make it more suitable for clustering,
There is a related new query engine called 'quack'. It uses merge and
hash joins because the indexed join style of single-machine TDB does not
work on a cluster.
https://github.com/afs/proto-lizard
It is not ready for real use.
Andy