Re: Why DatasetGraphInMemory?

Arne Bernhardt Sat, 20 May 2023 09:19:17 -0700

Hi Andy,
thank you, that was very helpful to get the whole picture.

Some time ago, I told you that at my workplace we implemented an in-memory
SPARQL-Server based on a Delta
<https://jena.apache.org/documentation/javadoc/jena/org.apache.jena.core/org/apache/jena/graph/compose/Delta.html>
.
We started a few years ago, before RDF-patch
<https://jena.apache.org/documentation/rdf-patch/>, based on the "difference
model"
<https://lists.w3.org/Archives/Public/www-rdf-interest/2001Mar/0216.html>,
that has become part of the CGMES standard.
For our server, we strictly follow the CQRS with event-sourcing
<https://learn.microsoft.com/en-us/azure/architecture/patterns/cqrs>
pattern. All transactions are recorded as an event with a list of triples
added and a list of triples removed.
The events are stored in an RDBMS (Oracle or PostgreSQL). For query
execution we need the relevant data to fit into memory but all data and
versions are also persisted.
To be able to store and load graphs very fast, we use RDF Thrift with LZ4
compression and store them in blobs.
All queries are executed on projected datasets for the requested version
(any previous version) of the data and the requested named graphs.
Thanks to the versioning, we fully support MR+SW. We even support multiple
writers, with a git-like branching and merging approach and optimistic
locking.
To prevent the chain of deltas from becoming too long, we have several
strategies to write full graphs into snapshots.


DatasetGraphInMemory seems to be 5-7 times slower than GraphMem, at least
when adding triples. So it might be worth trying to meet your constraints
while leaving the poor performance behind.
My basic idea is:
- having two instances "writer graph" and "reader graph" of GraphMem and
switching between them
   - Unfortunately that would mean a doubling of memory. (but
DatasetGraphInMemory seems to use approx. 3 times the memory of GraphMem)
- implement a dataset that performs the write transactions on the "writer
graph" and in deltas
  - meanwhile any number of readers could read from the previous version by
reading the "reader graph"
  - when the writer transaction finishes
     - if there are no readers using the "reader graph", it can be swapped
with the "writer graph" and the former "reader graph" can be updated using
the deltas
- when the next writer starts, it would again write into the "writer graph"
  - meanwhile any number of readers could read from previous version(s) by
reading "reader graph" [+ deltas]
- when the least reader of  the oldest version closes the transaction
    - if there are no writers using the "writer graph", it can be reversed
using deltas in reverse order. Then "reader graph" can be swapped with the
"writer graph" for the next read transaction. Then the former "reader
graph" needs to be updated using all deltas.
- only if there is no point in time without overlapping read and write
transactions, a short pause may be needed occasionally to clear the deltas
that are piling up.
- It is not lock-free
- there would be no background tasks or scheduling involved, "only" having
each graph twice and all #add and #remove operations would have to be done
3-4 times.

The idea is still a bit blurry in my head. What do you think?

     Arne

Am Sa., 20. Mai 2023 um 15:19 Uhr schrieb Andy Seaborne <[email protected]>:

> Hi Arne,
>
> On 19/05/2023 21:21, Arne Bernhardt wrote:
> > Hi,
> > in a recent  response
> > <https://github.com/apache/jena/issues/1867#issuecomment-1546931793> to
> an
> > issue it was said that   "Fuseki - uses DatasetGraphInMemory mostly"  .
> > For my  PR <https://github.com/apache/jena/pull/1865>, I added a JMH
> > benchmark suite to the project. So it was easy for me to compare the
> > performance of GraphMem with
> > "DatasetGraphFactory.createTxnMem().getDefaultGraph()".
> > DatasetGraphInMemory is much slower in every discipline tested (#add,
> > #delete, #contains, #find, #stream).
> > Maybe my approach is too naive?
> > I understand very well that the underlying Dexx Collections Framework,
> with
> > its immutable persistent data structures, makes threading and transaction
> > handling easy
>
> DatasetGraphInMemory (TIM = Transactions In Memory) has one big advantage.
>
> It supports multiple-readers and a single-writer (MR+SW) at the same
> time - truly concurrent. So does TDB2 (TDB1 is sort of hybrid).
>
> MR+SW has a cost which is a copy-on-write overhead, a reader-centric
> design choice allowing the readers to run latch-free.
>
> You can't directly use a regular hash map with concurrent updates. (And
> no, ConcurrentHashMap does not solve all problems, even for a single
> datastructure. A dataset needs to coordinate changes to multiple
> datastructure into a single transactional unit.
>
> GraphMem can not do MR+SW - for all storage datasets/graphs that do not
> have built-in for MR+SW, the best that can be done is MRSW -
> multiple-readers or a single-writer.
>
> For MRSW, when a writer starts, the system has to hold up subsequent
> readers, let existing ones finish, then let the writer run, then release
> any readers held up. (variations possible - whether readers or writers
> get priority).
>
> This is bad in a general concurrent environment. e.g. Fuseki.
>
> One writer can "accidently" lock-out the dataset.
>
> Maybe the application isn't doing updates, in which case, a memory
> dataset focuses on read throughput is better, especially with better
> triple density in memory.
>
> Maybe the application is single threaded or can control threads itself
> (non-Fuseki).
>
> > and that there are no issues with consuming iterators or
> > streams even after a read transaction has closed.
>
> Continuing to use an iterator after the end of a transaction should not
> be allowed.
>
> > Is it currently supported for consumers to use iterators and streams
> after
> > a transaction has been closed?
>
> Consumers that want this must copy the iterator - it's an explicit opt-in.
>
> Does this happen with Dexx? It may do, because Dexx relies on the
> garbage collector so some things just happen.
>
> > If so, I don't currently see an easy way to
> > replace DatasetGraphInMemory with a faster implementation. (although
> > transaction-aware iterators that copy the remaining elements into lists
> > could be an option).
>
> copy-iterators are going to be expensive in RAM - a denial of service
> issue - and speed (lesser issue, possibly).
>
> > Are there other reasons why DatasetGraphInMemory is the preferred dataset
> > implementation for Fuseki?
>
> MR+SW in an environment where there is no other information about
> requirements is the safe choice.
>
> If an app wants to trade the issues of MRSW for better performance, it
> is a choice it needs to make. One case for Fuseki is publishing
> relatively static data - e.g. reference data, changes from a known, well
> behaved, application
>
> Both a general purpose TIM and a higher density, faster dataset have
> their places.
>
>      Andy
>
> >
> > Cheers,
> > Arne
> >
>

Re: Why DatasetGraphInMemory?

Reply via email to