Re: TDB1/TDB2 disk space with and without named graphs

Andy Seaborne Thu, 16 Nov 2017 13:38:21 -0800


On 16/11/17 20:36, Osma Suominen wrote:

Andy Seaborne kirjoitti 16.11.2017 klo 22:04:
TDB1 or HDT.

TDB2 has no benefits for you at 40M triples at occasional updates.
Compaction would be a benefit, if it could be automated. But apparentlynot in the current state (see today's dev@ thread).
TDB2 goals are to address the scale limitations on transactions, thewrite-back queue overload problems, a better architecture e.g. fullyintegrate in jena-text transactions, and no quirks about models acrosstransactions. TDB2 is experimental at this stage.
Understood.
(You could use DatasetGraphSwitchable in TDB2 to make a switchable HDTbacked database.)
Thanks for the tip!
I think there's a lot of potential in HDT, it's just hampered byimplementation bugs and lack of resources on the hdt-java side. For myuse case it would be almost perfect, but the hdt-java implementationdoesn't support union default graph functionality [1]. It could be addedof course, just hasn't been.


Fuseki (well, ARQ) supports union graph on all datasets these days.

It will be a loop over graphs if necessary, and suppressing duplicatesis expensive in the general case. Putting graphs one by one into ageneral purpose RDF Dataset (DatasetImpl) means a loop.

(they use dataset in the general sense of "collection of data", not RDFDataset)


    Andy


-Osma

[1] https://github.com/rdfhdt/hdt-java/issues/3

Re: TDB1/TDB2 disk space with and without named graphs

Reply via email to