Re: TDB1/TDB2 disk space with and without named graphs

Andy Seaborne Fri, 17 Nov 2017 00:10:18 -0800


On 16/11/17 21:37, Andy Seaborne wrote:

On 16/11/17 20:36, Osma Suominen wrote:
Andy Seaborne kirjoitti 16.11.2017 klo 22:04:
TDB1 or HDT.

TDB2 has no benefits for you at 40M triples at occasional updates.
Compaction would be a benefit, if it could be automated. Butapparently not in the current state (see today's dev@ thread).
TDB2 goals are to address the scale limitations on transactions, thewrite-back queue overload problems, a better architecture e.g. fullyintegrate in jena-text transactions, and no quirks about modelsacross transactions. TDB2 is experimental at this stage.
Understood.
(You could use DatasetGraphSwitchable in TDB2 to make a switchableHDT backed database.)
Thanks for the tip!
I think there's a lot of potential in HDT, it's just hampered byimplementation bugs and lack of resources on the hdt-java side. For myuse case it would be almost perfect, but the hdt-java implementationdoesn't support union default graph functionality [1]. It could beadded of course, just hasn't been.
Fuseki (well, ARQ) supports union graph on all datasets these days.
It will be a loop over graphs if necessary, and suppressing duplicatesis expensive in the general case. Putting graphs one by one into ageneral purpose RDF Dataset (DatasetImpl) means a loop.
(they use dataset in the general sense of "collection of data", not RDFDataset)
     Andy
-Osma

[1] https://github.com/rdfhdt/hdt-java/issues/3


... isn't about union graphs.

It is about whether FROM/FROM NAMED picks graphs from the servicedataset, which in Fuseki1, they don't.


It is reported that

    select * where {graph ?g {?s ?p ?o}}

works so named graphs are working, which is nothing to do with HDT.

If they upgrade to Fuseki2, current, that should do it then
GRAPH <urn:x-arq:UnionGraph> should work.

    Andy

Re: TDB1/TDB2 disk space with and without named graphs

Reply via email to