Hi
It took hours loading a TTL document with text indexing (in TDB 3.15.0).
The TTL document is Taxrefld_taxonomy_classes.ttl (size: 2_676_428 triples)
in zip taxref12-core.zip
<https://github.com/frmichel/taxref-ld/blob/master/dataset/12.0/taxref12-core.zip>
.
This method in DatasetGraph is called :
public void add(Node g, Node s, Node p, Node o) ;
With logging at debug level, it appeared that most of the elapsed time is
taken by removing the graph, one entity at a time.
In fact I explicitly call *removeGraph()* before, because the data is
stored in provenance specific graphs in this database.
Is there a way to accelerate things ?
I wondered if wrapping removeGraph()operation in a transaction is mandatory
or useful. At runtime Jena does not protest about that ...
A typical block in the data:
<http://taxref.mnhn.fr/lod/taxon/629656/12.0>
a owl:Class ;
rdfs:isDefinedBy <
http://taxref.mnhn.fr/lod/taxref-ld/12.0> ;
* rdfs:label "Eranthemum pulchellum" ;*
rdfs:subClassOf <http://taxref.mnhn.fr/lod/taxon/452421/12.0> ;
schema:mainEntityOfPage <
https://inpn.mnhn.fr/espece/cd_nom/629656?lg=en> ;
taxrefprop:habitat taxrefhab:FreshWater ,
taxrefhab:Terrestrial ;
taxrefprop:hasRank taxrefrk:Species ;
taxrefprop:hasReferenceName <http://taxref.mnhn.fr/lod/name/629656>
;
taxrefprop:hasSynonym <http://taxref.mnhn.fr/lod/name/633029>
, <http://taxref.mnhn.fr/lod/name/637984> , <
http://taxref.mnhn.fr/lod/name/634312> ;
foaf:homepage <
https://inpn.mnhn.fr/espece/cd_nom/629656?lg=en> .
Jean-Marc Vanel
<http://semantic-forms.cc:9112/display?displayuri=http://jmvanel.free.fr/jmv.rdf%23me>
+33 (0)6 89 16 29 52
Twitter: @jmvanel , @jmvanel_fr ; chat: irc://irc.freenode.net#eulergui
Chroniques jardin
<http://semantic-forms.cc:1952/history?uri=http%3A%2F%2Fdbpedia.org%2Fresource%2FChronicle>