Hi Andy, Thanks for investigating this. Using tdb2.tdbloader for all files, as you suggested, is fine. GND was a large dataset ten years ago, but apparently doesn't qualify as such any more :)
Cheers, Joachim > -----Ursprüngliche Nachricht----- > Von: Andy Seaborne <a...@apache.org> > Gesendet: Sonntag, 27. Februar 2022 13:28 > An: users@jena.apache.org > Betreff: Re: WG: Broken GND dataset after loading with > tdb2.xloader+tdb2.tdbloader > > Hi Joachim, > > Yes, there is a bug in xloader. I have managed to create an example test case > using a small amount of data. > > It is the xloader. Running the load-then-load test case with all other loaders > hasn't shown a problem so it isn't the second data load. > > I'm not sure what the cause is yet but I haven't seen query go wrong if all > the > files are loaded once by xloader. > > Andy > > JENA-2294 > > On 22/02/2022 06:40, Neubert, Joachim wrote: > > This mail of yesterday didn't get through - here again. > > > > The data of the broken load is temporarily linked from > http://134.245.93.72/beta/tmp. > > > > I've now invoked > > > > /opt/jena/bin/tdb2.tdbloader --loader=parallel > > --loc=/zbw/var/lib/fuseki/databases/temp > > ../var/gnd/2021-11/src/GND.utf8.ttl.gz > > ../var/gnd/2021-11/src/gnd-sc.ttl > > ../var/gnd/2021-11/src/gnd-sc_notation.ttl > > ../var/gnd/2021-11/src/gndo.ttl > > > > and got a steadily decreasing rate (see below). On the other hand, the total > load time is nice. tdbstats ran correctly afterwards, and the query for > gndo:DifferentiatedPerson works as expected. > > > > Cheers - Joachim