and it's not all about size, Joachim. :) Some of my smaller datasets give me the greatest leverage and in combination with linked data they really start to shine.
Marco On Thu, Mar 3, 2022 at 9:51 AM Neubert, Joachim <[email protected]> wrote: > Hi Andy, > > Thanks for investigating this. Using tdb2.tdbloader for all files, as you > suggested, is fine. GND was a large dataset ten years ago, but apparently > doesn't qualify as such any more :) > > Cheers, Joachim > > > -----Ursprüngliche Nachricht----- > > Von: Andy Seaborne <[email protected]> > > Gesendet: Sonntag, 27. Februar 2022 13:28 > > An: [email protected] > > Betreff: Re: WG: Broken GND dataset after loading with > > tdb2.xloader+tdb2.tdbloader > > > > Hi Joachim, > > > > Yes, there is a bug in xloader. I have managed to create an example test > case > > using a small amount of data. > > > > It is the xloader. Running the load-then-load test case with all other > loaders > > hasn't shown a problem so it isn't the second data load. > > > > I'm not sure what the cause is yet but I haven't seen query go wrong if > all the > > files are loaded once by xloader. > > > > Andy > > > > JENA-2294 > > > > On 22/02/2022 06:40, Neubert, Joachim wrote: > > > This mail of yesterday didn't get through - here again. > > > > > > The data of the broken load is temporarily linked from > > http://134.245.93.72/beta/tmp. > > > > > > I've now invoked > > > > > > /opt/jena/bin/tdb2.tdbloader --loader=parallel > > > --loc=/zbw/var/lib/fuseki/databases/temp > > > ../var/gnd/2021-11/src/GND.utf8.ttl.gz > > > ../var/gnd/2021-11/src/gnd-sc.ttl > > > ../var/gnd/2021-11/src/gnd-sc_notation.ttl > > > ../var/gnd/2021-11/src/gndo.ttl > > > > > > and got a steadily decreasing rate (see below). On the other hand, the > total > > load time is nice. tdbstats ran correctly afterwards, and the query for > > gndo:DifferentiatedPerson works as expected. > > > > > > Cheers - Joachim > -- --- Marco Neumann KONA
