Hi Andy,

Thanks for investigating this. Using tdb2.tdbloader for all files, as you 
suggested, is fine. GND was a large dataset ten years ago, but apparently 
doesn't qualify as such any more :)

Cheers, Joachim

> -----Ursprüngliche Nachricht-----
> Von: Andy Seaborne <a...@apache.org>
> Gesendet: Sonntag, 27. Februar 2022 13:28
> An: users@jena.apache.org
> Betreff: Re: WG: Broken GND dataset after loading with
> tdb2.xloader+tdb2.tdbloader
> 
> Hi Joachim,
> 
> Yes, there is a bug in xloader. I have managed to create an example test case
> using a small amount of data.
> 
> It is the xloader. Running the load-then-load test case with all other loaders
> hasn't shown a problem so it isn't the second data load.
> 
> I'm not sure what the cause is yet but I haven't seen query go wrong if all 
> the
> files are loaded once by xloader.
> 
>      Andy
> 
> JENA-2294
> 
> On 22/02/2022 06:40, Neubert, Joachim wrote:
> > This mail of yesterday didn't get through - here again.
> >
> > The data of the broken load is temporarily linked from
> http://134.245.93.72/beta/tmp.
> >
> > I've now invoked
> >
> > /opt/jena/bin/tdb2.tdbloader --loader=parallel
> > --loc=/zbw/var/lib/fuseki/databases/temp
> > ../var/gnd/2021-11/src/GND.utf8.ttl.gz
> > ../var/gnd/2021-11/src/gnd-sc.ttl
> > ../var/gnd/2021-11/src/gnd-sc_notation.ttl
> > ../var/gnd/2021-11/src/gndo.ttl
> >
> > and got a steadily decreasing rate (see below). On the other hand, the total
> load time is nice. tdbstats ran correctly afterwards, and the query for
> gndo:DifferentiatedPerson works as expected.
> >
> > Cheers - Joachim

Reply via email to