and it's not all about size, Joachim. :)

Some of my smaller datasets give me the greatest leverage and in
combination with linked data they really start to shine.

Marco

On Thu, Mar 3, 2022 at 9:51 AM Neubert, Joachim <[email protected]> wrote:

> Hi Andy,
>
> Thanks for investigating this. Using tdb2.tdbloader for all files, as you
> suggested, is fine. GND was a large dataset ten years ago, but apparently
> doesn't qualify as such any more :)
>
> Cheers, Joachim
>
> > -----Ursprüngliche Nachricht-----
> > Von: Andy Seaborne <[email protected]>
> > Gesendet: Sonntag, 27. Februar 2022 13:28
> > An: [email protected]
> > Betreff: Re: WG: Broken GND dataset after loading with
> > tdb2.xloader+tdb2.tdbloader
> >
> > Hi Joachim,
> >
> > Yes, there is a bug in xloader. I have managed to create an example test
> case
> > using a small amount of data.
> >
> > It is the xloader. Running the load-then-load test case with all other
> loaders
> > hasn't shown a problem so it isn't the second data load.
> >
> > I'm not sure what the cause is yet but I haven't seen query go wrong if
> all the
> > files are loaded once by xloader.
> >
> >      Andy
> >
> > JENA-2294
> >
> > On 22/02/2022 06:40, Neubert, Joachim wrote:
> > > This mail of yesterday didn't get through - here again.
> > >
> > > The data of the broken load is temporarily linked from
> > http://134.245.93.72/beta/tmp.
> > >
> > > I've now invoked
> > >
> > > /opt/jena/bin/tdb2.tdbloader --loader=parallel
> > > --loc=/zbw/var/lib/fuseki/databases/temp
> > > ../var/gnd/2021-11/src/GND.utf8.ttl.gz
> > > ../var/gnd/2021-11/src/gnd-sc.ttl
> > > ../var/gnd/2021-11/src/gnd-sc_notation.ttl
> > > ../var/gnd/2021-11/src/gndo.ttl
> > >
> > > and got a steadily decreasing rate (see below). On the other hand, the
> total
> > load time is nice. tdbstats ran correctly afterwards, and the query for
> > gndo:DifferentiatedPerson works as expected.
> > >
> > > Cheers - Joachim
>


-- 


---
Marco Neumann
KONA

Reply via email to