Hi Brian - I was using tdbloader for both November and April imports - I have tested it before and for freebase data set it works better than tdbloader2. tdbloader2 had faster data importing phase but much slower the indexing phase hence it makes the total import time longer than tdbloader for my case.
2014-05-14 10:00 GMT+01:00 bwm-epimorphics <[email protected]>: > How did you load the TDB store? Is it possible you used tdbloader2 for > the first load and tdbloader for the second? > > Brian > > > On 13/05/14 14:13, Ewa Szwed wrote: > >> I have the following problem with my Jena TDB instance. >> Last year in November I have loaded freebase dump to Jena TDB and I was >> able to work with it reasonably good and got quite good performance for >> most of my queries. >> Recently I have updated my Jena TDB store with a dump from April. >> Here are some numbers to show the difference between these 2 instances. >> >> >> >> *November 2013* >> >> *April 2014* >> >> >> Full time of import >> >> 262,052 sec /3,03 days >> >> 716,121 sec / 8,29 days >> >> Number of triples >> >> 1,826,551,456 >> >> 2,489,221,915 >> >> Index size (whole dir) >> >> 174 GB >> >> 333 GB >> >> >> My problem is that my new instance in not performing at all. >> The queries that previously run for a couple of minutes take a couple of >> hours now and it is not acceptable for my business. :( >> So I would like to ask if there is a practical index limit size for Jena >> TDB. Is there anything I can do to improve the performance of it. >> Is this significant drop in performance sth expected or maybe I have sth >> fundamentally wrong in my set up - which I would need to track and fix. >> Please advise. >> Regards, >> Ewa Szwed >> >> > -- > Epimorphics Ltd (http://www.epimorphics.com) > > Epimorphics Ltd. is a limited company registered in England (number > 7016688) > Registered address: Court Lodge, 105 High Street, Portishead, Bristol BS20 > 6PT, UK > >
