Re: TDB2 and bulk loading

Andy Seaborne Wed, 21 Mar 2018 03:23:34 -0700

Bulkloading (TDB1) is for working from an empty dataset. The tricks ituses do not work when there is already data in dataset. For TDB1, Oneof the bulkloaders simply loads triples/qwuads, the other refuses to load.

For TDB2, which has no limits on the size of transactions, a batch sizeof 20K, or even 200M, should work. The larger the batch size, the morethe transaction overheads are amortized.


    Andy

On 19/03/18 15:50, Davide wrote:

I've about 20000 triples to load each time. I load data into models with
Jena API, and write data inside a StreamWriter. When the buffer has a
certain size, I load data in the dataset with the Bulkloader. But now I'm
trying to use TDB2 with Loader.Bulkload method to see if there are
improvements, but I've a problem. I retrieve the dataset with
"TDB2Factory.connectDataset(location), and pass it in the Bulkload
function. But I've a ClassCastException in runtime:
"org.apache.jena.tdb2.store.DatasetGraphSwitchable cannot be cast to
org.apache.jena.tdb2.store.DatasetGraphTDB". How can I resolve this?

Re: TDB2 and bulk loading

Reply via email to