Re: About fuseki2 load performance by java API

Andy Seaborne Thu, 18 Jul 2019 03:42:18 -0700

That's quite slow. I get maybe 50-70K triples for a 100m load via theFuseki UI.

The fastest way is to use the bulk loader directly to setup thedatabase, then add it to Fuseki.

The hardware of the server makes a big difference. What's the serversetup? Disk/SSD? Local or remote storage?


    Andy

You don't need the begin/commit in the client - the transaction is inthe backend server.


On 18/07/2019 09:02, Scarlet Remilia wrote:

Hello everyone,
I want to load a hundred millions triple into TDB2-backend fuseki2 by Java API.
I used code below:

Model model = ModelFactory.createDefaultModel();
model.add(model.asStatement(triple));
RDFConnectionRemoteBuilder builder = RDFConnectionFuseki.create()
                     .destination(FusekiURL);
             RDFConnection conn = builder.build();
             conn.begin(ReadWrite.WRITE);
             try {
                 conn.load(model);
                 conn.commit();
             } finally {
                 conn.end();
             }

The code is actually worked but performance is not ideal enough.

[2019-07-18 23:29:25] Fuseki     INFO  [46] POST 
http://192.168.204.244:2222/fuseki2?default
[2019-07-18 23:30:45] Fuseki     INFO  [15] Body: Content-Length=-1, 
Content-Type=application/rdf+thrift, Charset=null => RDF-THRIFT : Count=3257309 
Triples=3257309 Quads=0
[2019-07-18 23:31:12] Fuseki     INFO  [15] 200 OK (3,302.546 s)

Every 3 millions triples cost 3,302.546 seconds and there are totally 300 
millions triples in queue…(One in-mem Model is impossible to contain so much 
triples…)

Is there any better method to load them quicker?

Thanks!

Sent from Mail<https://go.microsoft.com/fwlink/?LinkId=550986> for Windows 10

Re: About fuseki2 load performance by java API

Reply via email to