On 25/12/17 06:41, Laura Morales wrote:
 From what I can tell, and from my little experience, you should not see such 
long waiting/idling times. But I've never used Windows (and I'm confident 
you'll get a better environment if you just switched to gnu/linux).
Anyway, you could try to merge all your files into a single .nt (using RIOT) 
and load this file only.

Yes - a single call of tdbloader on an empty database can load more efficienlty than if there are already triples.

"the pause time" suggest the database already has contents (it prints progress otherwise)..

It only has to be a single call, it can be multiple files (subject to shell limits).

   tdbloader --loc DB file1 file2 ...

tdbloader will in effect do the file merge as it loads.

What is more, it gets bNode labels across files right (so using _:a in two files is two bNodes).

    Andy

Sent: Monday, December 25, 2017 at 5:51 AM
From: "Shengyu Li" <[email protected]>
To: [email protected]
Subject: Is There Any Way to Shorten The Waiting Time After Upload Triples in 
Jena?

Hello,
I am uploading my .ttl data to my database, there are totally about 10,000 files and each file is about 4M. My new data is totally about 40GB. My origional db is also about 40GB. The server is in my local computer. I use tdbloader.bat --loc to upload data. After the Finish quads load, it will pause at this status for a long time (about half an hr for one file (4M), but if for 200 files one time(200*4M), the pause time will be 2 hrs). After the pause, the work will go back to the cmd. I guess the pause means the db is doing the organization about the data I uploaded just now, so won't return for a long time, am I right? Is there any way to shorten the waiting time? Thank you very much! Jena is really a useful thing! Best,
Shengyu

Reply via email to