Half an hour for a 4MB Turtle file?! Even without knowing the number of
triples per file, I'd say, something is wrong here.

Are you calling

tdbloader --loc

for each file separately?


As Laura already said:

If you'd use Linux, I'd suggest to use command line tools to convert the
files to N-Triples (e.g. using rapper), then concat (cat), then load the
single file. In your case, maybe create a few files instead of a single
file, or even compression might also be suitable. But ok, you're using
Windows ...

What I'm missing from your email is the description of your system and
TDB setup. "Local computer" can be anything. File sizes are indeed
interesting, number of triples might be also relevant.


Cheers,

Lorenz


On 25.12.2017 05:51, Shengyu Li wrote:
> Hello,
>
> I am uploading my .ttl data to my database, there are totally about
> 10,000 files and each file is about 4M. My new data is totally about
> 40GB. My origional db is also about 40GB. The server is in my local
> computer.
>
> I use tdbloader.bat --loc to upload data. After the Finish quads load,
> it will pause at this status for a long time (about half an hr for one
> file (4M), but if for 200 files one time(200*4M), the pause time will
> be 2 hrs). After the pause, the work will go back to the cmd.
> Inline image 1
>
> I guess the pause means the db is doing the organization about the
> data I uploaded just now, so won't return for a long time, am I right?
> Is there any way to shorten the waiting time?
>
> Thank you very much! Jena is really a useful thing! 
>
> Best,
> Shengyu

Reply via email to