My Setup:
I’m running a few fuseki servers via Docker containers. I need the storage to 
be persistent across container restarts, so I’m using TDB2 for my storage. The 
TDB2 database are stored on a volume that is mounted to the Docker containers. 
This volume is part of our S3 instance. The Fuseki servers’ individual DBs are 
kept in sync using RDF-Delta. The dataset in question is using full text search 
using jena-text (lucene) with two properties being indexed (though, they occur 
often in the dataset). The reasoner being used is `TransitiveReasoner`. I have 
only one default graph and no other graphs.

My Problem:
To upload ~10 MB of data (in a ttl file format), it is taking sometimes more 
than 3 hours to complete! We tried turning off full text search and it cut the 
time in ~half. But still 1.5 hours for only 10MB of triple data is waaaay too 
long. Does anyone have any ideas of how we could fix this issue (except the 
obvious to not use a network connected disk)?

Thanks.

No PHI in Email: PointClickCare and Collective Medical, A PointClickCare 
Company, policies prohibit sending protected health information (PHI) by email, 
which may violate regulatory requirements. If sending PHI is necessary, please 
contact the sender for secure delivery instructions.

Confidentiality Notice: This email message, including any attachments, is for 
the sole use of the intended recipient(s) and may contain confidential and 
privileged information. Any unauthorized review, use, disclosure or 
distribution is prohibited. If you are not the intended recipient, please 
contact the sender by reply email and destroy all copies of the original 
message.

Reply via email to