My Setup: I’m running a few fuseki servers via Docker containers. I need the storage to be persistent across container restarts, so I’m using TDB2 for my storage. The TDB2 database are stored on a volume that is mounted to the Docker containers. This volume is part of our S3 instance. The Fuseki servers’ individual DBs are kept in sync using RDF-Delta. The dataset in question is using full text search using jena-text (lucene) with two properties being indexed (though, they occur often in the dataset). The reasoner being used is `TransitiveReasoner`. I have only one default graph and no other graphs.
My Problem: To upload ~10 MB of data (in a ttl file format), it is taking sometimes more than 3 hours to complete! We tried turning off full text search and it cut the time in ~half. But still 1.5 hours for only 10MB of triple data is waaaay too long. Does anyone have any ideas of how we could fix this issue (except the obvious to not use a network connected disk)? Thanks. No PHI in Email: PointClickCare and Collective Medical, A PointClickCare Company, policies prohibit sending protected health information (PHI) by email, which may violate regulatory requirements. If sending PHI is necessary, please contact the sender for secure delivery instructions. Confidentiality Notice: This email message, including any attachments, is for the sole use of the intended recipient(s) and may contain confidential and privileged information. Any unauthorized review, use, disclosure or distribution is prohibited. If you are not the intended recipient, please contact the sender by reply email and destroy all copies of the original message.
