MathiasVDA opened a new issue, #3736: URL: https://github.com/apache/jena/issues/3736
### Version 5.6 ### Question Hello, My use case involves automated processes that prepare a new dataset offline using tdb2.tdbloader and tdb2.tdbupdate. I need to hot swap an existing dataset with the newly prepared files. ### Options that I tried that didn't work - Replacing the target dataset datafiles online obviously creates lock errors - Clearing and reloading the data over HTTP takes too much time (clearing takes more than 30 minutes in our case with 8.6M triples). - Clearing and reloading the data over CLI creates lock issues. - Editing the target dataset configuration to point to new dataset files only works when restarting the server. But I cannot automatically restart the Fuseki server after making the change. ### Working solution but maybe not long supported? What does work is to: 1. delete the target dataset using HTTP DELETE /$/datasets/target_name 2. use a POST on /$/datasets with sending the new assembler configuration file. This creates a minimal downtime of a few seconds. That only works after activating `allowConfigFileProperty `as documented here: https://github.com/apache/jena/issues/3346 But I'm a bit worried about this comment: > Be warned: the plan is still to disallowing arbitrary configuration upload with the same access levels as creating a dataset in a controlled manner. But the work on that hasn't started. That might indicate that this process will no longer be supported in the future? ### Alternative: templates I know that there is a template system and that there are templates that support setting the dataset file location, but when I check the source code, it seems that these templates are not used? How can I specify the target tdb2 location using the template system? ### Documentation I am not sure about the state of the documentation here: https://jena.apache.org/documentation/fuseki2/fuseki-server-protocol.html Especially that there is still the mention of POST an assembler config without specifying the required parameter. Or how to use the templates. I would like to contribute but I don't know what the actual situation is nor how to do a PR on the documentation. Feel free to let me know. ### Our environment The processes that prepare the dataset and use the http administration of Fuseki are running in Airflow. Both Airflow and Jena-Fuseki are deployed in a kubernetes cluster. Thanks! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
