MathiasVDA opened a new issue, #3736:
URL: https://github.com/apache/jena/issues/3736

   ### Version
   
   5.6
   
   ### Question
   
   Hello,
   
   My use case involves automated processes that prepare a new dataset offline 
using tdb2.tdbloader and tdb2.tdbupdate. I need to hot swap an existing dataset 
with the newly prepared files. 
   
   ### Options that I tried that didn't work
   
   - Replacing the target dataset datafiles online obviously creates lock errors
   - Clearing and reloading the data over HTTP takes too much time (clearing 
takes more than 30 minutes in our case with 8.6M triples). 
   - Clearing and reloading the data over CLI creates lock issues.
   - Editing the target dataset configuration to point to new dataset files 
only works when restarting the server. But I cannot automatically restart the 
Fuseki server after making the change.
   
   ### Working solution but maybe not long supported?
   
   What does work is to:
   1. delete the target dataset using HTTP DELETE /$/datasets/target_name 
   2. use a POST on /$/datasets with sending the new assembler configuration 
file. 
   
   This creates a minimal downtime of a few seconds.
   
   That only works after activating `allowConfigFileProperty `as documented 
here:
   https://github.com/apache/jena/issues/3346
   
   But I'm a bit worried about this comment:
   > Be warned: the plan is still to disallowing arbitrary configuration upload 
with the same access levels as creating a dataset in a controlled manner. But 
the work on that hasn't started.
   
   That might indicate that this process will no longer be supported in the 
future?
   
   ### Alternative: templates
   
   I know that there is a template system and that there are templates that 
support setting the dataset file location, but when I check the source code, it 
seems that these templates are not used? How can I specify the target tdb2 
location using the template system?
   
   ### Documentation
   
   I am not sure about the state of the documentation here: 
https://jena.apache.org/documentation/fuseki2/fuseki-server-protocol.html
   
   Especially that there is still the mention of POST an assembler config 
without specifying the required parameter. Or how to use the templates. I would 
like to contribute but I don't know what the actual situation is nor how to do 
a PR on the documentation. Feel free to let me know. 
   
   ### Our environment
   
   The processes that prepare the dataset and use the http administration of 
Fuseki are running in Airflow. Both Airflow and Jena-Fuseki are deployed in a 
kubernetes cluster.
   
   Thanks!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to