Hi,

I'm attempting to run a backup of a fuseki database. I've done this successfully many times on other systems, which I have configured identically (AFAIK; it is all done by hand, so an inconsistency is possible).

Based on the logs, fuseki appears to run through all the motions, but the file that's written is a 20-byte .gz file (i.e., an empty file that got gzip'd). Logs and info below.

I was running 3.6.0 standalone. I tried upgrading to 3.7.0 standalone, which didn't fix it.

Am i doing something wrong, or is this a bug?



In the spirit of full disclosure...I think it's probably fine, but It's possible that there is some corruption in the database.

In general queries work fine. We have one triple with a null value that we can't figure out how to delete with sparql. Our two experts are unavailable today and the rest of us are sparql-incompetent. I don't need help with that, we will figure it out when one of them returns and/or by reading some more docs.

This led me to the ham-fisted idea of backing up, deleting the offending line, and restoring. I agree with any suggestion that it's a stupid solution to this problem - but I still want to be able to back up this database, it's inconvenient to recreate it.




database has about 20 million triples.

Java:

log/archonnex/fuseki@acxcore-awstest0% java -version
openjdk version "1.8.0_171"
OpenJDK Runtime Environment (build 1.8.0_171-b10)
OpenJDK 64-Bit Server VM (build 25.171-b10, mixed mode)

fuseki command line:

fuseki 16548 1.1 0.6 12341352 209636 ? Sl 13:26 00:00:04 /etc/alternatives/java_sdk_1.8.0/bin/java -Xmx8G -Dlog4j.configuration=file:/etc/archonnex/fuseki/log4j.properties -verbose:gc -XX:+PrintGCDetails -XX:+PrintGCDateStamps -XX:+PrintGCTimeStamps -XX:+UseGCLogFileRotation -XX:NumberOfGCLogFiles=10 -XX:GCLogFileSize=100m -XX:+PrintTenuringDistribution -Xloggc:/var/log/archonnex/fuseki/gc-%t.log -jar /usr/local/apache-jena-fuseki-3.7.0/fuseki-server.jar --config=/etc/archonnex/fuseki/fcrepo.ttl

ttl config:

# cat fcrepo.ttl
@prefix :      <http://base/#> .
@prefix tdb: <http://jena.hpl.hp.com/2008/tdb#> .
@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .
@prefix ja: <http://jena.hpl.hp.com/2005/11/Assembler#> .
@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .
@prefix fuseki: <http://jena.apache.org/fuseki#> .

:service_tdb_all  a fuseki:Service ;
        rdfs:label "TDB fcrepo" ;
        fuseki:dataset :tdb_dataset_readwrite ;
        fuseki:name "fcrepo" ;
        fuseki:serviceQuery "query" , "sparql" ;
        fuseki:serviceReadGraphStore "get" ;
fuseki:serviceReadWriteGraphStore
                "data" ;
        fuseki:serviceUpdate "update" ;
        fuseki:serviceUpload "upload" .

:tdb_dataset_readwrite
        a             tdb:DatasetTDB ;
        tdb:location "/var/archonnex/fuseki/data/fcrepo" .


log file:

[2018-05-15 14:18:35] Admin INFO [1] POST http://127.0.0.1:3030/$/backup/fcrepo
[2018-05-15 14:18:35] Admin      INFO [1] Backup dataset /fcrepo
[2018-05-15 14:18:35] Server     INFO Task : 1 : backup
[2018-05-15 14:18:35] Server     INFO [Task 1] starts : backup
[2018-05-15 14:18:35] Backup INFO [1] >>>> Start backup /fcrepo -> /var/archonnex/fuseki/backups/fcrepo_2018-05-15_14-18-35
[2018-05-15 14:18:35] TDB        DEBUG Txn[2]/R: begin$
[2018-05-15 14:18:35] info       DEBUG Triple table: SPO :: SPO,POS,OSP
[2018-05-15 14:18:35] info DEBUG Quad table: GSPO :: GSPO,GPOS,GOSP,POSG,OSPG,SPOG
[2018-05-15 14:18:35] info       DEBUG Prefixes: GPU :: GPU
[2018-05-15 14:18:35] LockMRSW   DEBUG Lock : pool-2-thread-1
[2018-05-15 14:18:35] TDB        DEBUG Txn[2]/R: start
[2018-05-15 14:18:35] LockMRSW DEBUG pool-2-thread-1 >> enterCS: Thread R/W: 0/0 :: Model R/W: 0/0 (thread: pool-2-thread-1) [2018-05-15 14:18:35] LockMRSW DEBUG pool-2-thread-1 << enterCS: Thread R/W: 1/0 :: Model R/W: 1/0 (thread: pool-2-thread-1)
[2018-05-15 14:18:35] Admin      INFO [1] 200 OK (8 ms)
[2018-05-15 14:18:35] BlockAccessMapped DEBUG Segment: 0
[2018-05-15 14:18:35] BlockAccessMapped DEBUG Segment: 0
[2018-05-15 14:18:35] BlockAccessMapped DEBUG Segment: 0
[2018-05-15 14:18:35] BlockAccessMapped DEBUG Segment: 0
[2018-05-15 14:18:35] BlockAccessMapped DEBUG Segment: 0
[2018-05-15 14:18:35] BlockAccessMapped DEBUG Segment: 0
[2018-05-15 14:18:35] TDB        DEBUG Txn[2]/R: finish
[2018-05-15 14:18:35] LockMRSW DEBUG pool-2-thread-1 >> leaveCS: Thread R/W: 1/0 :: Model R/W: 1/0 (thread: pool-2-thread-1) [2018-05-15 14:18:35] LockMRSW DEBUG pool-2-thread-1 << leaveCS: Thread R/W: 0/0 :: Model R/W: 0/0 (thread: pool-2-thread-1) [2018-05-15 14:18:35] Backup INFO [1] <<<< Finish backup /fcrepo -> /var/archonnex/fuseki/backups/fcrepo_2018-05-15_14-18-35
[2018-05-15 14:18:35] Server     INFO [Task 1] finishes : backup

log/archonnex/fuseki@acxcore-awstest0% ls -l /var/archonnex/fuseki/backups/fcrepo_2018-05-15_14-18-35* -rw-rw-r--. 1 fuseki fuseki 20 May 15 14:18 /var/archonnex/fuseki/backups/fcrepo_2018-05-15_14-18-35.nq.gz



--
Dan Pritts
ICPSR Computing & Network Services
University of Michigan
<https://www.postbox-inc.com>

Reply via email to