JENA-1499 may have knock on effects.

Adding a quad and deleting a quad and still listing the graph name would be OK but "contains graph" returns false in TIM and true in general. That might make a difference - I haven't traced that part of the code.

As for the :3030, ":" is one of the nasty characters in URIs but the Fuseki log looks like it is %-ed correctly.

https://gist.github.com/afs/c5623911293dd19da7f6cc1b2cf64c17

is an updated script to use curl and not Jena tools.

There is variable to set the graph name.

Does this script illustrate the effect?

    Andy


On 07/03/18 14:14, Dave Reynolds wrote:
On 07/03/18 12:59, Andy Seaborne wrote:


On 06/03/18 22:43, Dave Reynolds wrote:
Hi Andy,

Thanks for confirming you seeing something similar, glad I wasn't hallucinating!

Not sure any more that I am :-|

There are ghost graphs in a TIM dataset after deletion, JENA-1499, but I'm not seeing an empty store and do see the added triples.

OK so this just got weirder. I just tried the script from your gist and that works for me. Whereas my own was still failing. The only difference is that in your initial s-put you have a slightly different graph name (includes the port).

So for me on 3.4.0 and 3.6.0 if I run a trimmed version of your test [1], with the same update.ttl [2] and U1.ru [3] scripts. I see correct results:

== Step 1a
----
== Step 1b
<http://localhost:3030/graph5> {
     <http://localhost/test/r4>
             a       <http://localhost/test/Resource> ;
             <http://www.w3.org/2000/01/rdf-schema#label>
                     "r 4" .
}
----
== Step 2a
----
<http://localhost/test/r4> {
     <http://localhost/test/r4>
             a       <http://localhost/test/Resource> ;
             <http://www.w3.org/2000/01/rdf-schema#label>
                     "r 4 modified" .
}

<http://localhost:3030/graph5> {
}
----
== Step 3a
-------------------------------------------------------------------------------------------------------------------------------------------------- | s                          | p         | o                                | G                          | ================================================================================================================================================== | <http://localhost/test/r4> | <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> | <http://localhost/test/Resource> | <http://localhost/test/r4> | | <http://localhost/test/r4> | <http://www.w3.org/2000/01/rdf-schema#label>      | "r 4 modified"             | <http://localhost/test/r4> | --------------------------------------------------------------------------------------------------------------------------------------------------
----

If I run the same script but without the ":3030" in the graph name [4] I see incorrect results from the SPARQL query but correct TRIG:

== Step 1a
----
== Step 1b
<http://localhost/graph5> {
     <http://localhost/test/r4>
             a       <http://localhost/test/Resource> ;
             <http://www.w3.org/2000/01/rdf-schema#label>
                     "r 4" .
}
----
== Step 2a
----
<http://localhost/graph5> {
}

<http://localhost/test/r4> {
     <http://localhost/test/r4>
             a       <http://localhost/test/Resource> ;
             <http://www.w3.org/2000/01/rdf-schema#label>
                     "r 4 modified" .
}
----
== Step 3a
-----------------
| s | p | o | G |
=================
-----------------
----

The only significant difference I can see is that the change in graph names means the graphs list in a different order in the TRIG, which suggests they hash in a different order in the store. Why that should matter though ... ?

This at least explains why it was so hard to isolate the problem and that it seemed non-deterministic. Any attempt to simplify the names made the problem go away.

Dave

[1] https://gist.github.com/der/d47602ffbd294b2ed572cbccb02c7778
[2] https://gist.github.com/der/26a2147d19ad495a925374e56428f7e3
[3] https://gist.github.com/der/704363855a7cca35b3f123dff136c56d
[4] https://gist.github.com/der/7e1e711c6341df417e9ff18416bd1a5c



     Andy


I've tried with '' instead of "" in the shell script version of the test with identical results.

Confirmed that using --memTDB the test passes for me.

Dave

On 06/03/18 17:29, Andy Seaborne wrote:
Weird.

I ran this script with Fuseki v3.6.0 "--mem" and also "--memTDB" for steps up to and including 6.

https://gist.github.com/afs/cd6953b06985dde37a9581134ec13165

There something going on with TIM because I'm seeing empty graph5 with TIM but not with TDB.

I may have seen no results once and then I changed:

 > 6. Check the contents of the store:
 >
 >      rsparql --service http://localhost:3030/ds/query "SELECT * WHERE
 > {Graph ?G {?s ?p ?o}} ORDER BY ?G"

There are "" quotes around a "*"  and it's a script.  Could you try ''-quotes please?

I'll try step 7.

     Andy

On 05/03/18 23:56, Dave Reynolds wrote:
I've been trying to debug some weird behaviour in my test cases and think they are due to a bug in memory-backed fuseki stores.

However, the behaviour is so odd and hard to reproduce I'd like some confirmation that someone else sees the same effect before opening a JIRA.

# Steps to reproduce

[Sorry this is convoluted but all my attempts to simplify fail to show the suspected bug.]

1. Fresh download of fuseki 3.6.0.

2. Start an in memory server in one shell:

     fuseki-server --mem --update /ds

3. Create test data with two statements about one resource:

     echo 'prefix rdfs:  <http://www.w3.org/2000/01/rdf-schema#>
prefix eg:    <http://localhost/test/>
eg:r4 a eg:Resource; rdfs:label "r 4" .' > update.ttl

4. Use the graph REST API to put the data into a named graph:

     s-put http://localhost:3030/ds/data http://localhost/graph5 update.ttl

5. Run a sparql update which will delete the original statements from the graph and add some replacement statements to a new graph:

     rupdate --service=http://localhost:3030/ds/update '
     DELETE {GRAPH ?G {<http://localhost/test/r4> ?p ?o}} WHERE {GRAPH ?G {<http://localhost/test/r4> ?p ?o}};
     INSERT DATA { GRAPH <http://localhost/test/r4> {
         <http://localhost/test/r4> <http://www.w3.org/2000/01/rdf-schema#label> "r 4 modified" .          <http://localhost/test/r4> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://localhost/test/Resource> .
     } }'

6. Check the contents of the store:

     rsparql --service http://localhost:3030/ds/query "SELECT * WHERE {Graph ?G {?s ?p ?o}} ORDER BY ?G"

At this point the store *should* contain two statements in graph http://localhost/test/r4. With a TDB-backed fuseki that's what I see. With the memory backed fuseki I see an apparently empty store.

If other named graphs are populated with other unrelated data they will seem to have disappeared as well.

7. Now reinsert the original data, running step 4 again and check by running step 6 again. At this point both the "missing" statements from graph http://localhost/test/r4 reappear, as does the reinserted original statements in http://localhost/graph5.

# Simplifying the test case

So far ...

   - I've failed to reproduce this outside of fuseki.

   - I've failed to reproduce this without mixing graph operations and update operations.

   - If I reduce the inserted/updated data to a single statement instead of a pair of statements it passes.

   - If I try with at empty TDB store it passes.

   - I get the same behaviour from 3.4.0 as from 3.6.0.

Any of these failures may be user error on my part, it has been so hard turning an apparently non-deterministic error into something reproducible that I'm no longer sure of anything :(

Am I going mad or does anyone else see the same behaviour?

Dave



Reply via email to