Re: Txn code not handling type of transaction

Andy Seaborne Fri, 29 Dec 2017 09:30:18 -0800

I can't see that variable "dataset" is a TDB dataset, not a bunch ofgraphs pulled together at the start of the request. If it's a compositewrapper containing TDB graphs, and if it is different on request, thatwould be consistent with what has been presented. Not the onlypossibility though.


        Andy


On 29/12/17 14:28, ajs6f wrote:

Just to add a bit of context to Andy's point:

It needs a dataset so it makes one and puts the model in as the default graph.


This is (more or less) because SPARQL is defined as acting against datasets, 
not individual graphs.

https://www.w3.org/TR/sparql11-query/#rdfDataset:

"A SPARQL query is executed against an RDF Dataset which represents a collection of 
graphs."

And if (as Andy pointed out is possible) the individual graph in question is a 
view over several datasets, that doesn't work either. There has to be a single 
dataset that is the basic context for SPARQL. (Even while using 
federation/SERVICE, there is still a single dataset against which the query is 
being executed.)

ajs6f

On Dec 29, 2017, at 9:21 AM, Andy Seaborne <[email protected]> wrote:



On 29/12/17 13:58, George News wrote:

On 2017-12-29 14:46, Andy Seaborne wrote:



On 29/12/17 09:51, George News wrote:

On 2017-12-28 18:02, dandh988 wrote:

Can you give a complete example? How are you calling MultiUnion? Are
you calling txn read on the dataset then building the union, then
calling txn write to update the graph?


Let's describe a use case.


The stack trace does not agree with the details here (and details
matter!) especially what dataset is because if that is a general dataset
built out of TDB graphs, this whole thing will not work.  A complete,
minimal example is needed.

A few general observations:


1) I have a dataset that includes 4 named graphs: G1, G2, G3, G4
2.a) Someone initiates a sparql request and the internal procedure
followed is:
    return Txn.calculateRead(dataset, () -> {
      // Create multiunion of 3 namegraphs
      MultiUnion union = new MultiUnion();
      union.addGraph(dataset.getNamedModel("G1").getGraph());
      union.addGraph(dataset.getNamedModel("G2").getGraph());
      union.addGraph(dataset.getNamedModel("G4").getGraph());


This can be done in SPARQL:

either UNION or

SELECT
FROM <G1>
FROM <G2>
FROM <G4>
WHERE { ... }

(copy the query object and modify it).

and TDB will handle it better.


      Model m = ModelFactory.createModelForGraph(union);


You are not querying the TDB dataset if you pass a model to
QueryExecutionFactory.create.  It has to create a dataset (a general
purpose one) for the query.

But if the model is created from graphs/models on a dataset, isn't it
the same? At the end the models are a reference to ones in the dataset.


Not the same and the model is not a reference into a dataset.

QueryExecutionFactory.create is given a model, here a composite one, could be a 
model without a datasets, could be a model over many datasets.  It can't know 
what the structure is, model is an interface.

It needs a dataset so it makes one and puts the model in as the default graph.

      // Launch Sparql query on it
      try (QueryExecution qExec = QueryExecutionFactory.create(query,
m)) {
        return ResultSetFactory.copyResults(qExec.execSelect())
      }
    });

2.b) Someone initiates a sparql request and the internal procedure
followed is:
    return Txn.calculateRead(dataset, () -> {
      // Create multiunion of 2 namegraphs
      // This code in in a function
      MultiUnion union = new MultiUnion();
      union.addGraph(dataset.getNamedModel("G1").getGraph());
      union.addGraph(dataset.getNamedModel("G2").getGraph());
      Model m1 = ModelFactory.createModelForGraph(union);

      // Retrieve namegraph G3
      // This code in in a function
      Model m2 = dataset.getNamedModel("G3");

      Model m = ModelFactory.createUnion(m2, m1);

      // Launch Sparql query on it
      try (QueryExecution qExec = QueryExecutionFactory.create(query,
m)) {
        return ResultSetFactory.copyResults(qExec.execSelect())
      }
    });


3) Someone initiaties a write in G2
    // m stores the new entity model
    Model m;
    Txn.executeWrite(dataset, () -> {
      dataset.getNamedModel("G2").add(m);
    });



If either version of 2), and 3) are not done in parallel there is no
problem and everything is executed correctly.

The problem arise when 2.a) or 2.b) is run, and before ending someone
tries to perform 3). Then I get the FileException("In the middle of an
alloc-write").

Do you have an idea on how to avoid this? How can I handle
transactions in this model?


I was thinking on creating a global mutex so if any action is being
performed over the dataset, then the rest would be blocked. The
problem here is that the code is part of a webservice and then if the
read/write operation lasts long, I will get a timeout that will close
the connection.

The other option is to disable writing while someone is reading. The
main problem here is how to properly reschedule writings not to have a
big queue.

Any help is more than welcome. I don't know what else to do to solve
this is issue, and the problem is making the service unusable :(

Thanks a lot for the great help you are all offering.
Jorge

Re: Txn code not handling type of transaction

Reply via email to