Re: InsertPic_(12-07(12-07-21-26-31)

2018-12-16 Thread Vincent Ventresque
Are you sure that named graphs have better performance? I'm not a specialist, and I'd like to know other users' opinion about that question. I thinks it depends both on the structure of your data and the queries you run. My use case consisted in using a dataset with +/- 168 M triples,

Re: sparql 1.4 billion triples

2018-12-16 Thread Dick Murray
Be very careful using vmtouch especially if you call -dl as you could very easily and quickly kill a system. I've used this tool on cloud VM's to mitigate cycle times, think DBAN due to public nature of hardware. It's a fast way to an irked OS thrashing around. Dick On Sun, 16 Dec 2018 19:57

Re: sparql 1.4 billion triples

2018-12-16 Thread Siddhesh Rane
I'll be happy to document this. I think FAQ would be a good place. I actually looked further into this and found that the vmtouch functionality is provided in the jdk itself. java.nio.MappedByteBuffer#load method will bring file pages in memory [1]. The way it works is similar to vmtouch, i.e.

Re: Build by program a Dataset that is both textual and spatial

2018-12-16 Thread Jean-Marc Vanel
Sorry , I sent a bad link on the list, my assembler file is really this: https://github.com/jmvanel/semantic_forms/blob/master/scala/jena.spatial%2Btext.assembler.ttl Le dim. 16 déc. 2018 à 13:40, Marco Neumann a écrit : > you are missing the text index in the assembler. > > On Sun, Dec 16,

Re: sparql 1.4 billion triples

2018-12-16 Thread ajs6f
This seems to be a Linux-only technique that relies on installing and maintaining vmtouch, correct? It doesn't seem that we could support that as a general solution, but would you be interested in writing something that gives the essentials up for someplace in the Jena docs? I'll admit I'm not

Re: Build by program a Dataset that is both textual and spatial

2018-12-16 Thread Marco Neumann
you are missing the text index in the assembler. On Sun, Dec 16, 2018 at 12:08 PM Jean-Marc Vanel wrote: > Yes indeed, > exactly with this assembler file: > > https://github.com/jmvanel/semantic_forms/blob/master/scala/jena.spatial.assembler.ttl > > And, when the Jena based application is

Re: Build by program a Dataset that is both textual and spatial

2018-12-16 Thread Jean-Marc Vanel
Yes indeed, exactly with this assembler file: https://github.com/jmvanel/semantic_forms/blob/master/scala/jena.spatial.assembler.ttl And, when the Jena based application is started with this same assembler file, the spatial queries work , as said above in this thread. Le dim. 16 déc. 2018 à

Re: sparql 1.4 billion triples

2018-12-16 Thread Jean-Marc Vanel
yphu, you didn't share your query. Maybe the query have questionable features. Did you try a simple but useful query, like getting the first 10 foaf:Person ? Le ven. 7 déc. 2018 à 15:53, y...@zju.edu.cn a écrit : > Dear jena, > I have built a graph with 1.4 billion triples and store it as a

Re: Build by program a Dataset that is both textual and spatial

2018-12-16 Thread Marco Neumann
did you create the text index with jena.textindexer --desc=//config.ttl? On Sun, Dec 16, 2018 at 8:48 AM Jean-Marc Vanel wrote: > I fixed a stupid error in text index URI, resulting from pasting, in new > file jena.spatial+text.assembler.ttl: > >

Re: sparql 1.4 billion triples

2018-12-16 Thread Siddhesh Rane
In-memory database has following limitations : 1) Time to create the database. Not a problem if you have a dedicated machine which runs 24/7 where you load data once and the process never exits. But a huge waste of time if you get hardware during certain time slots and you have to load data from

Re: Build by program a Dataset that is both textual and spatial

2018-12-16 Thread Jean-Marc Vanel
I fixed a stupid error in text index URI, resulting from pasting, in new file jena.spatial+text.assembler.ttl: https://github.com/jmvanel/semantic_forms/blob/master/scala/jena.spatial%2Btext.assembler.ttl Now, it reads: :spatial_dataset rdf:type spatial:SpatialDataset ; rdf:type

Re: Build by program a Dataset that is both textual and spatial

2018-12-16 Thread Jean-Marc Vanel
So I tried a new file jena.spatial+text.assembler.ttl with separate Lucene indices for spatial and text: https://github.com/jmvanel/semantic_forms/blob/master/scala/jena.spatial%2Btext.assembler.ttl It defines a Dataset with both Dataset types and both Lucene indices: :spatial_dataset rdf:type