Request: Load TDBs at Fuseki startup

2021-09-24 Thread Brandon Sara
Currently, it seems that all cached inference (at least with the transitive reasoner) is not loaded into cache until the first query that would query data from a dataset is submitted to the Fuseki server. For very large ontologies, this loading process can take quite a while. This basically

Re: Heap space problem with insert where

2021-09-24 Thread Harri Kiiskinen
Thanks a lot for the explanations! Harri On 24.9.2021 12.15, Andy Seaborne wrote: Inline... On 24/09/2021 09:13, Harri Kiiskinen wrote: Hi all, and thanks for the support! I did manage to resolve the problem by modifying the query, detailed comments below. Harri K. On 23.9.2021 22.47,

Re: Heap space problem with insert where

2021-09-24 Thread Marco Neumann
sure and if its fits your your use case even better On Fri, Sep 24, 2021 at 9:41 AM Harri Kiiskinen wrote: > Perhaps so; but as a tool, Jena, and SPARQL in general, is very suitable > for managing and processing data so that the processes can be described > and repeated. For example in this

Re: Heap space problem with insert where

2021-09-24 Thread Andy Seaborne
Inline... On 24/09/2021 09:13, Harri Kiiskinen wrote: Hi all, and thanks for the support! I did manage to resolve the problem by modifying the query, detailed comments below. Harri K. On 23.9.2021 22.47, Andy Seaborne wrote: I guess you are using TDB2 if you have -Xmx2G. TDB1 wil use even

Re: Support of streams in Fuseki

2021-09-24 Thread Andy Seaborne
Hi Henrik, Requests to Fuseki are HTTP requests. You could simply copy the HTTP response body stream into a file if you can get the body as a stream of bytes from the local HTTP client library. Whether it streams or whether the HTTP client library in your application collects the whole

Re: Heap space problem with insert where

2021-09-24 Thread Harri Kiiskinen
Perhaps so; but as a tool, Jena, and SPARQL in general, is very suitable for managing and processing data so that the processes can be described and repeated. For example in this case, processing the results of the OCR is very quick compared to the actual OCR process, so I prefer to store the

Re: Heap space problem with insert where

2021-09-24 Thread Marco Neumann
All that said, I would think you'd be best advised to run this type of operation outside of Jena during preprocessing with CLI tools such as grep, sed, awk or ack. On Fri, Sep 24, 2021 at 9:14 AM Harri Kiiskinen wrote: > Hi all, > > and thanks for the support! I did manage to resolve the

Re: Heap space problem with insert where

2021-09-24 Thread Harri Kiiskinen
Hi all, and thanks for the support! I did manage to resolve the problem by modifying the query, detailed comments below. Harri K. On 23.9.2021 22.47, Andy Seaborne wrote: I guess you are using TDB2 if you have -Xmx2G. TDB1 wil use even more heap space. Yes, TDB2. All those named

Support of streams in Fuseki

2021-09-24 Thread Dibowski Henrik (CR/ADT3)
Dear Jena team, in our current project we are exploring a way how to extract triples from a Jena Fuseki dataset via SPARQL Construct queries. We want to write (i.e. stream) the returned statements directly on-the-fly into a file on our server, without the need of loading all the statements