Is it possible for you to make a copy of the database to query offline?
That can be expensive in storage, but it's really the simplest thing to do
in many ways.

Adam

On Mon, Jan 3, 2022, 1:09 PM Andy Seaborne <[email protected]> wrote:

>
>
> On 03/01/2022 17:44, [email protected] wrote:
> >
> >
> > Hi,
> >
> > you are right, I was not clear in the request. I try to
> > explain myself better.
> > I have a knowledge base of over a billion
> > triples.
> > I am testing a query that returns about 2 million results (in
> > the future I will have many queries that will return a lot of data)
> > On
> > the client side I have to allow the download of the results in CSV
> > format (on asynchronous request, not through batch).
>
> How long does it take?
>
> > But, with these
> > volumes of data, we can have 2 types of errors:
> > - OutOfMemory on the
> > Result (I can increase the heap size....)
>
> How are you making the query? (what software?)
>
> Fuseki will stream results back and with the Jena client code, can
> provide a end-to-end streaming solution.
>
> The fastest results for is the binary Thrift encoding.
>
> RDFConnectionFuseki will use this.
>
> Some queries don't stream.
>
> > - Connection timeout on Fuseki
> > (can I increase the configuration timeout?)
>
> What is timing it out? Some intermediate?
>
> Fuseki by default does not have timeouts. Your configuration may set
> them but the default is unbounded.
>
> If you have set timeouts, you can create another service to the same
> database with different settings. It shares the TDB database safely.
>
> > For this reason I was
> > thinking of using the tdbquery command (takes 3 minutes to run with
> > tdbquery). But I can't stop fuseki to perform the download operation.
> > Fuseki must remain active at all times to answer all other
> > questions.
>
> You can't use tdbquery this way.
>
> It should cause an error saying "already in use" or some such message.
> There is locking on the file system to detect dual use.
>
> With virtualized setups it may be possible to not get the error because
> filing systems are weird, but all that has happened is the the locking
> is not seeing the duplicate use, not finding it is possible.
>
> You will corrupt the database.
>
> Corrupt = permanently damage, not recoverable.
>
>      Andy
>
> >
> > Il 03.01.2022 17:25 Rinor Sefa ha scritto:
> >
> >> I think if
> > you describe your use case in more detail, it would be easier to get
> > help.
> >>
> >> For example, can you clarify
> >> - a query? What kind of query
> >
> >> - "many results", any number?
> >> - What do you consider slow and
> > inefficient and what are would you consider ideal?
> >>
> >> Also, why do
> > you think that the HTTP call is the bottleneck? I think that this is a
> > wrong assumption. Try to run a simple query and you will see that the
> > HTTP call is not the bottleneck.
> >>
> >> -----Original Message-----
> >> From:
> > [email protected] [1]
> >> Sent: Monday, 3 January 2022 12:59
> >> To:
> > [email protected] [3]
> >> Subject: Use command tdbquery
> >>
> >> Hi,
> >>
> >>
> > i am using a fuseki server and need to run a query which returns a lot
> > of results. The use of the HTTP call (http: // localhost: 3030 / ds /
> > query = myQuery) is very slow and inefficient. I thought about using the
> > tdbquery command. But I don't want to stop fuseki. Is there any way to
> > do this?
> >>
> >> Con Tiscali Mobile Smart 70 hai 70 GB in 4G, minuti
> > illimitati e 100 SMS a soli 7,99EUR al mese http://tisca.li/Smart70 [4]
> >
> >
> >
> >
> > Con Tiscali Mobile Smart 70 hai 70 GB in 4G, minuti illimitati e 100 SMS
> a soli 7,99€ al mese http://tisca.li/Smart70
> >
> >
>

Reply via email to