ESWC 2023 - SPARQL Jena related papers

LB Wed, 31 May 2023 01:12:49 -0700

Hi all,

slightly off-topic, but given the ongoing ESWC 2023 conference, I wantto share two papers that might be interesting for the one or the other:


1.  Join Ordering of SPARQL Property Path Queries

SPARQL property path queries provide a succinct way to write complexnavigational queries over RDF knowledge graphs. However, theirevaluation remains difficult as they may involve the execution oftransitive closures. As a result, many property path queries justtimeout when executed on public online RDF knowledge graphs. Onesolution to speed up their execution is to find optimal join orders.Although the join ordering problem has been extensively studied fortraditional SPARQL queries, the presence of property path patternsbiases existing approaches. In this paper we focus on C2RP QUF queries(conjunctive SPARQL property path queries with UNION and FILTER), andwe present a query optimizer that is able to capture the cost of C2RPQUF queries using an appropriate cost model and a sampling-basedcardinality estimator. On the latest Wikidata Query Benchmark, weempirically demonstrate that our approach finds significantly betterjoin orders than Virtuoso and BlazeGraph.

Paper:https://2023.eswc-conferences.org/wp-content/uploads/2023/05/paper_Aimonier-Davat_2023_Join.pdf


Not directly related to Jena, but interesting anyways.

2. Evaluation of a Representative Selection of SPARQL Query Enginesusing Wikidata

In this paper, we present an evaluation of the performance of fiverepresentative RDF triplestores, including GraphDB, Jena Fuseki,Neptune, RDFox, and Stardog, and one experimental SPARQL query engine,QLever. We compare importing time, loading time, and exporting timeusing a complete version of the knowledge graph Wikidata, and we alsoevaluate query performances using 328 queries defined by Wikidatausers. To put this evaluation into context with respect to previousevaluations, we also analyze the query performances of these systemsusing a prominent synthetic benchmark: SP2Bench. We observed that mostof the systems we considered for the evaluation were able to completethe execution of almost all the queries defined by Wikidata usersbefore the timeout we established. We noticed, however, that the timeneeded by most systems to import and export Wikidata might be longerthan required in some industrial and academic projects, whereinformation is represented, enriched, and stored using differentrepresentation means.

Paper:https://2023.eswc-conferences.org/wp-content/uploads/2023/05/paper_Lam_2023_Evaluation.pdf


In the second paper Jena TDB2 (v4.4.0) has been used during the benchmark.


Cheers,

Lorenz

ESWC 2023 - SPARQL Jena related papers

Reply via email to