Re: SHACL

2024-04-06 Thread Paul Tyson
This is old, but might be useful. It has a chapter on SHACL. https://book.validatingrdf.com/ Regards, --Paul On 4/5/24 04:58, Hashim Khan wrote: Hi, I am interested in working with SHACL shapes for validation. I would like to know if someone points out a nice resource. Best,

Re: Database Migrations in Fuseki

2024-02-10 Thread Paul Tyson
I don't know if my experience is helpful, but I went a different way to solve these sorts of problems. I avoid adding business logic while generating the RDF from the source data systems. It is almost entirely a simple transliteration from one format to another (I use an R2RML mapping.) The

sparql query performance jena v2, 3, 4

2023-02-27 Thread Paul Tyson
I maintain an old jena/fuseki application that has happily been using jena v2.13 and tdb v1.1.2 for several years. It loads 1b+ triples into a tdb database, and runs a couple dozen queries, some not so trivial, on the tdb. Now it is time to update things. I first went to 3.17, to stay on

jena version java compatibility

2022-08-15 Thread Paul Tyson
Apologies for not paying close enough attention to the release history, but I need to find the latest jena version that runs in java 1.8. I know support for java8 was dropped somewhere in the 4.x series, but can't find where. My application must remain at java8 for the time being. Thanks in

Re: where is the RDF shapes action?

2022-07-02 Thread Paul Tyson
Hi Florian, Long ago I learned some about SPIN, which became SHACL. I thought back then it had quite a lot of potential to aid in RDF processing, but didn't have any use cases for it at the time More below. On 6/27/22 14:17, Florian Kleedorfer wrote: Am 2022-06-27 18:51, schrieb Paul Tyson

Re: where is the RDF shapes action?

2022-07-02 Thread Paul Tyson
). Also Jena feature emerging: If you could look at PR 1256 and see if that can be used for your "informative" validation if not covered by validation reports. https://github.com/apache/jena/pull/1256  (Florian Kleedorfer)     Andy On 27/06/2022 17:51, Paul Tyson wrote: Can an

Re: where is the RDF shapes action?

2022-06-27 Thread Paul Tyson
27, 2022, at 11:52, Paul Tyson wrote: > > Can anyone point to websites, mailing lists, or other forum where users are > discussing questions, use cases, and solutions involving RDF shapes (ShEx or > SHACL)? There is plenty of technical information about both languages, but > not

where is the RDF shapes action?

2022-06-27 Thread Paul Tyson
Can anyone point to websites, mailing lists, or other forum where users are discussing questions, use cases, and solutions involving RDF shapes (ShEx or SHACL)? There is plenty of technical information about both languages, but not so much real-world practical discussion. I recently started

Re: JSON-LD: 1.0 or 1.1

2022-04-23 Thread Paul Tyson
> On Apr 23, 2022, at 12:16, Andy Seaborne wrote: > > What should the default settings be JSON-LD 1.0 or 1.1? > 1.1 would better meet my use cases. Thanks, —Paul

Re: Java APIs

2022-02-06 Thread Paul Tyson
Hans-Jürgen, that sounds like an interesting and useful project. Can you post a website where we can get more information and follow the project? Thanks, --Paul On 2/5/22 18:34, Hans-Juergen Rennau wrote: Hello, I am interested in the integration of SPARQL into XQuery. The essential

Re: Ontology

2021-08-19 Thread Paul Tyson
Yes, off-topic, better forums would be ontolog-forum (http://ontologforum.org/info/) and semantic-...@w3.org. But, briefly: I have yet to see a good use case that would justify the expense and trouble of making a formal ontology. The most you will probably ever need is an RDFS schema, and

Re: Scalability

2021-07-23 Thread Paul Tyson
On 7/23/21 12:32 PM, Matt Whitby wrote: A little bit of a vague question, and perhaps a silly one. How well does Jena scale? Would it tap out after a given number of triples? There are way too many variables to give a simple answer. I curate a dataset of 1 billion triples that is

Re: [Apache Fuseki] Limits of Apache Fuseki Triple Store

2021-01-25 Thread Paul Tyson
Regarding first question: I maintain a dataset of 1 billion triples in a very old fuseki tdb version. I would like to know if anyone’s working in trillion triple range , perhaps with RDF/HDT. Regards, —Paul > On Jan 25, 2021, at 01:02, Marco Franke wrote: > >  > Dear developers, > > we

Re: Visualising Jena model saved in RDF/XML in the browser

2020-03-09 Thread Paul Tyson
at, > non-nested output. > RDF/XML is significant as a bridge format to the XML stack. > > Do you have an example of "useful schema-specific information presentations"? > > > Martynas > > On Mon, Mar 9, 2020 at 3:56 AM Paul Tyson wrote: > > > &g

Re: Visualising Jena model saved in RDF/XML in the browser

2020-03-08 Thread Paul Tyson
On Sun, 2020-03-08 at 12:06 +0530, Diptendu Dutta wrote: > I have used Jena to generate RDF/XML of the model: > > xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#; > xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#;> > > > TheConsignor > >

Re: programmatically construct UpdateDeleteInsert

2019-12-05 Thread Paul Tyson
Never mind. I missed: QuadAcc deletes = upd.getDeleteAcc(); deletes.addTriple(/* triple pattern to delete */); Regards, --Paul On Thu, 2019-12-05 at 09:12 -0600, Paul Tyson wrote: > I'm trying to construct a SPARQL update like: > > DELETE {?r ?p ?o. ?s ?p1 ?r.} > WHERE {?r

programmatically construct UpdateDeleteInsert

2019-12-05 Thread Paul Tyson
I'm trying to construct a SPARQL update like: DELETE {?r ?p ?o. ?s ?p1 ?r.} WHERE {?r ex:foo "123"; ex:bar "456"; ?p ?o. OPTIONAL {?s ?p1 ?r} } In other words, delete all triples with subject or object resource that has certain ex:foo and ex:bar values. I can't see how to set or modify the

Re: Delete all nested triples

2019-02-20 Thread Paul Tyson
On Wed, 2019-02-20 at 17:27 -0700, ganesh chandra wrote: > Hello All, > My data looks something like this: > a something:Entity ; > something:privateData [ a something:PrivateData ; > something:jsonContent "{\"fileType\":

Re: Example code

2018-03-19 Thread Paul Tyson
On Mon, 2018-03-19 at 20:01 +1000, David Moss wrote: > I agree, the technical documentation is not the place to keep basic how-to > examples. But with Jena the basic how-to examples seem to be missing entirely. > I have written GUI applications using the available examples from MYSQL. > MYSQL

Re: [JENA-DEV] SPARQL - Way to concat several property values

2018-02-20 Thread Paul Tyson
Data and query samples would help. (I could not see the image.) But from your problem description, you might try GROUP BY and the GROUP_CONCAT aggregate function. This would put (for example) all the properties of a subject in one result field, separated by the delimiter of your choice. See the

Re: there is a tutorial on the use of fuseki on jsp (JavaServer Pages), to be able to consult about the web platform

2017-11-21 Thread Paul Tyson
> On Nov 21, 2017, at 12:21, Manuel Quintero Fonseca wrote: > > Yes, I want to make a query with sparql in fuseki, to show the result in a > web. what has been researched, happens to use java for web pages JSP is > used in tomcat. You might have a good reason (or

Re: Return nested JSON results

2017-05-16 Thread Paul Tyson
On May 16, 2017, at 15:51, Laura Morales wrote: >> That's ok, but i wanted to know what kind of app is querying your Fuseki >> database? Only a hint.. I know it sounds 'fundamentally' not essential, >> but for 'me' it is the practice of developement. > > > Well yeah, I

Re: Return nested JSON results

2017-05-09 Thread Paul Tyson
On Tue, 2017-05-09 at 18:32 +, Dimov, Stefan wrote: > I’m also interested in having nested JSON results … > S. > > On 5/8/17, 11:06 PM, "Laura Morales" wrote: > > > I may have time to test if the change in this pull request [1] could > create such response. Feel free

Re: How to make complex SPARQL queries reusable?

2017-04-24 Thread Paul Tyson
Another option is to express the query logic in standard rule notation, such as RIF, and translate to sparql. This approach is especially indicated if the sparql queries represent actual business rules. Regards, --Paul > On Apr 24, 2017, at 06:22, Andy Seaborne wrote: > >

Re: Predicates with no vocabulary

2017-04-12 Thread Paul Tyson
Part of the fun (and ease) of RDF is being able to make stuff up as you go along. But, as others have said, when you move past the stage of learning and experimentation, and make your work persistent or reusable, you'll want to be more formal. The linked data patterns book [1] has several

Re: SPARQL query

2017-03-01 Thread Paul Tyson
Maybe something like: Select ?s ?p ?o Where { ?s ?p ?o. Filter (?p in (:A, :B, :C)) Minus {?s ?p2 ?o2. Filter (!(?p2 in (:A, :B, :C)))} Untested. Regards, --Paul > On Mar 1, 2017, at 14:12, Claude Warren wrote: > > I have a graph where resources have a number of controlled

Re: Benefits of Semantic web

2017-02-10 Thread Paul Tyson
You might find this related discussion interesting: https://groups.google.com/forum/m/#!topic/ontolog-forum/AUzkFVhGrok Regards, --Paul > On Feb 10, 2017, at 11:02, David Jordan wrote: > > I agree that have some discussion about this is very useful. Many of us > have

Re: JSON-LD questions

2017-01-11 Thread Paul Tyson
d sequence (e.g., XML). Regards, --Paul > >> On Wed, Jan 11, 2017 at 3:18 PM, Paul Tyson <phty...@sbcglobal.net> wrote: >> >> >> On Jan 10, 2017, at 18:06, Grahame Grieve >> <grah...@healthintersections.com.au> wrote: >> >>>> >

Re: sum property values

2016-10-13 Thread Paul Tyson
There have been 2 different techniques mentioned that solve different problems. The SUM aggregate function can be used to reduce a result set as illustrated by the following minimal CSV snippets: BEFORE: ?v1,?v2 "A",1 "A",1 "A",2 "B",3 "B",1 AFTER: ?v1,?v3 "A",4 "B",4 The above results could

Re: Applications/Projects Using Jena

2016-10-06 Thread Paul Tyson
As a contractor, I cannot disclose specific company information, but I have implemented a couple of large Jena projects. One is in the PLM (Product Lifecycle Management) domain. I extract part master and product structure from 4 enterprise data systems and load to TDB, using RDB2RDF (R2RML)

Re: Construct query

2016-10-01 Thread Paul Tyson
RUCT {?s ex:hasResearch "C-SPARQL"} WHERE {?s rdf:type ex:Student; ex:hasCourse/rdfs:label "Intro to SPARQL"} Regards, --Paul > > On Fri, Sep 30, 2016 at 8:52 PM, Paul Tyson <phty...@sbcglobal.net> wrote: > > > On Thu, 2016-09-29 at 13:44 -0700, tina s

Re: Construct query

2016-09-30 Thread Paul Tyson
On Thu, 2016-09-29 at 13:44 -0700, tina sani wrote: > I want to know about the Construct query. > How it differs from Select query One way to think about it, if you have any background in relational databases, is that SELECT returns a highly denormalized table (at least, from any non-trivial

Re: sparql algebra differences jena 2.13.0/3.n

2016-09-18 Thread Paul Tyson
} } I have noticed other cases where order of triples and bgps makes quite a difference in execution time, but I can't figure out any science to it. Are there any guidelines for ordering the components of a complex query (including UNION and OPTIONAL clauses) to optimize performance? Can

Re: sparql algebra differences jena 2.13.0/3.n

2016-09-16 Thread Paul Tyson
"str3" ) ) || > regex(?var4, "pat1") ) ) >} > { { ?var1 :p3 ?var4 } > UNION >{ ?var1 :p4 ?var4 } >} > } > > >Andy > > >> On 14/09/16 13:15, Paul Tyson wrote: >>> On Wed, 2016-09-14

Re: sparql algebra differences jena 2.13.0/3.n

2016-09-14 Thread Paul Tyson
uot;Or(x,y)". The surface syntax doesn't make any difference in the algebra, does it? Regards, --Paul > On 14/09/16 02:01, Paul Tyson wrote: > > I have some queries that worked fine in jena-2.13.0 but not in > > jena-3.1.0, using the same data. > > > > For a long ti

sparql algebra differences jena 2.13.0/3.n

2016-09-13 Thread Paul Tyson
I have some queries that worked fine in jena-2.13.0 but not in jena-3.1.0, using the same data. For a long time I've been running a couple dozen queries regularly over a large (900M triples) TDB, using jena-2.13.0. When I recently upgraded to jena-3.1.0, I found that 5 of these queries would not

Re: Relationship between similar columns from multiple databases

2016-09-07 Thread Paul Tyson
t; What are you using for R2RML? > > Ontop looks promising: http://ontop.inf.unibz.it/ > >> On Wed, Sep 7, 2016 at 9:11 PM, Paul Tyson <phty...@sbcglobal.net> wrote: >> Yes, I am using R2RML to convert 4 big PLM DBs into RDF, load in Jena TDB >> and serve via fuseki

Re: Relationship between similar columns from multiple databases

2016-09-07 Thread Paul Tyson
Yes, I am using R2RML to convert 4 big PLM DBs into RDF, load in Jena TDB and serve via fuseki for data mashups and inconsistency reports. Works very well. Best, --Paul > On Sep 7, 2016, at 13:39, Martynas Jusevičius wrote: > > I think R2RML and GRDDL could be of

Re: Optimising path to root concept SPARQL query

2016-02-01 Thread Paul Tyson
I don't know that you can get such results from sparql directly. I would get flat list of subclass relations in xml (.srx) or Json and then process with xslt or JavaScript to write out class hierarchy. Regards, --Paul > On Feb 1, 2016, at 07:05, Joël Kuiper wrote: > >

Re: optimizing serialization of results from fuseki

2016-01-07 Thread Paul Tyson
g filters, and got much better results using Lucene add-on for that. Regards, --Paul > Håvard > > > > > On 07/01/16 03:51, "Paul Tyson" <phty...@sbcglobal.net> wrote: > > >On Wed, 2016-01-06 at 18:52 +, Andy Seaborne wrote: >

Re: optimizing serialization of results from fuseki

2016-01-07 Thread Paul Tyson
dy > > On 07/01/16 08:48, Håvard Mikkelsen Ottestad wrote: > > Hi, > > > > Reordering the filters might help. > > > > Also, maybe a stats file would reorder your query to be faster. I dunno how > > often (or if) fuseki generates a stats file. You can try to gener

optimizing serialization of results from fuseki

2016-01-06 Thread Paul Tyson
I have a modest (17M triple) dataset, fairly flat graph. I run some queries selecting nodes with anywhere from 12-20 different property values. Result set counts are anywhere from 10,000 to 30,000 nodes. Total execution time measured at client are in the 30-40 second range. The web request

Re: optimizing serialization of results from fuseki

2016-01-06 Thread Paul Tyson
+ filter), and it seems to be the more complicated filters that start to slow things down, as might be expected. Thanks for your comments and interest. The performance we're seeing is unacceptable for our application requirements, so I wanted to see if there were any other performance factors I had mi

how to set fuseki options under tomcat

2016-01-05 Thread Paul Tyson
On another thread Andy mentioned fuseki options arq:optIndexJoinStrategy and arq:optMergeBGPs, with example using "--set" command line option. How do you set these when running under tomcat? I could not find instructions or examples in the documentation. Thanks, --Paul

Re: delete/insert problem with tdbupdate

2015-11-02 Thread Paul Tyson
On 01/11/15 23:00, Paul Tyson wrote: >>> On Sun, 2015-11-01 at 10:32 +, Andy Seaborne wrote: >>> Paul, >>> >>> Could you simplify the example please? >> >> I will work on that, but ran into some puzzling differences between >> windows and L

Re: delete/insert problem with tdbupdate

2015-11-01 Thread Paul Tyson
records yet in the dataset, as well as those that have been populated with the INSERT triples. The ex:gAlt property on ex:Thing1 instances is optional. If the outer OPTIONAL were removed we would need different patterns based on whether the dataset already held the target triples. > > On

Re: fuseki 2.3.0 startup failed in tomcat

2015-08-15 Thread Paul Tyson
Never mind, this was an unrelated problem in server.xml file. Regards, --Paul On Sat, 2015-08-15 at 12:06 -0500, Paul Tyson wrote: I dropped fuseki.war in webapps directory of tomcat. The server.xml has been configured to support another big webapp, but as far as I can tell should

fuseki 2.3.0 startup failed in tomcat

2015-08-15 Thread Paul Tyson
I dropped fuseki.war in webapps directory of tomcat. The server.xml has been configured to support another big webapp, but as far as I can tell should not exclude other webapps from running. fuseki fails to start and leaves these messages in log. Can anyone give a clue where to look for root

Re: fuseki ontmodel capacity

2015-07-01 Thread Paul Tyson
Hi Andy, further questions below. On Fri, 2015-06-26 at 18:47 +0100, Andy Seaborne wrote: On 25/06/15 17:35, Paul Tyson wrote: Hi Andy, no joy yet. On Wed, 2015-06-24 at 22:36 +0100, Andy Seaborne wrote: On 24/06/15 21:37, Paul Tyson wrote: Before working through the configuration

Re: fuseki2 ontmodel config

2015-06-25 Thread Paul Tyson
Thanks Andy. Simple fix, noted below. However, it turns out I have OWL2 constructs (datatype restrictions) that apparently are not handled by the Jena OWL reasoners. On Thu, 2015-06-25 at 12:56 +0100, Andy Seaborne wrote: Hi Paul, On 25/06/15 03:42, Paul Tyson wrote: I cannot piece

Re: fuseki ontmodel capacity

2015-06-25 Thread Paul Tyson
Hi Andy, no joy yet. On Wed, 2015-06-24 at 22:36 +0100, Andy Seaborne wrote: On 24/06/15 21:37, Paul Tyson wrote: Before working through the configuration of an ontology model in fuseki2, I wanted to ask if anyone has experience with large models. I estimate there will be 250K class

fuseki ontmodel capacity

2015-06-24 Thread Paul Tyson
Before working through the configuration of an ontology model in fuseki2, I wanted to ask if anyone has experience with large models. I estimate there will be 250K class definitions, about 40M triples. My queries will be for instance checking: select ?class where { _:a rdf:type ?class; ex:p1

fuseki2 ontmodel config

2015-06-24 Thread Paul Tyson
I cannot piece together a workable configuration for fuseki2 with an OntModel. Combining the config-tdb-dir template with info from http://jena.markmail.org/message/wr3f6gy5orxbszyd I get the config shown below. When I put this as file tdb-owl.ttl in the FUSEKI_BASE/configuration directory,

named graph impact on query performance

2015-04-11 Thread Paul Tyson
Hi, Any theoretical reasons or evidence that lots of named graphs in a TDB repository will adversely affect query performance? For example, 50,000 named graphs containing total of 11 million triples. Some queries will be for specific graphs, but most will be union queries over the entire

Re: named graph impact on query performance

2015-04-11 Thread Paul Tyson
if performance sucks that would jeopardize its viability, with TDB anyway. Regards, --Paul On Sun, Apr 12, 2015 at 3:10 AM, Paul Tyson phty...@sbcglobal.net wrote: Hi, Any theoretical reasons or evidence that lots of named graphs in a TDB repository will adversely affect query performance

Re: hot swap tdb behind fuseki

2015-02-13 Thread Paul Tyson
On Fri, 2015-02-13 at 16:49 +, Andy Seaborne wrote: On 12/02/15 22:09, Paul Tyson wrote: On Wed, 2015-02-11 at 10:53 +, Andy Seaborne wrote: Paul, You can add a new, pre-built database to a running Fuseki2 server with a new name but you can't hot swap an existing name

Re: hot swap tdb behind fuseki

2015-02-12 Thread Paul Tyson
others. On 11/02/15 00:07, Paul Tyson wrote: On Tue, 2015-02-10 at 23:38 +, Stian Soiland-Reyes wrote: Are you using the tdb to swap just for reading, or would you need to synchronize transactions? Below I'll assume you mean 'reading', and that you want to swap because you have

hot swap tdb behind fuseki

2015-02-10 Thread Paul Tyson
I've looked through the user documentation but did not find a clue to this problem. I have not dug too deeply into the code. The problem is to safely re-initialize a running fuseki server to read a new tdb location. I've thought of using 2 (or more) jetty or tomcat workers in a load-balancing

Re: hot swap tdb behind fuseki

2015-02-10 Thread Paul Tyson
probably hide/simplify that name from the URI with a simple Apache httpd ProxyPass or RewriteRule Thanks for the pointers and warning. I'll see if I can work it out. Regards, --Paul On 10 February 2015 at 19:19, Paul Tyson phty...@sbcglobal.net wrote: I've looked through the user

Re: Is this a good way to get started?

2014-12-11 Thread Paul Tyson
Hi Nate, I don't know if your questions were about Rob's particular application, or just in general, but I'll jump in with a few generic responses in areas I'm familiar with. On Thu, 2014-12-11 at 17:48 -0500, Nate Marks wrote: This is great feedback. Thanks for taking the time. If you

Re: XXXX-Large TDB (suggestion wanted)

2014-11-14 Thread Paul Tyson
Jacek, Sorry, I wasn't paying close attention to this thread, but saw your last comment and wanted to chime in. Of course jena/fuseki (nor any RDF system) can't compete with SQL in the things SQL is good at. I load 760M triples in about 8 hours on a Linux VM on what is by now probably a

text query analyzer problem

2014-09-18 Thread Paul Tyson
I've been using the configurable text query analyzer in jena 2.11.2 (fuseki 1.0.2) since it was provided by JENA-654. I use the KeywordAnalyzer to index a field that contains part numbers, which are mostly composed of digits with dashes, but a fair amount of alphabetic characters. I just noticed

Re: Configuring Jena TDB for a benchmark

2014-04-19 Thread Paul Tyson
On Sat, 2014-04-19 at 18:33 +0100, Saud Aljaloud wrote: Dear Jena folks, We are investigating how efficient different triple stores, including Jena TDB, handle literal strings within SPARQL. To this end, We are now working on benchmarking these triple stores against a set of specific

Re: Sparql To SQL

2014-03-30 Thread Paul Tyson
On Sun, 2014-03-30 at 09:37 +, Kamalraj Jairam wrote: Hello All, Whats the best way to convert sparql to SQL using R2RML mappings and convert resultset from DB to RDF? What are the givens? Do you have existing SPARQL text written against some RDF produced by some existing R2RML

Re: jena-text indexing fields with KeywordAnalyzer

2014-03-17 Thread Paul Tyson
On Mon, 2014-03-17 at 12:58 +, bwm-epimorphics wrote: On 14/03/14 00:51, Paul Tyson wrote: [...] Has anyone else encountered this problem? I have. I have an application that may require using either a different analyzer or the StandardAnalyzer with a different set of stop words

jena-text indexing fields with KeywordAnalyzer

2014-03-13 Thread Paul Tyson
I just tried out the jena-text indexing and query capabilities of jena 2.11. Great stuff, but the property values I indexed contain part numbers that frequently contain hyphens. Apparently Lucene's StandardAnalyzer tokenizes on hyphens, so my initial search results were quite puzzling. However,

sparql performance parameters and limitations

2013-06-01 Thread Paul Tyson
I'm seeking guidance for setting expectations for TDB sparql performance as the size and complexity of the queries grows. The dataset has about 600 million triples, around 200 million non-literal nodes, about 500 predicates. I generate sparql queries from logical rules, which as it turns out can

Re: stream turtle to TDBLoader

2013-05-16 Thread Paul Tyson
-powered machine. Regards, --Paul On Thu, 2013-05-16 at 12:50 +0100, Andy Seaborne wrote: On 16/05/13 04:21, Paul Tyson wrote: Hi, I'm trying to use the TDBLoader api to stream turtle to the bulk loader to create a new TDB repository. I suspect none of the TDBLoader.load*() methods

stream turtle to TDBLoader

2013-05-15 Thread Paul Tyson
Hi, I'm trying to use the TDBLoader api to stream turtle to the bulk loader to create a new TDB repository. I suspect none of the TDBLoader.load*() methods accept turtle input. I'm using version 2.10.1. This sort of code produces an immediate RIOT exception: InputStream is = ...; Dataset ds =