Dear Dominic, all,
Yes, I will always defend that modeling is technology independent,
limited however to the degree that science and technology should at
least provide the prospect of implementation in the near future, and
some viable approximations immediately. We definitely started the CRM
before the technology was generally available but expected. The primary
criterion is that the model reflects our insight about the scientific
discourse we target at. As such, I see the model-level discussion to be
between reasoning about "proposition sets" versus a "single binary
proposition". The technical discussion should be about best and most
effective approximations, regardless popular or not. The effectiveness
will depend on use cases and platform requirements.
Please let us know, who is interested in participating in a narrower
subgroup for creating a document analyzing the alternatives.
Best,
Martin
On 5/11/2023 8:01 PM, Dominic Oldman wrote:
Hi
Just a quick question on this. We develop the model independently of
technology. I can see that this discussion is getting technical. I
currently implement propositions sets using RDF named graphs because
we can and it works but it is not stipulated. Rob suggests that there
are tech upgrades that might suit this issue better. However, isn't it
the case that we need to be able to implement in different ways (I
don't currently know much about RDF*) depending on the systems we
have? How is RDF* implemented? - is it backwardly compatible with what
we are all using? Do we give more modelling credence to things that
everyone uses? etc., etc. But aren't these questions the reason why we
are technology independent? Given this, my question is, - have we got
to a stage when the modelling now depends on a particular technology?
Can someone provide some clarification on this? Which solution is tech
independent? Are they all independent of this tech discussion? One is
at least.
D
On Thu, 11 May 2023 at 16:18, Martin Doerr via Crm-sig
<crm-sig@ics.forth.gr> wrote:
Dear Robert,
We have just created the new issue to discuss this in detail. We
should prepare a detailed analysis, citing all pros and cons. May
be we continue this discussion better in a subgroup?
Named Graphs are not a very specific technology, if we take the
fact that all current triple stores are actually implemented as
quad stores, regardless whether they call the construct "Named
Graph" or "context". We have used and implemented this feature,
and it is very performant. It runs on BlazeGraph as well. I think
their is not a simple answer to that. Performance can become a
major issue, when you have really a lot of data.
For the attribution of artists and "style of" vs "school of" etc.
of the collection management system of the British Museum, the
ResearchSpace Project had created a set of subproperties of P14
carried out by, which could be used as input for a roles vocabulary.
I did not propose to use Dig as is, but to consider the construct.
The W3C annotation model is very interesting. We would need a
connection to the Creation Event of making an annotation, and
whose opinion it is, in order to make it CRM compatible. Why not
allowing a Named Graph as target? We should compare the segment
construct of the W3C annotation model with the METS <area> types
and extensions we used. The Dig model was used to trace provenance
of annotated area through transformations of digital objects. That
was very important for exchanging research insights on 3D models.
To be discussed!
We can extend E13 to Proposition Sets, which would be very
important to describe consistently CRMinf and generalized
observations. That would then be most effectively implementd via
Named Graphs.
Opinions?
Best,
Martin
On 5/11/2023 3:41 PM, Robert Sanderson wrote:
If the intent is that the assertion is in the discourse, and not
a syntactic workaround for .1 properties that would be
unnecessary if we had RDF* or property graphs, then I would say
E13 is exactly the right approach to use. In comparison, I
consider the PC classes to be just that - a syntactic work around
needed in RDF and not part of the discourse. In LInked Art, in a
discussion around uncertain attribution of artists and "style of"
vs "school of", we posited the need for a property on E13 for
this scenario. (Also the need for .1 on P11 for the same reason
as we have it on P14)
I would say that Dig's annotation is *not* the correct approach
for several reasons:
* Named Graphs are a very specific technology that have never
seen significant uptake and are likely (IMO) to decrease in usage
once RDF* is formalized.
* Dig needs to be updated, and Annotation is (I would hope)
likely to go away ... because ...
* ... it could just use the Web Annotation Data Model:
https://www.w3.org/TR/annotation-model/
(And reification has all the problems discussed in this thread
already)
Rob
On Thu, May 11, 2023 at 7:17 AM George Bruseker via Crm-sig
<crm-sig@ics.forth.gr> wrote:
Dear Martin,
I agree that E13 is a poor man's solution to a complicated
problem. But it is for some, the solution available. Other
solutions like Inf for documenting historical argumentation
and using named graphs is great as a possibility. Using prov
o to represent the meta discursive level of the provenance of
the dataset as such great. But my immediate interest was
simple the humble ability of E13 to be able to point to all
statements that can be made with precisely one link in CRM.
I'll keep watching the space!
Cheers,
G
On Thu, May 11, 2023 at 1:25 PM Martin Doerr
<mar...@ics.forth.gr> wrote:
Dear George,
I agree with you below about the historical aspects. The
annotation model has the same historical aspect, but is
not limited to a single link.
Let us discuss!😁
Best,
Martin
On 5/11/2023 12:41 PM, George Bruseker wrote:
Dear Francesco, Martin,
Again for the record since I seem to be being read at
cross purposes, when I mention the word 'provenance' I
do not mean it in the sense of dataset provenance (to
which prov o would apply). I mean that in the world to
be described (the real world of tables charis cats dogs
scholars ideas etc.) there are real world events in
which people attribute things to things (see my
previous email). This is content of the world to be
represented in the semantic graph (not a metagraph about
the graph). This is describable and is described in
CIDOC CRM using E13 and its friends. If you want to say
that there was a historical situation that someone in
your department said (likely in the information system)
that some attribute related two things you can do this
with E13 (or I have completely misunderstood the CIDOC
CRM). This happens all the time in art history. One
particular often arising case is an argument about who
played what role in some object. Was Davinci the painter
or was it Simon? This is just a hum drum case of needing
to apply CIDOC CRM to real cases. Since E13 is a
mechanism for so doing on all other statements, it would
be a logical continuation that it could be used also on
.1 statements. But for technical reasons it cannot, that
is why I suggested a mild technical solution that makes
the technical extension logically coherent. It is in
this sense that I mean provenance and not in the
metasense of the provenance of the data qua data, also
an exciting but other issue to my mind.
Cheers,
George
On Thu, May 11, 2023 at 12:27 PM Martin Doerr via
Crm-sig <crm-sig@ics.forth.gr> wrote:
Dear Francesco,
This is an excellent paper.
I cite: "However, reification has no formal
semantics, and leads to a high increase in the
number of triples, hence, it does not scale well. "
I agree with your proposals. Prov-O mapping is a
must for CRM-SIG.
Best,
Martin
On 5/10/2023 11:55 PM, Francesco Beretta via Crm-sig
wrote:
Dear Martin, George, All,
I would not dare to suggest some solution of this
complex issue but let me hint to a couple of useful
papers (among many others):
Sikos, Leslie F., and Dean Philp, ‘Provenance-Aware
Knowledge Representation: A Survey of Data Models
and Contextualized Knowledge Graphs’, /Data Science
and Engineering/, 5.3 (2020), 293–316
<https://doi.org/10.1007/s41019-020-00118-0>
Hernández, Daniel, Aidan Hogan, and Markus
Krötzsch, ‘Reifying RDF: What Works Well With
Wikidata?’, in /Proceedings of the 11th
International Workshop on Scalable Semantic Web
Knowledge Base Systems Co-Located with 14th
International Semantic Web Conference (ISWC 2015),
Bethlehem, PA, USA, October 11, 2015./, 2015, pp.
32–47 <http://ceur-ws.org/Vol-1457/SSWS2015_paper3.pdf>
Once again, I would like to suggest carefully
distinguishing between the CRM domain of discourse,
in which the E13 class is conceptualized, and the
issue of stating the provenance of the information
modelled in the discourse domain, including
instances of class E13 as part of the modelled domain.
For this last task (or domain of discourse), it
would seems reasonable and in line with best
practices to use the PROV model and the
corresponding PROV-O ontology, a W3C
recommendation. Or providing a specific extension
of the CRM, compatible and aligned with the PROV
model. But using PROV-O seems a good choice in
order to facilitate interoperability.
There remains the more fundamental question of
whether the current debate about RDF implementation
is not in fact indicative of a more fundamental
problem related to properties of properties and the
implicit and richer information they contain, which
cannot be adequately expressed in RDF without
conceptualising them in terms of actual classes.
Aren't these rather hybrid P(roperty)C(lasses),
especially if they should be declared as subclasses
of E1, to be considered as /de facto/ classes and
not just properties? Because if they are just
statements, then adopting one or the other form of
existing RDF reifications practices seems to be the
good way to go.
Best
Francesco
Le 10.05.23 à 18:48, Martin Doerr via Crm-sig a écrit :
Dear All,
I suggest to resolve the issue of referring to the
provenance of .1 properties more specifically:
Solution a: Add properties to E13 to specify a .1
property. This may be more effective than the
double indirection via PC class instance and 4
links of the E13 construct.
Solution b: Use RDF reification for this specific
problem via the PC class.
We need to examine in both cases the inferences we
want to maintain about the base property and its
domain and range, and what the relevant query
construct is.
Personally, I prefer solution c: Use the
annotation model of CRM Dig, which goes via Named
Graphs. This is much more performant and logically
clearer, because Named Graphs are implemented as
direct references to property identifier, and
maintain a reference count for each one. This is
an important logic in its own right. Inferences
about the .properties would work in out ouf of a
Named Graph, whereas the reification may need
additional rules.
The query languages of Quad stores support them
explicitly.
The latest version of 3M supports Named Graph
definitions. This feature should be tested.
I would rather discourage E13 in the long term as
a means to denote provenance generally and
recommend a uniform use of Named Graphs. I am
aware that not all RDF encodings support the
feature. I that case we could resort to reification.
Opinions?
Best,
Martin
On 5/9/2023 10:37 PM, Francesco Beretta via
Crm-sig wrote:
Dear Christian-Emil, All,
For the reasons I detailed in my other email, I
totally agree with your point of view and would
like to raise all possible caveats to this kind
of mixing up quick and dirty implementation
solutions and consistent conceptual modelling.
If we need more classes, even on a provisional
and experimental perspective, I would strongly
suggest to produce them and document them as
such, with stable URIs, and then refine
progressively the ontology and integrate it into
the CRM family. Of course, a nice place to do
this is ontome.net <http://ontome.net> 😉
Best
Francesco
Le 08.05.23 à 17:36, Christian-Emil Smith Ore via
Crm-sig a écrit :
Also: RDF(S) is an implementation technology. We
can assume that there exists a implmentation
function from the CRM-FOL to RDF(S), but this
may not be a 1-1 function. Strange constructs
like the PC0(?) may not have counterparts in
CRM-FOL. Changing the ontology on the bases of
special tricks used in the implementation may
not always be a good idea, but may inspire us to
make well thought out and consistent changes in
the ontology.
_______________________________________________
Crm-sig mailing list
Crm-sig@ics.forth.gr
http://lists.ics.forth.gr/mailman/listinfo/crm-sig
_______________________________________________
Crm-sig mailing list
Crm-sig@ics.forth.gr
http://lists.ics.forth.gr/mailman/listinfo/crm-sig
--
------------------------------------
Dr. Martin Doerr
Honorary Head of the
Center for Cultural Informatics
Information Systems Laboratory
Institute of Computer Science
Foundation for Research and Technology - Hellas (FORTH)
N.Plastira 100, Vassilika Vouton,
GR70013 Heraklion,Crete,Greece
Vox:+30(2810)391625
Email:mar...@ics.forth.gr
Web-site:http://www.ics.forth.gr/isl
_______________________________________________
Crm-sig mailing list
Crm-sig@ics.forth.gr
http://lists.ics.forth.gr/mailman/listinfo/crm-sig
--
------------------------------------
Dr. Martin Doerr
Honorary Head of the
Center for Cultural Informatics
Information Systems Laboratory
Institute of Computer Science
Foundation for Research and Technology - Hellas (FORTH)
N.Plastira 100, Vassilika Vouton,
GR70013 Heraklion,Crete,Greece
Vox:+30(2810)391625
Email:mar...@ics.forth.gr
Web-site:http://www.ics.forth.gr/isl
_______________________________________________
Crm-sig mailing list
Crm-sig@ics.forth.gr
http://lists.ics.forth.gr/mailman/listinfo/crm-sig
--
Rob Sanderson
Senior Director for Digital Cultural Heritage
Yale University
--
------------------------------------
Dr. Martin Doerr
Honorary Head of the
Center for Cultural Informatics
Information Systems Laboratory
Institute of Computer Science
Foundation for Research and Technology - Hellas (FORTH)
N.Plastira 100, Vassilika Vouton,
GR70013 Heraklion,Crete,Greece
Vox:+30(2810)391625
Email:mar...@ics.forth.gr
Web-site:http://www.ics.forth.gr/isl
_______________________________________________
Crm-sig mailing list
Crm-sig@ics.forth.gr
http://lists.ics.forth.gr/mailman/listinfo/crm-sig
--
------------------------------------
Dr. Martin Doerr
Honorary Head of the
Center for Cultural Informatics
Information Systems Laboratory
Institute of Computer Science
Foundation for Research and Technology - Hellas (FORTH)
N.Plastira 100, Vassilika Vouton,
GR70013 Heraklion,Crete,Greece
Vox:+30(2810)391625
Email:mar...@ics.forth.gr
Web-site:http://www.ics.forth.gr/isl
_______________________________________________
Crm-sig mailing list
Crm-sig@ics.forth.gr
http://lists.ics.forth.gr/mailman/listinfo/crm-sig