Dear George et al.
I completely follow your arguments. Teachability is an important
concern. At the moment, we make a convoluted discussion.
Let us separate:
Question a) Is the logical formulation as I sent it consistent, and does
it solve the logical problem. (I believe yes).
Question a1) Would the link make any difference logically (I believe
not, and expect objective arguments in support of), and would it reduce
or increase complexity.
Question b) Does the model describe adequately reality, independent from
complexity concerns. (I believe yes, and I believe Robert's examples
show a valid point. Think of reliquary caskets etc.)
Question c) To whom can it be taught as such. (I believe we have a
serious problem here)
Question d) Which is the user community of the spatiotemporal concepts
(I believe it is distinct, archaeologists, possibly conservators)
I suggest first to finish the issue by answering a) and b).
It was an immense work to clarify all these relationships. So we should
keep this separate.
Then I suggest to reconsider where the whole bunch of spatiotemporal
topology belongs to, and how to maintain the consistency we have arrived at.
Therefore we need to identify to total of concepts making up this part.
We may separate the standard into two chapters, or we move the thing
out, tightly interlaced with CRMbase, into an extension. Then we have a
question of maintaining identifiers, and people that would maintain the
distinct docs in sync.
Dividing into two chapters may be a way to teach CRM without the STVs
for those not interested in these deductions.
Best,
Martin
On 3/12/2019 11:23 PM, George Bruseker wrote:
Dear Martin et al.,
Although we are on a well trodden path, it seems to still engender
questions so perhaps worthy meandering on for a bit longer, not
necessarily to belabour any previous decisions but at least to ponder
them some more.
I wonder about the following:
1) did STV not enter CRMbase when we still thought to include all
extension top classes and relations to base? Now that this is no
longer the case (we will specify in a separate document all top level
classes and relations, thereby alleviating Base of having to hold all
possible top level classes and properties), is there a case for
reassessing whether STVs belong in base at all? I do not of course
question their utility, but they did arise because of questions in
CRMgeo. Are they a necessary feature of base?
Secondly, the lay person will see a knowledge graph of instances, and
not a theory. A knowledge graph with a lot of trivial links in my
opinion makes the model less intuitive to *use*.
and
I believe we cannot avoid entering some complexity here in our
discussions, and resolve it giving priority to the end-user schema.
I think the first arguments should be, if the final schema is
confusing, and if the alternative is less confusing.
2) I think we all agree that that understandability and teachability
is not at all an unimportant factor, especially for something that is
supposed to be applied by domain specialists. If the concept cannot be
communicated to the community it is supposed to serve, is it serving
the community? The lay person doing mapping and modelling (using it)
will need to understand these constructs and apply them correctly. We
argue that it is this domain/lay person who should ultimately
translate or validate their data. Therefore, the concept should be
understandable and clear to them (without having to retrain in another
discipline). They should more or often than not create intended models.
3) For those people who want and know how to use an STV, the fact that
they would populate the knowledge base distinctly would not be a
disadvantage but an advantage, as you could query it distinct from the
entity of which it was an STV. For everyone else, they simply would
not instantiate STVs and so would not have to worry about how they
work or what their properties are or if they cause an extra join.
A knowledge graph with a lot of trivial links in my opinion makes the
model less intuitive to *use*. Moreover, all our RDF databases are
still very bad following links. Any additional join has high cost.
Still, most CRM implementations materialize a huge number of paths to
increase performance.
4) If we change the ontology based on the present technological
capacities, are we not violating a modelling principle?
The alternative you are advocating for is:
a) Fill the database with a very large number of necessary 1:1
links: events are some of the the most frequent items we have.
5) In practice, I have never seen a knowledge base of CRM that
automatically creates for example the STV to an E2 (or something
similar). If someone (most CH users?) isn’t interested in STVs, they
don’t use them.
I am most intrigued though about Rob’s example. It seems like a
potentially reasonable way of interpreting E18 when it is a subclass
of E92 by a user of the ontology. I also think we do not actually want
to allow or encourage this modelling pattern by a user of the
ontology. I believe we do not want to do so because an E18 and its E92
are actually distinct, and we cannot say the same thing about the E18
and an E92 (but our model leads one to believe they are the same). We
can’t say that an STV is married, but we can say Rob is married. If
this doesn’t hold (Rob’s STV can be married), then it’s not clear to
me why Rob’s modelling is wrong, although the result is of course a
bevy of weird instances. I think the point here is that Rob’s
interpretation is not out to lunch. It is one which one could assume
would be an intended model. It is not, however, and why it is not
cannot clearly be specified possibly because we have said E18 isA E92.
Best,
George
------------------------------------------------------
Dr. George Bruseker
Coordinator
Centre for Cultural Informatics
Institute of Computer Science
Foundation for Research and Technology - Hellas (FORTH)
Science and Technology Park of Crete
Vassilika Vouton, P.O.Box 1385, GR-711 10 Heraklion, Crete, Greece
Tel.: +30 2810 391619 Fax: +30 2810 391638 E-mail:
[email protected] <mailto:[email protected]>
URL: http://www.ics.forth.gr/isl
On Mar 12, 2019, at 11:18 AM, Martin Doerr <[email protected]
<mailto:[email protected]>> wrote:
Dear Christian-Emil, All,
Thank you for your explanations. I agree with you technically completely.
I propose to have cardinality (1,1:0,1) for P4 and P160.
The scope note of P4 must be modified, as you say. We have discussed
already, that alternative opinions are questions of the knowledge
base, and not of the ontology.
I only disagree:-)with:
"The model will .... and hopefully be more intuitive for the lay
persons." with the additional link:
Firstly, I have made the argument, and got no response, that
introducing a link between E4 and E92 does*not*solve the problem of
equivalence of P4 and P160. It is still exactly the same,
only*more*complex to formulate: we have to equate a path with a
single link.
Secondly, the lay person will see a knowledge graph of instances, and
not a theory. A knowledge graph with a lot of trivial links in my
opinion makes the model less intuitive to*use*. Moreover, all our RDF
databases are still very bad following links. Any additional join has
high cost. Still, most CRM implementations materialize a huge number
of paths to increase performance.
.......
I still do not see, why we should reduce performance because we find
it difficult to explain the theory;-)
Basically, as you say, we repeat old arguments here, long before
decided, without new insight.
Best,
Martin
On 3/12/2019 5:04 PM, Christian-Emil Smith Ore wrote:
I resend the email, without the long tail of the previous emails due
to length restrictions.
Best
Christian-Emil
------------------------------------------------------------------------
*From:*Christian-Emil Smith Ore
*Sent:*12 March 2019 15:24
*To:*Martin Doerr;[email protected]; 'George Bruseker'
*Cc:*'crm-sig'
*Subject:*Re: [Crm-sig] Issue 326 Resolving inconsistencies between
E2, E4, E52 and E92
Dear all,
The issue 326 is old. I made some slides (dated 31/3/2017) which can
be found at
http://www.edd.uio.no/download/cidoc_crm/issue-326-overview-and-thoughts-HW.pptx
The exchange of emails has two topics:
1) E18 Physical Thing as a subclass of E92 Spacetime Volume
2) the properties P4 and P160
**********
1: In my opinion it is model theoretically correct that E18 Physical
Thingas a subclass ofE92 Spacetime Volume. However, it may
be confusing for persons not so interested in theory. Therefor we
could introduce a property Pxx E18 Physical Thing <-> E92
Spacetime Volumewith the cardinality (1,1:0,1)describing the the
(model theoretical) fact that a part of E18 Physical Thingis in a
1-1 correspondence with a subset of E92 Spacetime Volume.
The model will still have the same explanatory power, and hopefully
be more intuitive for the lay persons.
***********
2:
In the slides I give the following comment:
"The cardinality of P4 has time-span is (1,1:1,n), that is, two or
more instances of E2 Temporal Entity can “share” an instance of E52
Time-span. This was introduced in an early stage to model simultaneity.
This way of modeling simultaneity is considered obsolete and the
cardinality of P4 should be (1,1:1,1)-
E2 Temporal Entity and E52 Time-span in a one to one relation
E2 Temporal Entity and E92 Spacetime Volume in a one to one relation. "
Please, note that P4's cardinality states that every instance of P4
is connected to one and only one instance of E52 Time-span.
Therefore, the number of instances of E52 Time-span will be equal or
less than the number of instances of E2 Temporal Entity.
The number of instance of E92 Spacetime Volume and E2 Temporal
Entity will always be equal due to the cardinality (1,1:1,1) of P160
has temporal projection. E4 Period is a subclass of E92 Spacetime
Volume and has less than or equal number of instances. The
cardinality of P160 when lowered to
P160: E4 Period <-> E52 Time-span
must have the more strict cardinality (1,1:0,1), that is, it is an
injection of E4 Period into E52 Time-span. There may exist instances
of E52 Time-span which are not related to an instance of the
subclass E4 Period
Correspondingly:
P4: E4 Period <-> E52 Time-span
must have the cardinality constraint (1,1:0,n).
The scope note of P160:
“This property describes the temporal projection of an instance of
an E92 Spacetime Volume. The property P4 has time-span is the same
as P160 has temporal projection if it is used to document an
instance of E4 Period or any subclass of it.”
So the formulation discussed in the emails is already there.
The scope note of P4:
“This property describes the temporal confinement of an instance of
an E2 Temporal Entity. The related E52 Time-Span is understood as
the real Time-Span during which the phenomena were active, which
make up the temporal entity instance. It does not convey any other
meaning than a positioning on the “time-line” of chronology. The
Time-Span in turn is approximated by a set of dates (E61 Time
Primitive). A temporal entity can have in reality only one
Time-Span, but there may exist alternative opinions about it, which
we would express by assigning multiple Time-Spans. Related temporal
entities may share a Time-Span. Time-Spans may have completely
unknown dates but other descriptions by which we can infer knowledge.”
The formulation “A temporal entity can have in reality only one
Time-Span, but there may exist alternative opinions about it, which
we would express by assigning multiple Time-Spans.” should be
deleted. Such multiple assignment due to uncertainties or
alternative opinions is the case for many properties in CRM.
In my opinion “Related temporal entities may share a Time-Span.”
should also be deleted and the cardinality of P4 (E2 Temporal Entity
<-> E52 Time-span) made stricter to (1,1:1,1).
Best,
Christian-Emil
------------------------------------------------------------------------
*From:*Crm-sig<[email protected]>on behalf of Martin
Doerr<[email protected]>
*Sent:*12 March 2019 11:09
*To:*[email protected]; 'George Bruseker'
*Cc:*'crm-sig'
*Subject:*Re: [Crm-sig] Issue 326 Resolving inconsistencies between
E2, E4, E52 and E92
Dear Steve, George,
Your arguments well taken, I may remind you that the argument was
not only a 1:1 relation.
It contained 4 elements:
a) a 1:1 relation
b) a common identity condition: The identity of the STV depends on
the identity of the phenomenon
c) There existence conditions are identical: the one exists where
and as long as the other
d) Properties do not interfere.
The condition d) becomes more tricky with the question of the time
spans, as you have seen. Here, the question for me is not
ontological, but of the logical formalism. As I have shown, it can
be described in FOL. It is the only complication we have. We just
declare two properties to be identical downwards.
The alternative you are advocating for is:
a) Fill the database with a very large number of necessary 1:1
links: events are some of the the most frequent items we have.
b) You have not solved anything wrt P160, because P4 is still the
same as P160 in these cases, and the path of correspondence is even
more confusing.
So, we just buy in a much more confusing schema, to my opinion. The
schema is what we use on a daily base. Discussing CRM extensions is
not the end-users interest, but the task of the SIG.
I believe we cannot avoid entering some complexity here in our
discussions, and resolve it giving priority to the end-user schema.
I think the first arguments should be, if the final schema is
confusing, and if the alternative is less confusing.
I am not sure where to publish adequately the above reasoning. It
should be somewhere buried in the minutes. But we tried very hard to
make the things clear in the scope notes of E4, E18.
What do you think?
All the best,
Martin
_______________________________________________
Crm-sig mailing list
[email protected]
http://lists.ics.forth.gr/mailman/listinfo/crm-sig
--
------------------------------------
Dr. Martin Doerr
Honorary Head of the
Center for Cultural Informatics
Information Systems Laboratory
Institute of Computer Science
Foundation for Research and Technology - Hellas (FORTH)
N.Plastira 100, Vassilika Vouton,
GR70013 Heraklion,Crete,Greece
Vox:+30(2810)391625
Email:[email protected]
Web-site:http://www.ics.forth.gr/isl
_______________________________________________
Crm-sig mailing list
[email protected] <mailto:[email protected]>
http://lists.ics.forth.gr/mailman/listinfo/crm-sig
--
------------------------------------
Dr. Martin Doerr
Honorary Head of the
Center for Cultural Informatics
Information Systems Laboratory
Institute of Computer Science
Foundation for Research and Technology - Hellas (FORTH)
N.Plastira 100, Vassilika Vouton,
GR70013 Heraklion,Crete,Greece
Vox:+30(2810)391625
Email: [email protected]
Web-site: http://www.ics.forth.gr/isl