Excellent!
Contributions are most welcome, in particular if they do not propose the
most simple solution, but point to problems in practice of existing
methods (such as structured encoding of personal names). Clearly, all
solutions need to be associated with a practical scope, which should not
be named "vast majority", but be a concrete application profile.
For instance, xsd:dateTime goes back I think a billion years, but not 4
or 14 billion years. We can declare that out of scope.
Looking forward to that,
Martin
On 3/8/2018 10:02 AM, Richard Light wrote:
Martin,
Thanks for updating the string part of the RDF implementation doc.
I was thinking last night that maybe we should focus our RDF efforts
on exactly this issue: the representation of the CRM primitive classes
E60, E61 and E62 in RDF. The current RDF document is becoming quite
wide-ranging in its scope, and (for example) you have questioned
whether certain sections belong in it. If we concentrate on this
single aspect of the broader RDF issue, I think we can produce
something which is of practical value relatively quickly. In
particular, I would like to devote time to this during the Lyon meeting.
It seems to me that there are three elements which need to be
considered when recommending an approach:
* the CRM's own view on what information should be expressible, and
how (in an abstract sense) it should be represented
* RDF and other W3C/ISO recommendations and standards for
representing string-type information
* the view of communities of practice on the issues involved, and
the solutions they have come up with
In particular I think it important that we should consult widely on
this issue, and be seen to take account of existing community practice.
Best wishes,
Richard
On 06/03/2018 17:54, Martin Doerr wrote:
Dear Richard,
It would be really great if you could join our next meeting!
We need your help to finish the RDF guidelines.
I have rewritten the string part in the google doc:
"Recording string values
As mentioned in point 3 above, the RDFS Schema does not implement the
CRM primitive classes E60 Number, E61 Date or E62 String. Instead it
specifies rdfs:Literal as the range of properties which would
otherwise take one of these values:
*
P3_has_note [String]
*
P57_has_number_of_parts [Number]
*
P79_beginning_is_qualified_by [String]
*
P80_end_is_qualified_by [String]
*
P81_ongoing_throughout [Time primitive] [but see Note 8 above and
section on dates below]
*
P82_at_some_time_within [Time primitive] [but see Note 8 above
and section on dates below]
*
P90_has_value [Number]
The recommended RDFS implementation of the CIDOC CRM may further
refine the range of these properties to more specific datatypes, if
not yet done.
Recording names
Apart from the seven properties listed above, there are a number of
situations where the fully-worked-out path to a string value leads to
an unduly long chain of classes and properties. For example:
E55_Type > P1_is_identified_by > E41_Appellation > P3_has_note >
E62_String
Documenting an instance of E41_Appellation with a URI of its own, is
only useful if the instance is expected to be either an object of
discourse regardless what it identifies, such as etymology or name
variants etc., or if it needs an extended content model with
meaningful parts, such as a street address.
In cases where there is nothing more to say about the
E41_Appellation, P1_is_identified_by shouldbe replaced by rdfs:label
(“rdfs:label is an instance ofrdf:Property
<https://www.w3.org/TR/rdf-schema/#ch_property>that may be used to
provide a human-readable version of a resource's name”, in: RDF
Schema 1.1)
E55_Type > rdfs:label > rdfs:Literal
<https://www.w3.org/TR/rdf-schema/#ch_literal>.
Since RDFS does not qualify the range of rds:label further, we cannot
formally make rdfs:label a subproperty of P1_is_identified_by or
vice-versa. We can only register the convention and take care that
query systems retrieve labels together with instances of
P1_is_identified_by . The fact that the same name “Martin Doerr” may
appear in different encodings is inevitable. It is recommended to use
name spelling conventions from library cataloguing rules and SKOS
properties for instances of E55_Type.
"
Please comment!
I have discussed with George that we should make several distinctions:
Only digitized content can be stored in-line in the KB as Literal.
There must be a comparable way to point to a digitized content via
URI, URL, or literal. All representations of Symbolic Objects in
electronic form are ambiguous wrt the the intended level of symbolic
interpretation: Is it the bits, or the Latin1 characters, or
characters + font make up its identity?
We must distinguish between digital content of a symbolic object, a
general "note" about an individual, and values in a mathematical/
physical space.
I recommend NOT to recommend rdf:value:
"5.4.3 rdf:value rdf:value is an instance of rdf:Property
<https://www.w3.org/TR/rdf-schema/#ch_property> that may be
used in describing structured values. rdf:value has no
meaning on its own. "
We definitely need a recommendation for names, regardless how complex
it becomes.
When we created the RDF version, there were no datatype
recommendations. Now, that there are, we should remove "rdfs:Literal
from all properties in which it is unambiguous.
I kindly ask you to check
https://www.w3.org/TR/2004/REC-rdf-mt-20040210/#dtype_interp for
compatible datatypes. This must be well-justified. E.g.,
"P57_has_number_of_parts [Number]" should have range:
"xsd:nonNegativeInteger", and not "xsd:decimal".
E60 Number could be any value from the mathematical multidimensional
spaces made of real numbers, such as RGB images. We have no
super-representation in RDFS/XSD. We can enumerate compatible datatypes:
|"xsd:decimal|
<http://www.w3.org/TR/2001/REC-xmlschema-2-20010502/#decimal>,
|xsd:float|
<http://www.w3.org/TR/2001/REC-xmlschema-2-20010502/#float>,
|xsd:double|
<http://www.w3.org/TR/2001/REC-xmlschema-2-20010502/#double>,
|xsd:hexBinary|
<http://www.w3.org/TR/2001/REC-xmlschema-2-20010502/#hexBinary>,
|xsd:base64Binary|
<http://www.w3.org/TR/2001/REC-xmlschema-2-20010502/#base64Binary>,
|xsd:integer|
<http://www.w3.org/TR/2001/REC-xmlschema-2-20010502/#integer>,
|xsd:nonPositiveInteger|
<http://www.w3.org/TR/2001/REC-xmlschema-2-20010502/#nonPositiveInteger>,
|xsd:negativeInteger|
<http://www.w3.org/TR/2001/REC-xmlschema-2-20010502/#negativeInteger>,
|xsd:long|
<http://www.w3.org/TR/2001/REC-xmlschema-2-20010502/#long>, |xsd:int|
<http://www.w3.org/TR/2001/REC-xmlschema-2-20010502/#int>,
|xsd:short|
<http://www.w3.org/TR/2001/REC-xmlschema-2-20010502/#short>,
|xsd:byte|
<http://www.w3.org/TR/2001/REC-xmlschema-2-20010502/#byte>,
|xsd:nonNegativeInteger|
<http://www.w3.org/TR/2001/REC-xmlschema-2-20010502/#nonNegativeInteger>,
|xsd:unsignedLong|
<http://www.w3.org/TR/2001/REC-xmlschema-2-20010502/#unsignedLong>,
|xsd:unsignedInt|
<http://www.w3.org/TR/2001/REC-xmlschema-2-20010502/#unsignedInt>,
|xsd:unsignedShort|
<http://www.w3.org/TR/2001/REC-xmlschema-2-20010502/#unsignedShort>,
|xsd:unsignedByte|
<http://www.w3.org/TR/2001/REC-xmlschema-2-20010502/#unsignedByte>,
|xsd:positiveInteger",|
<http://www.w3.org/TR/2001/REC-xmlschema-2-20010502/#positiveInteger>
E61 Timeprimitive could be completely replaced by xsd:dateTime,
without causing incompatibilities if more precision/ coverage would
be needed.
"Spaceprimitive" should be a WKT string, I think.
Should E62 be xsd:string, or would that cause another outcry to be
too complex?
If someone does not convert values into xsd, is that "incompatible"?
Best,
Martin
--
--------------------------------------------------------------
Dr. Martin Doerr | Vox:+30(2810)391625 |
Research Director | Fax:+30(2810)391638 |
| Email:[email protected] |
|
Center for Cultural Informatics |
Information Systems Laboratory |
Institute of Computer Science |
Foundation for Research and Technology - Hellas (FORTH) |
|
N.Plastira 100, Vassilika Vouton, |
GR70013 Heraklion,Crete,Greece |
|
Web-site:http://www.ics.forth.gr/isl |
--------------------------------------------------------------
--
*Richard Light*
--
--------------------------------------------------------------
Dr. Martin Doerr | Vox:+30(2810)391625 |
Research Director | Fax:+30(2810)391638 |
| Email: [email protected] |
|
Center for Cultural Informatics |
Information Systems Laboratory |
Institute of Computer Science |
Foundation for Research and Technology - Hellas (FORTH) |
|
N.Plastira 100, Vassilika Vouton, |
GR70013 Heraklion,Crete,Greece |
|
Web-site: http://www.ics.forth.gr/isl |
--------------------------------------------------------------