Re: Various issues with using CURIEs in OWL

Bijan Parsia Fri, 10 Apr 2009 03:35:44 -0700

(Sean is my AC rep.)
On 10 Apr 2009, at 00:15, Shane McCarron wrote:

My (personal) comments inline:

Bijan Parsia wrote:
The OWL Working Group had intended to delegate our URI abbreviationmechanisms both for in-spec and in-concrete-syntax use. OWL has anumber of different concrete serializations (including 2 XML basedand 2 non-XML based), all of which use (or I would like to use)CURIEs.
Unfortunately, while trying to use the CURIE spec, I (and others)have found that the current CURIE spec does not meet the WG needseven putting aside concerns about the ultimate disposition of thedocument:
1) For non-XML host language: The CURIE spec provides no mechanism(although it provides permission) for excluding characters from thesyntax of the local part of CURIEs. This means that in hostlanguages which use symbols like ")" or "[" as part of theirsyntax, we run into parsing ambiguities. Note that safe CURIES donot solve this problem as the safe CURIE delimiters are common hostlanguage delimiters.
PROPOSED FIX: Ideally, there would be a "mimimalistic" CURIEprofile, ideally something like SPARQL's abbreviation mechanism.Even QNames would be fine (though we'd recommend the spec point outthat to cover all URIs there should be a non-abbreviated form).
The lexical form of a CURIE is an optional prefix, separator, and areference. Are you saying that the characters permitted in prefix(NCName) or reference (irelative-ref as defined in the IRI spec) aretoo rich a set of characters?


Reference, yes.

And that in your use you needed to make this collection ofcharacters less rich?


Yes.

 If so, I agree that this is permitted by the specification.

But this gives me no reason to use the spec, esp. with a normativereference.

Without a specific subsetting mechanism (e.g., for the datatype, onecould define by restriction) I think adopting a different set ofCURIEs just means not adopting the CURIE spec.


Contrast our use of the IRI  and SPARQL spec:
        http://www.w3.org/2007/OWL/wiki/Syntax#IRIs

fullIRI := an IRI as defined in [RFC3987], enclosed in a pair of < (U+3C) and > (U+3E) charactersprefixName := a finite sequence of characters matching the as PNAME_NSproduction of [SPARQL]


I think there are three reasonable categories of CURIE:

        Exactly QName
        What SPARQL currently does
        Full irelative-ref for reference

There are a couple of others I could imagine (i.e., with %encoding forstrict acsii). But without at least these I don't think the CURIE specis something SPARQL or OWL should use.

Note that *permission* to make a subset isn't all that helpful. Imean, then we're
just doing our own thing, yeah?
Not really - it means you are defining a subset or profile of acommon mechanism,

We disagree strongly. Without a defined subsetting mechanism, it'sjust not helpful. It *might* have been helpful with defined processingmodels...but we don't have that.

Thus, you've not convinced me. At the moment I am better off ignoringthe CURIE spec.

and that a CURIE expressed in that subset would be semanticallystill a CURIE. One reason for using a common datatype is that ithelps with comprehension.

? Comprehension support is not a goal. Specification factoring orimplementation interop are.

I find it very hard to believe that having to read another specimproves comprehension.

EDITORIAL NOTE: Many of us found the organization of the spec, andespecially of the normative parts, very confusing. See:<http://www.w3.org/mid/[email protected]>
I suggest that "Usage" and "Examples" be consolidated, and thatthere are two normative sections, "Syntax" and "IncorporatingCURIEs into Host Languages" which contain the respectiveconstraints. The second section could usefully be broken down into"XML host languages" and "Non-XML host languages".
Thanks for this. We are already done with CR more or less, but Iwill see what I can do.

I don't see how you can get out of CR to PR, looking at yourimplementation report. At this stage, I'm now asking Sean, my AC rep,to oppose such a transition.

Speaking as a spec implementor who sincerely tried to use the CURIEspec, I think there are problem that merit serious changes to thedesign of the language. This means another LC, if I'm not mistaken.

2) For XML host languages: The requirement to support the XMLnamespace based prefix declaration mechanism, even when analternative mechanism is supplied, is simply a non-starter. Many inthe XML world are hostile to the namespace based overloaded (evenfor proper QNames! see RELAX NG and Schematron). But being forcedto support *two* mechanisms, especially when one of them isn'tdesired, is unnecessarily restrictive and leads to the secondmechanism not being used:
  <http://www.w3.org/mid/29397.1237034...@ubehebe>
The XHTML 2 Working Group has already agreed to remove thisrestriction.


Great. That seems to trigger another LC.

In fact, what we agreed was that it was the host language'sresponsibility to define its prefix mapping mechanim(s).

Well...if that means that we all reinvent ours, then I don't thinkit's a good idea. For me, this means that the CURIE spec is not a rectrack sensible document, but would be better as a note.

3) For XML host languages: There's no reason not to have a standardprefix declaration mechanism in the XML namespace. What value isthere in letting XML host languages coin a bunch of different ones?
For example, <xml:Prefix name="" IRI=""/> is (basically) the syntaxwe're adopting, except with Prefix in the OWL namespace.
Perhaps. The XHTML 2 Working Group does not have authority to messin the xml space.


Ok, use your own, namespace. xml namespace would be better.

 I am sure the group will discuss your suggestion.


Thanks!

4) Processing: In some languages, multiple declarations of a prefixhave an overriding behavior. In OWL we chose to make that a syntaxerror. The CURIE spec should make clear the processing model.
We believe the processing model is completely host-language specific.

I don't think that's helpful. There are at least 2 sensible, fairlycommon, processing models:

        Error on redefinition
        Lexically nearest wins

Both are in common use. Define them. Provide a way to reference them.

The concept of a CURIE, that is an abbreviation that maps to anIRI, is general. The expression of that concept in a host languageis necessarily going to be related to that host language. Forexample, were you to use CURIEs in HTML you would not want to usesome "xml" mechanism to map a prefix.

Sure, but, uhm, HTML is not an XML host language. And I'm confused asto why we're talking syntax rather than processing.

To sum, I, personally, don't think the CURIE spec helps either withimplementation interop or with spec factoring, though I think itcould be made to. Thus, in its current form, there's no point inciting it and, thus, no real point in it being a recommendation.The minimal necessary changes from my pov are:
   A) A proper XML mechanism with no requirement to suport xmlns
   B) Sensible profiles (I suggest, QName/RDF, SPARQL, and ALL)
   C) A processing model
C could maybe be dropped. A is totally required. I just won'tadhere, or recommend anyone adhere, to the requirement to usexmlns. It's a nonstarter. Thus I won't use or recommend people usethe CURIE spec (in its current form) for XML based host languages.
I think we have already addressed this requirement. Thanks forreinforcing it though.


Great! I look forward to the next LC.

I won't use or recommend citing the CURIE spec without B for non-XML host languages. If you are happy with this being "using curies"then ok :)
Hope this helps.
I think it did! I really appreciate your taking the time to sendthis. The working group will get you a formal response in due course.


Great!

Cheers,
Bijan.

Re: Various issues with using CURIEs in OWL

Reply via email to