Re: How do you deprecate URIs? Re: OWL-DL and linked data

Dan Brickley Wed, 09 Jul 2008 03:21:18 -0700


Richard Cyganiak wrote:

On 9 Jul 2008, at 00:11, Bijan Parsia wrote:
[big snip]
Complaining that the Big Nasty People Who Know What They're TalkingAbout are raining on your sameAs parade isn't constructive.
Ah Bijan. How about *you* grow up, flameboy?

(Please soften your language, both of you. Consider picking up the phoneinstead.)

You keep asserting that There Are Technical Problems With Using sameAs.It would help your argument if you told us what those technical problemsactually *are*. I heard you say that using owl:sameAs could bite us inthe butt. Could you be more specific?

The core idea is quite simple, and relates to the notion of when two(rdf/owl) documents are describing (typically amongst other things) thesingle same entity, ...the same thing. In cases where it is true to saythey describe the same entity, the term 'owl:sameAs' is one handy way toexpress that situation. In cases where the two documents describedifferent entities, it is not true to say that owl:sameAs holds betweenthem. This is all irrespective of which document (if any) the owl:sameAsclaims are made in, and purely cast in terms of whether the claim istrue. And the main thing to remember about OWL here is that if theowl:sameAs claim is true, and we believe both of the docs, allinformation about that entity written in both documents gets pooled.

Many people in this forum, including me, do not have a background informal logics. Without that background, it is hard to distinguish properuses of owl:sameAs from improper uses of owl:sameAs.

This is true, regarding the list. There are people from a great varietyof backgrounds around here. And on a good day, that is one of our strengths.

A side note: The reason why I advocate the use of owl:sameAs is not thatit's the *right* solution. But it's *the only solution that wasavailable*. The alternative would have been to argue for a year or twoinstead of linking up our datasets. Not compelling. That being said, I'mvery interested in hearing your take on when I should use owl:sameAs andwhen not.

One metric here might simply be: what % of owl:sameAs claims in the LODscene are false claims. However, that isn't itself always a bad thing.Sometimes publishing false information online has value - for example,historical data. Life is a lot easier though if at least the identityreasoning we do is based on reliable information. For this reason,publishing false identity claims can be a lot more destructive thanpublishing other kinds of falsehood. The LiveJournal RDF/FOAF datasetfor example might be full of 10s of 1000s of fake birthdate properties.We kinda expect that. And we should also expect to see a rise in spamblogs making false identity claims too about their owners. Dealing withthe latter is a bigger pain though. For datasets that come fromrelatively trusted sources, it is a big win if we can believe theidentity-related claims they make.

If the best data / tools you have suggest that two docs/datasets aredescribing the selfsame entity, using owl:sameAs seems fine, even if youhave a secret hunch you're only perhaps 95% confident of the dataquality or tool reliability. If the best information you have instead istelling you "these two documents seem to be talking about more or lessthe same notion", then owl:sameAs probably isn't for you: it doesn'tcommunicate what you know. Which of these situations you're in might besomething of a judgement call, but it should be a judgement callgrounded in clarity about what a use of owl:sameAs is claiming.

I doubt we can get very far with this in the absence of examples. Wouldanyone like to collect up a dozen various owl:sameAs claims publishedexplicitly in the Web that might be considered questionable? (for nowlet's set aside cases where owl:sameAs is implied by other constructs).


cheers,

Dan

--
http://danbri.org/

Re: How do you deprecate URIs? Re: OWL-DL and linked data

Reply via email to