Re: Minting URIs is bad?

Dan Brickley Sun, 01 Feb 2009 23:53:45 -0800


Sergio:

do we want to create this (artificial) URIs?


Richard:

> You don't state any reasons against using URIs, you just say that you
> prefer not to use them. So please clarify: What do you gain by not
> introducing your own URI?

There are a few considerations...

One reason to be avoid creating artificial URIs is when we do not wantto raise expectations about longevity, maintainance, for them.

Another is when we don't want to confuse others about the 'real' / main/ official URI, ie. we suspect the things have well known identifiers,we just don't know what they are. Or perhaps have other reasons(business, IP etc.) for not yet publishing the real identifiers.

These two cases can be addressed by providing some more minimal metadataabout the identifiers. For example, that everything beginninghttp://tmpid.danbri.org/ is transient and may not be dereferenceableafter 2 weeks. Some pieces of POWDER might be re-usable here.

A third case (not directly Void-related), is where the entity beingidentified is a Person or other entity that has associated social orbusiness sensitivities.

If I convince the world that http://ids.danbri.org/richard_cyganiak is afine identifier for the person whose personal mailbox is[email protected], then I put myself in some position of advantage(and responsibility) with respect to online information-linkingregarding that person. My webserver sees every de-referencing of theURI. I see timing information, HTTP REFERER, HTTP USER AGENT, and more.I also probably have some responsibility to publish accurate (nonlibelous etc.) information. This covers both the nature of RDF claims Iintentionally publish (eg. there have been various cases likehttp://news.cnet.com/2100-1025_3-5984880.html w.r.t. Wikipedia accuracy;DBpedia re-users should bear this in mind). But it also covers thingslike server security. If the server is hacked or otherwise compromised,the descriptions served at the URI are at risk.

Also If the URIs are http: rather than https: because someone didn'twant to run SSL or pay an admin fee for a certificate check, the dataservice is less reliable (faked wifi access could substitute bad data,for eg.). For many cases on the Web, this is not a big deal. But whenyou are claiming that some URI serves as a reliable "identifier" for thething it describes, there are extra layers of care and expectation toconsider.

The authenticity aspects of this 3rd case can probably be addressed, atleast partially, with digital signature. I have been poking around XMLSignature lately. The privacy aspect is harder. Parties who claim to bepublishing URI identifiers for entities such as people, businesses, orcontent owned by others, should at least have very clearterms-of-service and privacy policy documents. This is easier said thandone, particularly in large or legally cautious organizations. Or withinformal opensource-style projects for that matter.

In such scenarios, uuid:, tag: or description-by-referenceidentification practices still have some value. But I agree, everythinggoes much more smoothly when we have the luxury of a nice URI to jointhe data with!


cheers,

Dan

--
http://danbri.org/

Re: Minting URIs is bad?

Reply via email to