On 23 Mar 2012, at 22:42, Jonathan A Rees wrote: > On Thu, Mar 22, 2012 at 4:21 PM, Jeni Tennison <[email protected]> wrote: > >> While there are instances of linked data websites using 303 redirections, >> there are also many examples of people making statements about URIs >> (particularly using HTML link relations, RDFa, microdata, and microformats) >> where those statements indicate that the URI is supposed to identify a >> non-information resource such as a Person or Book. > > Can you provide a handful of these Doing It Wrong URIs please from > various sites? I think it would really be helpful to have them on hand > during discussions.
OK. These picked up from dumps made available by webdatacommons.org, so very grateful to them for making that available; it can be quite hard to locate this kind of markup generally. Also I've used Gregg's distiller [1] to extract the RDFa out of the documents to double-check. http://www.logosportswear.com/product/1531 -> 301 -> http://www.logosportswear.com/product/1531/harbor-cruise-boat-tote which contains the RDFa statement <http://www.logosportswear.com/product/1531> a <http://rdf.data-vocabulary.org/#Product> ; . The URI is intended to identify a product, not a web page. http://developer.yahoo.com/yui/docs/YAHOO.util.Dom.html contains RDFa statements that state that this web page contains events, methods and properties: <http://developer.yahoo.com/yui/docs/YAHOO.util.Dom.html> yui:attributes <#configattributes>; yui:description """ Provides helper methods for DOM elements. """; yui:events <#events>; yui:methods <#methods>; yui:name "YAHOO.util.Dom"; yui:properties <#properties> . From the statements, the intention is for the URI to identify the (programming language) Object, not a web page (despite the .html on the end!). http://gondwanaland.com/mlog/2005/03/13/semweb-not-by-committee/ contains the RDFa statements <http://gondwanaland.com/mlog/2005/03/13/semweb-not-by-committee/> dcterms:publisher <http://gondwanaland.com/mlog/> ; sioc:has_owner <https://creativecommons.net/ml/> ; . The range of dcterms:publisher is a dcterms:Agent, but http://gondwanaland.com/mlog/ returns a 200. The range of sioc:has_owner is a sioc:UserAccount, but https://creativecommons.net/ml/ returns a 200. http://www.feedbooks.com/book/2679 contains microdata statements. How you should interpret these as RDF is obviously debatable but the obvious thing to do is for a href attribute to indicate the resource that it targets, so the page includes the statements [ a schema:Book ; schema:author <http://www.feedbooks.com/author/496> ; ] The range for schema:author is intended (I think) to be a person rather than a web page about a person, but resolving http://www.feedbooks.com/author/496 gives you a 200. (Based on the webdatacommons.org dumps, this site used to serve up RDFa that stated that <http://www.feedbooks.com/book/2679> identified a Book; books are not web pages.) http://www.mybanktracker.com/Citibank/Profile contains the RDFa statements <http://www.mybanktracker.com/Citibank/Profile> v:dtreviewed "2012-01-05 16:42:49"@en-US; v:itemreviewed <http://www.mybanktracker.com/Citibank/Profile>; v:rating "4"@en-US; v:reviewer <http://www.mybanktracker.com/member/lisaehrlich>; . The review is clearly about Citibank and not the web page. The object of the v:reviewer property should, I imagine, be a person but is instead a web page. http://www.flickr.com/photos/andreaweckerle/2559011937/ used to contain the triples (according to the webdatacommons.org data) <http://www.flickr.com/photos/andreaweckerle/2559011937/> dcterms:creator <http://www.flickr.com/photos/andreaweckerle/> . <http://www.flickr.com/photos/andreaweckerle/> foaf:name "andreaweckerle" . where http://www.flickr.com/photos/andreaweckerle/ resolves to a 200 but is plainly intended here to be a Person. Those statements don't seem to be there any more. I hope that gives a flavour. Jeni [1]: http://rdf.greggkellogg.net/distiller -- Jeni Tennison http://www.jenitennison.com
