[Fwd: Re: [whatwg] Trying to work out the problems solved by RDFa]

Dan Brickley Sat, 10 Jan 2009 07:02:29 -0800


Hi Steven,

(cc www-archive, libby)

Re the alumni/people page scenario, I asked on the whatwg list aboutwhether html5 is attempting any particular mechanism for saying whichbits of a page are 'comments' or untrusted. But it seems from Toby'sreply that RDFa is quite handy here.

I've been thinking about how one might use the hypertext path fromhttp://www.w3.org/ to /People and ..etc/Alumni to indicate that theyhave the same creator/publisher.


1st idea - use a custom relation like 'alumniPage'
2nd idea - generalise that - 'staffInfoPage', 'aboutOrg page'

3rd idea - generalise further - use RDF to state that those pages have adc:creator / foaf:maker which is the organization W3C4th idea - use POWDER to claim that all pages matching some URI prefixhave these properties

I think 4. is probably the way to go, but haven't dug into current stateof POWDER. The others would cause needless proliferation of propertiesand clutter each hyperlink with additional link-typing annotations.

This would allow some Org (companies, nonprofits, whatever) to say inRDF on their homepage "all HTML pages whose URI matcheshttp://eg.example.com/aboutus/*html"; are pages whose foaf:maker is theorganization whose homepage is http://eg.example.com/ and whose name is"E.G. Org.".

The point of this being that we need a way of picking out those pages(and pieces of pages) whose provenance/source is the main publisher,versus other things on the site (or in the page) that might be usersupplied. On w3.org, the msgid: proxy that includes all of lists.w3.orginto www.w3.org is a good use case; but also various W3C-linked people,WG/IG members etc., have write access to bits of the site.

In parallel to this I'm still exploring the xmldsig route. Here is atest (linked by wot:assurance from foaf.rdf) signing of my foaf file:http://danbri.org/foaf.rdf.sigdata ... although done with a randomgenerated key that I didn't write the java code to manage properly.

Use case for that is: how do we know whether to believe the foaf:tipjarproperty claim in http://danbri.org/foaf.rdf and buy danbri a book?

Hope this makes some sense! So I think next step is to check out POWDER.http://www.w3.org/TR/2008/WD-powder-primer-20081114/

I think they're using GRDDL due to the need to include quoted fragmentsof full RDF within each site 'label', something that's ugly to do inpure RDF (we tried in the earlier WCL design)...


cheers,

Dan


-------- Original Message --------
Subject: Re: [whatwg] Trying to work out the problems solved by RDFa
Date: Sat, 10 Jan 2009 13:51:26 +0000
From: Toby A Inkster <[email protected]>
To: [email protected]

Dan Brickley wrote:

While I'm unsure about the "commercial relationship" clause quite
capturing what's needed, the basic idea seems sound. Is there any
provision (or plans) for applying this notion to entire blocks of
markup, rather than just to simple hyperlinks? This would be rather
useful for distinguishing embedded metadata that comes from the page
author from that included from blog comments or similar.


While that might be useful for natural language processing, for RDFa
it is actually completely unneeded. The syntax of RDFa allows for
blocks of markup to be made "invisible" by making an ancestor node
into an XMLLiteral.

For example, a comment might be marked up as:

<section typeof="atom:Entry" xmlns:foaf="http://xmlns.com/foaf/0.1/";
xmlns:atom="http://bblfish.net/work/atom-owl/2006-06-06/#";>
  <address rel="atom:author">
    On <time property="atom:published" content="2009-01-10"
    >10 Jan 2009</time>,
    <a property="foaf:name" rel="foaf:page"
    href="http://joe.example.com";>Joe Bloggs</a> wrote:
  </address>
  <div rel="atom:content">
    <blockquote property="atom:xhtml">
      <!-- The comment goes here. -->
    </blockquote>
  </div>
</section>

The RDFa processing instructions say that as the blockquote doesn't
have an explicit datatype set, it is to be treated entirely as a
string literal (if it doesn't have any child elements) or an XML
literal (if it does), and that parsers must not look inside it for
triples. Thus spammers can't use the comment form for stuffing
triples into the page.

It should be noted in this case that RDFa also allows natural
language parsers to be made more useful. By looking at the RDFa which
marks up the author's name and website, they may be able to determine
that the comment has been written by someone other than the page's
main author, and thus not afford it the same level of trust granted
to the rest of the page. So the natural language processing can
benefit from RDFa.

--
Toby A Inkster
<mailto:[email protected]>
<http://tobyinkster.co.uk>

[Fwd: Re: [whatwg] Trying to work out the problems solved by RDFa]

Reply via email to