Re: RDFa in HTML 5

Philip Taylor Fri, 22 May 2009 08:21:43 -0700

As an attempt to clarify my current views:

I'm working from the basis that for any arbitrary stream of bytes servedas text/html, it should be possible to determine the set of triples thatare extracted when you apply the standard RDFa triple-extractionalgorithm to the document, and determine whether the document is valid.That behaviour should be well-defined (including for invalid inputs) andwell tested and should (eventually) match implementations, and ideallyit should be easy to implement and should match authors' expectationsand should be similar to the XHTML syntax and so on.

As an example of some arbitrary inputs, I've madehttp://philip.html5.org/demos/rdfa/tests.html [currently veryexperimental; only tested in Firefox 3.0 and Opera 9.6, has too littledocumentation and too many bugs, etc] to illustrate various cases thatmight be interesting. That also shows that current implementations arequite varied in how they handle the inputs, and so presumably theimplicit mapping from text/html to RDFa-in-XHTML is not obvious enoughby itself to ensure interoperable implementations.

Given that basis, I can't see a sensible way to solve the problemwithout relying on HTML 5 to define the mapping from an arbitrary streamof bytes to a DOM (because practical text/html parsing isn't definedanywhere else), and then defining the RDFa processing on top of that(perhaps via an explicit mapping onto another RDFa spec if that's possible).

My document was a rough attempt to show how I imagined it could bedefined in a way that would give clear answers to all the test casesabove, by building on top of HTML 5, as an alternative approach to whatI saw in http://www3.aptest.com/standards/rdfa-html/ andhttp://www.w3.org/TR/rdfa-syntax/

I don't intend this to be a competing specification - fragmentationwould certainly be bad, and (in the long term) everything should beconsistent and integrated and clear and it should all be defined inofficial RDFa specifications. (I don't think I have the time ormotivation or skill to write a proper specification for this anyway, soI'm more than happy to let other people do the work!)

It may have been a bad idea to make the document look like a spec, butI'm not sure of a better way to express what I imagine a solution couldlook like.


Responding to some specific points:

Shane McCarron wrote:

I'm sorry that my draft "profile" document doesn't answer yourquestions. Of course my intent is to evolve that profile so that, inconjunction with the other RFCs, Candidate Recommendations, andRecommendations it normatively references, it represents a thoroughdescription of the model for embedding RDFa in HTML documents.

That sounds like the best approach to the problem. My criticism of yourpublished document is coming from an understanding that it's an earlydraft and doesn't claim to be perfect and there's plenty of opportunityfor any problems to be solved in the future. (My intent is for thecriticism to be constructive, not rude - apologies if it's too much ofthe latter!)

http://lists.w3.org/Archives/Public/public-html/2009May/0127.htmlhighlighted some specific issues, but I didn't see how they could beresolved by localised changes to your existing document, which is why Iwanted to look at a more radical way of trying to resolve those issues.My way certainly isn't the best way, but I hope it can be used as apiece of feedback that will lead to a better solution in the end.

If there arethings in the CURIE spec that need clarification, then that is the placeto fix those.

Sure - perhaps my document should have said "I think the CURIE specshould be clarified by changing it to say something more like: ...".That would still have been missing the reasons why I think it should bechanged: the reasons are basically that for some of the examples inhttp://philip.html5.org/demos/rdfa/tests.html I don't see what theRDFa/CURIE specs say the output should be (mainly in terms of handlingerrors), but I don't have an exhaustive list of cases. (Would such alist be useful?)

Personally, I would rather have a quality test suite that exercises thespecification and ensure that suite gets extended to clarify any edgecases that implementors are curious about.

I would agree that's the best way to ensure the quality ofimplementations - it'd be great if the tests I linked above couldperhaps become useful as part of that.


--
Philip Taylor
pj...@cam.ac.uk

Re: RDFa in HTML 5

Reply via email to