Re: Why bound prefixes are an anti-pattern in language design

Martin McEvoy Tue, 11 Aug 2009 15:26:02 -0700

Hello all, My this has turned into a mighty discussion .....


Ian Hickson wrote:

On Sat, 8 Aug 2009, Martin McEvoy wrote:
Three things one must do to avoid becoming a Cargo Cult scientist...

1, " researchers must first of all avoid fooling themselves"

  Reverse DNS Identifiers, They are just backwards urls right!
No, they have several properties that URIs do not: They can't bedereferenced, so there's no illusion of extra meaning; they are purelyidentifiers, not locators. They're shorter, and they use less punctuation,leading to a cleaner syntax.
Note that Microdata allows URIs to be used as well, though. You don't haveto use reverse DNS identifiers if you don't want to.


Yes I read that in section 5.1.3

http://dev.w3.org/html5/spec/Overview.html#selecting-names-when-defining-vocabularies

I must have missed it ;) still reverse DNS identifiers are not reallypeople friendly, and make your markup very bulky, I think microdatashould have not included them, but that's my personal taste I guess.

2, "be willing to question and doubt their own theories and their ownresults"
  'Prefixes are an anti-pattern and notoriously hard for authors to
understand'.
I think there's ample evidence of this. I haven't just jumped to thisconclusion, I've thought about

I think that's more to do with where your thinking started from. In theRDF world (which is what the RDFa logical model is based on) prefixesare good, even necessary to convey the intended semantics of RDF , mostpeople who are used to RDF have no trouble understanding what prefixesare for. Prefixes in the html world however are not common orconvenient and little understood.


So another way....

The reason why I have taken so long in answering is because I have beentesting my own theories "are prefixes necessary" forget if they areunderstood or not, some say yes some say no its personal taste and styleif you do or not.

The simplest solution I have found is based on something you said abouttwitter and its use of json, I'm calling it a "dataset" for want of aword and uses json. The reason why I chose Json is that it can be parsedrelatively easily by pretty much everything, and its easy to build avalidator for json data.

A dataset is kind of like a semantic style-sheet its used to convey theauthors intended meaning of a page to a machine without embedding theraw data into the page, the physical model and the logical model areseparated.


Some examples:

Here is a page marked up with HTML5 microdata:http://getsemantic.info/test/dataset.htmlthere is nothing unusual about it other than there is a link in the headof the page using @rel=dataset, this tells the parser where the data is.


This is the dataset http://getsemantic.info/test/data.json

you are only able to define four attributes

"term" : "your term" ie: date
"prefix" : "the scope of your term" ie: dcterms

"ref" : "how the term is to be referenced" ie :http://purl.org/dc/terms/date"datatype" : "the datatype of your term" ie:http://www.w3.org/2001/XMLSchema#date

the json data is parsed along with the html matching terms from the htmlwith terms in the dataset.

Here is an example of the parsed datahttp://weborganics.co.uk/test/test.php?url=http://getsemantic.info/test/dataset.html

Its all been a pretty cool experience in all, I have tested the abovetheory in RDFa too it works just as well.

3, "investigate possible flaws in a theory "
The whole of your design concept (linking machine data together causinga long string "foo.example.directory.page#" ) was discussed in depthover on Microformats New around two and a half years ago but if you hadtalked to somebody about your "Idea" maybe someone could have stoppedyou from wasting your time, in short It was generally thought of as abad Idea.
Microdata is not 'linking machine data together causing a long string"foo.example.directory.page#"'; what suggested that? If the spec isn'tclear about this, I should fix it. What gave you that impression?

The big long strings ie: org.example.animal.cat and org.example.name, okthey are not "particularly" long strings but I can see authors writingthings like com.example.tag.cat# there is no real difference in what Istated above, its a good idea I think to drop reverse DNS from the HTML5spec, there is really no need for it to be there, if you do I expectpeople will warm to microdata a lot more.


Best wishes

--
Martin McEvoy
http://weborganics.co.uk/

Re: Why bound prefixes are an anti-pattern in language design

Reply via email to