easy RDF from XML (was RDFa + RDF/XML Considered Harmful?)

Paul Tyson Tue, 15 Jul 2008 20:11:04 -0700


Mark Birbeck wrote:


I did think though, that one of the things about the RDF/XML structure
was an attempt to enable many XML layouts to be interpreted as RDF.
But obviously that's enormously difficult.

The striping design of RDF/XML, by design or accident, makes it verywell suited to be the target of XSLT transformations. Seehttp://lists.w3.org/Archives/Public/semantic-web/2008Jul/0037.html for astylesheet that will transform any XML document to Infoset RDF/XML. Youcould of course write out the RDF graph in any other notation youchoose, but RDF/XML is no more difficult than another.

Infoset RDF might not be a big step forward, but at least it puts youinto the RDF world where you can merge graphs and do whatever semanticprocessing you like.

What we would really like to do is vivify the meaning that the XMLauthor was aiming for when he marked up the character stream in thefirst place. We won't get at that meaning from the grammar alone; wemust look at the semantics of the markup itself. The direction waspointed years ago in this article:http://xml.coverpages.org/xmlAndSemantics.html, and possibly in otherarticles undiscovered to me.

In this discussion I will set aside DTDs and XML Schemas and all othersuch tools of the grammarians and computer scientists; for I wish tofocus on the basic semantic gestures of markup itself. Structuralmarkup, as in SGML and XML, is a means of breaking up a sequence ofcharacters into components of interest. The syntactical rules forwell-formed XML enable a primitive--yet reliable and robust--set ofsemantic gestures, to wit:

        - naming (components of interest can be named)
        - attributing (components can have properties)
        - sequence (a component can have a positional predecessor)
        - containment (a component can be contained in another)

Nothing could be easier than making an RDFS vocabulary of these notions.

And it is only slightly harder to modify the stylesheet referenced aboveto emit RDF/XML using this vocabulary. (If I were to implement this Iwould add a "Chunk" class to contain character strings, instead ofrepresenting them as sequences of named things with a common parent.) Soyou can have, with very little effort, a system that reveals, for anyXML instance, the fundamental semantic gestures of its author.

In XML, as in natural language, we have many ways of expressing nearlythe same meaning. If we must decide if two utterances have the samemeaning, we cannot do it by comparing the sounds of the utterances--wemust consult some rules about the language: word definitions,grammatical rules, and usage conventions. Just so with XML--it isuseless to compare the surface structure. We must first of all exposethe semantic structure of each instance, then apply some rules ofsynonymy. Putting an XML document into some such RDF as described abovemakes it easier to apply these rules.


--Paul

easy RDF from XML (was RDFa + RDF/XML Considered Harmful?)

Reply via email to