[agi] Applicable to Cyc, NARS, ATM & others?

Mike Tintner Thu, 14 Feb 2008 06:40:41 -0800

The Semantic Web, Syllogism, and Worldview

First published November 7, 2003 on the "Networks, Economics, and Culture"mailing list.

Clay Shirky

The W3C's Semantic Web project has been described in many ways over the lastfew years: an extension of the current web in which information is givenwell-defined meaning, a place where machines can analyze all the data on theWeb, even a Web in which machine reasoning will be ubiquitous anddevastatingly powerful. The problem with descriptions this general, however,is that they don't answer the obvious question: What is the Semantic Webgood for?

The simple answer is this: The Semantic Web is a machine for creatingsyllogisms. A syllogism is a form of logic, first described by Aristotle,where "...certain things being stated, something other than what is statedfollows of necessity from their being so." [Organon]


The canonical syllogism is:

   Humans are mortal
   Greeks are human
   Therefore, Greeks are mortal

with the third statement derived from the previous two.

The Semantic Web is made up of assertions, e.g. "The creator of shirky.comis Clay Shirky." Given the two statements


   - Clay Shirky is the creator of shirky.com
   - The creator of shirky.com lives in Brooklyn

you can conclude that I live in Brooklyn, something you couldn't know fromeither statement on its own. From there, other expressions that include ClayShirky, shirky.com, or Brooklyn can be further coupled.

The Semantic Web specifies ways of exposing these kinds of assertions on theWeb, so that third parties can combine them to discover things that are truebut not specified directly. This is the promise of the Semantic Web -- itwill improve all the areas of your life where you currently use syllogisms.


Which is to say, almost nowhere.

Syllogisms are Not Very Useful #

Though the syllogism has been around since Aristotle, it reached itsapotheosis in the 19th century, in the work of Charles Dodgson (better knownas Lewis Carroll.) Dodgson wrote two books of syllogisms and methods forrepresenting them in graphic form, and his syllogisms often took the form ofsorites, where the conclusion from one pair of linked assertions becomes anew assertion to be linked to others.


One of Dodgson's sorites goes:

   - Remedies for bleeding, which fail to check it, are a mockery
   - Tincture of Calendula is not to be despised

- Remedies, which will check the bleeding when you cut your finger, areuseful

   - All mock remedies for bleeding are despicable

which lets you conclude that Tincture of Calendula will check the bleedingwhen you cut your finger.

Despite their appealing simplicity, syllogisms don't work well in the realworld, because most of the data we use is not amenable to such effortlessrecombination. As a result, the Semantic Web will not be very useful either.

The people working on the Semantic Web greatly overestimate the value ofdeductive reasoning (a persistent theme in Artificial Intelligence projectsgenerally.) The great popularizer of this error was Arthur Conan Doyle,whose Sherlock Holmes stories have done more damage to people'sunderstanding of human intelligence than anyone other than Rene Descartes.Doyle has convinced generations of readers that what seriously smart peopledo when they think is to arrive at inevitable conclusions by linkingantecedent facts. As Holmes famously put it "when you have eliminated theimpossible, whatever remains, however improbable, must be the truth."

This sentiment is attractive precisely because it describes a world simplerthan our own. In the real world, we are usually operating with partial,inconclusive or context-sensitive information. When we have to make adecision based on this information, we guess, extrapolate, intuit, we dowhat we did last time, we do what we think our friends would do or whatJesus or Joan Jett would have done, we do all of those things and more, butwe almost never use actual deductive logic.

As a consequence, almost none of the statements we make, even seeminglyobvious ones, are true in the way the Semantic Web needs them to be true.Drew McDermott, in his brilliant Critique of Pure Reason [ComputationalIntelligence, 3:151-237, 1987], took on the notion that you could createArtificial Intelligence by building a sufficiently detailed deductivescaffolding. He concluded that this approach was fatally flawed, noting that"It must be the case that a significant portion of the inferences we want[to make] are deductions, or it will simply be irrelevant how many theoremsfollow deductively from a given axiom set." Though Critique of Pure Reasonpredates not just the Semantic Web but the actual web as well, the criticismstill holds.


Consider the following statements:

   - The creator of shirky.com lives in Brooklyn
   - People who live in Brooklyn speak with a Brooklyn accent

You could conclude from this pair of assertions that the creator ofshirky.com pronounces it "shoiky.com." This, unlike assertions about myphysical location, is false. It would be easy to shrug this error off asGarbage In, Garbage Out, but it isn't so simple. The creator of shirky.comdoes live in Brooklyn, and some people who live in Brooklyn do speak with aBrooklyn accent, just not all of them (us).

Each of those statements is true, in other words, but each is true in adifferent way. It is tempting to note that the second statement is ageneralization that can only be understood in context, but that way madnesslies. Any requirement that a given statement be cross-checked against alibrary of context-giving statements, which would have still furthercontext, would doom the system to death by scale.

We Describe The World In Generalities #

We can't disallow generalizations because we can't know which statements aregeneralizations by looking at them. Even if we could, it wouldn't help,because generalizations are a fundamental tool of expression. "People wholive in France speak French" is structurally no different than " People wholive in Brooklyn speak with a Brooklyn accent." In any human context "Peoplewho live in France speak French" is true, but it is false if universals arerequired, as there are French immigrants and ex-patriates who don't speakthe language.

Syllogisms sound stilted in part because they traffic in absurd absolutes.Consider this gem from Dodgson:


   - No interesting poems are unpopular among people of real taste
   - No modern poetry is free from affectation
   - All your poems are on the subject of soap-bubbles
   - No affected poetry is popular among people of real taste
   - No ancient poetry is on the subject of soap-bubbles

This, of course, allows you to conclude that all your poems are bad.

This 5-line syllogism is the best critique of the Semantic Web everpublished, as its illustrates the kind of world we would have to live in forthis form of reasoning to work, a world where language is merely math donewith words. Actual human expression must take into account the ambiguitiesof the real world, where people, even those with real taste, disagree aboutwhat is interesting or affected, and where no poets, even the mostuninteresting, write all their poems about soap bubbles.

The Semantic Web's Proposed Uses #

Dodgson's syllogisms actually demonstrate the limitations of the form, apattern that could be called "proof of no concept", where the absurdity ofan illustrative example undermines the point being made. So it is with theSemantic Web. Consider the following, from the W3C's own site:


   Q: How do you buy a book over the Semantic Web?

A: You browse/query until you find a suitable offer to sell the book youwant. You add information to the Semantic Web saying that you accept theoffer and giving details (your name, shipping address, credit cardinformation, etc). Of course you add it (1) with access control so only youand seller can see it, and (2) you store it in a place where the seller caneasily get it, perhaps the seller's own server, (3) you notify the sellerabout it. You wait or query for confirmation that the seller has receivedyour acceptance, and perhaps (later) for shipping information, etc.[http://www.w3.org/2002/03/semweb/]


One doubts Jeff Bezos is losing sleep.

This example sets the pattern for descriptions of the Semantic Web. First,take some well-known problem. Next, misconstrue it so that the hard part ismade to seem trivial and the trivial part hard. Finally, congratulateyourself for solving the trivial part.

All the actual complexities of matching readers with books are waved away inthe first sentence: "You browse/query until you find a suitable offer tosell the book you want." Who knew it was so simple? Meanwhile, the trivialoperation of paying for it gets a lavish description designed to obscure thefact that once you've found a book for sale, using a credit card is a prettyobvious next move.

Consider another description of the Semantic Web that similarly misconstruesthe problem:

Merging databases simply becomes a matter of recording in RDF somewherethat "Person Name" in your database is equivalent to "Name" in my database,and then throwing all of the information together and getting a processor tothink about it. [http://infomesh.net/2001/swintro/]

No one who has ever dealt with merging databases would use the word'simply'. If making a thesaurus of field names were all there was to it,there would be no need for the Semantic Web; this process would work today.Contrariwise, to adopt a Lewis Carroll-ism, the use of hand-waving aroundthe actual problem -- human names are not globally unique -- masks thetriviality of linking Name and Person Name. Is your "Person Name = JohnSmith" the same person as my "Name = John Q. Smith"? Who knows? Not theSemantic Web. The processor could "think" about this til the silicon smokeswithout arriving at an answer.

From time to time, proselytizers of the Semantic Web try to give it a human

face:

For example, we may want to prove that Joe loves Mary. The way that wecame across the information is that we found two documents on a trustedsite, one of which said that ":Joe :loves :MJS", and another of which saidthat ":MJS daml:equivalentTo :Mary". We also got the checksums of the filesin person from the maintainer of the site.

To check this information, we can list the checksums in a local file,and then set up some FOPL rules that say "if file 'a' contains theinformation Joe loves mary and has the checksum md5:0qrhf8q3hfh, then recordSuccessA", "if file 'b' contains the information MJS is equivalent to Mary,and has the checksum md5:0892t925h, then record SuccessB", and "if SuccessAand SuccessB, then Joe loves Mary". [http://infomesh.net/2001/swintro/]

You may want to read that second paragraph again, to savor its delicious mixof minutia and cluelessness.

Anyone who has ever been 15 years old knows that protestations of love,checksummed or no, are not to be taken at face value. And even if we wantedto take love out of this example, what would we replace it with? Theuniverse of assertions that Joe might make about Mary is large, but thesubset of those assertions that are universally interpretable anduncomplicated is tiny.


One final entry in the proof of no concept category:

Here's an example: Let's say one company decides that if someone sellsmore than 100 of our products, then they are a member of the Super Salesmanclub. A smart program can now follow this rule to make a simple deduction:"John has sold 102 things, therefore John is a member of the Super Salesmanclub." [http://logicerror.com/semanticWeb-long]

This is perhaps perhaps the high water mark of presenting trivial problemsas worthy of Semantic intervention: a program that can conclude that 102 isgreater than 100 is labeled smart. Artificial Intelligence, here we come.

Meta-data is Not A Panacea #

The Semantic Web runs on meta-data, and much meta-data is untrustworthy, fora variety of reasons that are not amenable to easy solution. (See forexample Doctorow, Pilgrim, Shirky.) Though at least some of this problemcomes from people trying to game the system, the far larger problem is thateven when people publish meta-data that they believe to be correct, we stillrun into trouble.


Consider the following assertions:

   - Count Dracula is a Vampire
   - Count Dracula lives in Transylvania
   - Transylvania is a region of Romania
   - Vampires are not real

You can draw only one non-clashing conclusion from such a set ofassertions -- Romania isn't real. That's wrong, of course, but the wrongnessis nowhere reflected in these statements. There is simply no way to cleanlyseparate fact from fiction, and this matters in surprising and subtle waysthat relate to matters far more weighty than vampiric identity. Considerthese assertions:


   - US citizens are people
   - The First Amendment covers the rights of US citizens
   - Nike is protected by the First Amendment

You could conclude from this that Nike is a person, and of course you wouldbe right. In the context of in First Amendment law, corporations are treatedas people. If, however, you linked this conclusion with a medical database,you could go on to reason that Nike's kidneys move poisons from Nike'sbloodstream into Nike's urine.

Ontology is Not A Requirement #

Though proponents of the Semantic Web gamely try to illustrate simple usesfor it, the kind of explanatory failures above are baked in, because theSemantic Web is divided between two goals, one good but unnecessary, theother audacious but doomed.

The first goal is simple: get people to use more meta-data. The Semantic Webwas one of the earliest efforts to rely on the idea of XML as a commoninterchange format for data. With such a foundation, making formalagreements about the nature of whatever was being described -- anontology -- seemed a logical next step.

Instead, it turns out that people can share data without having to share aworldview, so we got the meta-data without needing the ontology. Exhibit Ain this regard is the weblog world. In a recent paper discussing theSemantic Web and weblogs, Matt Rothenberg details the invention and rapidspread of "RSS autodiscovery", where an existing HTML tag was pressed intoservice as a way of automatically pointing to a weblog's syndication feed.

About this process, which went from suggestion to implementation in meredays, Rothenberg says:

Granted, RSS autodiscovery was a relatively simplistic technicalstandard compared to the types of standards required for the environment ofpervasive meta-data stipulated by the semantic web, but its adoptiondemonstrates an environment in which new technical standards for publishingcan go from prototype to widespread utility extremely quickly. [PDF offline,cache here]

This, of course, is the standard Hail Mary play for anyone whose technologyis caught on the wrong side of complexity. People pushing such technologiesoften make the "gateway drug" claim that rapid adoption of simpletechnologies is a precursor to later adoption of much more complex ones.Lotus claimed that simple internet email would eventually leave peopleclamoring for the more sophisticated features of CC:Mail (RIP), PointCast(also RIP) tried to label email a "push" technology so they would look likea next-generation tool rather than a dead-end, and so on.

Here Rothenberg follows the script to a tee, labeling RSS autodiscovery'simplistic' without entertaining the idea that simplicity may be arequirement of rapid and broad diffusion. The real lesson of RSSautodiscovery is that developers can create valuable meta-data withoutneeding any of the trappings of the Semantic Web. Were the whole effort tobe shelved tomorrow, successes like RSS autodiscovery would not be affectedin the slightest.

Artificial Intelligence Reborn #

If the sole goal of the Semantic Web were pervasive markup, it would benothing more than a "Got meta-data?" campaign -- a generic exhortation fordevelopers to do what they are doing anyway. The second, and larger goal,however, is to take up the old Artificial Intelligence project in a newcontext.

After 50 years of work, the performance of machines designed to think aboutthe world the way humans do has remained, to put it politely, sub-optimal.The Semantic Web sets out to address this by reversing the problem. Sinceit's hard to make machines think about the world, the new goal is todescribe the world in ways that are easy for machines to think about.

Descriptions of the Semantic Web exhibit an inversion of trivial and hardissues because the core goal does as well. The Semantic Web takes forgranted that many important aspects of the world can be specified in anunambiguous and universally agreed-on fashion, then spends a great deal oftime talking about the ideal XML formats for those descriptions. This putsthe stress on the wrong part of the problem -- if the world were easy todescribe, you could do it in Sanskrit.

Likewise, statements in the Semantic Web work as inputs to syllogistic logicnot because syllogisms are a good way to deal with slippery, partial, orcontext-dependent statements -- they are not, for the reasons discussedabove -- but rather because syllogisms are things computers do well. If theworld can't be reduced to unambiguous statements that can be effortlesslyrecombined, then it will be hard to rescue the Artificial Intelligenceproject. And that, of course, would be unthinkable.

Worldviews Differ For Good Reasons #

Many networked projects, including things like business-to-business marketsand Web Services, have started with the unobjectionable hypothesis thatcommunication would be easier if everyone described things the same way.

From there, it is a short but fatal leap to conclude that a particular brand

of unifying description will therefore be broadly and swiftly adopted (the"this will work because it would be good if it did" fallacy.)

Any attempt at a global ontology is doomed to fail, because meta-datadescribes a worldview. The designers of the Soviet library's catalogingsystem were making an assertion about the world when they made the firstcategory of books "Works of the classical authors of Marxism-Leninism."Melvyl Dewey was making an assertion about the world when he lumped allbooks about non-Christian religions into a single category, listed lastamong books about religion. It is not possible to neatly map these twosystems onto one another, or onto other classification schemes -- theydescribe different kinds of worlds.

Because meta-data describes a worldview, incompatibility is an inevitableby-product of vigorous argument. It would be relatively easy, for example,to encode a description of genes in XML, but it would be impossible to get auniversal standard for such a description, because biologists are stillarguing about what a gene actually is. There are several competing standardsfor describing genetic information, and the semantic divergence is anartifact of a real conversation among biologists. You can't get a standardtil you have an agreement, and you can't force an agreement to exist wherenone actually does.

Furthermore, when we see attempts to enforce semantics on human situations,it ends up debasing the semantics, rather then making the connection moreinformative. Social networking services like Friendster and LinkedIn assumethat people will treat links to one another as external signals of deepassociation, so that the social mesh as represented by the software will bean accurate model of the real world. In fact, the concept of friend, or eventhe type and depth of connection required to say you know someone, is quiteslippery, and as a result, links between people on Friendster have beendrained of much of their intended meaning. Trying to express implicit andfuzzy relationships in ways that are explicit and sharp doesn't clarify themeaning, it destroys it.

Worse is Better #

In an echo of Richard Gabriel's Worse is Better argumment, the Semantic Webimagines that completeness and correctness of data exposed on the web arethe cardinal virtues, and that any amount of implementation complexity isacceptable in pursuit of those virtues. The problem is that the moresemantic consistency required by a standard, the sharper the tradeoffbetween complexity and scale. It's easy to get broad agreement in a narrowgroup of users, or vice-versa, but not both.

The systems that have succeeded at scale have made simple implementation thecore virtue, up the stack from Ethernet over Token Ring to the web overgopher and WAIS. The most widely adopted digital descriptor in history, theURL, regards semantics as a side conversation between consenting adults, andmakes no requirements in this regard whatsoever: sports.yahoo.com/nfl/ is avalid URL, but so is 12.0.0.1/ftrjjk.ppq. The fact that a URL itself doesn'thave to mean anything is essential -- the Web succeeded in part because itdoes not try to make any assertions about the meaning of the documents itcontained, only about their location.

There is a list of technologies that are actually political philosophymasquerading as code, a list that includes Xanadu, Freenet, and now theSemantic Web. The Semantic Web's philosophical argument -- the world shouldmake more sense than it does -- is hard to argue with. The Semantic Web,with its neat ontologies and its syllogistic logic, is a nice vision.However, like many visions that project future benefits but ignore presentcosts, it requires too much coordination and too much energy to effect inthe real world, where deductive logic is less effective and shared worldviewis harder to create than we often want to admit.

Much of the proposed value of the Semantic Web is coming, but it is notcoming because of the Semantic Web. The amount of meta-data we generate isincreasing dramatically, and it is being exposed for consumption by machinesas well as, or instead of, people. But it is being designed a bit at a time,out of self-interest and without regard for global ontology. It is alsobeing adopted piecemeal, and it will bring with it with all theincompatibilities and complexities that implies. There are significantdisadvantages to this process relative to the shining vision of the SemanticWeb, but the big advantage of this bottom-up design and adoption is that itis actually working now.


-------------------------------------------
agi
Archives: http://www.listbox.com/member/archive/303/=now
RSS Feed: http://www.listbox.com/member/archive/rss/303/
Modify Your Subscription: 
http://www.listbox.com/member/?member_id=8660244&id_secret=95818715-a78a9b
Powered by Listbox: http://www.listbox.com

[agi] Applicable to Cyc, NARS, ATM & others?

Reply via email to