Re: [uf-discuss] Re: Apple Data Detectors

Alex Faaborg Fri, 08 Feb 2008 19:36:56 -0800

On the other end, if, as I type this, I get an intellisense-likelist of my contacts that I can select from, then I can just selectJoe from the list and have the microformat markup added for me

I've been thinking a lot about how a Web browser could help end usersauthor microformatted content in blogs and wikis, and I think we needto consider the user's goals and motivations. I can't imagine peopleassociating a contact in their address book with Joe as they casuallymention him in a blog post just because they have an appreciation forthe beauty of structured data. However, if their goal is closelyaligned with the goal of their readers, then I can see users going tothe extra effort. For instance, let's say you want to reviewsomething, and because you want your vote to count and other peopleto be able to take advantage of your review once it gets aggregated, Ican see users going to the extra effort of filling out a form like thehReview creator (http://microformats.org/code/hreview/creator) to getinformation into the structure of an hReview. The same goes forpeople who want to promote an event: since their motivation is forpeople to attend, they make it easy for users to add the event totheir calendar. We already see this type of behavior in applicationslike Outlook or Zimbra, where people create events for other people,so they are easy to accept. Microformats allow to take thatinteraction out of closed systems, and apply it to HTML emails, blogposts, wikis, etc.

I'm all for building systems that attempt to infer structure fromnatural language, because like we see in Apple's 1998 article, and nowin Mail.app, these types of systems can be really useful when theywork. But I also don't think we should discount situations where theuser may actually have a clear motivation for creating structured databy filling out a form.

In case anyone is interested in reading more about Data Detectors, youmight find this paper interesting. It catalogs all of the researchdone throughout the late 90s, and discusses a prototype system thatleverages large knowledge bases like Stanford's TAP and MIT'sConceptNet to disambiguate natural language and provide structure tounstructured text:


http://alumni.media.mit.edu/~faaborg/files/thesis/draft/complete/CHI06_goalOrientedWebBrowser.pdf

-Alex



On Feb 8, 2008, at 8:40 AM, Guillaume Lebleu wrote:

Toby A Inkster wrote:
Guillaume Lebleu wrote:
What I have been thinking more and more and what this tells meagain is
that the same way we talk of POSH and microformats, we could talk of
plain text or plain old english formats, essentially standardizinghowpeople write dates, addresses, etc on the Web or on their emails.Asking
people to write "Tuesday, February 5, 2008" in this order, with the
commas, etc. is very likely even simpler for normal people thanwriting<abbr class="foo" title="2008-05-02">Tuesday, February 5, 2008</abbr>.
One problem with that is that it will find matches on people whoaren't even intending to use your plain-old-english format. Theymay happen to be including "Tuesday, February 5, 2008" on theirpages with a different intended meaning. 2008 could refer to eightminutes past eight PM in military time -- unlikely, but possible.And as you move away from dates, phone numbers and postcodes whichhave relatively parseable formats, towards locations, people'snames and job titles and so on, the likelihood of false matchesincreases.
The use of explicit tags to mark up information do makemicroformats slightly harder to use, yes. But the key is that theyalso make microformats much easier to explicitly not use.
Toby,
I understand the challenge of disambiguation and the valuemicroformats bring in terms of easier parser implementation and morereliable information consumption experience. The challenge foraverage people writing microformats can't be underestimated though.I strongly believe that the time where disambiguation costs are thelowest are at publishing time, but this is also the time where youare focused on the english content, not the microformats. This iswhy in the second part of the post you cited, I suggested the use ofApple Data Detectors' like functionality, not to detect objects inplain old english (POE) in published content, but to detect objectsin POE at the time they are written and ask for the user fordisambiguation at the same time, in a way that the underlyingmicroformat markup is generated, but without the user having to knowthe syntax. I'm thinking of this particularly in the context ofwriting a blog post: writing 1 hCards just to say "My friend Joe" isway too much for normal people. On the other end, if, as I typethis, I get an intellisense-like list of my contacts that I canselect from, then I can just select Joe from the list and have themicroformat markup added for me (just like Wordpress adds a lot ofmarkup that isn't in the visual editor or like Wiki convertssimplified markup into HTML markup).
Guillaume
_______________________________________________
microformats-discuss mailing list
[email protected]
http://microformats.org/mailman/listinfo/microformats-discuss


_______________________________________________
microformats-discuss mailing list
[email protected]
http://microformats.org/mailman/listinfo/microformats-discuss

Re: [uf-discuss] Re: Apple Data Detectors

Reply via email to