On Oct 29, 2006, at 8:39 PM, Chris Messina wrote:

Whohoo! A point of real debate!

:)

Well, as I did when we first spoke months ago, I firmly disagree, and
over time, I think you will be proven wrong. There's simply no point
in having multiple instantiations of the same data in a text-based
format (I'm exempting relational databases).

Mmm, I think you're confusing the issues of immediate presentation to the user and long term storage (one of which is a *human* problem and the other is a *machine* problem). In many cases -- specifically Adium's case -- the on-disk logs are going to be read by things that aren't even *close* to a web browser -- Spotlight, for instance. Gaim's log viewer is another example.

Also, think about an AJAXy web app (like Meebo) that constantly is modifying its DOM tree. I don't think that that markup and all of its included JS (and also the fact that it might be necessary to have a server connection) is suitable for long term storage and parsing. This is another reason against using a uF for long term storage -- you can have uFs in web pages with all sorts of extraneous information on them.

I think it's very telling that you exempt relational databases. I could imagine a uF for specifying a table (<table> anyone?) and the relations therein -- this could of course be easily parsed and modified by JS. The way I see it, you've already acknowledging the need for machine-friendly data that is separate from human-friendly data (uFs).

In any case -- I know where you're coming from and I appreciate you
making a public statement about your position. This will allow us to
finally have this conversation out in daylight.

Thanks! I agree, it's important to have this out in the open. On that note, I'm curious if anything you've worked on might provide additional examples for discussion?

Your point has been that straight XML is easier to parse than XHTML --
which could include linked CSS for use in presentation, or JS to
add/modify behavior.

A nitpicky, off-topic point: XML can include CSS and JS and can be styled.

In any case, my real point is that it's not the XHTML-nature of uFs makes them harder to parse. Rather, it's something about microformats. Microformats have to be much more flexible to interact with the other kinds of markup and presentation issues that exist on the web. A good example that popped up on the list recently was hCard's powerful (but tricky to parse) value class. A log that exists for machines to parse definitely doesn't need something like that.


Furthermore, as new chat programs emerge, the ability to move data
between them in a format that works in *any browser* seems like an
investment in the future, as opposed to the present and recent past.

My main problem with this is the word "browser". Adium is not a web browser. It is a user agent, in that it displays (X)HTML, but it does no browsing of the web. As mentioned above, this is another instance of a non-browser working with HTML. See above for another example of the need for something that is easily read by a non-browser.

As far as in-browser elements like Google's Gtalk web chat, those logs are stored on-disk by the server and then displayed to the user. Storing the entire web page (or a section thereof) for the log file that will be shown to the user is impossible -- Gmail is a highly dynamic web page. Thus, there's a uF offers no real advantage in this case: the file is going to have to be read in and parsed anyway, so why not use something easier for machines (like XML).

Lastly, the unknown uses and mashability of XHTML is one of the most
important elements of using XHTML as opposed to custom XML.

Another nitpicky, offtopic issue: ULF isn't exactly custom, being that it's a published standard and has multiple implementations.

Otherwise, this is a good point, and one of the reasons why I'm not discounting a uF for chat all together -- I'm just arguing about the specific *focus* the format will take.

These are a few of the basic assumptions that I work under. I would be
very happy if you would discount them, one at a time. ;)

Hopefully I've done a satisfactory job. Anyone else can feel free to jump in, btw.

-Colin

On 10/29/06, Colin Barrett <[EMAIL PROTECTED]> wrote:
Particularly, I'm going to be talking about "hChat".

Personally, I see the primary use case of a chat microformat to be for
displaying of chat contents in a live (or semi-live) way. There are
quite a few examples of this in the wild -- web-based IM clients like
Meebo, and Adium and Kopete's message view, both of which are HTML
based.

Contrarily, most data formats are not HTML based (XML based, usually),
or if they are, do not use modern HTML (i.e. AOL's HTML log format).

I think the chat group should alter its focus to providing ways to
semantically talk about the structure of a chat and a particular
message entry. One of the most obvious benefits in this area is much
better clipboard support -- copying things out of Meebo and Adium/
Kopete's HTML view usually results in an un-useful mess.

Another important use case, that I think there has been some research
towards already, is "snippets". That is, small snippets of an IM
conversation in a blog post. This, conveniently, goes along with my
earlier point about copy-paste.

Just a couple of other points that have been percolating:
  - the use of hCard to mark up people's names
  - Time zone information is important -- particularly wrt. time time
zone of the sender and the time zone of the receiver.

This email has been a long time coming -- I've waited for a couple of
months, just lurking and trying to understand the process. Should
people agree that this is the direction research should go, I'll start
working on getting links on the wiki to examples of documentations of
chat representations. I think that the idea of chat *logging* is a
well-solved problem by XML and microformats shouldn't be butting their
heads in -- especially since most of the time full logs aren't
available on the web (the uF log bot being a notable, and interesting,
exception).

In the interest of full disclosure: I'm one of the developers of Adium
-- we're looking into improving our message view, and copy/paste has
been something that's plagued us for a while. I'm also one of the
primary authors of an XML based log format that's been adopted by
Adium, Gaim and Kopete.

--
Colin Barrett
Developer, Adium
http://adiumx.com
_______________________________________________
microformats-discuss mailing list
[email protected]
http://microformats.org/mailman/listinfo/microformats-discuss



--
Chris Messina
Citizen Provocateur &
 Open Source Ambassador-at-Large
Work: http://citizenagency.com
Blog: http://factoryjoe.com/blog
Cell: 412 225-1051
Skype: factoryjoe
This email is:   [ ] bloggable    [X] ask first   [ ] private
_______________________________________________
microformats-discuss mailing list
[email protected]
http://microformats.org/mailman/listinfo/microformats-discuss

_______________________________________________
microformats-discuss mailing list
[email protected]
http://microformats.org/mailman/listinfo/microformats-discuss

Reply via email to