Reviving Atom Export

Alastair Rankine Mon, 11 Dec 2006 02:01:41 -0800

So I got a bit distracted for a bit. Here's what we were talking about.


### What is it?

Atom Export is a proposal for a standard/convention/something for theexport of content from a CMS (think blogging engine) in a format thatis similar to other Atom standards. The assumption here is that thecontent being exported is a good fit for Atom. I believe this to bethe case (hence discussing it on this list) but it is not establishedyet.

More rationale, and initial attempt, here: http://girtby.net/articles/2006/08/14/towards-a-common-blog-export-format


### Where did we get up to?

We have a list of requirements for the export format, specificallythe data from a typical blogging engine that should be included inthe export. We also had identified some preliminary issues withrepresenting the content in an Atom-based document format. Lastly wehad clarified some scope, namely that the primary use case should befor migrating from one content engine to another, and not necessarilyfor backup purposes within a single application.


### What data is to be represented by this?

Here is a list of data to be included in the export. For conveniencea blog engine is assumed, although other compatible CMSs mayimplement this as well:


   1. Complete list of authors defined. For each author:
         1. Name
         2. URI
         3. email
   2. Complete list of categories defined:
         1. Name
         2. URI
   3. All articles. For each article:
         1. Source text

2. All the relevant metadata from the Atom spec, namely:author, ID, published, rights, title, updated, summary, categories

         3. Some other metadata: draft status, syntax of source
   4. All comments and trackbacks. For each comment or trackback:
         1. Source text

2. Atom spec metadata: author, ID, title, published,summary, avatar?3. Additional metadata: pointer to parent article orcomment (ie "in-reply-to")

   5. All "Owned" media. For each media object:
         1. URI
         2. MIME type
         3. Binary data

### What are the issues identified so far?

Briefly:

* It was felt that the inclusion of binary media objects warrantsthe use of a binary archive format. The issue of how to organise thecontent within the archive file is unresolved.* There was some discussion about the inclusion of generated text,specifically HTML generated from the source text. It is redundant butmay be useful.

### What now?

I think the best way to proceed is to propose a fresh mapping of theabove data requirements to Atom-derived constructs. I already had ago at this previously, but I think it might be better to startafresh. This time I'd like to try an approach based around theconcepts introduced in APP, particularly that of collections, whichlook like a good fit for items 3,4, and 5 above.

Reviving Atom Export

Reply via email to