Re: Atom Export

Alastair Rankine Tue, 17 Oct 2006 05:08:05 -0700


On 05/10/2006, at 1:08 AM, James M Snell wrote:

1. Complete list of authors and categories defined


With permissions, metadata, etc.


Metadata like URL and email, sure.

I'm not so sure about permissions. Do we do groups then? And do wetry to represent permissions for actions that are not directlyrelated to authoring content? (eg changing the theme) There is hugepotential for scope creep here.

My intent with an Atom-based export file format is to facilitatemigration from one blogging engine (or similar CMS) to another. Butimplementations will differ in the functions they offer, and the waythey offer them, so it is unreasonable I think to expect that asingle file format would allow seamless migration in all cases. Sucha format would be a union of the collective data models of allcurrent and future blogging engines, and hence doomed tounimplementable complexity.

So there is a compromise to be made here: on the one hand we want theexported file to contain as much important data as possible, but onthe other hand the format needs to be simple enough to implementacross multiple blogging engines.

I think it's a worthy goal to minimize, rather than eliminate, theamount of manual reworking required when migrating from one platformto another. In other words it is not a goal for this format to besuitable for backup/restore.

All this is a roundabout way of saying that each blogging engine islikely to have a unique permissions model, and that it is not a bigwin to attempt to represent this in the export file format.

I would add to this information about what plugins have beenapplied and

what templates have been used.  These, of course, are not going to be
portable to different blog environments but the information would be
necessary in order to faithfully recreate the entries later.


Agree, but there is a minefield of complexity here.

I currently use Typo as a blogging engine and it supports macros andfilters in addition to a set of basic markup langauges. Getting theright combination of macros and filters is important to producingcorrect HTML output. So it would seem pretty important to includethis information in the export file.

But on the other hand the macros and filters are unlikely to besupported on other platform, so probably the safest course of actionis to expand these when the content is exported? There is some lossof information here, and an export/import operation is no longersemantically neutral, but the alternative is potentially worse.

The tricky bit is defining what is meant by "owned" media.


I would say that it's any media linked to by an entry located on the
same host as the entry, with the exporter given some discretion as to
what to include and what not to include.

Agree. In practice the decision is likely to be based on whether theexporter can find the linked media file on a local filesystem.

Each image included probably needs to be accompanied by the(relative?) URL at which it was originally published.


So the list is now:

1. Complete list of authors defined. For each author:
        a. Name
        b. URI
        c. email
2. Complete list of categories defined:
        a. Name
        b. URI
3. All articles. For each article:
        a. Source text
        b. All the relevant metadata from the Atom spec, namely:
                author, ID, published, rights, title, updated, summary, 
categories
        c. Some other metadata:
                draft status, syntax of source
4. All comments and trackbacks. For each comment or trackback:
        a. Source text
        b. Atom spec metadata:
                author, ID, title, published, summary, avatar?
        c. Additional metadata:
                pointer to parent article or comment (ie "in-reply-to")
5. All "Owned" media. For each media object:
        a. URI
        b. MIME type
        c. Binary data

Does this look about right? Obviously there would need to be aliberal sprinkling of extension points for proprietary information.

Re: Atom Export

Reply via email to