Atom, files and directories: what it's all about

Henry Story Wed, 23 Nov 2005 03:34:42 -0800

I suppose I am kind of slow in some ways. Perhaps everyone hasrealized this all along. But the relation between atom and a filesystem has just struck me forcefully recently.

REST does not have the notion of a directory hierarchy. You may thinkit does because urls have slashes in them, and because you can browsethe file system in a web browser. But really you are just lucky thatmost people use hierarchical file systems to construct their urls.The pages you get back if you point your browser to something likehttp://bblfish.net/bloged/ is an html human readable representationof the contents of a directory.

What is a feed? It is a machine readable representation of theRESTful equivalent of a directory.

What is an entry? It is the machine readable representation of theRESTful equivalent of a file.

Directories contain files. Feeds contain entries. Files contain'content'. Entries contain content too. When you type 'ls -ali' in adirectory


bash-2.03$ ls -ali
total 61
2705932 drwxr-xr-x   3 hjs  vuser    512 Nov 22 16:10 ./
2662433 drwxr-xr-x  34 hjs  vuser   1024 Nov 22 16:10 ../
2705933 -rw-r--r--   1 hjs  vuser   2059 Nov 22 16:10 BlogEd.jnlp
2705934 -rw-r--r--   1 hjs  vuser  49674 Nov 22 16:10 BlogEd.tiff
2705935 drwxr-xr-x   2 hjs  vuser   1024 Nov 22 16:10 lib/

you get all kinds of metadata about the directory '.', and about thefiles in the directory. The first number is the inode, which is alittle like the src url of the content.


Same when you look at a basic feed

<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom";>

  <title>Example Feed</title>
  <link href="http://example.org/"/>
  <updated>2003-12-13T18:30:02Z</updated>
  <author>
    <name>John Doe</name>
  </author>
  <id>urn:uuid:60a76c80-d399-11d9-b93C-0003939e0af6</id>

  <entry>
    <title>Atom-Powered Robots Run Amok in France</title>
    <link href="http://example.org/2003/12/13/atom03"/>
    <id>urn:uuid:1225c695-cfb8-4ebb-aaaa-80da344efa6a</id>
    <updated>2005-11-13T18:30:02Z</updated>
    <summary>Some text.</summary>
    type="xhtml" xml:lang="en"
     xml:base="http://diveintomark.org/";>
      <div xmlns="http://www.w3.org/1999/xhtml";>

<p><i>After a crash course in french culturalintegration...</i></p>

      </div>
    </content>
  </entry>

</feed>

The information is just presented differently. The xml makes iteasier for machines to parse the content, as it deals with all theproblems of unicode and separation of the data. Yes you can use sedand grep in unix, but you always will come across weird corner cases,such as when a file has a blank space in it, non ascii weirdcharacters, ...

Entries and Feeds are therefore both containers. In Unix we haveknown this all along: everything is a file. It is just thatDirectories are special kinds of files, that contain filesthemselves. So, as I have argued for a long time, Feeds are reallytypes of Entries.

Feeds can have images (icons) associated with them. In most end userfile systems, so can folders. The atom group is working on adding theability to add icons and images to entries too, then the symmetrywill be complete.

Entries are really containers for file metadata. As OSX is slowlymoving to enabling file system metadata [1],


% xattr --set name John file
% xattr --set color red file

% xattr --list file
file
        color   red
         name   John

so using the link relation one can add all types of metadata on feedsor entries


 <entry>
    <title>Atom draft-07 snapshot</title>
    <link rel="alternate" type="text/html"
     href="http://example.org/2005/04/02/atom"/>
    <link rel="enclosure" type="audio/mpeg" length="1337"
     href="http://example.org/audio/ph34r_my_podcast.mp3"/>
    <id>tag:example.org,2003:3.2397</id>
...
  </entry>

The attribute value of the rel relation is in fact a url, as befits aweb document text, and people can add whatever metadata they like there.

By adding a "feed" link relation one allows feeds to point to feeds,which is the equivalent of directories inside directories.

A few improvements of atom over directories is that our feed cancontain not just the current version of an entry, but all previousversions as well, which I think I remember was a feature supported bythe vms file system.

Of course the URI id construct and the fact that the web does nothave partitions, allows one to keep track of the identity of a fileover the whole web, which simple inodes just cannot do.


Henry Story

[1] http://arstechnica.com/reviews/os/macosx-10.4.ars/7

Atom, files and directories: what it's all about

Reply via email to