[linuxkernelnewbies] Development [LWN.net]

Peter Teoh Sat, 27 Dec 2008 07:24:19 -0800


The libferris virtual filesystem


November 19, 2008

This article was contributed by Ben Martin

The Unix mantra "everything is a file" gives you great flexibility overwhere you store your data and how information is manipulated andreplicated. Unfortunately, many things in Unix and Linux are not files, orones that you might want to interact with anyway. For example, aPostgreSQL database is ultimately stored in a collection of binary filesthough you probably wouldn't want to interact with those files directly.Instead of storing settings in a collection of tiny files, manyapplications use XML to store settings in a single file but then have todeal with parsing XML instead of just reading little files. libferris letsyou mount both PostgreSQL and XML and provides you with a useful way tointeract with the data contained in both as a virtual filesystem.

Other operating systems like Plan 9 pushed the envelope further than Unix,making more things "just a file". Unfortunately, to use Plan 9 you had toabandon your trusty old Unix roots and jump to an entirely new operatingsystem.

I started the libferris virtual filesystem project back in 2001 to pushthe "everything is a file" concept further, it was all implemented on aLinux base. Libferris is a virtual filesystem implemented as a sharedlibrary with FUSE bindings. Because FUSE is already in the Linux kernelyou don't have to do any kernel patching to use libferris. Becauselibferris is a shared library and not in the kernel, it can use otherlibraries to help it mount data sources like XML, relational databases andEmacs to name a few. And as an upshot of being out of kernel, I can workon letting libferris mount anything I like no matter how strange it mightbe without any third party approval.

There are actually two ways to use libferris -- through a native C++interface and using the normal Unix APIs with FUSE. The FUSE interface isvery useful if you want to rsync(1) some structured information from anXML file into a PostgreSQL database. Just mount them both with FUSE andrsync away. Another few interesting things you can do with the FUSEinterface is expose data as a virtual office document using XSLTstylesheets that libferris processes for you as well as geotagging withGoogle Earth.

The design of libferris revolves around two primitives: exposing filecontents as C++ std::iostreams, and rich metadata support through aninterface similar to Extended Attributes (EA) attr_get(3). Since thenlibferris has gained sophisticated support for indexing both the full textcontents of files as well as their metadata. Libferris is written in C++and aims to take full advantage of the language. Interfaces are designedto be as easy to pickup for C++ programmers as possible, for example,displaying a directory can be done using iterators, find(), begin() toend() etc.

Both the types of things that libferris can provide as virtual filesystemsand the metadata handling are done through a plugin interface. Thehandling of metadata is done through the Extended Attributes (EA)interface. This EA interface is also virtualized -- if you write anattribute to file:///foo/bar and the kernel filesystem supports extendedattributes, then the value will be saved in a kernel level EA usingattr_set(3). On the other hand if file:///foo/bar happens to exist on anetwork filesystem that does not support EA, then your value is saved inRDF by libferris. In both cases the value can be read again using anidentical interface.

Looking at filesystems in an abstract way -- a hierarchy of files, filecontents, and metadata associated with files and directories as key-valuepairs -- there is somewhat of a resemblance to the data model of XML.Although there are obvious differences: XML elements can have multipletext nodes as contents, an XML element does not need to have specificunique names for each child XML element and so on. In many cases it can beadvantageous to smooth over the differences and view a filesystem as XMLand vice versa. Over the years libferris has gained the ability tointeract with it's virtual filesystems as virtual Document Object Models(DOM)s. The reverse is also true, you can take an xerces-c DOM andinteract with it as a virtual filesystem. Using virtual DOMs makes it easyto create a view of a filesystem using a browser and XSLT. See xml.com forinformation on using XQuery against a libferris virtual filesystem.

The ability to mount XML and Berkeley db4 data as filesystems has longbeen a part of libferris. If you want to store a filesystem inside aplatform independent format, then using XML is great, whereas the speed ofindividual file look up in a Berkeley db4 database of many many filerecords can come in handy. Each format has its advantages, but they areall just virtual filesystems as far as libferris is concerned.

When a filesystem can offer what it likes through key-value pairs (EA)associated with files, relational databases can also be viewed as avirtual filesystem. Databases, views, tables and result sets becomedirectories, tuples become files named by the value of their primary key,and the individual values of tuples are exposed as Extended Attributes ontheir tuple file. Again, PostgreSQL appears just like another virtualfilesystem. For relational data there are a few caveats, for example, tocreate a new "file" in a table you must supply at least the primary key EAas well as any EA which are explicitly marked "not null" in the database.

Libferris will automatically mount many filesystems for the user. Forexample, if you try to read an XML file as though it is a directory thenlibferris will implicitly mount it as one for you. This does blur thelines between what is a directory and what is a file in the system. Thereis some additional metadata that libferris makes available if you wouldlike to avoid the automatic mounting. For example, if you wish not todescend into XML files then read the is-file metadata and if it is true donot attempt to descend into the file.

One of the motivations for creating libferris as a project of its own wasto be able to expose anything that I felt could be interacted with in aninteresting manner as a filesystem as one. So libferris can mount somethings that folks might not think of as filesystems -- including Firefox,Emacs, DBus, LDAP, Evolution, Amarok, klipper, xmms, X Window System andgphoto2.

The metadata plugins for libferris currently support extractinginformation from file formats automatically, for example, EXIF, XMP andID3 tags. Metadata overlays are also supported, so you can see what tagsyou have associated with an image in f-spot through extended attributes inlibferris. I use the term overlays because a central repository of tagdata (in this case from f-spot) is scattered over an entire filesystem inlibferris. The lower level metadata plugins handle more standard extendedattributes usage, for example using attr_set(3) to store values or savingthem in RDF.

Many of the standard utilities have been rewritten to use the nativelibferris API and take advantage of extra features it offers. Things likels, cp, mv, rm, cat, io-redirection, touch, head and tail all have nativelibferris versions which are shipped with the main tarball. These all alsoserve as code samples for how to use the libferris API. Extensions to thenormal clients include the ability to output directory listings in XML forferrisls, ferriscp has the ability to use memory mapped IO as well as themore standard open(), read() and write() calls to perform the copy. Usingmemory mapped IO this way also uses the madvise(2) MADV_SEQUENTIAL call tolet the kernel correctly select caching policy.

The indexing support in libferris is also handled using plugins. Twodifferent indexing plugin types exist; full text and metadata. There aretwo types of plugin, because the strategy for how to create an index canbe quite different depending on if you are performing a search for somewords in a document text or if you wish to find files with certainmetadata values. Using inverted files can be great for resolving a rankedfull text query for "alice wonderland" but finding all files in either myhome directory or /pictures that have been modified in December 2008 canbe solved in a number of ways.

There are currently indexing plugins for CLucene, Lucene, LDAP,Federations of other libferris indexes, ODBC, PostgreSQL, Redland (RDF),Xapian, Beagle, Strigi and some custom designs. There are likely to bemore index plugins explicitly designed to work on NAND Flash in thefuture. Those interested in indexing and libferris should see this article.

A major advantage of closely combining the index and search operationsinto the virtual filesystem is that anything the virtual filesystem cansee can be indexed. When searches are performed you should also be able tointeract with any of the results as a virtual filesystem. This avoids theissue where a discrete search library might return a URL that the clientcan not do anything with.

So, what does it look like to code using libferris? Most objects in ferrisare smart pointers, many using intrusive reference counting. The type forsuch objects is prefixed with "fh_" to indicate a ferris handle. Thenotion of files and directories is amalgamated into a single "Context"abstraction. To get a smart pointer to a filesystem path the Resolve()function is used. So without further ado, to get a file and its metadatawith libferris:

fh_context c  = Resolve( "~/myfile" );
{
  // let the scope close it for me
  fh_istream ss = c->getIOStream( ios::trunc );
  ss << "Bah!" << endl;
}
// std::string getStrAttr( fh_context, eaname, default-value, ... )
string filename = getStrAttr( c, "name", "" );
string md5sum   = getStrAttr( c, "md5", "" );
cout << "the filename should be myfile:" << filename << endl;
cout << "the md5 checksum is:" << md5sum << endl;
setStrAttr( c, "foo", "bar" );
fh_attribute a = c->getAttribute("foo");
fh_istream ass = a->getIStream();
cout << "Getting the metadata again:";
copy( istreambuf_iterator<char>(ass),
      istreambuf_iterator<char>(),
      ostreambuf_iterator<char>(cout));
cout << endl;

Libferris is steadily gaining commercial interest. Currently I providethings like custom builds of libferris, explicit support for new testcases in the core regression test suite that are important to clients andof course extensions to libferris to perform a specific task that might bedesired.

There are packages available for both 32 and 64-bit Fedora 8, 9 and Ubuntu7.10 gusty as well as 32bit packages for openSUSE 10.3. Unfortunatelythere is currently a bug in building 64bit stldb4 on openSUSE. Install thelibferris-suite package to pull in all the dependencies.

Feel free to email the witme-feris mailing list or add comments to thisarticle suggesting any weird and wonderful (and obscure) filesystems youhave experienced in the past. Though my libferris.TODO file always growsmore than it shrinks, I'm always happy to add new and exciting suggestionsnear the top of it.


Comments (6 posted)

[linuxkernelnewbies] Development [LWN.net]

Reply via email to