On Tue, Oct 7, 2014 at 11:29 AM, Fernando Cacciola <
fernando.cacci...@gmail.com> wrote:

>
> I do realise of course that maintaining a gigantic master index of all
> files is not applicable to the real world, but nonetheless I think the
> general form of the solution (split identity from location) is worth being
> considered.
>
>
> OTOH, it just ocurred to me that this can be made to scale to the real
world in the following way:

A file could be formally classified a standalone or multiply-referenced.
A standalone file would be one which doesn't have an entry in the master
index.
A multiply-referenced file would be listed in the master index, given a
unique id and mapped to its current location.

When you create a new link, the file is given a unique id, added to the
master index and the link is associated to the target's id.
The master index can even have a reference count allowing the file to
transition to standalone state when the last link is removed.

This way the master index size is kept bounded by the number of effective
links in the file system.


Best

-- 
Fernando Cacciola
SciSoft Consulting, Founder
http://www.scisoft-consulting.com
_______________________________________________
fonc mailing list
fonc@vpri.org
http://vpri.org/mailman/listinfo/fonc

Reply via email to