On Apr 26, 2007, at 11:46 AM, Olivier Grisel wrote:

Hi,

We will probably soon need to store checksums of file attachments as part of a customer project. The goal is to be able to quickly find dupes when importing a
bunch of files from a file-system folder to a nuxeo workspace.

I wondered if it would be relevant to add sha (or md5 ?) checksums by default in nuxeo, either as a Blob new feature or in a dedicated field of the file schema
computed by a core event listener.

A dedicated field would make it trivial to leverage with the SearchService. I don't have much to say about other aspects. Maybe make that unpluggable for highly loaded sites that don't care that much
about integrity.


Possible usage:

- search / query for documents by checksum provided that the checksum field is
indexed by the search service;
 - invalidation key for the transform service hypothetical cache;
- making it easier to do integrity checks on the client side by having some
browser plugin on sign the checksum instead of the complete binary;
- additional metadata displayed in the UI so that geeks can check the integrity of their latest Prison Break^W^W^W Wikipedia iso before burning it to a DVD;
...


Cool.


----
Georges Racinet,   Nuxeo SAS
Open Source Enterprise Content Management (ECM)
Web: http://www.nuxeo.com/ and http://www.nuxeo.org/ - Tel: +33 1 40 33 79 87



_______________________________________________
ECM mailing list
[email protected]
http://lists.nuxeo.com/mailman/listinfo/ecm

Reply via email to