On Apr 26, 2007, at 11:46 AM, Olivier Grisel wrote:
Hi,
We will probably soon need to store checksums of file attachments
as part of a
customer project. The goal is to be able to quickly find dupes when
importing a
bunch of files from a file-system folder to a nuxeo workspace.
I wondered if it would be relevant to add sha (or md5 ?) checksums
by default in
nuxeo, either as a Blob new feature or in a dedicated field of the
file schema
computed by a core event listener.
A dedicated field would make it trivial to leverage with the
SearchService. I don't have much to
say about other aspects. Maybe make that unpluggable for highly
loaded sites that don't care that much
about integrity.
Possible usage:
- search / query for documents by checksum provided that the
checksum field is
indexed by the search service;
- invalidation key for the transform service hypothetical cache;
- making it easier to do integrity checks on the client side by
having some
browser plugin on sign the checksum instead of the complete binary;
- additional metadata displayed in the UI so that geeks can check
the integrity
of their latest Prison Break^W^W^W Wikipedia iso before burning it
to a DVD;
...
Cool.
----
Georges Racinet, Nuxeo SAS
Open Source Enterprise Content Management (ECM)
Web: http://www.nuxeo.com/ and http://www.nuxeo.org/ - Tel: +33 1 40
33 79 87
_______________________________________________
ECM mailing list
[email protected]
http://lists.nuxeo.com/mailman/listinfo/ecm