Hi,

On 26 avr. 07, at 10:46, Olivier Grisel wrote:
We will probably soon need to store checksums of file attachments as part of a customer project. The goal is to be able to quickly find dupes when importing a bunch of files from a file-system folder to a nuxeo workspace. I wondered if it would be relevant to add sha (or md5 ?) checksums by default in nuxeo, either as a Blob new feature or in a dedicated field of the file schema computed by a core event listener.
Yes it is, as long as the computation of the checksum is done asynchronously after an upload. It can definitely add many possible use case for the platform especially when dealing with large files. So go on for NXP without forgetting to add the corresponding JIRA ticket. :-)

Possible usage:

- search / query for documents by checksum provided that the checksum field is
indexed by the search service;
 - invalidation key for the transform service hypothetical cache;
- making it easier to do integrity checks on the client side by having some
browser plugin on sign the checksum instead of the complete binary;
- additional metadata displayed in the UI so that geeks can check the integrity of their latest Prison Break^W^W^W Wikipedia iso before burning it to a DVD;
:-)

Bogdan: what do you think about potential impact of this feature regarding the planned refactoring on schema for blobs.


Thanks,

EB.

--
Éric Barroca - Ex. VP of Operations - +33 6 21 74 77 64
www.nuxeo.com - Nuxeo: Open Source ECM - www.nuxeo.org
Nuxeo EP 5: extensible, Java EE and standards based ECM Platform !


_______________________________________________
ECM mailing list
[email protected]
http://lists.nuxeo.com/mailman/listinfo/ecm

Reply via email to