[
https://issues.apache.org/jira/browse/OAK-3140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15343021#comment-15343021
]
Dheeraj Khanna commented on OAK-3140:
-------------------------------------
[~tmueller]
{quote}Disable calculating the content hash (de-duplication) for some binaries.
{quote}
Would it make sense to calculate the hash before uploading the binary itself,
may be using some kind of preprocessing.
Use case: in case of large assets (>5GB) the file gets uploaded first and then
the user gets to know that this is a duplicate file, if this can be done in
advance, the user will not have to wait for a large upload to finish (which
could take many minutes) to get this information.
> DataStore / BlobStore: add a method to pass a "type" when writing
> -----------------------------------------------------------------
>
> Key: OAK-3140
> URL: https://issues.apache.org/jira/browse/OAK-3140
> Project: Jackrabbit Oak
> Issue Type: New Feature
> Components: blob
> Reporter: Thomas Mueller
> Assignee: Thomas Mueller
> Labels: performance
>
> Currently, the BlobStore interface has a method "String writeBlob(InputStream
> in)". This issue is about adding a new method "String writeBlob(String type,
> InputStream in)", for the following reasons (in no particular order):
> * Store some binaries (for example Lucene index files) in a different place,
> in order to safely and quickly run garbage collection just on those files.
> * Store some binaries in a slow, some in a fast storage or location.
> * Disable calculating the content hash (de-duplication) for some binaries.
> * Store some binaries in a shared storage (for fast cross-repository
> copying), and some in local storage.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)