[ 
https://issues.apache.org/jira/browse/OAK-3140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15343021#comment-15343021
 ] 

Dheeraj Khanna commented on OAK-3140:
-------------------------------------

[~tmueller]
{quote}Disable calculating the content hash (de-duplication) for some binaries. 
{quote}
Would it make sense to calculate the hash before uploading the binary itself, 
may be using some kind of preprocessing. 
Use case: in case of large assets (>5GB) the file gets uploaded first and then 
the user gets to know that this is a duplicate file, if this can be done in 
advance, the user will not have to wait for a large upload to finish (which 
could take many minutes) to get this information.

> DataStore / BlobStore: add a method to pass a "type" when writing
> -----------------------------------------------------------------
>
>                 Key: OAK-3140
>                 URL: https://issues.apache.org/jira/browse/OAK-3140
>             Project: Jackrabbit Oak
>          Issue Type: New Feature
>          Components: blob
>            Reporter: Thomas Mueller
>            Assignee: Thomas Mueller
>              Labels: performance
>
> Currently, the BlobStore interface has a method "String writeBlob(InputStream 
> in)". This issue is about adding a new method "String writeBlob(String type, 
> InputStream in)", for the following reasons (in no particular order):
> * Store some binaries (for example Lucene index files) in a different place, 
> in order to safely and quickly run garbage collection just on those files.
> * Store some binaries in a slow, some in a fast storage or location.
> * Disable calculating the content hash (de-duplication) for some binaries.
> * Store some binaries in a shared storage (for fast cross-repository 
> copying), and some in local storage.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to