[
https://issues.apache.org/jira/browse/JCR-3534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13608985#comment-13608985
]
Jukka Zitting commented on JCR-3534:
------------------------------------
Note that there is no guarantee that the userIds of the two repositories have
no relation with each other, so I don't think it should be included in a
mechanism like the one proposed.
Instead, a HMAC of the content identifier, signed by the underlying data store
should be good enough. The target repository can check that the message comes
from the same underlying data store, so anyone who has access to such a code
already has access to the related binary through one of the repositories
attached to that data store. Thus allowing the user to access the binary
doesn't reveal anything he or she couldn't already access by other means.
Ideally I'd see such a HMAC to be passed transparently as a part of Binary
instances acquired from a remote RMI or Davex connection to the source
repository. The target repository would automatically extract and evaluate the
information based on the type of the passed Binary instance, and could always
fall back to streaming the data if for example the HMAC doesn't match. The
client would use the pattern I outlined above, the only difference being that
repositoryA would be a remote RMI or Davex connection instead of a local
cluster node.
> Add JackrabbitSession.getValueByContentId method
> ------------------------------------------------
>
> Key: JCR-3534
> URL: https://issues.apache.org/jira/browse/JCR-3534
> Project: Jackrabbit Content Repository
> Issue Type: New Feature
> Components: jackrabbit-api, jackrabbit-core
> Affects Versions: 2.6
> Reporter: Felix Meschberger
> Attachments: JCR-3534.patch
>
>
> we have a couple of use cases, where we would like to leverage the global
> data store to prevent sending around and copying around large binary data
> unnecessarily: We have two separate Jackrabbit instances configured to use
> the same DataStore (for the sake of this discussion assume we have the
> problems of concurrent access and garbage collection under control). When
> sending content from one instance to the other instance we don't want to send
> potentially large binary data (e.g. video files) if not needed.
> The idea is for the sender to just send the content identity from
> JackrabbitValue.getContentIdentity(). The receiver would then check whether
> the such content already exists and would reuse if so:
> String ci = contentIdentity_from_sender;
> try {
> Value v = session.getValueByContentIdentity(ci);
> Property p = targetNode.setProperty(propName, v);
> } catch (ItemNotFoundException ie) {
> // unknown or invalid content Identity
> } catch (RepositoryException re) {
> // some other exception
> }
> Thus the proposed JackrabbitSession.getValueByContentIdentity(String) method
> would allow for round tripping the JackrabbitValue.getContentIdentity()
> preventing superfluous binary data copying and moving.
> See also the dev@ thread
> http://jackrabbit.markmail.org/thread/gedk5jsrp6offkhi
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira