[
https://issues.apache.org/jira/browse/HDFS-4523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13586246#comment-13586246
]
Aaron T. Myers commented on HDFS-4523:
--------------------------------------
Hi Nicholas, unless I'm misunderstanding something, I think I question the
premise of this JIRA. Judging by the description, it sounds like you're saying
that when a concat(...) is performed on a set of files after those files have
been included in a snapshot, that that snapshot should be modified to include
the final concat'ed file, instead of the constituent files pre-concat. That
would seem to violate the "read-only" premise of snapshots, which doesn't seem
like the correct behavior to me.
Please let me know if I misunderstood the intent of this JIRA.
Thanks.
> Fix concat for snapshots
> ------------------------
>
> Key: HDFS-4523
> URL: https://issues.apache.org/jira/browse/HDFS-4523
> Project: Hadoop HDFS
> Issue Type: Sub-task
> Components: namenode
> Reporter: Tsz Wo (Nicholas), SZE
> Assignee: Tsz Wo (Nicholas), SZE
> Attachments: h4523_20130222.patch, h4523_20130223.patch
>
>
> The use case of concat is for copying large files across clusters using the
> following steps.
> - Step 1: The blocks of a file in the source cluster are copied in parallel
> to transient files in the destination cluster.
> - Step 2: Then the transient files in the destination cluster are
> concatenated in order to obtain the original file.
> If a snapshot is taken in the destination cluster before Step 2, some
> transient files may be captured in the snapshot. These transient files
> should be removed in Step 2.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira