[jira] [Commented] (HDFS-4523) Fix concat for snapshots

Tsz Wo (Nicholas), SZE (JIRA) Mon, 25 Feb 2013 15:06:13 -0800

    [ 
https://issues.apache.org/jira/browse/HDFS-4523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13586427#comment-13586427
 ]


Tsz Wo (Nicholas), SZE commented on HDFS-4523:
----------------------------------------------

The original files has to be set up specifically for concat.  It is not like 
that you can concat on any set of files.

On the other hand, we may fail concat if the transient files are in some 
snapshots.  However, these transient files will be remained in the system until 
the snapshots are deleted.  This is the inefficiency I am talking about.
                
> Fix concat for snapshots
> ------------------------
>
>                 Key: HDFS-4523
>                 URL: https://issues.apache.org/jira/browse/HDFS-4523
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>          Components: namenode
>            Reporter: Tsz Wo (Nicholas), SZE
>            Assignee: Tsz Wo (Nicholas), SZE
>         Attachments: h4523_20130222.patch, h4523_20130223.patch, 
> h4523_20130225.patch
>
>
> The use case of concat is for copying large files across clusters using the 
> following steps.
> - Step 1: The blocks of a file in the source cluster are copied in parallel 
> to transient files in the destination cluster.
> - Step 2: Then the transient files in the destination cluster are 
> concatenated in order to obtain the original file.
> If a snapshot is taken in the destination cluster before Step 2, some 
> transient files may be captured in the snapshot.  These transient files 
> should be removed in Step 2.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HDFS-4523) Fix concat for snapshots

Reply via email to