[ 
https://issues.apache.org/jira/browse/HDFS-12295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16137674#comment-16137674
 ] 

Chris Douglas commented on HDFS-12295:
--------------------------------------

If the subtree under {{/.reserved/bypassExtAttr}} is read-only, that should 
address many of the issues that [~daryn] raised. As long as it's only the split 
generation that's using this API, that limits the cases that break when this 
feature is used.

The requirements for this feature- any user can perform backup-style copies 
using distcp- may be too broad. Your objective is to avoid cluttering the 
destination namesystem with xattrs from the external attribute provider at the 
source. Relying on _all_ users to set this flag correctly is unlikely to 
achieve this. What you want is the opposite: copying data between these 
clusters, by default, should take the path that reads the raw xattrs.

The less-invasive solutions attempt to relax the requirement that all users run 
distcp directly. While the user-facing solution satisfies all the requirements, 
it relies on cooperative users. Would it be feasible to add a layer of 
indirection in the deployments that need this functionality? If so, then we can 
make inter-cluster copies available to all users, without changing the 
internals of HDFS.

[Repeating|https://issues.apache.org/jira/browse/HDFS-12202?focusedCommentId=16120861&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-16120861]
 from HDFS-12202, the {{distcp}} command can be swapped out in 3.x. In 
deployments with this requirement, users can contact a service to schedule an 
inter-cluster transfer. That backup user could not only be a special-case in 
the NameNode plugin, it could also help users avoid copying data from 
encryption zones into unprotected clusters (HDFS-6509).

If that's not feasible, can this use case be supported by extending 
MAPREDUCE-6007? If the src/dst are under {{/.reserved/raw}}, then omitting the 
external attribute provider is reasonable behavior.

> NameNode to support file path prefix /.reserved/bypassExtAttr
> -------------------------------------------------------------
>
>                 Key: HDFS-12295
>                 URL: https://issues.apache.org/jira/browse/HDFS-12295
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>          Components: hdfs, namenode
>            Reporter: Yongjun Zhang
>            Assignee: Yongjun Zhang
>         Attachments: HDFS-12295.001.patch, HDFS-12295.001.patch
>
>
> Let NameNode to support prefix /.reserved/bypassExtAttr, so client can add 
> thisprefix to a path before calling getFileStatus, e.g. /ab/c becomes 
> /.reserved/bypassExtAttr/a/b/c. NN will parse the path at the very beginning, 
> and bypass external attribute provider if the prefix is there.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to