[ 
https://issues.apache.org/jira/browse/HADOOP-13650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15788216#comment-15788216
 ] 

Chris Nauroth commented on HADOOP-13650:
----------------------------------------

bq. Would you mind to give me advice how to generate 
{{libexec/tools/hadoop-distcp.sh}}?

Hello [~eddyxu].  Here is a general recipe that should work:
# Create file hadoop-aws/src/main/shellprofile.d/hadoop-aws.sh.  The code of 
this shell profile would need to add hadoop-aws.jar and its dependencies to the 
classpath and also call something like {{hadoop_add_subcommand "s3a" "S3A 
Utilities"}} and define {{function hadoop_subcommand_s3a}}.
# Update hadoop-assemblies/src/main/resources/assemblies/hadoop-tools.xml so 
that the assembly copies the new file to libexec/shellprofile.d when building 
the distro.  You can probably copy and adapt the XML stanza that already does 
it for hadoop-distcp.  This is sufficient to land it into the distro as an 
optional shell profile, which individual deployments or users would have to 
enable explicitly.
# To turn it into a "built-in" like DistCp, which is always on the classpath 
and command set for every deployment, then you would need an additional step.  
Edit hadoop-tools/hadoop-aws/pom.xml and add a copy-dependencies execution.  
You can probably copy and adapt the XML stanza that already does it for 
hadoop-distcp.  Look for "tools-builtin" in hadoop-tools/hadoop-distcp/pom.xml. 
 This will cause it to land in libexec/tools and make it a built-in.

However, I'm not sure we really want it to be a built-in.  Doing so would put 
extra jars onto the classpath for all Hadoop deployments, even for users that 
won't use S3A at all.  That would have the usual side effects of bloating the 
classpath and possibly causing dependency management challenges for end users 
that weren't expecting to receive those jars.

> S3Guard: Provide command line tools to manipulate metadata store.
> -----------------------------------------------------------------
>
>                 Key: HADOOP-13650
>                 URL: https://issues.apache.org/jira/browse/HADOOP-13650
>             Project: Hadoop Common
>          Issue Type: Sub-task
>          Components: fs/s3
>            Reporter: Lei (Eddy) Xu
>            Assignee: Lei (Eddy) Xu
>         Attachments: HADOOP-13650-HADOOP-13345.000.patch, 
> HADOOP-13650-HADOOP-13345.001.patch, HADOOP-13650-HADOOP-13345.002.patch
>
>
> Similar systems like EMRFS has the CLI tools to manipulate the metadata 
> store, i.e., create or delete metadata store, or {{import}}, {{sync}} the 
> file metadata between metadata store and S3. 
> http://docs.aws.amazon.com//ElasticMapReduce/latest/ReleaseGuide/emrfs-cli-reference.html
> S3Guard should offer similar functionality. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to