[
https://issues.apache.org/jira/browse/HADOOP-15838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
LinJi updated HADOOP-15838:
---------------------------
Attachment: (was: 微信截图_20181010224330.png)
> Copy files from SFTP to HDFS using DistCp failed with error
> -----------------------------------------------------------
>
> Key: HADOOP-15838
> URL: https://issues.apache.org/jira/browse/HADOOP-15838
> Project: Hadoop Common
> Issue Type: Bug
> Components: tools/distcp
> Affects Versions: 2.5.0, 2.7.2
> Environment: Hadoop 2.5.0 + kerberos
> Reporter: LinJi
> Priority: Major
> Labels: distcp, hadoop-tools
> Fix For: 2.7.5
>
> Original Estimate: 96h
> Remaining Estimate: 96h
>
> 1. When I run command:
> {code:java}
> hadoop distcp sftp://mysftp:1qaz_@[email protected]:/upload/hosts
> /tmp/JOY{code}
>
> I got error like:
>
> {noformat}
> 2018-10-10 22:31:37,799 INFO util.KerberosUtil: Using principal pattern:
> HTTP/_HOST
> 2018-10-10 22:31:39,055 INFO tools.DistCp: Input Options:
> DistCpOptions{atomicCommit=false, syncFolder=false, deleteMissing=false,
> ignoreFailures=false, maxMaps=20, sslConfigurationFile='null',
> copyStrategy='uniformsize', sourceFileListing=null,
> sourcePaths=[sftp://mysftp:1qaz_@[email protected]:/upload/hosts],
> targetPath=/tmp/JOY, targetPathExists=false}
> 2018-10-10 22:31:39,365 ERROR tools.DistCp: Exception encountered
> java.io.IOException: Invalid host specified
> at
> org.apache.hadoop.fs.sftp.SFTPFileSystem.initialize(SFTPFileSystem.java:67)
> at
> org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2591)
> at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:89)
> at
> org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2643)
> at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2625)
> at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:368)
> at org.apache.hadoop.fs.Path.getFileSystem(Path.java:296)
> at
> org.apache.hadoop.tools.GlobbedCopyListing.doBuildListing(GlobbedCopyListing.java:76)
> at
> org.apache.hadoop.tools.CopyListing.buildListing(CopyListing.java:84)
> at
> org.apache.hadoop.tools.DistCp.createInputFileListing(DistCp.java:353)
> at org.apache.hadoop.tools.DistCp.execute(DistCp.java:160)
> at org.apache.hadoop.tools.DistCp.run(DistCp.java:121)
> at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
> at org.apache.hadoop.tools.DistCp.main(DistCp.java:401)
> {noformat}
>
> 2. When I run command:
> {code:java}
> hadoop distcp sftp://mysftp:1qaz_%[email protected]:/upload/hosts
> /tmp/JOY{code}
> I got error like:
> {noformat}
> 2018-10-10 22:31:59,909 INFO util.KerberosUtil: Using principal pattern:
> HTTP/_HOST
> 2018-10-10 22:32:01,286 INFO tools.DistCp: Input Options:
> DistCpOptions{atomicCommit=false, syncFolder=false, deleteMissing=false,
> ignoreFailures=false, maxMaps=20, sslConfigurationFile='null',
> copyStrategy='uniformsize', sourceFileListing=null,
> sourcePaths=[sftp://mysftp:1qaz_%[email protected]:/upload/hosts],
> targetPath=/tmp/JOY, targetPathExists=false}
> 2018-10-10 22:32:02,190 ERROR tools.DistCp: Exception encountered
> java.io.IOException: SSH_MSG_DISCONNECT: 2 Too many authentication failures
> for mysftp
> at
> org.apache.hadoop.fs.sftp.SFTPFileSystem.connect(SFTPFileSystem.java:143)
> at
> org.apache.hadoop.fs.sftp.SFTPFileSystem.getFileStatus(SFTPFileSystem.java:371)
> at org.apache.hadoop.fs.Globber.getFileStatus(Globber.java:57)
> at org.apache.hadoop.fs.Globber.glob(Globber.java:252)
> at org.apache.hadoop.fs.FileSystem.globStatus(FileSystem.java:1623)
> at
> org.apache.hadoop.tools.GlobbedCopyListing.doBuildListing(GlobbedCopyListing.java:77)
> at
> org.apache.hadoop.tools.CopyListing.buildListing(CopyListing.java:84)
> at
> org.apache.hadoop.tools.DistCp.createInputFileListing(DistCp.java:353)
> at org.apache.hadoop.tools.DistCp.execute(DistCp.java:160)
> at org.apache.hadoop.tools.DistCp.run(DistCp.java:121)
> at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
> at org.apache.hadoop.tools.DistCp.main(DistCp.java:401){noformat}
> The SFTP username is mysftp
> password is 1qaz_@WSX
>
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]