LinJi created HADOOP-15838:
------------------------------
Summary: Copy files from SFTP to HDFS using DistCp failed with
error
Key: HADOOP-15838
URL: https://issues.apache.org/jira/browse/HADOOP-15838
Project: Hadoop Common
Issue Type: Bug
Components: tools/distcp
Affects Versions: 2.7.2, 2.5.0
Environment: Hadoop 2.5.0 + kerberos
Reporter: LinJi
Fix For: 2.7.5
Attachments: 微信截图_20181010224316.png, 微信截图_20181010224330.png
1. When I run command:
{code:java}
hadoop distcp sftp://mysftp:1qaz_@[email protected]:/upload/hosts /tmp/JOY{code}
I got error like:
{noformat}
2018-10-10 22:31:37,799 INFO util.KerberosUtil: Using principal pattern:
HTTP/_HOST
2018-10-10 22:31:39,055 INFO tools.DistCp: Input Options:
DistCpOptions{atomicCommit=false, syncFolder=false, deleteMissing=false,
ignoreFailures=false, maxMaps=20, sslConfigurationFile='null',
copyStrategy='uniformsize', sourceFileListing=null,
sourcePaths=[sftp://mysftp:1qaz_@[email protected]:/upload/hosts],
targetPath=/tmp/JOY, targetPathExists=false}
2018-10-10 22:31:39,365 ERROR tools.DistCp: Exception encountered
java.io.IOException: Invalid host specified
at
org.apache.hadoop.fs.sftp.SFTPFileSystem.initialize(SFTPFileSystem.java:67)
at
org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2591)
at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:89)
at
org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2643)
at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2625)
at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:368)
at org.apache.hadoop.fs.Path.getFileSystem(Path.java:296)
at
org.apache.hadoop.tools.GlobbedCopyListing.doBuildListing(GlobbedCopyListing.java:76)
at org.apache.hadoop.tools.CopyListing.buildListing(CopyListing.java:84)
at
org.apache.hadoop.tools.DistCp.createInputFileListing(DistCp.java:353)
at org.apache.hadoop.tools.DistCp.execute(DistCp.java:160)
at org.apache.hadoop.tools.DistCp.run(DistCp.java:121)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
at org.apache.hadoop.tools.DistCp.main(DistCp.java:401)
{noformat}
2. When I run command:
{code:java}
hadoop distcp sftp://mysftp:1qaz_%[email protected]:/upload/hosts
/tmp/JOY{code}
I got error like:
{noformat}
2018-10-10 22:31:59,909 INFO util.KerberosUtil: Using principal pattern:
HTTP/_HOST
2018-10-10 22:32:01,286 INFO tools.DistCp: Input Options:
DistCpOptions{atomicCommit=false, syncFolder=false, deleteMissing=false,
ignoreFailures=false, maxMaps=20, sslConfigurationFile='null',
copyStrategy='uniformsize', sourceFileListing=null,
sourcePaths=[sftp://mysftp:1qaz_%[email protected]:/upload/hosts],
targetPath=/tmp/JOY, targetPathExists=false}
2018-10-10 22:32:02,190 ERROR tools.DistCp: Exception encountered
java.io.IOException: SSH_MSG_DISCONNECT: 2 Too many authentication failures for
mysftp
at
org.apache.hadoop.fs.sftp.SFTPFileSystem.connect(SFTPFileSystem.java:143)
at
org.apache.hadoop.fs.sftp.SFTPFileSystem.getFileStatus(SFTPFileSystem.java:371)
at org.apache.hadoop.fs.Globber.getFileStatus(Globber.java:57)
at org.apache.hadoop.fs.Globber.glob(Globber.java:252)
at org.apache.hadoop.fs.FileSystem.globStatus(FileSystem.java:1623)
at
org.apache.hadoop.tools.GlobbedCopyListing.doBuildListing(GlobbedCopyListing.java:77)
at org.apache.hadoop.tools.CopyListing.buildListing(CopyListing.java:84)
at
org.apache.hadoop.tools.DistCp.createInputFileListing(DistCp.java:353)
at org.apache.hadoop.tools.DistCp.execute(DistCp.java:160)
at org.apache.hadoop.tools.DistCp.run(DistCp.java:121)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
at org.apache.hadoop.tools.DistCp.main(DistCp.java:401){noformat}
The SFTP username is mysftp
password is 1qaz_@WSX
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]