[ 
https://issues.apache.org/jira/browse/HADOOP-3246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12593148#action_12593148
 ] 

Chris Douglas commented on HADOOP-3246:
---------------------------------------

A few points:
* +1 for this being a FileSystem
* Changing javac.version in build.xml from 1.5 to 1.6 should not be part of 
this patch
* ftp.server.host, ftp.server.username, and ftp.server.password should be 
{{o.a.h.conf.Configuration}} properties, not system properties. Conditionally 
running the unit tests based on a switch in build.xml is also not consistent 
with the existing unit tests; all the changes to build.xml should probably be 
reverted.
* ftpClient.sh doesn't seem necessary; it certainly won't be if this is a 
FileSystem
* It looks like some testing code accidentally made it into the patch, in 
FtpShell:
{noformat}
+    conf.set("fs.default.name", "hdfs://agoel-pc:9000");
+    conf.set("fs.hdfs.impl", "org.apache.hadoop.dfs.DistributedFileSystem");
+    conf.set("hadoop.tmp.dir", "/tmp/hadoop-aankurgoel");
{noformat}
* Process trivia: CHANGES.txt is added by the committer when the patch goes in. 
The "Release Notes" field in JIRA is filled out by the contributor
* FtpClient::EraserThread is a heavyweight way to hide the password. 
Unfortunately, java.io.Console::readPassword is only in Java 1.6, and it 
disables echo via a (platform-dependent) native call, so it's not clear how one 
would do this in Java 1.5. Leaving this as a configuration property is probably 
sufficient until we can move to Java 1.6; as a FileSystem, it needs to be a 
Configuration property, anyway.
* If you wanted to keep some of the code for a FTP client separate from 
existing tools, take a look at {{o.a.h.util.ToolBase}} to pick up some of the 
generic option parsing common to most hadoop utilities

> FTP client over HDFS
> --------------------
>
>                 Key: HADOOP-3246
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3246
>             Project: Hadoop Core
>          Issue Type: New Feature
>          Components: util
>    Affects Versions: 0.16.3
>            Reporter: Ankur
>            Priority: Minor
>         Attachments: ftpClient.patch
>
>
> An FTP client that stores content directly into HDFS allows data from FTP 
> serves to be stored directly into HDFS instead of first copying the data 
> locally and then uploading it into HDFS. The benefits are apparent from an 
> administrative perspective as large datasets can be pulled from FTP servers 
> with minimal human intervention.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to