Removal of default port# in NameNode.getUri() cause a map/reduce job failed to 
prompt temporay output
-----------------------------------------------------------------------------------------------------

                 Key: HADOOP-4717
                 URL: https://issues.apache.org/jira/browse/HADOOP-4717
             Project: Hadoop Core
          Issue Type: Bug
          Components: dfs
    Affects Versions: 0.18.0
            Reporter: Hairong Kuang
             Fix For: 0.18.3


Problem reported here is that when the default port number (8020) is specified 
in the output, job succeeds but no output is created. The cause of the problem 
is that "listStatus" call drops the port number because NameNode.getUri removes 
the default port#.

Assuming that a map/reduce output directory is set to be 
"hdfs://localhost:8020/out", A call "listStatus" on any of its sub directory, 
for example, "hdfs://localhost:8020/out/tempXX", returns results like below: 

hdfs://localhost/out/tempXX/part-00005

Because of this, Task.java
    574   private Path getFinalPath(Path jobOutputDir, Path taskOutput) {
    575     URI relativePath = 
taskOutputPath.toUri().relativize(taskOutput.toUri());

does not get the correct relativePath because TaskOutputPath contain ports, but 
taskOutput doesn't.

It seems to me that the problem could be fixed if we make Path.makeQualified() 
to return the same path not matter the input path contains the default port or 
not. 


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to