[ 
https://issues.apache.org/jira/browse/HADOOP-3041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12579899#action_12579899
 ] 

Alejandro Abdelnur commented on HADOOP-3041:
--------------------------------------------

If there is a method returning the original path is OK. 

But, using the  rule of least surprise, wouldn't make more sense to have a 
getTaskOutputPath() that returns the path to the part file for the current task 
and leave the getOutputPath() with the user entered value?

Also the javadoc should not say 'Get the Path to the output directory for the 
map-reduce job' in its one line description then.




> Within a task, the value ofJobConf.getOutputPath() method is modified
> ---------------------------------------------------------------------
>
>                 Key: HADOOP-3041
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3041
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: mapred
>    Affects Versions: 0.16.1
>         Environment: all
>            Reporter: Alejandro Abdelnur
>            Priority: Blocker
>             Fix For: 0.16.2
>
>
> Until 0.16.0 the value of the getOutputPath() method, if queried within a 
> task, pointed to the part file assigned to the task. 
> For example: /user/foo/myoutput/part_00000
> In 0.16.1, now it returns an internal hadoop for the task output temporary 
> location.
> For the above example: /user/foo/myoutput/_temporary/part_00000
> This change breaks applications that use the getOutputPath() to compute other 
> directories.
> IMO, this has always being broken, Hadoop should not change the values of 
> properties injected by the client, instead it should use private properties 
> or internal helper methods. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to