[ 
https://issues.apache.org/jira/browse/HADOOP-9984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13785701#comment-13785701
 ] 

Colin Patrick McCabe commented on HADOOP-9984:
----------------------------------------------

Short summary of the webex:

* We discussed alternatives to HADOOP-9984, but concluded that they weren't 
workable.
* We agreed that doing the symlink resolution in each Filesystem subclass is 
what we ought to do in 9984, in order to keep compatibility with out-of-tree 
filesystems.
* We agreed to disable symlink resolution in Hadoop 2 GA.  We will spend a few 
weeks ironing out all the bugs and enable it in Hadoop 2.3.  However, we would 
like to make all backwards-incompatible API changes prior to Hadoop 2 GA.
* We agreed that HADOOP-9972 (new symlink-aware API for globStatus) should get 
into Hadoop 2 GA.
* We discussed the issue of returning resolved paths versus unresolved paths, 
but were unable to come to any conclusion.  Everyone agreed that there would be 
serious performance problems if we returned unresolved paths, but some claimed 
that programs would break when encountering resolved paths.

> FileSystem#globStatus and FileSystem#listStatus should resolve symlinks by 
> default
> ----------------------------------------------------------------------------------
>
>                 Key: HADOOP-9984
>                 URL: https://issues.apache.org/jira/browse/HADOOP-9984
>             Project: Hadoop Common
>          Issue Type: Bug
>          Components: fs
>    Affects Versions: 2.1.0-beta
>            Reporter: Colin Patrick McCabe
>            Assignee: Colin Patrick McCabe
>            Priority: Blocker
>         Attachments: HADOOP-9984.001.patch, HADOOP-9984.003.patch, 
> HADOOP-9984.005.patch, HADOOP-9984.007.patch, HADOOP-9984.009.patch, 
> HADOOP-9984.010.patch, HADOOP-9984.011.patch, HADOOP-9984.012.patch, 
> HADOOP-9984.013.patch, HADOOP-9984.014.patch
>
>
> During the process of adding symlink support to FileSystem, we realized that 
> many existing HDFS clients would be broken by listStatus and globStatus 
> returning symlinks.  One example is applications that assume that 
> !FileStatus#isFile implies that the inode is a directory.  As we discussed in 
> HADOOP-9972 and HADOOP-9912, we should default these APIs to returning 
> resolved paths.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to