Jason Dere created HADOOP-10340:
-----------------------------------

             Summary: FileInputFormat.listStatus() including directories in its 
results
                 Key: HADOOP-10340
                 URL: https://issues.apache.org/jira/browse/HADOOP-10340
             Project: Hadoop Common
          Issue Type: Bug
            Reporter: Jason Dere


Trying to track down HIVE-6401, where we see some "is not a file" errors 
because getSplits() is giving us directories.  I believe the culprit is 
FileInputFormat.listStatus():

{code}
                if (recursive && stat.isDirectory()) {
                  addInputPathRecursively(result, fs, stat.getPath(),
                      inputFilter);
                } else {
                  result.add(stat);
                }
{code}

Which seems to be allowing directories to be added to the results if recursive 
is false.  Is this meant to return directories? If not, I think it should look 
like this:

{code}
                if (stat.isDirectory()) {
                 if (recursive) {
                  addInputPathRecursively(result, fs, stat.getPath(),
                      inputFilter);
                 }
                } else {
                  result.add(stat);
                }
{code}



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to