[
https://issues.apache.org/jira/browse/HADOOP-5836?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ian Nowland updated HADOOP-5836:
--------------------------------
Attachment: HADOOP-5836-1.patch
* Yes, I have run Jets3tNativeS3FileSystemContractTest. Multiple Times in fact,
including for the newest patch :).
* I have reworked logging, making everything debug, except the following:
+ LOG.info("Opening key '" + key + "' for reading at position '" + pos +
"'");
+ LOG.info("OutputStream for key '" + key + "' writing to tempfile '" +
this.backupFile + "'");
+ LOG.info("OutputStream for key '" + key + "' closed. Now beginning
upload");
+ LOG.info("OutputStream for key '" + key + "' upload complete");
+ LOG.info("Opening '" + f + "' for reading");
The basic idea is I want to always capture in a tasks syslog what S3 files it
is reading from as this is very useful when a subset of tasks fail. Also I
wanted to capture the time spent in actually uploading the file to S3 very
specifically.
* Good catch - must have happened as part of the diff I did ignoring
whitespace. I have now gone through with a fine tooth comb and fixed all
indentation issues I could see.
* Done
> Bug in S3N handling of directory markers using an object with a trailing "/"
> causes jobs to fail
> ------------------------------------------------------------------------------------------------
>
> Key: HADOOP-5836
> URL: https://issues.apache.org/jira/browse/HADOOP-5836
> Project: Hadoop Core
> Issue Type: Bug
> Components: fs/s3
> Affects Versions: 0.18.3
> Reporter: Ian Nowland
> Assignee: Ian Nowland
> Fix For: 0.21.0
>
> Attachments: HADOOP-5836-0.patch, HADOOP-5836-1.patch
>
>
> Some tools which upload to S3 and use a object terminated with a "/" as a
> directory marker, for instance "s3n://mybucket/mydir/". If asked to iterate
> that "directory" via listStatus(), then the current code will return an empty
> file "", which the InputFormatter happily assigns to a split, and which later
> causes a task to fail, and probably the job to fail.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.