[
https://issues.apache.org/jira/browse/HADOOP-16458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16892027#comment-16892027
]
Steve Loughran commented on HADOOP-16458:
-----------------------------------------
{code}
org.apache.hadoop.mapred.InvalidInputException: Input path does not exist:
s3a://demo/user/qa/schemaevolution/tests/data/all100k
2019-07-22 14:26:49,833
org.apache.hadoop.mapred.LocatedFileStatusFetcher.getFileStatuses(LocatedFileStatusFetcher.java:155)
2019-07-22 14:26:49,833
org.apache.hadoop.mapred.FileInputFormat.listStatus(FileInputFormat.java:247)
2019-07-22 14:26:49,834
org.apache.hadoop.mapred.FileInputFormat.getSplits(FileInputFormat.java:325)
2019-07-22 14:26:49,834
org.apache.hadoop.hive.ql.io.HiveInputFormat.addSplitsForGroup(HiveInputFormat.java:524)
2019-07-22 14:26:49,834
org.apache.hadoop.hive.ql.io.HiveInputFormat.getSplits(HiveInputFormat.java:781)
2019-07-22 14:26:49,834
org.apache.hadoop.hive.ql.exec.tez.HiveSplitGenerator.initialize(HiveSplitGenerator.java:243)
{code}
> LocatedFileStatusFetcher.getFileStatuses failing intermittently with s3
> -----------------------------------------------------------------------
>
> Key: HADOOP-16458
> URL: https://issues.apache.org/jira/browse/HADOOP-16458
> Project: Hadoop Common
> Issue Type: Sub-task
> Components: fs/s3
> Affects Versions: 3.3.0
> Environment: S3 + S3Guard
> Reporter: Steve Loughran
> Assignee: Steve Loughran
> Priority: Major
>
> Intermittent failure of LocatedFileStatusFetcher.getFileStatuses(), which is
> using globStatus to find files.
> I'd say "turn s3guard on" except this appears to be the case, and the dataset
> being read is
> over 1h old.
> Which means it is harder than I'd like to blame S3 for what would sound like
> an inconsistency
> We're hampered by the number of debug level statements in the globber code
> being approximately none; there's no debugging to turn on. All we know is
> that globFiles returns null without any explanation.
--
This message was sent by Atlassian JIRA
(v7.6.14#76016)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]