[
https://issues.apache.org/jira/browse/FLINK-1307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14244442#comment-14244442
]
ASF GitHub Bot commented on FLINK-1307:
---------------------------------------
Github user fhueske commented on the pull request:
https://github.com/apache/incubator-flink/pull/260#issuecomment-66800177
Ah, OK. I got it :-)
I think we need only one method to recursively enumerate files (and compute
their length).
Not doing the `acceptFile()` check in the `getFileStats()` call is a bug,
that you could fix with this PR. Otherwise, files which will not be read are
included in the size estimates for the optimizer.
> Allow input from nested directory structure
> -------------------------------------------
>
> Key: FLINK-1307
> URL: https://issues.apache.org/jira/browse/FLINK-1307
> Project: Flink
> Issue Type: New Feature
> Reporter: Vasia Kalavri
> Assignee: Vasia Kalavri
> Priority: Minor
> Labels: easyfix, features
>
> Currently it is not possible to read input from nested directories. When
> given a directory as input path, only the top-level files are considered.
> This issue proposes to add support for recursively enumerating nested files
> in FileInputFormat.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)