[
https://issues.apache.org/jira/browse/APEXMALHAR-2250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15965057#comment-15965057
]
Munagala V. Ramanath commented on APEXMALHAR-2250:
--------------------------------------------------
It would be good to merge https://github.com/apache/apex-malhar/pull/597 before
starting on this.
A workaround for this issue in particular situations might be to find a regex
that would not be
matched by any directories, if possible.
> AbstractFileInputOperator.DirectoryScanner does not handle directories
> correctly.
> ---------------------------------------------------------------------------------
>
> Key: APEXMALHAR-2250
> URL: https://issues.apache.org/jira/browse/APEXMALHAR-2250
> Project: Apache Apex Malhar
> Issue Type: Bug
> Reporter: Tushar Gosavi
>
> The default DirectoryScanner defined in AbstractFileInputOperator does not
> handle directories correctly. If there is a directory in the configured path,
> it gets added as a file in pendingFile list and when operator tries to open
> it for reading
> it fails, the operator keeps retrying for configured number of time and then
> ignore this file.
> The fix would be to not return directory name in scanned file names in the
> first place.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)