Munagala V. Ramanath created APEXMALHAR-2274:
------------------------------------------------

             Summary: AbstractFileInputOperator gets killed when there are a 
large number of files.
                 Key: APEXMALHAR-2274
                 URL: https://issues.apache.org/jira/browse/APEXMALHAR-2274
             Project: Apache Apex Malhar
          Issue Type: Bug
            Reporter: Munagala V. Ramanath


When there are a large number of files in the monitored directory, the call to 
DirectoryScanner.scan() can take a long time since it calls 
FileSystem.listStatus() which returns the entire list. Meanwhile, the AppMaster 
deems this operator hung and restarts it which again results in the same 
problem.

It should use FileSystem.listStatusIterator() to limit the number files 
processed in a single call.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to