[ 
https://issues.apache.org/jira/browse/BEAM-1309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17132472#comment-17132472
 ] 

Beam JIRA Bot commented on BEAM-1309:
-------------------------------------

This issue is assigned but has not received an update in 30 days so it has been 
labeled "stale-assigned". If you are still working on the issue, please give an 
update and remove the label. If you are no longer working on the issue, please 
unassign so someone else may work on it. In 7 days the issue will be 
automatically unassigned.

> FileIOChannelFactory.match() traverses entire parent directory recursively
> --------------------------------------------------------------------------
>
>                 Key: BEAM-1309
>                 URL: https://issues.apache.org/jira/browse/BEAM-1309
>             Project: Beam
>          Issue Type: Bug
>          Components: sdk-java-core
>            Reporter: Eugene Kirpichov
>            Assignee: Pei He
>            Priority: P2
>              Labels: stale-assigned
>
> I was running a pipeline that reads a single file from my local home 
> directory.
> The pipeline got stuck, and upon taking a stack snapshot, I noticed that it 
> was stuck in FileIOChannelFactory.match().
> The code currently works by traversing the whole parent directory of the 
> requested filepattern and checking which files match the filepattern. In my 
> case, that means traversing everything in my home directory, which is *a lot* 
> (and includes remotely mounted directories).
> https://github.com/apache/beam/blob/master/sdks/java/core/src/main/java/org/apache/beam/sdk/util/FileIOChannelFactory.java#L109
> This is very wasteful and should be fixed.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to