Jerome MASSOT created BEAM-7998:
-----------------------------------

             Summary: MatchesFiles or MatchAll seems to return seveval time the 
same element
                 Key: BEAM-7998
                 URL: https://issues.apache.org/jira/browse/BEAM-7998
             Project: Beam
          Issue Type: Bug
          Components: beam-model
    Affects Versions: 2.14.0
         Environment: GCP for storage, DirectRunner and DataflowRunner both 
have the problem. PyCharm on Win10 for IDE and dev environment.
            Reporter: Jerome MASSOT


Hi team,

when I use MatcheFiles using wildcard and files located in a GCP bucket, the 
MatcheFiles transform returns several times (at least 2) the same file.

I have tried to follow the stack, and I can see that the MatchesAll is called 
twice when I run the pipeline on a debug project where a single element is 
present in the bucket.

But I am not good enough to say more than that. Sorry.

Best regards

Jerome



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

Reply via email to