Jerome MASSOT created BEAM-7998:
-----------------------------------
Summary: MatchesFiles or MatchAll seems to return seveval time the
same element
Key: BEAM-7998
URL: https://issues.apache.org/jira/browse/BEAM-7998
Project: Beam
Issue Type: Bug
Components: beam-model
Affects Versions: 2.14.0
Environment: GCP for storage, DirectRunner and DataflowRunner both
have the problem. PyCharm on Win10 for IDE and dev environment.
Reporter: Jerome MASSOT
Hi team,
when I use MatcheFiles using wildcard and files located in a GCP bucket, the
MatcheFiles transform returns several times (at least 2) the same file.
I have tried to follow the stack, and I can see that the MatchesAll is called
twice when I run the pipeline on a debug project where a single element is
present in the bucket.
But I am not good enough to say more than that. Sorry.
Best regards
Jerome
--
This message was sent by Atlassian JIRA
(v7.6.14#76016)