[
https://issues.apache.org/jira/browse/BEAM-7998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17121927#comment-17121927
]
Kenneth Knowles commented on BEAM-7998:
---------------------------------------
This issue is assigned but has not received an update in 30 days so it has been
labeled "stale-assigned". If you are still working on the issue, please give an
update and remove the label. If you are no longer working on the issue, please
unassign so someone else may work on it. In 7 days the issue will be
automatically unassigned.
> MatchesFiles or MatchAll seems to return several times the same element
> -----------------------------------------------------------------------
>
> Key: BEAM-7998
> URL: https://issues.apache.org/jira/browse/BEAM-7998
> Project: Beam
> Issue Type: Bug
> Components: io-py-files
> Affects Versions: 2.14.0
> Environment: GCP for storage, DirectRunner and DataflowRunner both
> have the problem. PyCharm on Win10 for IDE and dev environment.
> Reporter: Jerome MASSOT
> Assignee: Pablo Estrada
> Priority: P2
> Labels: ccoss2019, stale-assigned
>
> Hi team,
> when I use MatcheFiles using wildcard and files located in a GCP bucket, the
> MatcheFiles transform returns several times (at least 2) the same file.
> I have tried to follow the stack, and I can see that the MatchesAll is called
> twice when I run the pipeline on a debug project where a single element is
> present in the bucket.
> But I am not good enough to say more than that. Sorry.
> Best regards
> Jerome
--
This message was sent by Atlassian Jira
(v8.3.4#803005)