[
https://issues.apache.org/jira/browse/BEAM-11934?focusedWorklogId=632500&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-632500
]
ASF GitHub Bot logged work on BEAM-11934:
-----------------------------------------
Author: ASF GitHub Bot
Created on: 02/Aug/21 17:18
Start Date: 02/Aug/21 17:18
Worklog Time Spent: 10m
Work Description: nehsyc commented on pull request #15256:
URL: https://github.com/apache/beam/pull/15256#issuecomment-891195968
If I understand it correctly the flakiness was caused by running a streaming
pipeline on a bounded source in the word count pipeline. Note that the
implementation for runner determined sharding in the `WriteFiles` is different
for bounded and unbounded source. So the override was only used by bounded
source in streaming and without the override the word count pipeline
unexpectedly picked up the implementation for batch.
Perhaps a better fix might be to modify the override to also check the
bounded-ness of the input so unbounded datacan still adopt proper runner
determined sharding implementation for streaming.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 632500)
Time Spent: 7.5h (was: 7h 20m)
> Dataflow GCS streaming file sink uses runner determined sharding
> ----------------------------------------------------------------
>
> Key: BEAM-11934
> URL: https://issues.apache.org/jira/browse/BEAM-11934
> Project: Beam
> Issue Type: Improvement
> Components: io-java-files, io-java-gcp, runner-dataflow
> Reporter: Siyuan Chen
> Priority: P2
> Labels: stale-P2
> Time Spent: 7.5h
> Remaining Estimate: 0h
>
> Integrate file sink with shardable `GroupIntoBatches` (BEAM-10475) to allow
> runner determined dynamic sharding for streaming use cases.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)