[
https://issues.apache.org/jira/browse/BEAM-5713?focusedWorklogId=154317&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-154317
]
ASF GitHub Bot logged work on BEAM-5713:
----------------------------------------
Author: ASF GitHub Bot
Created on: 15/Oct/18 14:14
Start Date: 15/Oct/18 14:14
Worklog Time Spent: 10m
Work Description: mxm commented on issue #6654: [BEAM-5713] Make
ImpulseSourceFunction execute in parallel
URL: https://github.com/apache/beam/pull/6654#issuecomment-429869253
I believe the execution graph should look something like this:
```
ParDo
/
Impulse -> ParDo -> Reshuffle |- ParDo
\
ParDo
```
The first ParDo does the initial source creation and assigns work items, the
following ParDos process these in parallel.
That looks a lot more complicated than just having a source but moving
forward with SplittableDoFn this will be the new way.
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 154317)
Time Spent: 1h 20m (was: 1h 10m)
> Flink portable runner schedules all tasks of streaming job on same task
> manager
> -------------------------------------------------------------------------------
>
> Key: BEAM-5713
> URL: https://issues.apache.org/jira/browse/BEAM-5713
> Project: Beam
> Issue Type: Bug
> Components: runner-flink
> Affects Versions: 2.8.0
> Reporter: Thomas Weise
> Assignee: Maximilian Michels
> Priority: Major
> Labels: portability, portability-flink
> Attachments: Different SlotSharingGroup.png, With
> RichParallelSourceFunction and parallelism 5.png,
> image-2018-10-11-11-43-50-333.png, image-2018-10-11-16-20-45-221.png
>
> Time Spent: 1h 20m
> Remaining Estimate: 0h
>
> The cluster has 9 task managers and 144 task slots total. A simple streaming
> pipeline with parallelism of 8 will get all tasks scheduled on the same task
> manager, causing the host to be fully booked and the remaining cluster idle.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)