[
https://issues.apache.org/jira/browse/BEAM-11403?focusedWorklogId=527347&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-527347
]
ASF GitHub Bot logged work on BEAM-11403:
-----------------------------------------
Author: ASF GitHub Bot
Created on: 22/Dec/20 20:14
Start Date: 22/Dec/20 20:14
Worklog Time Spent: 10m
Work Description: je-ik commented on pull request #13592:
URL: https://github.com/apache/beam/pull/13592#issuecomment-749754986
> Hm, maybe Dataflow is not using SplittableDoFnViaKeyedWorkItems and has
some specific implementation?
Ah, got that. Dataflow uses cache for state access. So writing to state and
reading it back can return the same identical object. But that is runner
specific. Flink always serializes access to state.
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 527347)
Time Spent: 2h 20m (was: 2h 10m)
> Unbounded SDF wrapper causes performance regression on DirectRunner
> -------------------------------------------------------------------
>
> Key: BEAM-11403
> URL: https://issues.apache.org/jira/browse/BEAM-11403
> Project: Beam
> Issue Type: Bug
> Components: runner-direct, sdk-java-core
> Reporter: Boyuan Zhang
> Assignee: Boyuan Zhang
> Priority: P2
> Time Spent: 2h 20m
> Remaining Estimate: 0h
>
> There is a significant performance regression when switching from
> UnboundedSource to Unbounded SDF wrapper. So far there are 2 IOs reported:
> * Pubsub Read:
> https://lists.apache.org/thread.html/re6b0941a8b4951293a0327ce9b25e607cafd6e45b69783f65290edee%40%3Cdev.beam.apache.org%3E
> * Kafka Read: https://the-asf.slack.com/archives/C9H0YNP3P/p1606155042346600
--
This message was sent by Atlassian Jira
(v8.3.4#803005)