[
https://issues.apache.org/jira/browse/BEAM-12222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17344072#comment-17344072
]
Kenneth Knowles commented on BEAM-12222:
----------------------------------------
The change is rolled forward. I need to confirm it is on the release branch.
> Dataflow side input translation "Unknown producer for value"
> ------------------------------------------------------------
>
> Key: BEAM-12222
> URL: https://issues.apache.org/jira/browse/BEAM-12222
> Project: Beam
> Issue Type: Bug
> Components: runner-dataflow
> Reporter: Kenneth Knowles
> Assignee: Kenneth Knowles
> Priority: P1
> Fix For: 2.30.0
>
> Time Spent: 5h 50m
> Remaining Estimate: 0h
>
> I have identified a seemingly nondeterministic issue in Dataflow translation,
> where pipelines with side inputs sometimes are translated in the wrong order.
> {code}
> java.lang.NullPointerException: Unknown producer for value
> SimplePCollectionView{tag=Tag<org.apache.beam.sdk.values.PCollectionViews$SimplePCollectionView.<init>:1221#4dca087078898728>}
> while translating step
> TfIdf.ComputeTfIdf/Combine.globally(Count)/ProduceDefault
> at
> org.apache.beam.vendor.guava.v26_0_jre.com.google.common.base.Preconditions.checkNotNull(Preconditions.java:1227)
> {code}
> Seen on
> https://ci-beam.apache.org/job/beam_PostCommit_Java_Examples_Dataflow_V2_PR/32/testReport/junit/org.apache.beam.examples.complete/TfIdfIT/testE2ETfIdf/
> and also other changes. I think the change itself is just triggering the
> nondeterministic problem.
> So there is a lurking problem with side inputs overall.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)