[ 
https://issues.apache.org/jira/browse/BEAM-6261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16724274#comment-16724274
 ] 

Scott Wegner commented on BEAM-6261:
------------------------------------

I spoke with [[email protected]] who mentioned this is a bug in the Dataflow 
service; the backend is not properly expiring cached side input values.

A fix is in progress and when it rolls out will apply to all SDK versions. 

> Dataflow runner does not refresh updated side inputs
> ----------------------------------------------------
>
>                 Key: BEAM-6261
>                 URL: https://issues.apache.org/jira/browse/BEAM-6261
>             Project: Beam
>          Issue Type: New Feature
>          Components: runner-dataflow
>            Reporter: Scott Wegner
>            Assignee: Daniel Mills
>            Priority: Major
>
> See [this user@ 
> thread|https://lists.apache.org/thread.html/5eed0fc3beeb9f1c1fe4a623cbcad41cb15d0d80490cafb1f27e4577@%3Cuser.beam.apache.org%3E].
> The [Slowly-changing lookup 
> cache|https://cloud.google.com/blog/products/gcp/guide-to-common-cloud-dataflow-use-case-patterns-part-1]
>  pattern described on the GCP blog uses a side input to feed lookup data to 
> join with the main data input. However, the Dataflow runner doesn't update 
> side inputs.
> Example pipeline here: https://github.com/lbordwell/sideinput



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to