[ 
https://issues.apache.org/jira/browse/BEAM-9268?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Niel Markwick updated BEAM-9268:
--------------------------------
    Fix Version/s: 2.20.0

> SpannerIO: Better documentation and warning about creating tables in the 
> pipeline
> ---------------------------------------------------------------------------------
>
>                 Key: BEAM-9268
>                 URL: https://issues.apache.org/jira/browse/BEAM-9268
>             Project: Beam
>          Issue Type: Improvement
>          Components: io-go-gcp
>    Affects Versions: 2.16.0, 2.17.0, 2.18.0, 2.19.0
>            Reporter: Niel Markwick
>            Assignee: Niel Markwick
>            Priority: Major
>              Labels: google-cloud-spanner, perfomance
>             Fix For: 2.20.0
>
>          Time Spent: 1h 50m
>  Remaining Estimate: 0h
>
> The javadoc for SpannerIO.Write mentions in passing that the transform needs 
> to know the DB schema for optimal performance. If the schema is created 
> within the pipeline, then there is a race between the schema being created 
> and SpannerIO reading it, leading to a potential performance penalty if 
> SpannerIO does not know about the existence of some tables. 
>  
> Javadoc needs to make this clearer and more explicit, and point the user at 
> the Write.withSchemaReadySignal().
>  
> Pipeline needs to raise (rate limited) warnings if it sees writes being made 
> to tables it does not know about (warnings can refer back to javadocs)
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to