[ https://issues.apache.org/jira/browse/BEAM-3506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16334773#comment-16334773 ]
Eugene Kirpichov commented on BEAM-3506: ---------------------------------------- Knut - any reason why you're not using SpannerIO for writing to Spanner? > JdbcIO: Support writing iterables (i.e. collections) of rows instead of only > single rows > ---------------------------------------------------------------------------------------- > > Key: BEAM-3506 > URL: https://issues.apache.org/jira/browse/BEAM-3506 > Project: Beam > Issue Type: Improvement > Components: sdk-java-extensions > Affects Versions: 2.3.0 > Reporter: Knut Olav Loite > Assignee: Jean-Baptiste Onofré > Priority: Minor > Labels: JdbcIO, jdbc > Original Estimate: 4h > Remaining Estimate: 4h > > The current JdbcIO write interface expects a PCollection<T> where T is the > row to be written. Each instance of T is then added to a batch and written to > the database. The user has little control over how many rows will be added to > one batch. If JdbcIO would also support writing a PCollection<Iterable<T>> > the user would have more control over the number of rows in one batch. > Especially when writing to cloud databases, such as Google Cloud Spanner, the > batching of multiple rows together is important for performance. > I already have a solution locally and I will submit a pull request. -- This message was sent by Atlassian JIRA (v7.6.3#76005)