[
https://issues.apache.org/jira/browse/BEAM-3506?focusedWorklogId=109804&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-109804
]
ASF GitHub Bot logged work on BEAM-3506:
----------------------------------------
Author: ASF GitHub Bot
Created on: 07/Jun/18 17:30
Start Date: 07/Jun/18 17:30
Worklog Time Spent: 10m
Work Description: jkff commented on issue #4457: [BEAM-3506] - Add a
feature in JdbcIO that allows writing PCollection<Iterable<T>>
URL: https://github.com/apache/beam/pull/4457#issuecomment-395502296
Apologies, I forgot about this PR. But yeah, we have a workaround and this
issue didn't come up again, so I'm going to close. Thanks!
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 109804)
Time Spent: 20m (was: 10m)
Remaining Estimate: 3h 40m (was: 3h 50m)
> JdbcIO: Support writing iterables (i.e. collections) of rows instead of only
> single rows
> ----------------------------------------------------------------------------------------
>
> Key: BEAM-3506
> URL: https://issues.apache.org/jira/browse/BEAM-3506
> Project: Beam
> Issue Type: Improvement
> Components: z-do-not-use-sdk-java-extensions
> Affects Versions: 2.3.0
> Reporter: Knut Olav Loite
> Assignee: Jean-Baptiste Onofré
> Priority: Minor
> Labels: JdbcIO, jdbc
> Fix For: Not applicable
>
> Original Estimate: 4h
> Time Spent: 20m
> Remaining Estimate: 3h 40m
>
> The current JdbcIO write interface expects a PCollection<T> where T is the
> row to be written. Each instance of T is then added to a batch and written to
> the database. The user has little control over how many rows will be added to
> one batch. If JdbcIO would also support writing a PCollection<Iterable<T>>
> the user would have more control over the number of rows in one batch.
> Especially when writing to cloud databases, such as Google Cloud Spanner, the
> batching of multiple rows together is important for performance.
> I already have a solution locally and I will submit a pull request.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)