[ https://issues.apache.org/jira/browse/BEAM-4049?focusedWorklogId=91362&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-91362 ]
ASF GitHub Bot logged work on BEAM-4049: ---------------------------------------- Author: ASF GitHub Bot Created on: 16/Apr/18 15:16 Start Date: 16/Apr/18 15:16 Worklog Time Spent: 10m Work Description: echauchot commented on issue #5112: [BEAM-4049] Improve CassandraIO write throughput by performing async queries URL: https://github.com/apache/beam/pull/5112#issuecomment-381622622 @adejanovski > I'm not sure why the tests are failing in Jenkins now since I just changed the comment style on CONCURRENT_ASYNC_QUERIES. Maybe it's unrelated to my push ? I don't know if it is the case here, but please know for the future that sometimes the jenkins build hits the misconfigured beam5 machine; so errors might be unrelated to the code sometimes ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking ------------------- Worklog Id: (was: 91362) Time Spent: 3h 50m (was: 3h 40m) > Improve write throughput of CassandraIO > --------------------------------------- > > Key: BEAM-4049 > URL: https://issues.apache.org/jira/browse/BEAM-4049 > Project: Beam > Issue Type: Improvement > Components: io-java-cassandra > Affects Versions: 2.4.0 > Reporter: Alexander Dejanovski > Assignee: Alexander Dejanovski > Priority: Major > Labels: performance > Time Spent: 3h 50m > Remaining Estimate: 0h > > The CassandraIO currently uses the mapper to perform writes in a synchronous > fashion. > This implies that writes are serialized and is a very suboptimal way of > writing to Cassandra. > The IO should use the saveAsync() method instead of save() and should wait > for completion each time 100 queries are in flight, in order to avoid > overwhelming clusters. -- This message was sent by Atlassian JIRA (v7.6.3#76005)