[jira] [Commented] (CASSANDRA-13299) Potential OOMs and lock contention in write path streams

ZhaoYang (JIRA) Wed, 27 Sep 2017 22:34:34 -0700

    [ 
https://issues.apache.org/jira/browse/CASSANDRA-13299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16183689#comment-16183689
 ]


ZhaoYang commented on CASSANDRA-13299:
--------------------------------------

Thanks for the fix.

bq. Could you also modify complexThrottleWithTombstoneTest to test range 
deletions?

Added.

bq. I think that instead of throwing an AssertionError when the returned 
iterator is not exhausted, we could simply exhaust it
+1

bq. Right now we're verifying the results with all the nodes UP, but it's 
possible that another node responds the query even though one of the 
inconsistent nodes did not stream correctly. I think we should check the 
results on each node individually (with the others down) to ensure they 
streamed data correctly from other nodes.
bq. Add range deletions since that's when the range tombstones special cases 
will be properly exercised.
Added.

> Potential OOMs and lock contention in write path streams
> --------------------------------------------------------
>
>                 Key: CASSANDRA-13299
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-13299
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Materialized Views
>            Reporter: Benjamin Roth
>            Assignee: ZhaoYang
>             Fix For: 4.x
>
>
> I see a potential OOM, when a stream (e.g. repair) goes through the write 
> path as it is with MVs.
> StreamReceiveTask gets a bunch of SSTableReaders. These produce rowiterators 
> and they again produce mutations. So every partition creates a single 
> mutation, which in case of (very) big partitions can result in (very) big 
> mutations. Those are created on heap and stay there until they finished 
> processing.
> I don't think it is necessary to create a single mutation for each partition. 
> Why don't we implement a PartitionUpdateGeneratorIterator that takes a 
> UnfilteredRowIterator and a max size and spits out PartitionUpdates to be 
> used to create and apply mutations?
> The max size should be something like min(reasonable_absolute_max_size, 
> max_mutation_size, commitlog_segment_size / 2). reasonable_absolute_max_size 
> could be like 16M or sth.
> A mutation shouldn't be too large as it also affects MV partition locking. 
> The longer a MV partition is locked during a stream, the higher chances are 
> that WTE's occur during streams.
> I could also imagine that a max number of updates per mutation regardless of 
> size in bytes could make sense to avoid lock contention.
> Love to get feedback and suggestions, incl. naming suggestions.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (CASSANDRA-13299) Potential OOMs and lock contention in write path streams

Reply via email to