[
https://issues.apache.org/jira/browse/CASSANDRA-15164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17200940#comment-17200940
]
Benjamin Lerer edited comment on CASSANDRA-15164 at 9/23/20, 4:36 PM:
----------------------------------------------------------------------
After thinking about that problem overnight, I realized that a corruption might
not necessary be the problem. I had a quick chat with [~slebresne] who believe
that it should be possible to create a partitions with more than 2 billions
cells.
By consequence, I think that we should change the code to ensure that we do not
overflow if a partition has more than 2 billion cells. For that we can either
increase the default number of buckets (using {{118}} instead of {{114}} will
allow for more than 4 billions cells) or dynamically increasing the number of
buckets if needed.
Otherwise, the approach to clear the overflow seems reasonable to me as it will
solve the problem of the already corrupted {{Statistics}}.
WDYT?
was (Author: blerer):
After thinking about that problem overnight, I realized that a corruption might
not necessary be the problem. I had a quick chat with [~slebresne] who believe
that it should be possible to create a partitions with more than 2 billions
cells.
By consequence, I think that we should change the code to ensure that we do not
overflow if a partition has more than 2 billion cells. For that we can either
increase the default number of buckets (using {{118}} instead of {{114}} will
allow for more than 4 billions cells) or dynamically increasing the number of
buckets if needed.
Otherwise, the approach to clear the overflow seems reasonable to me as it will
solve the problem of the already corrupted {{Statistics}}.
> Overflowed Partition Cell Histograms Can Prevent Compactions from Executing
> ---------------------------------------------------------------------------
>
> Key: CASSANDRA-15164
> URL: https://issues.apache.org/jira/browse/CASSANDRA-15164
> Project: Cassandra
> Issue Type: Bug
> Components: CQL/Interpreter
> Reporter: Ankur Jha
> Assignee: Caleb Rackliffe
> Priority: Urgent
> Labels: compaction, partition
> Fix For: 4.0-beta
>
> Time Spent: 2h 20m
> Remaining Estimate: 0h
>
> Hi, we are running 6 node Cassandra cluster in production with 3 seed node
> but from last night one of our seed nodes is continuously throwing an error
> like this;-
> cassandra.protocol.ServerError: <Error from server: code=0000 [Server error]
> message="java.lang.IllegalStateException: Unable to compute ceiling for max
> when histogram overflowed">
> For a cluster to be up and running I Drained this node.
> Can somebody help me out with this?
>
> Any help or lead would be appreciated
>
> Note : We are using Cassandra version 3.7
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]