Re: [DISCUSS] changing default token behavior for 4.0

Dinesh Joshi Fri, 21 Sep 2018 21:30:50 -0700

Jon, thanks for starting this thread!

I have created CASSANDRA-14784 to track this.


Dinesh

> On Sep 21, 2018, at 9:18 PM, Sankalp Kohli <kohlisank...@gmail.com> wrote:
> 
> Putting it on JIRA is to make sure someone is assigned to it and it is 
> tracked. Changes should be discussed over ML like you are saying. 
> 
> On Sep 21, 2018, at 21:02, Jonathan Haddad <j...@jonhaddad.com> wrote:
> 
>>> We should create a JIRA to find what other defaults we need revisit.
>> 
>> Changing a default is a pretty big deal, I think we should discuss any
>> changes to defaults here on the ML before moving it into JIRA.  It's nice
>> to get a bit more discussion around the change than what happens in JIRA.
>> 
>> We (TLP) did some testing on 4 tokens and found it to work surprisingly
>> well.   It wasn't particularly formal, but we verified the load stays
>> pretty even with only 4 tokens as we added nodes to the cluster.  Higher
>> token count hurts availability by increasing the number of nodes any given
>> node is a neighbor with, meaning any 2 nodes that fail have an increased
>> chance of downtime when using QUORUM.  In addition, with the recent
>> streaming optimization it seems the token counts will give a greater chance
>> of a node streaming entire sstables (with LCS), meaning we'll do a better
>> job with node density out of the box.
>> 
>> Next week I can try to put together something a little more convincing.
>> Weekend time.
>> 
>> Jon
>> 
>> 
>> On Fri, Sep 21, 2018 at 8:45 PM sankalp kohli <kohlisank...@gmail.com>
>> wrote:
>> 
>>> +1 to lowering it.
>>> Thanks Jon for starting this.We should create a JIRA to find what other
>>> defaults we need revisit. (Please keep this discussion for "default token"
>>> only.  )
>>> 
>>>> On Fri, Sep 21, 2018 at 8:26 PM Jeff Jirsa <jji...@gmail.com> wrote:
>>>> 
>>>> Also agree it should be lowered, but definitely not to 1, and probably
>>>> something closer to 32 than 4.
>>>> 
>>>> --
>>>> Jeff Jirsa
>>>> 
>>>> 
>>>>> On Sep 21, 2018, at 8:24 PM, Jeremy Hanna <jeremy.hanna1...@gmail.com>
>>>> wrote:
>>>>> 
>>>>> I agree that it should be lowered. What I’ve seen debated a bit in the
>>>> past is the number but I don’t think anyone thinks that it should remain
>>>> 256.
>>>>> 
>>>>>> On Sep 21, 2018, at 7:05 PM, Jonathan Haddad <j...@jonhaddad.com>
>>> wrote:
>>>>>> 
>>>>>> One thing that's really, really bothered me for a while is how we
>>>> default
>>>>>> to 256 tokens still.  There's no experienced operator that leaves it
>>> as
>>>> is
>>>>>> at this point, meaning the only people using 256 are the poor folks
>>> that
>>>>>> just got started using C*.  I've worked with over a hundred clusters
>>> in
>>>> the
>>>>>> last couple years, and I think I only worked with one that had lowered
>>>> it
>>>>>> to something else.
>>>>>> 
>>>>>> I think it's time we changed the default to 4 (or 8, up for debate).
>>>>>> 
>>>>>> To improve the behavior, we need to change a couple other things.  The
>>>>>> allocate_tokens_for_keyspace setting is... odd.  It requires you have
>>> a
>>>>>> keyspace already created, which doesn't help on new clusters.  What
>>> I'd
>>>>>> like to do is add a new setting, allocate_tokens_for_rf, and set it to
>>>> 3 by
>>>>>> default.
>>>>>> 
>>>>>> To handle clusters that are already using 256 tokens, we could prevent
>>>> the
>>>>>> new node from joining unless a -D flag is set to explicitly allow
>>>>>> imbalanced tokens.
>>>>>> 
>>>>>> We've agreed to a trunk freeze, but I feel like this is important
>>> enough
>>>>>> (and pretty trivial) to do now.  I'd also personally characterize this
>>>> as a
>>>>>> bug fix since 256 is horribly broken when the cluster gets to any
>>>>>> reasonable size, but maybe I'm alone there.
>>>>>> 
>>>>>> I honestly can't think of a use case where random tokens is a good
>>>> choice
>>>>>> anymore, so I'd be fine / ecstatic with removing it completely and
>>>>>> requiring either allocate_tokens_for_keyspace (for existing clusters)
>>>>>> or allocate_tokens_for_rf
>>>>>> to be set.
>>>>>> 
>>>>>> Thoughts?  Objections?
>>>>>> --
>>>>>> Jon Haddad
>>>>>> http://www.rustyrazorblade.com
>>>>>> twitter: rustyrazorblade
>>>>> 
>>>>> ---------------------------------------------------------------------
>>>>> To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org
>>>>> For additional commands, e-mail: dev-h...@cassandra.apache.org
>>>>> 
>>>> 
>>>> ---------------------------------------------------------------------
>>>> To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org
>>>> For additional commands, e-mail: dev-h...@cassandra.apache.org
>>>> 
>>>> 
>>> 
>> 
>> 
>> -- 
>> Jon Haddad
>> http://www.rustyrazorblade.com
>> twitter: rustyrazorblade
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org
> For additional commands, e-mail: dev-h...@cassandra.apache.org
> 


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org
For additional commands, e-mail: dev-h...@cassandra.apache.org

Re: [DISCUSS] changing default token behavior for 4.0

Reply via email to