Yes. please do. We should also update our JVM defaults.

On Thu, Jan 23, 2020, 9:28 PM Jeremy Hanna <jeremy.hanna1...@gmail.com>
wrote:

> To summarize this thread, I think people are generally okay with updating
> certain defaults for 4.0 provided we make sure it doesn't unpleasantly
> surprise cluster operators.  I think with the num_tokens and
> compaction_throughput_in_mb we could go with a release note for the reasons
> in my last email.  I also agree that we should consider bump
> roles_validity_in_ms, permissions_validity_in_ms, and
>    credentials_validity_in_ms along with the default snitch (going to GPFS
> as the default) as that gives people a DC aware default at least to start.
>
> Is everyone okay if I create tickets for each of these and link them with
> an epic so that we can discuss them separately?
>
> Thanks,
>
> Jeremy
>
> On Thu, Jan 23, 2020 at 5:34 AM Alex Ott <alex...@gmail.com> wrote:
>
> > In addition to these, maybe we could consider to change other as well?
> > Like:
> >
> > 1. bump roles_validity_in_ms, permissions_validity_in_ms, and
> >    credentials_validity_in_ms as well - maybe at least to a minute, or
> 2. I
> >    have seen multiple times when authentication was failing under the
> heavy
> >    load because queries to system tables were timing out - with these
> >    defaults people may still have the possibility to get updates to
> >    roles/credentials faster when specifying _update_interval_ variants of
> >    these configurations.
> > 2. change default snitch from SimpleSnitch to
> GossipingPropertyFileSnitch -
> >    we're anyway saying that SimpleSnitch is only appropriate for
> >    single-datacenter deployments, and for real production we need to use
> >    GossipingPropertyFileSnitch - why not to set it as default?
> >
> >
> > Jeremy Hanna  at "Wed, 22 Jan 2020 11:22:36 +1100" wrote:
> >  JH> I mentioned this in the contributor meeting as a topic to bring up
> on
> > the list - should we
> >  JH> take the opportunity to update defaults for Cassandra 4.0?
> >
> >  JH> The rationale is two-fold:
> >  JH> 1) There are best practices and tribal knowledge around certain
> > properties where people
> >  JH> just know to update those properties immediately as a starting
> > point.  If it's pretty much
> >  JH> a given that we set something as a starting point different than the
> > current defaults, why
> >  JH> not make that the new default?
> >  JH> 2) We should align the defaults with what we test with.  There may
> be
> > exceptions if we
> >  JH> have one-off tests but on the whole, we should be testing with
> > defaults.
> >
> >  JH> As a starting point, compaction throughput and number of vnodes seem
> > like good candidates
> >  JH> but it would be great to get feedback for any others.
> >
> >  JH> For compaction throughput (
> > https://jira.apache.org/jira/browse/CASSANDRA-14902), I've made
> >  JH> a basic case on the ticket to default to 64 just as a starting point
> > because the decision
> >  JH> for 16 was made when spinning disk was most common.  Hence most
> > people I know change that
> >  JH> and I think without too much bikeshedding, 64 is a reasonable
> > starting point.  A case
> >  JH> could be made that empirically the compaction throughput throttle
> may
> > have less effect
> >  JH> than many people think, but I still think an updated default would
> > make sense.
> >
> >  JH> For number of vnodes, Michael Shuler made the point in the
> discussion
> > that we already test
> >  JH> with 32, which is a far better number than the 256 default.  I know
> > many new users that
> >  JH> just leave the 256 default and then discover later that it's better
> > to go lower.  I think
> >  JH> 32 is a good balance.  One could go lower with the new algorithm but
> > I think 32 is much
> >  JH> better than 256 without being too skewed, and it's what we currently
> > test.
> >
> >  JH> Jeff brought up a good point that we want to be careful with
> defaults
> > since changing them
> >  JH> could come as an unpleasant surprise to people who don't explicitly
> > set them.  As a
> >  JH> general rule, we should always update release notes to clearly state
> > that a default has
> >  JH> changed.  For these two defaults in particular, I think it's safe.
> > For compaction
> >  JH> throughput I think a release not is sufficient in case they want to
> > modify it.  For number
> >  JH> of vnodes, it won't affect existing deployments with data - it would
> > be for new clusters,
> >  JH> which would honestly benefit from this anyway.
> >
> >  JH> The other point is whether it's too late to go into 4.0.  For these
> > two changes, I think
> >  JH> significant testing can still be done with these new defaults before
> > release and I think
> >  JH> testing more explicitly with 32 vnodes in particular will give
> people
> > more confidence in
> >  JH> the lower number with a wider array of testing (where we don't
> > already use 32 explicitly).
> >
> >  JH> In summary, are people okay with considering updating these defaults
> > and possibly others
> >  JH> in the alpha stage of a new major release?  Are there other
> > properties to consider?
> >
> >  JH> Jeremy
> >  JH>
> ---------------------------------------------------------------------
> >  JH> To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org
> >  JH> For additional commands, e-mail: dev-h...@cassandra.apache.org
> >
> >
> >
> > --
> > With best wishes,                    Alex Ott
> > Principal Architect, DataStax
> > http://datastax.com/
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org
> > For additional commands, e-mail: dev-h...@cassandra.apache.org
> >
> >
>

Reply via email to