Sent you an invite Sam. Welcome to the community!

On Fri, Oct 27, 2023 at 10:31 AM Sam <samueldlightf...@gmail.com> wrote:

> Please can I have an invite to the Slack workspace on this email. I'd like
> to take a look through some of the items for first time contributors :-)
>
> Thanks!
>
> On Fri, 27 Oct 2023 at 18:10, Josh McKenzie <jmcken...@apache.org> wrote:
>
>> In case you're keeping score on how frequently these are coming out: *please
>> stop*. ;)
>>
>> Silver lining - looks like we have a lot to discuss this round! Last
>> update was late July and we've been churning through the 5.0 freeze and
>> stabilization phase.
>>
>>
>>
>> *[New Contributors Getting Started]*
>> Check out https://the-asf.slack.com, channel #cassandra-dev. Reply
>> directly to me on this email if you need an invite for your account, and
>> reach out to the @cassandra_mentors alias in the channel if you need to get
>> oriented.
>>
>> We have a list of curated "getting started" tickets you can find here,
>> filtered to "ToDo" (i.e. not yet worked):
>> https://issues.apache.org/jira/secure/RapidBoard.jspa?rapidView=484&quickFilter=2160&quickFilter=2162&quickFilter=2652
>> .
>>
>> *Helpful links:*
>> - Getting Started with Development on C*:
>> https://cassandra.apache.org/_/development/gettingstarted.html
>> - Building and IDE integration (worktrees are your friend; msg me on
>> slack if you need pointers):
>> https://cassandra.apache.org/_/development/ide.html
>> - Code Style: https://cassandra.apache.org/_/development/code_style.html
>>
>>
>>
>> *[Dev mailing list]*
>>
>> https://lists.apache.org/list?dev@cassandra.apache.org:dfr=2023-7-20%7Cdto=2023-10-27
>> :
>>
>> My last email of shame was 35 threads. Drumroll for this one...
>> 91. *Yeesh*. Let me stick to highlights.
>>
>> Ekaterina pushed through dropping JDK8 support and adding JDK17
>> support... back in July. If you didn't know about it by know, consider
>> yourself doubly notified. :) .
>> https://lists.apache.org/thread/9pwz3vtpf88fly27psc7yxvcv0lwbz8k I think
>> I can speak on behalf of all of us when I say: *Thank You Ekaterina.*
>>
>> This came up recently on another thread about when to branch 5.1, but we
>> discussed our freeze plans and exception rules for TCM and Accord here:
>> https://lists.apache.org/thread/mzj3dq8b7mzf60k6mkby88b9n9ywmsgw. Mick
>> was essentially looking for a similar waiver for Vector search since it was
>> well abstracted, depended on SAI and external libs, and in general
>> shouldn't be too big of a disruption to get into 5.0. General consensus at
>> the time was "sure", and the work has since been completed. But here's the
>> reminder and link for posterity (and in case you missed it).
>>
>> Jaydeep reached out about a potential short-term solution to detecting
>> token-ownership mismatch while we don't yet have TCM; this seems more
>> pressing now as we're looking at a 5.0 without yet having TCM in it. The
>> dev ML thread is here:
>> https://lists.apache.org/thread/4p0orhom42g36osnknqj3fqmqhvqml1g, and he
>> created https://issues.apache.org/jira/browse/CASSANDRA-18758 dealing
>> with the topic. There's a relatively modest (7 files, just over 300 lines)
>> PR available here: https://github.com/apache/cassandra/pull/2595/files;
>> I haven't looked into it, but it might be worth considering getting this
>> into 5.0 since it looks like we're moving to cutting w/out TCM. Any
>> thoughts?
>>
>> We had a pretty good discussion about automated repair scheduling,
>> discussing whether it should live in the DB proper vs. in the sidecar, pros
>> and cons, pressures, etc. Not sure if things moved beyond that; I know
>> there's at least a few implementations out there that haven't yet made
>> their way back to the ASF project proper. Thread:
>> https://lists.apache.org/thread/glvmkwknf91rxc5l6w4d4m1kcvlr6mrv. My
>> hope is we can avoid the gridlock we hit for a long time with the sidecar
>> where there are multiple implementations with different tradeoffs and
>> everyone's disincentivized from accepting a solution different from their
>> own in-house one since it'd theoretically require re-tooling. Tough problem
>> with no easy solutions, but would love to see this become a first class
>> citizen in the ecosystem.
>>
>> Paulo brought up a discussion about moving to disk_access_mode =
>> mmap_index_only on 5.0. Seemed to be a consensus there but I'm not sure we
>> actually changed that in the 5.0 branch? Thread:
>> https://lists.apache.org/thread/nhp6vftc4kc3dxskngxy5rpo1lp19drw. Just
>> pulled on cassandra-5.0 and it looks like auto + hasLargeAddressSpace() ==
>> .mmap rather than .mmap_index_only.
>>
>> David Capwell worked on adding some retries to repair messages when
>> they're failing to make the process more robust:
>> https://lists.apache.org/thread/wxv6k6slljqcw73xcmpxj4kn5lz95jd1.
>> Reception was positive enough that he went so far as to back-port it and
>> also work on some for IR. Looks like he could use a reviewer here:
>> https://issues.apache.org/jira/browse/CASSANDRA-18962 - and this is
>> patch available.
>>
>> Mike Adamson reached out about adding / taking a dependency on jvector:
>> https://lists.apache.org/thread/zkqg7mk9hp35zn0cf1tvywc2m3l63jrn. The
>> general gist of it was "looks good, written by committer(s) / pmc members,
>> permissvely licensed. Go for it". Some discussion about copyright holders
>> and whether that matters from an ASF perspective, and we've further had
>> some good discussion about the application of generative AI tooling to not
>> just code contributed to the ASF, but also in dependencies we bring into
>> the project. If you're curious about more details, check out the Apache
>> LEGAL-656 JIRA here: https://issues.apache.org/jira/browse/LEGAL-656.
>> The TL;DR comment is from Roman here:
>> https://issues.apache.org/jira/browse/LEGAL-656?focusedCommentId=17779813&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-17779813
>> .
>>
>> Maxim Muzafarov keeps fighting the good fight of helping to clean up our
>> codebase; he opened a thread about Cassandra's code style and source
>> analysis here:
>> https://lists.apache.org/thread/lr90ckt7scgs4tqjwd2t7928plngo5zl. We
>> have a label for "code-polishing" that you can check that we're holding off
>> on until after Accord and TCM merge so they don't take on a painful rebase
>> burden mid-integration work (
>> https://issues.apache.org/jira/issues/?jql=labels%20%3D%20code-polishing
>> ).
>>
>> Mick had some ideas around improving how we announce and handle having
>> broken branches and merging to them:
>> https://lists.apache.org/thread/n7zhzk4svdh1v3pswkrfwxw4o3g2f6xy. The
>> gist of this: it's not great when a branch is straight up broken in ASF CI
>> and then folks merge more code on top of that break; makes it harder to
>> root out what's going on. We didn't _really_ get too far in closure on how
>> we'd prevent this case in the future beyond "email the dev ML, post in
>> #cassandra-dev slack, and... pray?". I'm in favor of a slack-bot that yells
>> at us hourly if our builds are formally broken so we can't forget, with the
>> assumption it _should_ be a pretty rare situation. If anyone else has input
>> here that'd be helpful.
>>
>> Builds for 5.0 and trunk are now based on in-tree build scripts (found in
>> .build). The scripts were moved from the cassandra-builds repo here:
>> https://github.com/apache/cassandra-builds, where you can find build
>> scripts used for other branches. Expect this to continue to evolve as we
>> take some of the best learnings from circleci and other build systems and
>> integrate them upstream.
>>
>> Claude discovered that our documentation for development dependencies is
>> out of date:
>> https://lists.apache.org/thread/91l7x7r0w7yycndslfc8kjs74s3jyqr2. Looks
>> like Abe's working on an update there, but if anyone has opinions or cycles
>> to help out this is high leverage work.
>>
>> Yifan Cai reached out about merging some changes for CQLSSTableWriter to
>> 4.0 and up. Since this is offline tools only the general consensus was "go
>> for it": https://lists.apache.org/thread/nwqdmqzoht2nyw9hg8o061vh6vk2oxd5
>>
>> Maxim could use a reviewer for allowing UPDATE on settings virtual tables
>> (ML: https://lists.apache.org/thread/rsgtwdlg411d76kptkbxv292hnv1s1c5,
>> original ML thread here:
>> https://lists.apache.org/thread/8kywzv24n0dp07mhvch7hwhjypssoh0l, JIRA:
>> https://issues.apache.org/jira/browse/CASSANDRA-15254). I have to
>> imagine most users would prefer to use CQL to interact w/their node
>> settings than JMX, though I assume most of us have some Stockholm Syndrome
>> at this point.
>>
>> Amit Pawar reached out about how we're approaching our defaults for the
>> CommitLog (mmap vs. the new DirectI/O they have a PR up for). The general
>> consensus was "that looks and sounds great, and we shouldn't change
>> defaults until it's had time to bake as an option".
>> https://lists.apache.org/thread/t6v0p10737p0joob2vcsdt0r3g8zt94q
>>
>>
>>
>> *[CI]*
>> https://butler.cassandra.apache.org/#/
>>
>> Since late July (~ 3 months):
>>
>> 3.0: 9 -> 18
>> • Was hovering around 12 ish for a good while there
>> 3.11: 16 -> 20
>> • There's a lot more variance on this one. Curious why the delta from 3.0.
>> 4.0: 24 -> 11
>> • Looks like long-term trend is around the 8? mark
>> 4.1: 12 -> 12
>> • Pretty stable around 12 failures here
>> 5.0: Averaging around 10
>> • Do we have too many branches yet?
>> trunk: 16 -> 12
>> • One pretty big spike in there when CI was transitioning over, but on
>> the whole in a pretty "tame" place.
>>
>> Low-grade noise on each of the branches. Spot-checking failures on 3.0,
>> 4.0, and trunk, nothing really pops as being commonalities between them.
>>
>>
>> *[What's been closed out]*
>> Updated quick-filter with new, ridiculous 90 day duration:
>> https://issues.apache.org/jira/secure/RapidBoard.jspa?rapidView=484&quickFilter=2278
>> JQL sorted by priority then type:
>> https://issues.apache.org/jira/issues/?jql=project%20%3D%20CASSANDRA%20and%20resolution%20%3D%20fixed%20and%20resolved%20%3E%20-90d%20order%20by%20priority%20DESC%2C%20type%20DESC
>>
>> Due to the sheer volume of tickets (170 in the past 90 days!), I'll
>> refrain from including them all in this email thread here. I should be
>> considerably less "compressed for time" in the near future, so fingers
>> crossed we can get back to a more digestible volume on these updates on a
>> monthly cadence as we go into aggressive "release-mode".
>>
>> Being a part of an open-source community that's this mature, in a domain
>> this complex, that's not only firing on all cylinders but going further and
>> self-improving and accelerating is really gratifying and humbling for me.
>> Thanks everyone for being a part of this.
>>
>> ~Josh
>>
>

Reply via email to