RE: Looking for common cases in cassandra fit to autoheal
Michael is right. It's not the topic of this list. Write me offline and I'll be glad to direct you to some resources with the information you are looking for. Kenneth Brotman -Original Message- From: Michael Shuler [mailto:mshu...@pbandjelly.org] On Behalf Of Michael Shuler Sent: Friday, March 01, 2019 12:49 PM To: dev@cassandra.apache.org Subject: Re: Looking for common cases in cassandra fit to autoheal Your request is not related to the topic of this mailing list, the development of Apache Cassandra. Your question would be better suited for the user@ mailing list. Your question is also quite vague, and you might include some detail or context about what exactly you are looking for. You may also wish to drop the unenforceable email footer or use a personal email address, if it's appended by your mail server. -- Kind regards, Michael http://www.pbandjelly.org/2011/03/to-whom-it-may-concern/ On 3/1/19 2:33 PM, Sundaramoorthy, Natarajan wrote: > Can someone please reply? Thanks > > > > > -Original Message- > From: Sundaramoorthy, Natarajan [mailto:natarajan_sundaramoor...@optum.com] > Sent: Wednesday, February 27, 2019 3:17 PM > To: dev@cassandra.apache.org > Subject: Looking for common cases in cassandra fit to autoheal > > Can someone please provide some common cases in cassandra which can be > candidate for autoheal? Looking for log files and issue..Both from VM and/or > pod world welcome...Thanks in advance for help > > This e-mail, including attachments, may include confidential and/or > proprietary information, and may be used only by the person or entity > to which it is addressed. If the reader of this e-mail is not the intended > recipient or his or her authorized agent, the reader is hereby notified > that any dissemination, distribution or copying of this e-mail is > prohibited. If you have received this e-mail in error, please notify the > sender by replying to this message and delete this e-mail immediately. > > This e-mail, including attachments, may include confidential and/or > proprietary information, and may be used only by the person or entity > to which it is addressed. If the reader of this e-mail is not the intended > recipient or his or her authorized agent, the reader is hereby notified > that any dissemination, distribution or copying of this e-mail is > prohibited. If you have received this e-mail in error, please notify the > sender by replying to this message and delete this e-mail immediately. > > > - > To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org > For additional commands, e-mail: dev-h...@cassandra.apache.org > - To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org For additional commands, e-mail: dev-h...@cassandra.apache.org - To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org For additional commands, e-mail: dev-h...@cassandra.apache.org
RE: Roadmap for 4.0
So in a vibrant community like ours, each year it is reasonable to expect that some new features will be ready. We probably can't predict far in advance which ones. So each year whatever is ready, is included in that year's major release (using a 12 month cycle as an example) and then no one is rushing to meet a deadline because they are holding up a version, because that feature will just be able to go in the next version. No pressure; and no energy discussing how we will approach each version. That's a lot of high caliber talent we are consuming. Kenneth Brotman -Original Message- From: Kenneth Brotman [mailto:kenbrot...@yahoo.com.INVALID] Sent: Wednesday, April 04, 2018 4:51 PM To: dev@cassandra.apache.org Subject: RE: Roadmap for 4.0 The group seems to be trying to find a set of features that will define version 4.0. I'm saying that makes things way too complicated. You'll drift, time will go by, no release because of this or that. I'm saying instead, accept that you can't know the time frame really that it will take to properly develop and test a feature. You won't want to release it until it's ready. So take the pressure and complication out of it. Every once in a while a nice set of features will be ready and should not be delayed when they are. That's when you have a new major release. Let it happen more naturally instead of forcing it. Kenneth Brotman -Original Message- From: Kenneth Brotman [mailto:kenbrot...@yahoo.com.INVALID] Sent: Wednesday, April 04, 2018 4:23 PM To: dev@cassandra.apache.org Subject: RE: Roadmap for 4.0 I wouldn't want to add anything to a release that isn't ready. Whatever isn't ready can go in a future release. -Original Message- From: Scott Andreas [mailto:sc...@paradoxica.net] Sent: Wednesday, April 04, 2018 4:18 PM To: dev@cassandra.apache.org Subject: Re: Roadmap for 4.0 Re-raising a point made earlier in the thread by Jeff and affirmed by Josh: --- Jeff: >> A hard date for a feature freeze makes sense, a hard date for a >> release does not. Josh: > Strongly agree. We should also collectively define what "Done" looks > like post freeze so we don't end up in bike-shedding hell like we have > in the past. --- Another way of saying this: ensuring that the 4.0 release is of high quality is more important than cutting the release on a specific date. If we adopt Sylvain's suggestion of freezing features on a "feature complete" date (modulo a "definition of done" as Josh suggested), that will help us align toward the polish, performance work, and dog-fooding needed to feel great about shipping 4.0. It's a good time to start thinking about the approaches to testing, profiling, and dog-fooding various contributors will want to take on before release. I love how Ben put it: > An "exciting" 4.0 release to me is one that is stable and usable with > no perf regressions on day 1 and includes some of the big internal > changes mentioned previously. > > This will set the community up well for some awesome and exciting > stuff that will still be in the pipeline if it doesn't make it to 4.0. That sounds great to me, too. - Scott From: Kenneth Brotman <kenbrot...@yahoo.com.INVALID> Sent: Wednesday, April 4, 2018 2:20:59 PM To: dev@cassandra.apache.org Subject: RE: Roadmap for 4.0 Focusing on 4.0 release then, lets agree on a date next year. Whatever is ready for release by that date is what will be in that release. Kenneth Brotman -Original Message- From: Nate McCall [mailto:zznat...@gmail.com] Sent: Wednesday, April 04, 2018 12:59 PM To: dev Subject: Re: Roadmap for 4.0 On Thu, Apr 5, 2018 at 3:26 AM, Kenneth Brotman <kenbrot...@yahoo.com.invalid> wrote: > Can I suggest a way of defining the next few progressions as a way of approaching this? > > How about something like this: > Version 4.0: A major release of as many improvements to the > code as can be ready for a release on a date sometime next year ;to be decided on by us this month. > Versions 4.x: minor releases about every three months starting after a major release with improvements to the code that can be ready for release, with bug fixes as needed done in between. > Version: 5.0: a major release of whatever significant > improvements are ready for release one year after the release of 4.0 > Versions 5.x: minor releases about every three months with improvements, with bug fixes as needed done in between, > And so on: > A Major release every 12 months of whatever can be > ready for release in that major version, > A minor release every 3 months of whatever can be > ready for release in that minor version. > Bug fixes as needed. > > The folks working on code could then get an idea
RE: Roadmap for 4.0
The group seems to be trying to find a set of features that will define version 4.0. I'm saying that makes things way too complicated. You'll drift, time will go by, no release because of this or that. I'm saying instead, accept that you can't know the time frame really that it will take to properly develop and test a feature. You won't want to release it until it's ready. So take the pressure and complication out of it. Every once in a while a nice set of features will be ready and should not be delayed when they are. That's when you have a new major release. Let it happen more naturally instead of forcing it. Kenneth Brotman -Original Message- From: Kenneth Brotman [mailto:kenbrot...@yahoo.com.INVALID] Sent: Wednesday, April 04, 2018 4:23 PM To: dev@cassandra.apache.org Subject: RE: Roadmap for 4.0 I wouldn't want to add anything to a release that isn't ready. Whatever isn't ready can go in a future release. -Original Message- From: Scott Andreas [mailto:sc...@paradoxica.net] Sent: Wednesday, April 04, 2018 4:18 PM To: dev@cassandra.apache.org Subject: Re: Roadmap for 4.0 Re-raising a point made earlier in the thread by Jeff and affirmed by Josh: --- Jeff: >> A hard date for a feature freeze makes sense, a hard date for a >> release does not. Josh: > Strongly agree. We should also collectively define what "Done" looks > like post freeze so we don't end up in bike-shedding hell like we have > in the past. --- Another way of saying this: ensuring that the 4.0 release is of high quality is more important than cutting the release on a specific date. If we adopt Sylvain's suggestion of freezing features on a "feature complete" date (modulo a "definition of done" as Josh suggested), that will help us align toward the polish, performance work, and dog-fooding needed to feel great about shipping 4.0. It's a good time to start thinking about the approaches to testing, profiling, and dog-fooding various contributors will want to take on before release. I love how Ben put it: > An "exciting" 4.0 release to me is one that is stable and usable with > no perf regressions on day 1 and includes some of the big internal > changes mentioned previously. > > This will set the community up well for some awesome and exciting > stuff that will still be in the pipeline if it doesn't make it to 4.0. That sounds great to me, too. - Scott From: Kenneth Brotman <kenbrot...@yahoo.com.INVALID> Sent: Wednesday, April 4, 2018 2:20:59 PM To: dev@cassandra.apache.org Subject: RE: Roadmap for 4.0 Focusing on 4.0 release then, lets agree on a date next year. Whatever is ready for release by that date is what will be in that release. Kenneth Brotman -Original Message- From: Nate McCall [mailto:zznat...@gmail.com] Sent: Wednesday, April 04, 2018 12:59 PM To: dev Subject: Re: Roadmap for 4.0 On Thu, Apr 5, 2018 at 3:26 AM, Kenneth Brotman <kenbrot...@yahoo.com.invalid> wrote: > Can I suggest a way of defining the next few progressions as a way of approaching this? > > How about something like this: > Version 4.0: A major release of as many improvements to the > code as can be ready for a release on a date sometime next year ;to be decided on by us this month. > Versions 4.x: minor releases about every three months starting after a major release with improvements to the code that can be ready for release, with bug fixes as needed done in between. > Version: 5.0: a major release of whatever significant > improvements are ready for release one year after the release of 4.0 > Versions 5.x: minor releases about every three months with improvements, with bug fixes as needed done in between, > And so on: > A Major release every 12 months of whatever can be > ready for release in that major version, > A minor release every 3 months of whatever can be > ready for release in that minor version. > Bug fixes as needed. > > The folks working on code could then get an idea of when their code > would be ready for release version-wise. > > Kenneth Brotman Hi Kenneth, Appreciate the input, but this is quite a well-trodden path of discussion. Please see the following two (lengthy) threads from last year for background: https://lists.apache.org/thread.html/f7e1fa12ea2fb9c3eb366a04dfd7cab5d0d64eb 9f4057ad65bd62ace@%3Cdev.cassandra.apache.org%3E https://lists.apache.org/thread.html/684b559bf27b9deca0be0dd9629e6cd1fff5644 598180f950ff4f478@%3Cdev.cassandra.apache.org%3E Let's focus this thread on 4.0 release. - To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org For additional commands, e-mail: dev-h...@cassandra.apache.org --
RE: Roadmap for 4.0
I wouldn't want to add anything to a release that isn't ready. Whatever isn't ready can go in a future release. -Original Message- From: Scott Andreas [mailto:sc...@paradoxica.net] Sent: Wednesday, April 04, 2018 4:18 PM To: dev@cassandra.apache.org Subject: Re: Roadmap for 4.0 Re-raising a point made earlier in the thread by Jeff and affirmed by Josh: --- Jeff: >> A hard date for a feature freeze makes sense, a hard date for a >> release does not. Josh: > Strongly agree. We should also collectively define what "Done" looks > like post freeze so we don't end up in bike-shedding hell like we have > in the past. --- Another way of saying this: ensuring that the 4.0 release is of high quality is more important than cutting the release on a specific date. If we adopt Sylvain's suggestion of freezing features on a "feature complete" date (modulo a "definition of done" as Josh suggested), that will help us align toward the polish, performance work, and dog-fooding needed to feel great about shipping 4.0. It's a good time to start thinking about the approaches to testing, profiling, and dog-fooding various contributors will want to take on before release. I love how Ben put it: > An "exciting" 4.0 release to me is one that is stable and usable with > no perf regressions on day 1 and includes some of the big internal > changes mentioned previously. > > This will set the community up well for some awesome and exciting > stuff that will still be in the pipeline if it doesn't make it to 4.0. That sounds great to me, too. - Scott From: Kenneth Brotman <kenbrot...@yahoo.com.INVALID> Sent: Wednesday, April 4, 2018 2:20:59 PM To: dev@cassandra.apache.org Subject: RE: Roadmap for 4.0 Focusing on 4.0 release then, lets agree on a date next year. Whatever is ready for release by that date is what will be in that release. Kenneth Brotman -Original Message- From: Nate McCall [mailto:zznat...@gmail.com] Sent: Wednesday, April 04, 2018 12:59 PM To: dev Subject: Re: Roadmap for 4.0 On Thu, Apr 5, 2018 at 3:26 AM, Kenneth Brotman <kenbrot...@yahoo.com.invalid> wrote: > Can I suggest a way of defining the next few progressions as a way of approaching this? > > How about something like this: > Version 4.0: A major release of as many improvements to the code as can be ready for a release on a date sometime next year ;to be decided on by us this month. > Versions 4.x: minor releases about every three months starting after a major release with improvements to the code that can be ready for release, with bug fixes as needed done in between. > Version: 5.0: a major release of whatever significant improvements are ready for release one year after the release of 4.0 > Versions 5.x: minor releases about every three months with improvements, with bug fixes as needed done in between, > And so on: > A Major release every 12 months of whatever can be ready for release in that major version, > A minor release every 3 months of whatever can be ready for release in that minor version. > Bug fixes as needed. > > The folks working on code could then get an idea of when their code would be ready for release version-wise. > > Kenneth Brotman Hi Kenneth, Appreciate the input, but this is quite a well-trodden path of discussion. Please see the following two (lengthy) threads from last year for background: https://lists.apache.org/thread.html/f7e1fa12ea2fb9c3eb366a04dfd7cab5d0d64eb 9f4057ad65bd62ace@%3Cdev.cassandra.apache.org%3E https://lists.apache.org/thread.html/684b559bf27b9deca0be0dd9629e6cd1fff5644 598180f950ff4f478@%3Cdev.cassandra.apache.org%3E Let's focus this thread on 4.0 release. - To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org For additional commands, e-mail: dev-h...@cassandra.apache.org - To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org For additional commands, e-mail: dev-h...@cassandra.apache.org - To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org For additional commands, e-mail: dev-h...@cassandra.apache.org - To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org For additional commands, e-mail: dev-h...@cassandra.apache.org
RE: Roadmap for 4.0
Focusing on 4.0 release then, lets agree on a date next year. Whatever is ready for release by that date is what will be in that release. Kenneth Brotman -Original Message- From: Nate McCall [mailto:zznat...@gmail.com] Sent: Wednesday, April 04, 2018 12:59 PM To: dev Subject: Re: Roadmap for 4.0 On Thu, Apr 5, 2018 at 3:26 AM, Kenneth Brotman <kenbrot...@yahoo.com.invalid> wrote: > Can I suggest a way of defining the next few progressions as a way of > approaching this? > > How about something like this: > Version 4.0: A major release of as many improvements to the code as > can be ready for a release on a date sometime next year ;to be decided on by > us this month. > Versions 4.x: minor releases about every three months starting after > a major release with improvements to the code that can be ready for release, > with bug fixes as needed done in between. > Version: 5.0: a major release of whatever significant improvements > are ready for release one year after the release of 4.0 > Versions 5.x: minor releases about every three months with > improvements, with bug fixes as needed done in between, > And so on: > A Major release every 12 months of whatever can be ready for > release in that major version, > A minor release every 3 months of whatever can be ready for > release in that minor version. > Bug fixes as needed. > > The folks working on code could then get an idea of when their code would be > ready for release version-wise. > > Kenneth Brotman Hi Kenneth, Appreciate the input, but this is quite a well-trodden path of discussion. Please see the following two (lengthy) threads from last year for background: https://lists.apache.org/thread.html/f7e1fa12ea2fb9c3eb366a04dfd7cab5d0d64eb9f4057ad65bd62ace@%3Cdev.cassandra.apache.org%3E https://lists.apache.org/thread.html/684b559bf27b9deca0be0dd9629e6cd1fff5644598180f950ff4f478@%3Cdev.cassandra.apache.org%3E Let's focus this thread on 4.0 release. - To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org For additional commands, e-mail: dev-h...@cassandra.apache.org - To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org For additional commands, e-mail: dev-h...@cassandra.apache.org
RE: Roadmap for 4.0
Can I suggest a way of defining the next few progressions as a way of approaching this? How about something like this: Version 4.0: A major release of as many improvements to the code as can be ready for a release on a date sometime next year ;to be decided on by us this month. Versions 4.x: minor releases about every three months starting after a major release with improvements to the code that can be ready for release, with bug fixes as needed done in between. Version: 5.0: a major release of whatever significant improvements are ready for release one year after the release of 4.0 Versions 5.x: minor releases about every three months with improvements, with bug fixes as needed done in between, And so on: A Major release every 12 months of whatever can be ready for release in that major version, A minor release every 3 months of whatever can be ready for release in that minor version. Bug fixes as needed. The folks working on code could then get an idea of when their code would be ready for release version-wise. Kenneth Brotman -Original Message- From: Jon Haddad [mailto:jonathan.had...@gmail.com] On Behalf Of Jon Haddad Sent: Wednesday, April 04, 2018 8:00 AM To: dev@cassandra.apache.org Subject: Re: Roadmap for 4.0 Agreed with Josh. There’s nothing set in stone after we release 4.0, trying to extrapolate what we do here for the rest of humanity’s timeline isn’t going to be a useful exercise. Regarding building a big list - it’s of no value. In fact, if we’re already talking about releasing 4.0 we should really only be merging in small features that enhance user experience like improving nodetool output or reasonable optimizations. Merging in big features at the very end of the merge window is a really great idea to have dozens of follow up bug fix releases that nobody considers stable, where the Coli conjecture always wins. IMO, it would be better / more responsible to merge them into trunk *after* we branch for 4.0. Yes, that makes the next release less exciting, but I really don’t think “exciting” is what we’re shooting for. I’m more in favor of stable. Regarding supporting 3.0 / 3.11, since we’re talking about feature freezing 4.0 2 months from now, and releasing it *sometime* after that, then add 6 months, we’re talking about close to an extra year of 3.0 support. People are, of course, free to continue patching 3.0, back porting fixes, etc, but I am completely OK with saying there’s only 9 more months of support starting today. I’m also in the go straight to 3.11 camp. I see no reason to upgrade to only 3.0 if you’re on 2.x. Jon > On Apr 4, 2018, at 6:29 AM, Josh McKenzie <jmcken...@apache.org> wrote: > >> >> This discussion was always about the release strategy. There is no >> separation between the release strategy for 4.0 and the release >> strategy for the project, they are the same thing and what is >> intended to be discussed here. > > Not trying to be pedantic here, but the email thread is titled > "Roadmap for 4.0" and has been concerned with how we get 4.0 out the > door. I don't think it's implicit that whatever strategy we settle on > for 4.0 is intended to apply to subsequent releases, since the 3.0.X > to 3.X to 4.0 relationship/delta is different than a 4.0 to 5.0 can be > expected to be. > > >> sidenote: 3.10 was released in January 2017, and while the changes >> list for >> 4.0 is getting quite large there's not much there that's going to win >> over users. It's mostly refactorings and improvements that affect >> developers more so than users. > > If you assume most 3. users are on 3.10, this argument makes sense. I > believe a majority are on 3.0.X or 2.1/2.2, which leaves a minority > looking at the small delta from 3.10 to 4.0 in the current form. > > > > On Wed, Apr 4, 2018 at 8:25 AM, kurt greaves <k...@instaclustr.com> wrote: > >>> >>> I'm also a bit sad that we seem to be getting back to our old demons >>> of trying to shove as much as we possibly can in the next major as >>> if having a feature miss it means it will never happen. >> >> That wasn't the intention of this thread, but that's the response I got. >> Thought I made it pretty clear that this was about compiling a list >> of things that people are currently working on and can commit to >> getting finished soon (which should be a relatively small list >> considering the limited number of full time contributors). >> >> Of course, we should probably (re-(re-(re-)))start a discussion on >> release >>> "strategy" in parallel because it doesn't seem we have one right >>> now, but that's im
RE: Repair scheduling tools
Why not make it configurable? auto_manage_repair_consistancy: true (default: false) Then users can use the built in auto repair function that would be created or continue to handle it as now. Default behavior would be "false" so nothing changes on its own. Just wondering why not have that option? It might accelerate progress as others have already suggested. Kenneth Brotman -Original Message- From: Nate McCall [mailto:zznat...@gmail.com] Sent: Tuesday, April 03, 2018 1:37 PM To: dev Subject: Re: Repair scheduling tools This document does a really good job of listing out some of the issues of coordinating scheduling repair. Regardless of which camp you fall into, it is certainly worth a read. On Wed, Apr 4, 2018 at 8:10 AM, Joseph Lynch <joe.e.ly...@gmail.com> wrote: > I just want to say I think it would be great for our users if we moved > repair scheduling into Cassandra itself. The team here at Netflix has > opened the ticket > <https://issues.apache.org/jira/browse/CASSANDRA-14346> > and have written a detailed design document > <https://docs.google.com/document/d/1RV4rOrG1gwlD5IljmrIq_t45rz7H3xs9G > bFSEyGzEtM/edit#heading=h.iasguic42ger> > that includes problem discussion and prior art if anyone wants to > contribute to that. We tried to fairly discuss existing solutions, > what their drawbacks are, and a proposed solution. > > If we were to put this as part of the main Cassandra daemon, I think > it should probably be marked experimental and of course be something > that users opt into (table by table or cluster by cluster) with the > understanding that it might not fully work out of the box the first > time we ship it. We have to be willing to take risks but we also have > to be honest with our users. It may help build confidence if a few > major deployments use it (such as Netflix) and we are happy of course > to provide that QA as best we can. > > -Joey > > On Tue, Apr 3, 2018 at 10:48 AM, Blake Eggleston > <beggles...@apple.com> > wrote: > >> Hi dev@, >> >> >> >> The question of the best way to schedule repairs came up on >> CASSANDRA-14346, and I thought it would be good to bring up the idea >> of an external tool on the dev list. >> >> >> >> Cassandra lacks any sort of tools for automating routine tasks that >> are required for running clusters, specifically repair. Regular >> repair is a must for most clusters, like compaction. This means that, >> especially as far as eventual consistency is concerned, Cassandra >> isn’t totally functional out of the box. Operators either need to >> find a 3rd party solution or implement one themselves. Adding this to >> Cassandra would make it easier to use. >> >> >> >> Is this something we should be doing? If so, what should it look like? >> >> >> >> Personally, I feel like this is a pretty big gap in the project and >> would like to see an out of process tool offered. Ideally, Cassandra >> would just take care of itself, but writing a distributed repair >> scheduler that you trust to run in production is a lot harder than >> writing a single process management application that can failover. >> >> >> >> Any thoughts on this? >> >> >> >> Thanks, >> >> >> >> Blake >> >> - To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org For additional commands, e-mail: dev-h...@cassandra.apache.org - To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org For additional commands, e-mail: dev-h...@cassandra.apache.org
RE: A Simple List of New Major Features Desired for Apache Cassandra Version 4.0
Thanks Ben for taking the time to write your response. I am still learning the culture and ways of the group. I'm admittedly new to the open source way of doing things. I am always anxious to pitch in. Carry on. I will leave it alone for now. Kenneth Brotman -Original Message- From: Ben Bromhead [mailto:b...@instaclustr.com] Sent: Saturday, March 31, 2018 6:59 PM To: dev@cassandra.apache.org Subject: Re: A Simple List of New Major Features Desired for Apache Cassandra Version 4.0 Thank you, Kenneth, for listening to the PMC and for putting the discussion in the correct list. I also appreciate your enthusiasm. To address a few of your points from your previous emails in no particular order: - There is already a compiled list of features slated for 4.0, this list is simply a search using the following JQL on JIRA - `project = CASSANDRA AND fixVersion = 4.0 ORDER BY priority DESC, updated DESC`. By looking at this list, we can see fixes/patches/features that have been committed to trunk as well as tickets that are still in progress but someone at some point thought it was likely to land in 4.0 - I appreciate a desire to create a bucket list of any and all desired features for 4.0, the reality is that this is an open source project driven by individual contributors and so what goes in 4.0 is largely up to those who do the work and put those features in. - Whilst it has been a long time since 3.0 was first released, the 3.x tick-tock experiment resulted in a large number of major releases in short succession and as such, I think most folks are simply recovering from that push + there is no longer a march to a vendors drum beat. Hence the slow down in major feature release and a focus on getting existing features more stable. - The major release of 3.11 was only released at the end of June 2017 and it has had 2 point releases in the meantime. - Whilst not having a consistent major release schedule can be frustrating for end users and product marketers, not having a stable database is far more maddening. - Having said that, there is a good body of both large and small changes that are in "done/resolved", "patch ready", "awaiting feedback" etc and slated for 4.0 which would be good to get out the door. While this is a matter of opinion I think it's about reaching a nice balance of having not too many changes making it a larger adoption risk and having enough time to work on some good things (e.g. https://issues.apache.org/jira/browse/CASSANDRA-12229), such that it's actually worth working toward cutting a major release. - I'd respectfully disagree that we have a "basic collaboration challenge". This is primarily a community that communicates by gradually moving towards consensus (which takes time) and this thread is simply one of the many discussions and interactions that move towards consensus about 4.0. If anything we are resource/people constrained, but that is true of all open source communities. I've decided to respond on the previous thread, "Roadmap to 4.0" (however just via the dev list where the discussion should live) with my suggestions as I don't want to ignore both Kurt and Jeffs previous contributions on this subject. On Fri, Mar 30, 2018 at 6:49 PM Kenneth Brotman <kenbrot...@yahoo.com.invalid> wrote: > Just list any desired new major features for 4.0 that you want added. > I will maintain a compiled list for all to see. Don't worry about any > steps beyond this. Don't make any judgements about or make any > comments at all about what others add. > > No judgments at this point. This is a list of everyone's suggestions. > Add your suggestions for new major features you desire to be added for > version > 4.0 only. Keep it simple, not detailed yet. That comes a few steps > from now. What we have is a basic collaboration challenge. No problem. > > Kenneth Brotman > > > > > > -- Ben Bromhead CTO | Instaclustr <https://www.instaclustr.com/> +1 650 284 9692 Reliability at Scale Cassandra, Spark, Elasticsearch on AWS, Azure, GCP and Softlayer - To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org For additional commands, e-mail: dev-h...@cassandra.apache.org
A Simple List of New Major Features Desired for Apache Cassandra Version 4.0
Just list any desired new major features for 4.0 that you want added. I will maintain a compiled list for all to see. Don't worry about any steps beyond this. Don't make any judgements about or make any comments at all about what others add. No judgments at this point. This is a list of everyone's suggestions. Add your suggestions for new major features you desire to be added for version 4.0 only. Keep it simple, not detailed yet. That comes a few steps from now. What we have is a basic collaboration challenge. No problem. Kenneth Brotman
RE: Paying off tech debt and correctly naming things
Perfect! -Original Message- From: Jon Haddad [mailto:jonathan.had...@gmail.com] On Behalf Of Jon Haddad Sent: Thursday, March 22, 2018 8:10 AM To: dev@cassandra.apache.org Subject: Re: Paying off tech debt and correctly naming things Cool. I think there’s general agreement that doing this in as small bites as possible is going to be the best approach. I have no interest in mega patches. > The combined approach takes a > change that's already non-trivially dealing with complex subsystem > changes and injects a bunch of trivial renaming noise across unrelated > subsystems into the signal of an actual logic refactor. I agree. This is why I like the idea of proactively working to improve the readability of the codebase as a specific goal, rather than being wrapped into some other unrelated patch. Keeping the scope in check is the challenge. Simple class and method renames, as several have pointed out, is easy enough with IDEA. I’ll start with class renames, as individual patches for each of them. I’ll be sure to call it out on the ML. First one will be ColumnFamilyStore -> TableStore. Jon > On Mar 22, 2018, at 7:13 AM, Jason Brownwrote: > > Jon, > > Thanks for bringing up this topic. I'll admit that I've been around > this code base for long enough, and have enough accumulated history, > that I probably can't fully appreciate the impact for a newcomer wrt naming. > However, as Josh points out, this situation probably happens to "every > non-trivially aged code-base ever". > > One thing I'd like to add is that with these types of large > refactoring changes, the review effort is non-trivial. This is because > the review still has to ensure that correctness is preserved and it's > easy to overlook a seemingly innocuous change. > > That being said, I am supportive of this effort. However, I believe > it's going to be best, for contributor and reviewer, to break it up > into smaller, more digestible pieces. I'd also like to request that we > not go whole hog and try to do everything in a compressed time frame; > reviewer availability is already stretched thin and I'm afraid of > deepening the review queue, especially mine :) > > Thanks, > > -Jason > > > > > On Thu, Mar 22, 2018 at 6:41 AM, Josh McKenzie wrote: > >>> Some of us have big patches in flight, things that actually pay off >>> some technical debt, and dealing with such renames is rebase >> hell :\ >> For sure, but with a code-base this old / organically grown, I expect >> this will always be the case. If we're talking something as simple as >> an intellij rename refactor, while menial, couldn't someone with a >> giant patch just do the same thing on their side and spend half an >> hour of their life clicking next? ;) >> >>> That said, there is good time for such renames - it’s during those >>> major refactors and rewrites. When you are changing a subsystem, >>> might as well do the appropriate renames. >> Does that hold true for a code-base with as much static state and >> abstraction leaking / bad factoring as we have? (i.e. every >> non-trivially aged code-base ever) The combined approach takes a >> change that's already non-trivially dealing with complex subsystem >> changes and injects a bunch of trivial renaming noise across >> unrelated subsystems into the signal of an actual logic refactor. >> >> On Thu, Mar 22, 2018 at 9:31 AM, Aleksey Yeshchenko >> >> wrote: >>> Poor and out-of-date naming of things is probably the least serious >>> part >> of our technical debt. Bad factoring, and straight-up >>> poorly written components is where it’s really at. >>> >>> Doing a big rename for rename sake alone does more harm than it is >>> good, >> sometimes. Some of us have big patches >>> in flight, things that actually pay off some technical debt, and >>> dealing >> with such renames is rebase hell :\ >>> >>> That said, there is good time for such renames - it’s during those >>> major >> refactors and rewrites. When you are >>> changing a subsystem, might as well do the appropriate renames. >>> >>> — >>> AY >>> >>> On 20 March 2018 at 22:04:48, Jon Haddad (j...@jonhaddad.com) wrote: >>> >>> Whenever I hop around in the codebase, one thing that always manages >>> to >> slow me down is needing to understand the context of the variable >> names that I’m looking at. We’ve now removed thrift the transport, >> but the variables, classes and comments still remain. Personally, I’d >> like to go in and pay off as much technical debt as possible by >> refactoring the code to be as close to CQL as possible. Rows should >> be rows, not partitions, I’d love to see the term column family >> removed forever in favor of always using tables. That said, it’s a >> big task. I did a quick refactor in a branch, simply changing the >> ColumnFamilyStore class to TableStore, and pushed it up to GitHub. >> [1] >>> >>>
RE: Paying off tech debt and correctly naming things
I agree with Jason Brown: good topic to address, effect not trivial. Going whole hog too risky. Careful meticulous small areas at a time with warning to everyone so they can chime in if they are working on that area and would prefer it left alone for now. I wouldn't want it to delay others things. For example, not sure where everyone is on this but, if Reaper was integrated in Cassandra so it just worked away in Cassandra without needing any installation or attention of any kind - in time for version 4.0 - would be really cool. Kenneth Brotman -Original Message- From: Jason Brown [mailto:jasedbr...@gmail.com] Sent: Thursday, March 22, 2018 7:14 AM To: dev@cassandra.apache.org Subject: Re: Paying off tech debt and correctly naming things Jon, Thanks for bringing up this topic. I'll admit that I've been around this code base for long enough, and have enough accumulated history, that I probably can't fully appreciate the impact for a newcomer wrt naming. However, as Josh points out, this situation probably happens to "every non-trivially aged code-base ever". One thing I'd like to add is that with these types of large refactoring changes, the review effort is non-trivial. This is because the review still has to ensure that correctness is preserved and it's easy to overlook a seemingly innocuous change. That being said, I am supportive of this effort. However, I believe it's going to be best, for contributor and reviewer, to break it up into smaller, more digestible pieces. I'd also like to request that we not go whole hog and try to do everything in a compressed time frame; reviewer availability is already stretched thin and I'm afraid of deepening the review queue, especially mine :) Thanks, -Jason On Thu, Mar 22, 2018 at 6:41 AM, Josh McKenzie <jmcken...@apache.org> wrote: > > Some of us have big patches in flight, things that actually pay off > > some technical debt, and dealing with such renames is rebase > hell :\ > For sure, but with a code-base this old / organically grown, I expect > this will always be the case. If we're talking something as simple as > an intellij rename refactor, while menial, couldn't someone with a > giant patch just do the same thing on their side and spend half an > hour of their life clicking next? ;) > > > That said, there is good time for such renames - it’s during those > > major refactors and rewrites. When you are changing a subsystem, > > might as well do the appropriate renames. > Does that hold true for a code-base with as much static state and > abstraction leaking / bad factoring as we have? (i.e. every > non-trivially aged code-base ever) The combined approach takes a > change that's already non-trivially dealing with complex subsystem > changes and injects a bunch of trivial renaming noise across unrelated > subsystems into the signal of an actual logic refactor. > > On Thu, Mar 22, 2018 at 9:31 AM, Aleksey Yeshchenko > <alek...@apple.com> > wrote: > > Poor and out-of-date naming of things is probably the least serious > > part > of our technical debt. Bad factoring, and straight-up > > poorly written components is where it’s really at. > > > > Doing a big rename for rename sake alone does more harm than it is > > good, > sometimes. Some of us have big patches > > in flight, things that actually pay off some technical debt, and > > dealing > with such renames is rebase hell :\ > > > > That said, there is good time for such renames - it’s during those > > major > refactors and rewrites. When you are > > changing a subsystem, might as well do the appropriate renames. > > > > — > > AY > > > > On 20 March 2018 at 22:04:48, Jon Haddad (j...@jonhaddad.com) wrote: > > > > Whenever I hop around in the codebase, one thing that always manages > > to > slow me down is needing to understand the context of the variable > names that I’m looking at. We’ve now removed thrift the transport, but > the variables, classes and comments still remain. Personally, I’d like > to go in and pay off as much technical debt as possible by refactoring > the code to be as close to CQL as possible. Rows should be rows, not > partitions, I’d love to see the term column family removed forever in > favor of always using tables. That said, it’s a big task. I did a > quick refactor in a branch, simply changing the ColumnFamilyStore > class to TableStore, and pushed it up to GitHub. [1] > > > > Didn’t click on the link? That’s ok. The TL;DR is that it’s almost > > 2K > LOC changed across 275 files. I’ll note that my branch doesn’t change > any of the almost 1000 search results of “columnfamilystore” found in > the codebase and hundreds of tests f
RE: A JIRA proposing a seperate repository for the online documentation
I've been trying to withdraw from the conversation since Friday. -Original Message- From: Josh McKenzie [mailto:jmcken...@apache.org] Sent: Monday, March 19, 2018 9:56 AM To: dev@cassandra.apache.org Subject: Re: A JIRA proposing a seperate repository for the online documentation And to be explicit about this: I'm going to withdraw from this conversation now. I don't think we're generating sufficient signal in this dialog compared to the noise. Thanks for the enthusiasm and input Kenneth. On Mon, Mar 19, 2018 at 11:48 AM, Josh McKenzie <jmcken...@apache.org> wrote: > You missed the point ... let them do it their way when it makes no >> difference > > If you're suggesting we use a static markdown based generator as the > "go to" method for content generation and accept PR's for "New > Hotness" (do it their way), then sure, that seems a totally fair > compromise. Pretty sure that's not what you're suggesting, and people > Wild Westing a contribution to a project, that the project then owns > when you get bored of it and move on, doesn't qualify for "makes no > difference". > > On Mon, Mar 19, 2018 at 11:10 AM, Kenneth Brotman < > kenbrot...@yahoo.com.invalid> wrote: > >> You missed the point. If someone can do a good job on something, >> just let them do it their way when it makes no difference. >> Specifically regarding the website, I should be able to submit HTML, >> CSS and JS if I want, others should be able to do it with the static >> generator thing if they want. I could have fixed the website with >> complete version specific pages, with some really good text to fill >> in where there is none now and so on; and done that several different >> ways by now. I'm donating my time too. Please keep that in mind. >> >> Let me ask again, where are the standards? How can you leave the >> website for the project you will be associated with looking that substandard? >> Rahul Singh has been making a very important point that different >> types of uses with different uses cases use the website. This is a >> public website; not an internal site for coders. >> >> Kenneth Brotman >> >> -Original Message- >> From: Josh McKenzie [mailto:jmcken...@apache.org] >> Sent: Monday, March 19, 2018 7:42 AM >> To: dev@cassandra.apache.org >> Subject: Re: A JIRA proposing a seperate repository for the online >> documentation >> >> Wow. Ok, let's try this again. >> >> Josh, you made my point for me. Lower the barriers to entry >> >> Kenneth, I disagree. The C* community is not expressing universal >> interest in having a hand-crafted bespoke website, but many have >> expressed being open to using markdown to create content. Former: >> high barrier to entry *for the community as a whole, not just you.* >> Important distinction. >> >> If someone like me offers to knock out something that's been a >> problem for >> > the group and can do a very professional job, next time get out of >> > the >> way. >> >> It's been a problem because people haven't devoted time to it, not >> because they couldn't figure out markdown. The Apache Way is about >> consensus, not telling people to get out of the way when they don't >> agree with you. >> >> I don't make junk Josh. >> >> Actions speak louder than words and thus far your tone, >> combativeness, and repeated unwillingness to acknowledge what looks >> to be a strong general consensus from your peers is the evidence we have to >> go on. >> >> On Mon, Mar 19, 2018 at 10:21 AM, Kenneth Brotman < >> kenbrot...@yahoo.com.invalid> wrote: >> >> > Josh, you made my point for me. Lower the barriers to entry, not throw >> > extra steps I don’t need at me. If someone like me offers to knock >> > out something that's been a problem for the group and can do a very >> > professional job, next time get out of the way. I don't make junk Josh. >> > Sorry that Apache is not interested in a site with multi-media >> > support; or even sites with complete pages. Show me one quality >> > open source Apache site. Wake up. Raise your bar! Engineers >> > shouldn't speak in the language of mediocrity. >> > >> > Kenneth Brotman >> > >> > -Original Message- >> > From: Josh McKenzie [mailto:jmcken...@apache.org] >> > Sent: Monday, March 19, 2018 5:27 AM >> > To: dev@cassandra.apache.org >> > Subject: Re: A JIRA proposing a seperate repository for the online >> >
RE: A JIRA proposing a seperate repository for the online documentation
You missed the point. If someone can do a good job on something, just let them do it their way when it makes no difference. Specifically regarding the website, I should be able to submit HTML, CSS and JS if I want, others should be able to do it with the static generator thing if they want. I could have fixed the website with complete version specific pages, with some really good text to fill in where there is none now and so on; and done that several different ways by now. I'm donating my time too. Please keep that in mind. Let me ask again, where are the standards? How can you leave the website for the project you will be associated with looking that substandard? Rahul Singh has been making a very important point that different types of uses with different uses cases use the website. This is a public website; not an internal site for coders. Kenneth Brotman -Original Message- From: Josh McKenzie [mailto:jmcken...@apache.org] Sent: Monday, March 19, 2018 7:42 AM To: dev@cassandra.apache.org Subject: Re: A JIRA proposing a seperate repository for the online documentation Wow. Ok, let's try this again. Josh, you made my point for me. Lower the barriers to entry Kenneth, I disagree. The C* community is not expressing universal interest in having a hand-crafted bespoke website, but many have expressed being open to using markdown to create content. Former: high barrier to entry *for the community as a whole, not just you.* Important distinction. If someone like me offers to knock out something that's been a problem for > the group and can do a very professional job, next time get out of the way. It's been a problem because people haven't devoted time to it, not because they couldn't figure out markdown. The Apache Way is about consensus, not telling people to get out of the way when they don't agree with you. I don't make junk Josh. Actions speak louder than words and thus far your tone, combativeness, and repeated unwillingness to acknowledge what looks to be a strong general consensus from your peers is the evidence we have to go on. On Mon, Mar 19, 2018 at 10:21 AM, Kenneth Brotman < kenbrot...@yahoo.com.invalid> wrote: > Josh, you made my point for me. Lower the barriers to entry, not throw > extra steps I don’t need at me. If someone like me offers to knock > out something that's been a problem for the group and can do a very > professional job, next time get out of the way. I don't make junk Josh. > Sorry that Apache is not interested in a site with multi-media > support; or even sites with complete pages. Show me one quality open > source Apache site. Wake up. Raise your bar! Engineers shouldn't > speak in the language of mediocrity. > > Kenneth Brotman > > -Original Message- > From: Josh McKenzie [mailto:jmcken...@apache.org] > Sent: Monday, March 19, 2018 5:27 AM > To: dev@cassandra.apache.org > Subject: Re: A JIRA proposing a seperate repository for the online > documentation > > > > > I've been writing html a long time; since about 1990. You're asking > > me to learn a weird little program, a static site generator just to > > change something I can already do without using a program at all. > > You're one person among a community of back-end Java devs. If you want > people to contribute to things you need to lower the barriers to > entry, not deep-dive into some bespoke thing that may never see the > light of day and, when completed, misses the mark on what users of cassandra > care about: > clarity and speed. Also: bus-factor 1 is bad. > > Another weird thing: Wouldn't we want a website that is dynamic and > > multi-media rich? > > Personally? No. We're talking function here, not form. As an engineer, > I don't want to wade through someone else's idea of what "dynamic and > multi-media rich" means when I'm trying to find an answer to something > or learn something so I can get on with my job. > > > > On Sun, Mar 18, 2018 at 5:12 PM, Nate McCall <zznat...@gmail.com> wrote: > > > On Mon, Mar 19, 2018 at 12:41 AM, Kenneth Brotman > > <kenbrot...@yahoo.com.invalid> wrote: > > > I don't know what we are doing for the website technologies right > > > now > > because like everything else what we do is not documented anywhere. > > Where are the servers: the cloud? What server software are we > > running? How is the html, etc. generated and published? How is > > search > done for the website? > > > > http://cassandra.apache.org/doc/latest/development/documentation.htm > > l The resulting static HTML from Sphinx is hosted on ASF > > infrastructure. > > > > > > - To unsubscribe, e-ma
RE: Website front page broken links
Split the ticket up. Don't leave the home page broken for weeks! Move the third party discussion to another JIRA. Get the home page fixed ASAP! Kenneth Brotman -Original Message- From: Hannu Kröger [mailto:hkro...@gmail.com] Sent: Monday, March 19, 2018 12:43 AM To: dev@cassandra.apache.org Subject: Re: Website front page broken links ah, of course. Sorry for the duplicate. Hannu > On 19 Mar 2018, at 09:36, kurt greaves <k...@instaclustr.com> wrote: > > A ticket already exists that covers this, with a patch. just needs > some consensus around the third party page. > https://issues.apache.org/jira/browse/CASSANDRA-14128 > > On 19 Mar. 2018 18:26, "Hannu Kröger" <hkro...@gmail.com> wrote: > >> Hello, >> >> I created this ticket >> https://issues.apache.org/jira/browse/CASSANDRA-14324 >> >> Basically the website front page contains manybroken links to >> planetcassandra.org >> >> Hannu >> - >> To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org >> For additional commands, e-mail: dev-h...@cassandra.apache.org >> >> - To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org For additional commands, e-mail: dev-h...@cassandra.apache.org - To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org For additional commands, e-mail: dev-h...@cassandra.apache.org
RE: A JIRA proposing a seperate repository for the online documentation
Josh, you made my point for me. Lower the barriers to entry, not throw extra steps I don’t need at me. If someone like me offers to knock out something that's been a problem for the group and can do a very professional job, next time get out of the way. I don't make junk Josh. Sorry that Apache is not interested in a site with multi-media support; or even sites with complete pages. Show me one quality open source Apache site. Wake up. Raise your bar! Engineers shouldn't speak in the language of mediocrity. Kenneth Brotman -Original Message- From: Josh McKenzie [mailto:jmcken...@apache.org] Sent: Monday, March 19, 2018 5:27 AM To: dev@cassandra.apache.org Subject: Re: A JIRA proposing a seperate repository for the online documentation > > I've been writing html a long time; since about 1990. You're asking > me to learn a weird little program, a static site generator just to > change something I can already do without using a program at all. You're one person among a community of back-end Java devs. If you want people to contribute to things you need to lower the barriers to entry, not deep-dive into some bespoke thing that may never see the light of day and, when completed, misses the mark on what users of cassandra care about: clarity and speed. Also: bus-factor 1 is bad. Another weird thing: Wouldn't we want a website that is dynamic and > multi-media rich? Personally? No. We're talking function here, not form. As an engineer, I don't want to wade through someone else's idea of what "dynamic and multi-media rich" means when I'm trying to find an answer to something or learn something so I can get on with my job. On Sun, Mar 18, 2018 at 5:12 PM, Nate McCall <zznat...@gmail.com> wrote: > On Mon, Mar 19, 2018 at 12:41 AM, Kenneth Brotman > <kenbrot...@yahoo.com.invalid> wrote: > > I don't know what we are doing for the website technologies right > > now > because like everything else what we do is not documented anywhere. > Where are the servers: the cloud? What server software are we > running? How is the html, etc. generated and published? How is search done > for the website? > > http://cassandra.apache.org/doc/latest/development/documentation.html > The resulting static HTML from Sphinx is hosted on ASF infrastructure. > > - > To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org > For additional commands, e-mail: dev-h...@cassandra.apache.org > > - To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org For additional commands, e-mail: dev-h...@cassandra.apache.org
RE: A JIRA proposing a seperate repository for the online documentation
The answer is right before us: Let's use Rahul Singh's Awesome Cassandra list for now: https://github.com/anant/awesome-cassandra while he works on some other things for us. I will be submitting contributions to it regularly and referencing it as the go to resource. It will serve us well for now. Kenneth Brotman -Original Message- From: Kenneth Brotman [mailto:kenbrot...@yahoo.com.INVALID] Sent: Sunday, March 18, 2018 4:41 AM To: dev@cassandra.apache.org Subject: RE: A JIRA proposing a seperate repository for the online documentation I don't know what we are doing for the website technologies right now because like everything else what we do is not documented anywhere. Where are the servers: the cloud? What server software are we running? How is the html, etc. generated and published? How is search done for the website? Kenneth Brotman -Original Message- From: Rahul Singh [mailto:rahul.xavier.si...@gmail.com] Sent: Saturday, March 17, 2018 7:12 AM To: dev@cassandra.apache.org Subject: RE: A JIRA proposing a seperate repository for the online documentation Static site generator just takes content from flat files or apis (that can be managed from a headless cms) and creates static files or progressive web apps that are optimized for speed. Nothing to do with multi-media or dynamic in terms of client side javascript / css. It’s just an old technology with a new name. Thats how we used to generate sites back in 1990s.. :) -- Rahul Singh rahul.si...@anant.us Anant Corporation On Mar 17, 2018, 10:03 AM -0400, Kenneth Brotman <kenbrot...@yahoo.com.invalid>, wrote: > How about if we look at the website a little differently. Isn't it an > opportunity to showcase Cassandra and related technologies like search! > Shouldn't we be an extraordinary advocate and example of the technology > ourselves? > > Rahul, your comment on the different users with different use cases was very > wise. > > I've been writing html a long time; since about 1990. You're asking me to > learn a weird little program, a static site generator just to change > something I can already do without using a program at all. > > Another weird thing: Wouldn't we want a website that is dynamic and > multi-media rich? > > Kenneth Brotman > > > -Original Message- > From: Rahul Singh [mailto:rahul.xavier.si...@gmail.com] > Sent: Saturday, March 17, 2018 5:57 AM > To: dev@cassandra.apache.org > Subject: RE: A JIRA proposing a seperate repository for the online > documentation > > I’ve previously deep dived into Static Site generators and there are numerous > ones. > > http://leaves.anant.us/#!/leaf/7255?tag=static.site > > I don’t like changing technology for the sake of change. I think it’s a > stupid waste of time. In one hand I agree, the substance is more important > than the form. On the other hand. I [insert f-bomb] hate writing HTML / CSS, > or restructured text. Markdown is much easier. Hugo is one of many that if > setup right, it can save a ton of time and make it more accessible for people > to contribute. > > There is a difference however in developer documentation for developers of > cassandra, user documentation for cassandra users, documentation for and > administrators. They are different users and have different use cases Some > need reference style docs, others need guides. > > Some good examples, (the software quality not-withstanding), correlate with > software propularity are Wordpress. I am not wild about Wordpress, but their > codex.wordpress.org has been generally a good “user doc.” > > Envision the outcome even if you have to mimic someone else. I don’t mind > stealing/copying if it gets us one step further than we are. The reaper docs > look easy to maintain and I could care less about Hugo, Hexo, Jekyll, Hyde, > KafkasMom, EinsteinsDog, ShrodingersCat static site generator. > > I think action should come before decision in open source. Prove something > before suggesting a change. Jon’s reaper example is good. If anyone has > something better, show it. Prove it. > > -- > Rahul Singh > rahul.si...@anant.us > > Anant Corporation > > On Mar 16, 2018, 6:54 PM -0400, Kenneth Brotman > <kenbrotman@yahoo.cominvalid>, wrote: > > There is no need for another program. Keep the code in html, css and js > > People can modify that and show proposed changes in that. No need to > > convert back and forth from other formats. If someone is doing something > > more involved, they probably already have a program themselves. > > > > -Original Message- > > From: beggles...@apple.com [mailto:beggles...@apple.com] > > Sent: Friday, March 16, 2018 3:16 PM > > To: dev@cassandra.apache.org > > Subject: Re: A J
RE: A JIRA proposing a seperate repository for the online documentation
I don't know what we are doing for the website technologies right now because like everything else what we do is not documented anywhere. Where are the servers: the cloud? What server software are we running? How is the html, etc. generated and published? How is search done for the website? Kenneth Brotman -Original Message- From: Rahul Singh [mailto:rahul.xavier.si...@gmail.com] Sent: Saturday, March 17, 2018 7:12 AM To: dev@cassandra.apache.org Subject: RE: A JIRA proposing a seperate repository for the online documentation Static site generator just takes content from flat files or apis (that can be managed from a headless cms) and creates static files or progressive web apps that are optimized for speed. Nothing to do with multi-media or dynamic in terms of client side javascript / css. It’s just an old technology with a new name. Thats how we used to generate sites back in 1990s.. :) -- Rahul Singh rahul.si...@anant.us Anant Corporation On Mar 17, 2018, 10:03 AM -0400, Kenneth Brotman <kenbrot...@yahoo.com.invalid>, wrote: > How about if we look at the website a little differently. Isn't it an > opportunity to showcase Cassandra and related technologies like search! > Shouldn't we be an extraordinary advocate and example of the technology > ourselves? > > Rahul, your comment on the different users with different use cases was very > wise. > > I've been writing html a long time; since about 1990. You're asking me to > learn a weird little program, a static site generator just to change > something I can already do without using a program at all. > > Another weird thing: Wouldn't we want a website that is dynamic and > multi-media rich? > > Kenneth Brotman > > > -Original Message- > From: Rahul Singh [mailto:rahul.xavier.si...@gmail.com] > Sent: Saturday, March 17, 2018 5:57 AM > To: dev@cassandra.apache.org > Subject: RE: A JIRA proposing a seperate repository for the online > documentation > > I’ve previously deep dived into Static Site generators and there are numerous > ones. > > http://leaves.anant.us/#!/leaf/7255?tag=static.site > > I don’t like changing technology for the sake of change. I think it’s a > stupid waste of time. In one hand I agree, the substance is more important > than the form. On the other hand. I [insert f-bomb] hate writing HTML / CSS, > or restructured text. Markdown is much easier. Hugo is one of many that if > setup right, it can save a ton of time and make it more accessible for people > to contribute. > > There is a difference however in developer documentation for developers of > cassandra, user documentation for cassandra users, documentation for and > administrators. They are different users and have different use cases Some > need reference style docs, others need guides. > > Some good examples, (the software quality not-withstanding), correlate with > software propularity are Wordpress. I am not wild about Wordpress, but their > codex.wordpress.org has been generally a good “user doc.” > > Envision the outcome even if you have to mimic someone else. I don’t mind > stealing/copying if it gets us one step further than we are. The reaper docs > look easy to maintain and I could care less about Hugo, Hexo, Jekyll, Hyde, > KafkasMom, EinsteinsDog, ShrodingersCat static site generator. > > I think action should come before decision in open source. Prove something > before suggesting a change. Jon’s reaper example is good. If anyone has > something better, show it. Prove it. > > -- > Rahul Singh > rahul.si...@anant.us > > Anant Corporation > > On Mar 16, 2018, 6:54 PM -0400, Kenneth Brotman > <kenbrotman@yahoo.cominvalid>, wrote: > > There is no need for another program. Keep the code in html, css and js > > People can modify that and show proposed changes in that. No need to > > convert back and forth from other formats. If someone is doing something > > more involved, they probably already have a program themselves. > > > > -Original Message- > > From: beggles...@apple.com [mailto:beggles...@apple.com] > > Sent: Friday, March 16, 2018 3:16 PM > > To: dev@cassandra.apache.org > > Subject: Re: A JIRA proposing a seperate repository for the online > > documentation > > > > It would probably be more productive to list some specific concerns you > > have with Hugo. Then explain why you think they make using it a bad idea > > Then offer some alternatives. > > > > On 3/16/18, 1:18 PM, "Kenneth Brotman" <kenbrot...@yahoo.com.INVALID> wrote: > > > > Thanks for that Eric Evans. > > > > I'm not sure Hugo is the way to go. I don't see how I would generate the > &
RE: A JIRA proposing a seperate repository for the online documentation
How about if we look at the website a little differently. Isn't it an opportunity to showcase Cassandra and related technologies like search! Shouldn't we be an extraordinary advocate and example of the technology ourselves? Rahul, your comment on the different users with different use cases was very wise. I've been writing html a long time; since about 1990. You're asking me to learn a weird little program, a static site generator just to change something I can already do without using a program at all. Another weird thing: Wouldn't we want a website that is dynamic and multi-media rich? Kenneth Brotman -Original Message- From: Rahul Singh [mailto:rahul.xavier.si...@gmail.com] Sent: Saturday, March 17, 2018 5:57 AM To: dev@cassandra.apache.org Subject: RE: A JIRA proposing a seperate repository for the online documentation I’ve previously deep dived into Static Site generators and there are numerous ones. http://leaves.anant.us/#!/leaf/7255?tag=static.site I don’t like changing technology for the sake of change. I think it’s a stupid waste of time. In one hand I agree, the substance is more important than the form. On the other hand. I [insert f-bomb] hate writing HTML / CSS, or restructured text. Markdown is much easier. Hugo is one of many that if setup right, it can save a ton of time and make it more accessible for people to contribute. There is a difference however in developer documentation for developers of cassandra, user documentation for cassandra users, documentation for and administrators. They are different users and have different use cases. Some need reference style docs, others need guides. Some good examples, (the software quality not-withstanding), correlate with software propularity are Wordpress. I am not wild about Wordpress, but their codex.wordpress.org has been generally a good “user doc.” Envision the outcome even if you have to mimic someone else. I don’t mind stealing/copying if it gets us one step further than we are. The reaper docs look easy to maintain and I could care less about Hugo, Hexo, Jekyll, Hyde, KafkasMom, EinsteinsDog, ShrodingersCat static site generator. I think action should come before decision in open source. Prove something before suggesting a change. Jon’s reaper example is good. If anyone has something better, show it. Prove it. -- Rahul Singh rahul.si...@anant.us Anant Corporation On Mar 16, 2018, 6:54 PM -0400, Kenneth Brotman <kenbrot...@yahoo.com.invalid>, wrote: > There is no need for another program. Keep the code in html, css and js > People can modify that and show proposed changes in that. No need to convert > back and forth from other formats. If someone is doing something more > involved, they probably already have a program themselves. > > -Original Message- > From: beggles...@apple.com [mailto:beggles...@apple.com] > Sent: Friday, March 16, 2018 3:16 PM > To: dev@cassandra.apache.org > Subject: Re: A JIRA proposing a seperate repository for the online > documentation > > It would probably be more productive to list some specific concerns you have > with Hugo. Then explain why you think they make using it a bad idea Then > offer some alternatives. > > On 3/16/18, 1:18 PM, "Kenneth Brotman" <kenbrot...@yahoo.com.INVALID> wrote: > > Thanks for that Eric Evans. > > I'm not sure Hugo is the way to go. I don't see how I would generate the > quality of work I would want with it. It seems like another example of coders > learning and using a more complicated program to generate the code they could > have already generated - it’s a disease in the I.T. industry right now. But I > could be wrong. > > Here's the thing. I've been spending a lot of my time for the past three > weeks now trying to help with the website. That is a tiny website. I've never > worked with a website that tiny. Bear with me. > > I'm studying Jeff Carpenter and Eben Hewitt's book: Cassandra The Definitive > Guide > https://www.amazon.com/Cassandra-Definitive-Guide-Distributed-Scale/dp/1491933666/ref=sr_1_1?ie=UTF8=1521230539=8-1=cassandra+the+definitive+guide > and have already have a terrible itch to start contributing some code. I > just want to get set up to do that. The book seems to be a good way to get > familiar with the internals and the code of Cassandra. > > I can only do so much for the group at one time just like anyone else. I'll > only do top quality work. I'll only be a part of top quality work. It could > be that I won't feel comfortable with what the group wants to do for the > website. > > Please keep working on it as it is really embarrassing, terrible, substandard > unacceptable beneath professional standards... > > I will contribute if it's possible for me to do so. Let's see what we decide > to do going forward f
RE: A JIRA proposing a seperate repository for the online documentation
Thanks for that Eric Evans. I'm not sure Hugo is the way to go. I don't see how I would generate the quality of work I would want with it. It seems like another example of coders learning and using a more complicated program to generate the code they could have already generated - it’s a disease in the I.T. industry right now. But I could be wrong. Here's the thing. I've been spending a lot of my time for the past three weeks now trying to help with the website. That is a tiny website. I've never worked with a website that tiny. Bear with me. I'm studying Jeff Carpenter and Eben Hewitt's book: Cassandra The Definitive Guide https://www.amazon.com/Cassandra-Definitive-Guide-Distributed-Scale/dp/1491933666/ref=sr_1_1?ie=UTF8=1521230539=8-1=cassandra+the+definitive+guide and have already have a terrible itch to start contributing some code. I just want to get set up to do that. The book seems to be a good way to get familiar with the internals and the code of Cassandra. I can only do so much for the group at one time just like anyone else. I'll only do top quality work. I'll only be a part of top quality work. It could be that I won't feel comfortable with what the group wants to do for the website. Please keep working on it as it is really embarrassing, terrible, substandard unacceptable beneath professional standards... I will contribute if it's possible for me to do so. Let's see what we decide to do going forward for the website. Kenneth Brotman (Cassandra coder?) -Original Message- From: Eric Evans [mailto:john.eric.ev...@gmail.com] Sent: Friday, March 16, 2018 7:59 AM To: dev@cassandra.apache.org Subject: Re: A JIRA proposing a seperate repository for the online documentation On Thu, Mar 15, 2018 at 11:40 AM, Kenneth Brotman <kenbrot...@yahoo.com.invalid> wrote: > Well pickle my cucumbers Jon! It's good to know that you have experience > with Hugo, see it as a good fit and that all has been well. I look forward > to the jira epic! > > How exactly does the group make such a decision: Call for final discussion? > Call for vote? Wait for the PMC to vote? Good question! Decisions like this are made by consensus; As the person who is attempting to do something, you discuss your ideas with the group, listen to the feedback of others, and develop consensus around a direction. This is more difficult than demanding your way, or having someone(s) in a position of absolute power tell you what you can and cannot do, but the result is better. > -Original Message- > From: Jon Haddad [mailto:jonathan.had...@gmail.com] On Behalf Of Jon > Haddad > Sent: Thursday, March 15, 2018 9:24 AM > To: dev@cassandra.apache.org > Subject: Re: A JIRA proposing a seperate repository for the online > documentation > > Murukesh is correct on a very useable, pretty standard process of > multi-versioned docs. > > I’ll put my thoughts in a JIRA epic tonight. I’ll be a multi-phase process. > Also correct in that I’d like us to move to Hugo for the site, I’d like us to > have a unified system between the site & the docs, and hugo has been > excellent. We run the reaper site & docs off hugo, it works well. We just > don’t do multi-versions (because we don’t support multiple): > https://github.com/thelastpickle/cassandra-reaper/tree/master/src/docs > <https://github.com/thelastpickle/cassandra-reaper/tree/master/src/docs>. > > Jon > >> On Mar 15, 2018, at 8:57 AM, Murukesh Mohanan <murukesh.moha...@gmail.com> >> wrote: >> >> On Fri, Mar 16, 2018 at 0:19 Kenneth Brotman >> <kenbrot...@yahoo.com.invalid <mailto:kenbrot...@yahoo.com.invalid>> >> wrote: >> >>> Help me out here. I could have had a website with support for more >>> than one version done several different ways by now. >>> >>> A website with several versions of documentation is going to have >>> sub-directories for each version of documentation obviously. I've >>> offered to create those sub-directories under the "doc" folder of >>> the current repository; and I've offered to move the online >>> documentation to a separate repository and have the sub-directories >>> there. Both were shot down. Is there a third way? If so please just >>> spill the beans. >>> >> >> There is. Note that the website is an independent repository. So to >> host docs for multiple versions, only the website's repository (or >> rather, the final built contents) needs multiple directories. You can >> just checkout each branch or tag, generate the docs, make a directory >> for that branch or tag in the website repo, and copy the generated >> docs there with appropriate modifications. >>
RE: NVIDIA TESLA: The World's Most Advance Data Center GPU's for accelerating demanding HPC workloads
If anyone sees interesting uses of this related to the development of C*, please email me privately as the very idea of such an inquiry seems to be bothering some folks. Kenneth Brotman -Original Message- From: Josh McKenzie [mailto:jmcken...@apache.org] Sent: Friday, March 16, 2018 5:30 AM To: dev@cassandra.apache.org Subject: Re: NVIDIA TESLA: The World's Most Advance Data Center GPU's for accelerating demanding HPC workloads > > It's off topic if there aren't any C* uses but I wasn't (still not > sure) there aren't. Maybe you're missing out. This dev list is for the development of C*. The type of discussion on this email thread should be kept to user@. On Thu, Mar 15, 2018 at 1:32 PM, Kenneth Brotman < kenbrot...@yahoo.com.invalid> wrote: > It's off topic if there aren't any C* uses but I wasn't (still not > sure) there aren't. Maybe you're missing out. > > -Original Message- > From: Jason Brown [mailto:jasedbr...@gmail.com] > Sent: Thursday, March 15, 2018 10:26 AM > To: dev@cassandra.apache.org > Subject: Re: NVIDIA TESLA: The World's Most Advance Data Center GPU's > for accelerating demanding HPC workloads > > All, > > This is completely OFF-TOPIC for this mailing list. Please stop. > > -Jason > > On Thu, Mar 15, 2018 at 10:09 AM, daemeon reiydelle > <daeme...@gmail.com> > wrote: > > > Not so sure they are C* relevant, I build (100's of GPU enabled > > node) HPC's that use them for ML, AI, Graph analytics, etc. with the > > sources in C* or more typically Hadoop/EMR data. > > > > > > <==> > > "Who do you think made the first stone spear? The Asperger guy. > > If you get rid of the autism genetics, there would be no Silicon Valley" > > Temple Grandin > > > > > > *Daemeon C.M. ReiydelleSan Francisco 1.415.501.0198London 44 020 > > 8144 > > 9872* > > > > > > On Thu, Mar 15, 2018 at 8:40 AM, Kenneth Brotman < > > kenbrot...@yahoo.com.invalid> wrote: > > > > > I see things like this > > > https://www.nvidia.com/en-us/data-center/tesla/#section3 as > > > something I might be using in things I help build. Does anyone > > > have any experience with them? > > > > > > > > > > > > Kenneth Brotman > > > > > > > > > > > - > To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org > For additional commands, e-mail: dev-h...@cassandra.apache.org > > - To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org For additional commands, e-mail: dev-h...@cassandra.apache.org
RE: NVIDIA TESLA: The World's Most Advance Data Center GPU's for accelerating demanding HPC workloads
It's off topic if there aren't any C* uses but I wasn't (still not sure) there aren't. Maybe you're missing out. -Original Message- From: Jason Brown [mailto:jasedbr...@gmail.com] Sent: Thursday, March 15, 2018 10:26 AM To: dev@cassandra.apache.org Subject: Re: NVIDIA TESLA: The World's Most Advance Data Center GPU's for accelerating demanding HPC workloads All, This is completely OFF-TOPIC for this mailing list. Please stop. -Jason On Thu, Mar 15, 2018 at 10:09 AM, daemeon reiydelle <daeme...@gmail.com> wrote: > Not so sure they are C* relevant, I build (100's of GPU enabled node) > HPC's that use them for ML, AI, Graph analytics, etc. with the sources > in C* or more typically Hadoop/EMR data. > > > <==> > "Who do you think made the first stone spear? The Asperger guy. > If you get rid of the autism genetics, there would be no Silicon Valley" > Temple Grandin > > > *Daemeon C.M. ReiydelleSan Francisco 1.415.501.0198London 44 020 8144 > 9872* > > > On Thu, Mar 15, 2018 at 8:40 AM, Kenneth Brotman < > kenbrot...@yahoo.com.invalid> wrote: > > > I see things like this > > https://www.nvidia.com/en-us/data-center/tesla/#section3 as > > something I might be using in things I help build. Does anyone have > > any experience with them? > > > > > > > > Kenneth Brotman > > > > > - To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org For additional commands, e-mail: dev-h...@cassandra.apache.org
RE: A JIRA proposing a seperate repository for the online documentation
Well pickle my cucumbers Jon! It's good to know that you have experience with Hugo, see it as a good fit and that all has been well. I look forward to the jira epic! How exactly does the group make such a decision: Call for final discussion? Call for vote? Wait for the PMC to vote? Kenneth Brotman -Original Message- From: Jon Haddad [mailto:jonathan.had...@gmail.com] On Behalf Of Jon Haddad Sent: Thursday, March 15, 2018 9:24 AM To: dev@cassandra.apache.org Subject: Re: A JIRA proposing a seperate repository for the online documentation Murukesh is correct on a very useable, pretty standard process of multi-versioned docs. I’ll put my thoughts in a JIRA epic tonight. I’ll be a multi-phase process. Also correct in that I’d like us to move to Hugo for the site, I’d like us to have a unified system between the site & the docs, and hugo has been excellent. We run the reaper site & docs off hugo, it works well. We just don’t do multi-versions (because we don’t support multiple): https://github.com/thelastpickle/cassandra-reaper/tree/master/src/docs <https://github.com/thelastpickle/cassandra-reaper/tree/master/src/docs>. Jon > On Mar 15, 2018, at 8:57 AM, Murukesh Mohanan <murukesh.moha...@gmail.com> > wrote: > > On Fri, Mar 16, 2018 at 0:19 Kenneth Brotman > <kenbrot...@yahoo.com.invalid <mailto:kenbrot...@yahoo.com.invalid>> > wrote: > >> Help me out here. I could have had a website with support for more >> than one version done several different ways by now. >> >> A website with several versions of documentation is going to have >> sub-directories for each version of documentation obviously. I've >> offered to create those sub-directories under the "doc" folder of the >> current repository; and I've offered to move the online documentation >> to a separate repository and have the sub-directories there. Both >> were shot down. Is there a third way? If so please just spill the beans. >> > > There is. Note that the website is an independent repository. So to > host docs for multiple versions, only the website's repository (or > rather, the final built contents) needs multiple directories. You can > just checkout each branch or tag, generate the docs, make a directory > for that branch or tag in the website repo, and copy the generated > docs there with appropriate modifications. > > I do this on a smaller scale using GitHub Pages (repo: > https://github.com/murukeshm/cassandra > <https://github.com/murukeshm/cassandra> site: > https://murukeshm.github.io/cassandra/ > <https://murukeshm.github.io/cassandra/> > <https://murukeshm.github.io/cassandra/3.11.1/ > <https://murukeshm.github.io/cassandra/3.11.1/>> ). The method is a > bit hacky as I noted in CASSANDRA-13907. A daily cronjobs updated the repo if > docs are updated. 3.9+ versions are available. > > > > >> Also, no offense to anyone at Sphinx but for a project our size it's >> not suitable. We need to move off it now. It's a problem. >> >> Can we go past this and on to the documenting! Please help resolve this. >> >> How are we going to: >> Make the submission of code changes include required changes to >> documentation including the online documentation? >> Allow, encourage the online documentation to publish multiple >> versions of documentation concurrently including all officially supported >> versions? > > > Only on this point: we'll need to start by improving the website build > process. Michael's comment on 13907 ( > https://issues.apache.org/jira/browse/CASSANDRA-13907?focusedCommentId > =16211365=com.atlassian.jira.plugin.system.issuetabpanels:comment > -tabpanel#comment-16211365 > <https://issues.apache.org/jira/browse/CASSANDRA-13907?focusedCommentI > d=16211365=com.atlassian.jira.plugin.system.issuetabpanels:commen > t-tabpanel#comment-16211365> > ) > shows it's a painful, fiddly process. That seems to be the main > blocker. I think Jon has shown interest in moving to Hugo from the > current Jekyll setup. > > > >> Move our project onto a more suitable program than Sphinx for our needs? >> >> Kenneth Brotman >> >> -Original Message- >> From: Eric Evans [mailto:john.eric.ev...@gmail.com] >> Sent: Thursday, March 15, 2018 7:50 AM >> To: dev@cassandra.apache.org >> Subject: Re: A JIRA proposing a seperate repository for the online >> documentation >> >> On Thu, Mar 15, 2018 at 4:58 AM, Rahul Singh >> <rahul.xavier.si...@gmail.com> >> wrote: >>> >>> I don’t understand why it’s so complicated. In tree docs
NVIDIA TESLA: The World's Most Advance Data Center GPU's for accelerating demanding HPC workloads
I see things like this https://www.nvidia.com/en-us/data-center/tesla/#section3 as something I might be using in things I help build. Does anyone have any experience with them? Kenneth Brotman
RE: A JIRA proposing a seperate repository for the online documentation
Help me out here. I could have had a website with support for more than one version done several different ways by now. A website with several versions of documentation is going to have sub-directories for each version of documentation obviously. I've offered to create those sub-directories under the "doc" folder of the current repository; and I've offered to move the online documentation to a separate repository and have the sub-directories there. Both were shot down. Is there a third way? If so please just spill the beans. Also, no offense to anyone at Sphinx but for a project our size it's not suitable. We need to move off it now. It's a problem. Can we go past this and on to the documenting! Please help resolve this. How are we going to: Make the submission of code changes include required changes to documentation including the online documentation? Allow, encourage the online documentation to publish multiple versions of documentation concurrently including all officially supported versions? Move our project onto a more suitable program than Sphinx for our needs? Kenneth Brotman -Original Message- From: Eric Evans [mailto:john.eric.ev...@gmail.com] Sent: Thursday, March 15, 2018 7:50 AM To: dev@cassandra.apache.org Subject: Re: A JIRA proposing a seperate repository for the online documentation On Thu, Mar 15, 2018 at 4:58 AM, Rahul Singh <rahul.xavier.si...@gmail.com> wrote: > > I don’t understand why it’s so complicated. In tree docs are as good as any. > All the old docs are there in the version control system. > > All we need to is a) generate docs for old versions b) improve user > experience on the site by having it clearly laid out what is latest vs. old > docs and c) have some semblance of a search maybe using something like > Algolia or whatever. This. Keeping the docs in-tree is a huge win, because they can move in lock-step with changes occurring in that branch/version. I don't think we've been enforcing this, but code-changes that alter documented behavior should be accompanied by corresponding changes to the documentation, or be rejected. Code-changes that correspond with undocumented behavior are an opportunity to include some docs (not grounds to reject a code-review IMO, but certainly an opportunity to politely ask/suggest). Publishing more than one version (as generated from the respective branches/tags), is a solvable problem. > > On Thu, Mar 15, 2018 at 1:22 Kenneth Brotman > > <kenbrot...@yahoo.com.invalid > > wrote: > > > > > https://issues.apache.org/jira/browse/CASSANDRA-14313 > > > > > > > > > > > > For some reason I'm told by many committers that we should not > > > have sets of documentation for other versions than the current > > > version in a tree for that version. This has made it difficult, > > > maybe impossible to have documentation for all the supported > > > versions on the website at one time. > > > > > > > > > > > > As a solution I propose that we maintain the online documentation > > > in a separate repository that is managed as the current repository > > > under the guidance of the Apache Cassandra PMC (Project Management > > > Committee); and that in the new repository . . . > > > > > > > > > > > > Please see the jira. I hope it's a good answer to everyone. -- Eric Evans john.eric.ev...@gmail.com - To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org For additional commands, e-mail: dev-h...@cassandra.apache.org - To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org For additional commands, e-mail: dev-h...@cassandra.apache.org
A JIRA proposing a seperate repository for the online documentation
https://issues.apache.org/jira/browse/CASSANDRA-14313 For some reason I'm told by many committers that we should not have sets of documentation for other versions than the current version in a tree for that version. This has made it difficult, maybe impossible to have documentation for all the supported versions on the website at one time. As a solution I propose that we maintain the online documentation in a separate repository that is managed as the current repository under the guidance of the Apache Cassandra PMC (Project Management Committee); and that in the new repository . . . Please see the jira. I hope it's a good answer to everyone. KennethBrotman
RE: Pull request question
Thanks. Something is not working right or I have brain fog. I tried twice. Got unwanted results each time so I closed the pull requests with uncommitted merges. I hope that is okay. Here is the JIRA: https://issues.apache.org/jira/browse/CASSANDRA-14312 I must need to do something to sync my repository back up with the Apache Cassandra mirror site. The pull requests were trying to bring over a bunch of other files. Thanks for the reply. Kenneth Brotman -Original Message- From: Dinesh Joshi [mailto:dinesh.jo...@yahoo.com.INVALID] Sent: Tuesday, March 13, 2018 11:13 PM To: dev@cassandra.apache.org Subject: Re: Pull request question Hi Kenneth, Normally you would open a Jira ticket and put the details in there including a link to your branch, pull request or patch. Is this pull request associated with a jira? If so, could you please point us to it? And yes, normally you would send pull requests against trunk unless they're for fixes to particular versions of Cassandra. Dinesh On Tuesday, March 13, 2018, 10:48:22 PM PDT, Kenneth Brotman <kenbrot...@yahoo.com.INVALID> wrote: I made some sub directories and files so we could start to work on having separate versions of documentation. I did the pull request to the truck. Was that right? Kenneth Brotman - To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org For additional commands, e-mail: dev-h...@cassandra.apache.org
Pull request question
I made some sub directories and files so we could start to work on having separate versions of documentation. I did the pull request to the truck. Was that right? Kenneth Brotman
RE: Mentor
That one is JIRA 14264: https://issues.apache.org/jira/browse/CASSANDRA-14264 I won't be writing that one anytime soon but if it existed I would be reading it. If the guru's would throw some text in the JIRA as they have it handy, myself or someone else will take it from there. Kenneth Brotman -Original Message- From: Darshan Sreenivasamurthy [mailto:darshants.darsh...@gmail.com] Sent: Monday, February 26, 2018 11:33 AM To: dev@cassandra.apache.org Subject: Re: Mentor + 1 > On Feb 26, 2018, at 11:24 AM, Kenneth Brotman <kenbrot...@yahoo.com.INVALID> wrote: > > how about a nice Quick Tour Document for dev's that want to get oriented on the code efficiently. - To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org For additional commands, e-mail: dev-h...@cassandra.apache.org
RE: Mentor
Yeah, how about a nice Quick Tour Document for dev's that want to get oriented on the code efficiently. I would like that too. Kenneth Brotman -Original Message- From: Sumant Sahney [mailto:sahneysum...@gmail.com] Sent: Monday, February 26, 2018 11:12 AM To: dev@cassandra.apache.org Subject: Mentor Hi, I am new to Open Source. I am really confused on how to get started with open source. I would really appreciate if someone can help me through the process and help me get started. Can someone please share the code documentation with me as well. Which I can refer to get an idea about all the modules. Thanks, Sumant - To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org For additional commands, e-mail: dev-h...@cassandra.apache.org
Alignment and coodination between DataStax's online documentation and Apache Cassandra online documentation
To the amazing people of DataStax, The DataStax website is a little unwieldy as it tries to support open source Cassandra and DataStax's version. Meanwhile some of that information would help fill in the Apache Cassandra web site. If you would like to work on this, I'm up for it. Kenneth Brotman
Filling in the blank To Do sections on the Apache Cassandra web site
These nine web pages on the Apache Cassandra web site have blank To Do sections. Most of the web pages are completely blank. Mind you there is a lot of hard work already done on the documentation. I'll make JIRA's for any of the blank sections where there is not already a JIRA. Then it will be on to writing up those sections. If you have any text to help me get started for any of these sections that would be really cool. http://cassandra.apache.org/doc/latest/architecture/overview.html http://cassandra.apache.org/doc/latest/architecture/dynamo.html http://cassandra.apache.org/doc/latest/architecture/guarantees.html http://cassandra.apache.org/doc/latest/data_modeling/index.html http://cassandra.apache.org/doc/latest/operating/read_repair.html http://cassandra.apache.org/doc/latest/operating/hints.html http://cassandra.apache.org/doc/latest/operating/backups.html http://cassandra.apache.org/doc/latest/operating/bulk_loading.html http://cassandra.apache.org/doc/latest/troubleshooting/index.html Kenneth Brotman
RE: Cassandra Needs to Grow Up by Version Five!
A sincere thank you for everyone that replied. I will heavy lift the docs for a while, do my Slender Cassandra reference project and then I’ll try to find one or two areas where I can contribute code to get going on that. I'll have a few JIRA's started by the end of the workday. Kenneth Brotman - To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org For additional commands, e-mail: dev-h...@cassandra.apache.org
RE: Cassandra Needs to Grow Up by Version Five!
Jeff, I already addressed everything you said. Boy! Would I like to bring up the out of date articles on the web that trip people up and the lousy documentation on the Apache website but I can’t because a lot of folks don’t know me or why I’m saying these things. I will be making another post that I hope clarifies what’s going on with me. After that I will either be a freakishly valuable asset to this community or I will be a freakishly valuable asset to another community. You sure have a funny way of reigning in people that are used to helping out. You sure misjudged me. Wow. Kenneth Brotman From: Jeff Jirsa [mailto:jji...@gmail.com] Sent: Wednesday, February 21, 2018 3:12 PM To: cassandra Cc: Cassandra DEV Subject: Re: Cassandra Needs to Grow Up by Version Five! On Wed, Feb 21, 2018 at 2:53 PM, Kenneth Brotman <kenbrot...@yahoo.com.invalid> wrote: Hi Akash, I get the part about outside work which is why in replying to Jeff Jirsa I was suggesting the big companies could justify taking it on easy enough and you know actually pay the people who would be working at it so those people could have a life. The part I don't get is the aversion to usability. Isn't that what you think about when you are coding? "Am I making this thing I'm building easy to use?" If you were programming for me, we would be constantly talking about what we are building and how we can make things easier for users. If I had to fight with a developer, architect or engineer about usability all the time, they would be gone and quick. How do approach programming if you aren't trying to make things easy. There's no aversion to usability, you're assuming things that just aren't true Nobody's against usability, we've just prioritized other things HIGHER. We make those decisions in part by looking at open JIRAs and determining what's asked for the most, what members of the community have contributed, and then balance that against what we ourselves care about. You're making a statement that it should be the top priority for the next release, with no JIRA, and history of contributing (and indeed, no real clear sign that you even understand the full extent of the database), no sign that you're willing to do the work yourself, and making a ton of assumptions about the level of effort and ROI. I would love for Cassandra to be easier to use, I'm sure everyone does. There's a dozen features I'd love to add if I had infinite budget and infinite manpower. But what you're asking for is A LOT of effort and / or A LOT of money, and you're assuming someone's going to step up and foot the bill, but there's no real reason to believe that's the case. In the mean time, everyone's spending hours replying to this thread that is 0% actionable. We would all have been objectively better off had everyone ignored this thread and just spent 10 minutes writing some section of the docs. So the next time I get the urge to reply, I'm just going to do that instead.
RE: Cassandra Needs to Grow Up by Version Five!
Jon, Very sorry that you don't see the value of the time I'm taking for this. I don't have demands; I do have a stern warning and I'm right Jon. Please be very careful not to mischaracterized my words Jon. You suggest I put things in JIRA's, then seem to suggest that I'd be lucky if anyone looked at it and did anything. That's what I figured too. I don't appreciate the hostility. You will understand more fully in the next post where I'm coming from. Try to keep the conversation civilized. I'm trying or at least so you understand I think what I'm doing is saving your gig and mine. I really like a lot of people is this group. I've come to a preliminary assessment on things. Soon the cloud will clear or I'll be gone. Don't worry. I'm a very peaceful person and like you I am driven by real important projects that I feel compelled to work on for the good of others. I don't have time for people to hand hold a database and I can't get stuck with my projects on the wrong stuff. Kenneth Brotman -Original Message- From: Jon Haddad [mailto:jonathan.had...@gmail.com] On Behalf Of Jon Haddad Sent: Wednesday, February 21, 2018 12:44 PM To: u...@cassandra.apache.org Cc: dev@cassandra.apache.org Subject: Re: Cassandra Needs to Grow Up by Version Five! Ken, Maybe it’s not clear how open source projects work, so let me try to explain. There’s a bunch of us who either get paid by someone or volunteer on our free time. The folks that get paid, (yay!) usually take direction on what the priorities are, and work on projects that directly affect our jobs. That means that someone needs to care enough about the features you want to work on them, if you’re not going to do it yourself. Now as others have said already, please put your list of demands in JIRA, if someone is interested, they will work on it. You may need to contribute a little more than you’ve done already, be prepared to get involved if you actually want to to see something get done. Perhaps learning a little more about Cassandra’s internals and the people involved will reveal some of the design decisions and priorities of the project. Third, you seem to be a little obsessed with market share. While market share is fun to talk about, *most* of us that are working on and contributing to Cassandra do so because it does actually solve a problem we have, and solves it reasonably well. If some magic open source DB appears out of no where and does everything you want Cassandra to, and is bug free, keeps your data consistent, automatically does backups, comes with really nice cert management, ad hoc querying, amazing materialized views that are perfect, no caveats to secondary indexes, and somehow still gives you linear scalability without any mental overhead whatsoever then sure, people might start using it. And that’s actually OK, because if that happens we’ll all be incredibly pumped out of our minds because we won’t have to work as hard. If on the slim chance that doesn’t manifest, those of us that use Cassandra and are part of the community will keep working on the things we care about, iterating, and improving things. Maybe someone will even take a look at your JIRA issues. Further filling the mailing list with your grievances will likely not help you progress towards your goal of a Cassandra that’s easier to use, so I encourage you to try to be a little more productive and try to help rather than just complain, which is not constructive. I did a quick search for your name on the mailing list, and I’ve seen very little from you, so to everyone’s who’s been around for a while and trying to help you it looks like you’re just some random dude asking for people to work for free on the things you’re asking for, without offering anything back in return. Jon > On Feb 21, 2018, at 11:56 AM, Kenneth Brotman <kenbrot...@yahoo.com.INVALID> > wrote: > > Josh, > > To say nothing is indifference. If you care about your community, sometimes > don't you have to bring up a subject even though you know it's also > temporarily adding some discomfort? > > As to opening a JIRA, I've got a very specific topic to try in mind now. An > easy one I'll work on and then announce. Someone else will have to do the > coding. A year from now I would probably just knock it out to make sure it's > as easy as I expect it to be but to be honest, as I've been saying, I'm not > set up to do that right now. I've barely looked at any Cassandra code; for > one; everyone on this list probably codes more than I do, secondly; and > lastly, it's a good one for someone that wants an easy one to start with: > vNodes. I've already seen too many people seeking assistance with the vNode > setting. > > And you can expect as others have been mentioning that there should be > similar ones on compaction, repair and backup. >
RE: Cassandra Needs to Grow Up by Version Five!
Josh, To say nothing is indifference. If you care about your community, sometimes don't you have to bring up a subject even though you know it's also temporarily adding some discomfort? As to opening a JIRA, I've got a very specific topic to try in mind now. An easy one I'll work on and then announce. Someone else will have to do the coding. A year from now I would probably just knock it out to make sure it's as easy as I expect it to be but to be honest, as I've been saying, I'm not set up to do that right now. I've barely looked at any Cassandra code; for one; everyone on this list probably codes more than I do, secondly; and lastly, it's a good one for someone that wants an easy one to start with: vNodes. I've already seen too many people seeking assistance with the vNode setting. And you can expect as others have been mentioning that there should be similar ones on compaction, repair and backup. Microsoft knows poor usability gives them an easy market to take over. And they make it easy to switch. Beginning at 4:17 in the video, it says the following: "You don't need to worry about replica sets, quorum or read repair. You can focus on writing correct application logic." At 4:42, it says: "Hopefully this gives you a quick idea of how seamlessly you can bring your existing Cassandra applications to Azure Cosmos DB. No code changes are required. It works with your favorite Cassandra tools and drivers including for example native Cassandra driver for Spark. And it takes seconds to get going, and it's elastically and globally scalable." More to come, Kenneth Brotman -Original Message- From: Josh McKenzie [mailto:jmcken...@apache.org] Sent: Wednesday, February 21, 2018 8:28 AM To: dev@cassandra.apache.org Cc: User Subject: Re: Cassandra Needs to Grow Up by Version Five! There's a disheartening amount of "here's where Cassandra is bad, and here's what it needs to do for me for free" happening in this thread. This is open-source software. Everyone is *strongly encouraged* to submit a patch to move the needle on *any* of these things being complained about in this thread. For the Apache Way <https://www.apache.org/foundation/governance/> to work, people need to step up and meaningfully contribute to a project to scratch their own itch instead of just waiting for a random corporation-subsidized engineer to happen to have interests that align with them and contribute that to the project. Beating a dead horse for things everyone on the project knows are serious pain points is not productive. On Wed, Feb 21, 2018 at 5:45 AM, Oleksandr Shulgin < oleksandr.shul...@zalando.de> wrote: > On Mon, Feb 19, 2018 at 10:01 AM, Kenneth Brotman < > kenbrot...@yahoo.com.invalid> wrote: > > > > > >> Cluster wide management should be a big theme in any next major > release. > > >> > > >Na. Stability and testing should be a big theme in the next major > release. > > > > > > > Double Na on that one Jeff. I think you have a concern there about > > the need to test sufficiently to ensure the stability of the next > > major release. That makes perfect sense.- for every release, > > especially the major ones. Continuous improvement is not a phase of > > development for example. CI should be in everything, in every > > phase. Stability and testing a part of every release not just one. > > A major release should be > a > > nice step from the previous major release though. > > > > I guess what Jeff refers to is the tick-tock release cycle experiment, > which has proven to be a complete disaster by popular opinion. > > There's also the "materialized views" feature which failed to > materialize in the end (pun intended) and had to be declared > experimental retroactively. > > Another prominent example is incremental repair which was introduced > as the default option in 2.2 and now is not recommended to use because > of so many corner cases where it can fail. So again experimental as an > afterthought. > > Not to mention that even if you are aware of the default incremental > and go with full repair instead, you're still up for a sad surprise: > anti-compaction will be triggered despite the "full" repair. Because > anti-compaction is only disabled in case of sub-range repair (don't > ask why), so you need to use something advanced like Reaper if you > want to avoid that. I don't think you'll ever find this in the documentation. > > Honestly, for an eventually-consistent system like Cassandra > anti-entropy repair is one of the most important pieces to get right. > And Cassandra fails really badly on that one: the feature is not > really well d
RE: Cassandra Needs to Grow Up by Version Five!
If you watch this video through you'll see why usability is so important. You can't ignore usability issues. Cassandra does not exist in a vacuum. The competitors are world class. The video is on the New Cassandra API for Azure Cosmos DB: https://www.youtube.com/watch?v=1Sf4McGN1AQ Kenneth Brotman -Original Message- From: Daniel Hölbling-Inzko [mailto:daniel.hoelbling-in...@bitmovin.com] Sent: Tuesday, February 20, 2018 1:28 AM To: u...@cassandra.apache.org; James Briggs Cc: dev@cassandra.apache.org Subject: Re: Cassandra Needs to Grow Up by Version Five! Hi, I have to add my own two cents here as the main thing that keeps me from really running Cassandra is the amount of pain running it incurs. Not so much because it's actually painful but because the tools are so different and the documentation and best practices are scattered across a dozen outdated DataStax articles and this mailing list etc.. We've been hesitant (although our use case is perfect for using Cassandra) to deploy Cassandra to any critical systems as even after a year of running it we still don't have the operational experience to confidently run critical systems with it. Simple things like a foolproof / safe cluster-wide S3 Backup (like Elasticsearch has it) would for example solve a TON of issues for new people. I don't need it auto-scheduled or something, but having to configure cron jobs across the whole cluster is a pain in the ass for small teams. To be honest, even the way snapshots are done right now is already super painful. Every other system I operated so far will just create one backup folder I can export, in C* the Backup is scattered across a bunch of different Keyspace folders etc.. needless to say that it took a while until I trusted my backup scripts fully. And especially for a Database I believe Backup/Restore needs to be a non-issue that's documented front and center. If not smaller teams just don't have the resources to dedicate to learning and building the tools around it. Now that the team is getting larger we could spare the resources to operate these things, but switching from a well-understood RDBMs schema to Cassandra is now incredibly hard and will probably take years. greetings Daniel On Tue, 20 Feb 2018 at 05:56 James Briggs <james.bri...@yahoo.com.invalid> wrote: > Kenneth: > > What you said is not wrong. > > Vertica and Riak are examples of distributed databases that don't > require hand-holding. > > Cassandra is for Java-programmer DIYers, or more often Datastax > clients, at this point. > Thanks, James. > > ------ > *From:* Kenneth Brotman <kenbrot...@yahoo.com.INVALID> > *To:* u...@cassandra.apache.org > *Cc:* dev@cassandra.apache.org > *Sent:* Monday, February 19, 2018 4:56 PM > > *Subject:* RE: Cassandra Needs to Grow Up by Version Five! > > Jeff, you helped me figure out what I was missing. It just took me a > day to digest what you wrote. I’m coming over from another type of > engineering. I didn’t know and it’s not really documented. Cassandra > runs in a data center. Now days that means the nodes are going to be > in managed containers, Docker containers, managed by Kerbernetes, > Meso or something, and for that reason anyone operating Cassandra in a > real world setting would not encounter the issues I raised in the way I > described. > > Shouldn’t the architectural diagrams people reference indicate that in > some way? That would have help me. > > Kenneth Brotman > > *From:* Kenneth Brotman [mailto:kenbrot...@yahoo.com] > *Sent:* Monday, February 19, 2018 10:43 AM > *To:* 'u...@cassandra.apache.org' > *Cc:* 'dev@cassandra.apache.org' > *Subject:* RE: Cassandra Needs to Grow Up by Version Five! > > Well said. Very fair. I wouldn’t mind hearing from others still > You’re a good guy! > > Kenneth Brotman > > *From:* Jeff Jirsa [mailto:jji...@gmail.com <jji...@gmail.com>] > *Sent:* Monday, February 19, 2018 9:10 AM > *To:* cassandra > *Cc:* Cassandra DEV > *Subject:* Re: Cassandra Needs to Grow Up by Version Five! > > There's a lot of things below I disagree with, but it's ok. I > convinced myself not to nit-pick every point. > > https://issues.apache.org/jira/browse/CASSANDRA-13971 has some of > Stefan's work with cert management > > Beyond that, I encourage you to do what Michael suggested: open JIRAs > for things you care strongly about, work on them if you have time. > Sometime this year we'll schedule a NGCC (Next Generation Cassandra > Conference) where we talk about future project work and direction, I > encourage you to attend if you're able (I encourage anyone who cares > about the direction of Cassandra to attend, it's probably be either > free or very low cost, just to cover a venue and som
RE: Cassandra Needs to Grow Up by Version Five!
Jeff, you helped me figure out what I was missing. It just took me a day to digest what you wrote. I’m coming over from another type of engineering. I didn’t know and it’s not really documented. Cassandra runs in a data center. Now days that means the nodes are going to be in managed containers, Docker containers, managed by Kerbernetes, Meso or something, and for that reason anyone operating Cassandra in a real world setting would not encounter the issues I raised in the way I described. Shouldn’t the architectural diagrams people reference indicate that in some way? That would have help me. Kenneth Brotman From: Kenneth Brotman [mailto:kenbrot...@yahoo.com] Sent: Monday, February 19, 2018 10:43 AM To: 'u...@cassandra.apache.org' Cc: 'dev@cassandra.apache.org' Subject: RE: Cassandra Needs to Grow Up by Version Five! Well said. Very fair. I wouldn’t mind hearing from others still. You’re a good guy! Kenneth Brotman From: Jeff Jirsa [mailto:jji...@gmail.com] Sent: Monday, February 19, 2018 9:10 AM To: cassandra Cc: Cassandra DEV Subject: Re: Cassandra Needs to Grow Up by Version Five! There's a lot of things below I disagree with, but it's ok. I convinced myself not to nit-pick every point. https://issues.apache.org/jira/browse/CASSANDRA-13971 has some of Stefan's work with cert management Beyond that, I encourage you to do what Michael suggested: open JIRAs for things you care strongly about, work on them if you have time. Sometime this year we'll schedule a NGCC (Next Generation Cassandra Conference) where we talk about future project work and direction, I encourage you to attend if you're able (I encourage anyone who cares about the direction of Cassandra to attend, it's probably be either free or very low cost, just to cover a venue and some food). If nothing else, you'll meet some of the teams who are working on the project, and learn why they've selected the projects on which they're working. You'll have an opportunity to pitch your vision, and maybe you can talk some folks into helping out. - Jeff On Mon, Feb 19, 2018 at 1:01 AM, Kenneth Brotman <kenbrot...@yahoo.com.invalid> wrote: Comments inline >-Original Message- >From: Jeff Jirsa [mailto:jji...@gmail.com] >Sent: Sunday, February 18, 2018 10:58 PM >To: u...@cassandra.apache.org >Cc: dev@cassandra.apache.org >Subject: Re: Cassandra Needs to Grow Up by Version Five! > >Comments inline > > >> On Feb 18, 2018, at 9:39 PM, Kenneth Brotman <kenbrot...@yahoo.com.INVALID> >> wrote: >> > >Cassandra feels like an unfinished program to me. The problem is not that > >it’s open source or cutting edge. It’s an open source cutting edge program > >that lacks some of its basic functionality. We are all stuck addressing > >fundamental mechanical tasks for Cassandra because the basic code that would > >do that part has not been contributed yet. >> >There’s probably 2-3 reasons why here: > >1) Historically the pmc has tried to keep the scope of the project very >narrow. It’s a database. We don’t ship drivers. We don’t ship developer tools. >We don’t ship fancy UIs. We ship a database. I think for the most part the >narrow vision has been for the best, but maybe it’s time to reconsider some of >the scope. > >Postgres will autovacuum to prevent wraparound (hopefully), but everyone I >know running Postgres uses flexible-freeze in cron - sometimes it’s ok to let >the database have its opinions and let third party tools fill in the gaps. > I can appreciate the desire to stay in scope. I believe usability is the King. When users have to learn the database, then learn what they have to automate, then learn an automation tool and then use the automation tool to do something that is as fundamental as the fundamental tasks I described, then something is missing from the database itself that is adversely affecting usability - and that is very bad. Where those big companies need to calculate the ROI is in the cost of acquiring or training the next group of users. Consider how steep the learning curve is for new users. Consider the business case for improving ease of use. >2) Cassandra is, by definition, a database for large scale problems. Most of >the companies working on/with it tend to be big companies. Big companies often >have pre-existing automation that solved the stuff you consider fundamental >tasks, so there’s probably nobody actively working on the solved problems that >you may consider missing features - for many people they’re already solved. > I could be wrong but it sounds like a lot of the code work is done, and if the companies would take the time to contribute more code, then the rest of the code needed could be generated easily. >3) It’s not nearly as basic as you think it
RE: Cassandra Needs to Grow Up by Version Five!
Well said. Very fair. I wouldn’t mind hearing from others still. You’re a good guy! Kenneth Brotman From: Jeff Jirsa [mailto:jji...@gmail.com] Sent: Monday, February 19, 2018 9:10 AM To: cassandra Cc: Cassandra DEV Subject: Re: Cassandra Needs to Grow Up by Version Five! There's a lot of things below I disagree with, but it's ok. I convinced myself not to nit-pick every point. https://issues.apache.org/jira/browse/CASSANDRA-13971 has some of Stefan's work with cert management Beyond that, I encourage you to do what Michael suggested: open JIRAs for things you care strongly about, work on them if you have time. Sometime this year we'll schedule a NGCC (Next Generation Cassandra Conference) where we talk about future project work and direction, I encourage you to attend if you're able (I encourage anyone who cares about the direction of Cassandra to attend, it's probably be either free or very low cost, just to cover a venue and some food). If nothing else, you'll meet some of the teams who are working on the project, and learn why they've selected the projects on which they're working. You'll have an opportunity to pitch your vision, and maybe you can talk some folks into helping out. - Jeff On Mon, Feb 19, 2018 at 1:01 AM, Kenneth Brotman <kenbrot...@yahoo.com.invalid> wrote: Comments inline >-Original Message- >From: Jeff Jirsa [mailto:jji...@gmail.com] >Sent: Sunday, February 18, 2018 10:58 PM >To: u...@cassandra.apache.org >Cc: dev@cassandra.apache.org >Subject: Re: Cassandra Needs to Grow Up by Version Five! > >Comments inline > > >> On Feb 18, 2018, at 9:39 PM, Kenneth Brotman <kenbrot...@yahoo.com.INVALID> >> wrote: >> > >Cassandra feels like an unfinished program to me. The problem is not that > >it’s open source or cutting edge. It’s an open source cutting edge program > >that lacks some of its basic functionality. We are all stuck addressing > >fundamental mechanical tasks for Cassandra because the basic code that would > >do that part has not been contributed yet. >> >There’s probably 2-3 reasons why here: > >1) Historically the pmc has tried to keep the scope of the project very >narrow. It’s a database. We don’t ship drivers. We don’t ship developer tools. >We don’t ship fancy UIs. We ship a database. I think for the most part the >narrow vision has been for the best, but maybe it’s time to reconsider some of >the scope. > >Postgres will autovacuum to prevent wraparound (hopefully), but everyone I >know running Postgres uses flexible-freeze in cron - sometimes it’s ok to let >the database have its opinions and let third party tools fill in the gaps. > I can appreciate the desire to stay in scope. I believe usability is the King. When users have to learn the database, then learn what they have to automate, then learn an automation tool and then use the automation tool to do something that is as fundamental as the fundamental tasks I described, then something is missing from the database itself that is adversely affecting usability - and that is very bad. Where those big companies need to calculate the ROI is in the cost of acquiring or training the next group of users. Consider how steep the learning curve is for new users. Consider the business case for improving ease of use. >2) Cassandra is, by definition, a database for large scale problems. Most of >the companies working on/with it tend to be big companies. Big companies often >have pre-existing automation that solved the stuff you consider fundamental >tasks, so there’s probably nobody actively working on the solved problems that >you may consider missing features - for many people they’re already solved. > I could be wrong but it sounds like a lot of the code work is done, and if the companies would take the time to contribute more code, then the rest of the code needed could be generated easily. >3) It’s not nearly as basic as you think it is. Datastax seemingly had a >multi-person team on opscenter, and while it was better than anything else >around last time I used it (before it stopped supporting the OSS version), it >left a lot to be desired. It’s probably 2-3 engineers working for a month to >have any sort of meaningful, reliable, mostly trivial cluster-managing UI, and >I can think of about 10 JIRAs I’d rather see that time be spent on first. How about 6-9 engineers working 12 months a year on it then. I'm not kidding. For a big company with revenues in the tens of billions or more, and a heavy use of Cassandra nodes, it's easy to make a case for having a full time person or more that involved. They aren't paying for using the open source code that is Cassandra. Let's see what would the licensing fees be for a big company if the costs where like Micr
RE: Cassandra Needs to Grow Up by Version Five!
Comments inline >-Original Message- >From: Jeff Jirsa [mailto:jji...@gmail.com] >Sent: Sunday, February 18, 2018 10:58 PM >To: u...@cassandra.apache.org >Cc: dev@cassandra.apache.org >Subject: Re: Cassandra Needs to Grow Up by Version Five! > >Comments inline > > >> On Feb 18, 2018, at 9:39 PM, Kenneth Brotman <kenbrot...@yahoo.com.INVALID> >> wrote: >> > >Cassandra feels like an unfinished program to me. The problem is not that > >it’s open source or cutting edge. It’s an open source cutting edge program > >that lacks some of its basic functionality. We are all stuck addressing > >fundamental mechanical tasks for Cassandra because the basic code that would > >do that part has not been contributed yet. >> >There’s probably 2-3 reasons why here: > >1) Historically the pmc has tried to keep the scope of the project very >narrow. It’s a database. We don’t ship drivers. We don’t ship developer tools. >We don’t ship fancy UIs. We ship a database. I think for the most part the >narrow vision has been for the best, but maybe it’s time to reconsider some of >the scope. > >Postgres will autovacuum to prevent wraparound (hopefully), but everyone I >know running Postgres uses flexible-freeze in cron - sometimes it’s ok to let >the database have its opinions and let third party tools fill in the gaps. > I can appreciate the desire to stay in scope. I believe usability is the King. When users have to learn the database, then learn what they have to automate, then learn an automation tool and then use the automation tool to do something that is as fundamental as the fundamental tasks I described, then something is missing from the database itself that is adversely affecting usability - and that is very bad. Where those big companies need to calculate the ROI is in the cost of acquiring or training the next group of users. Consider how steep the learning curve is for new users. Consider the business case for improving ease of use. >2) Cassandra is, by definition, a database for large scale problems. Most of >the companies working on/with it tend to be big companies. Big companies often >have pre-existing automation that solved the stuff you consider fundamental >tasks, so there’s probably nobody actively working on the solved problems that >you may consider missing features - for many people they’re already solved. > I could be wrong but it sounds like a lot of the code work is done, and if the companies would take the time to contribute more code, then the rest of the code needed could be generated easily. >3) It’s not nearly as basic as you think it is. Datastax seemingly had a >multi-person team on opscenter, and while it was better than anything else >around last time I used it (before it stopped supporting the OSS version), it >left a lot to be desired. It’s probably 2-3 engineers working for a month to >have any sort of meaningful, reliable, mostly trivial cluster-managing UI, and >I can think of about 10 JIRAs I’d rather see that time be spent on first. How about 6-9 engineers working 12 months a year on it then. I'm not kidding. For a big company with revenues in the tens of billions or more, and a heavy use of Cassandra nodes, it's easy to make a case for having a full time person or more that involved. They aren't paying for using the open source code that is Cassandra. Let's see what would the licensing fees be for a big company if the costs where like Microsoft or Oracle would charge for their enterprise level relational database? What's the contribution of one or two people in comparison. >> Ease of use issues need to be given much more attention. For an >> administrator, the ease of use of Cassandra is very poor. >> >>Furthermore, currently Cassandra is an idiot. We have to do everything for >>Cassandra. Contrast that with the fact that we are in the dawn of artificial >>intelligence. >> > >And for everything you think is obvious, there’s a 50% chance someone else >will have already solved differently, and your obvious new solution will be >seen as an inconvenient assumption and complexity they won’t appreciate. Open >source projects get to walk a fine line of trying to be useful without making >too many assumptions, being “too” opinionated, or overstepping bounds. We may >be too conservative, but it’s very easy to go too far in the opposite >direction. > I appreciate that but when such concerns result in inaction instead of resolution that is no good. >> Software exists to automate tasks for humans, not mechanize humans to >> administer tasks for a database. I’m an engineering type. My job is to >> apply science and technology to solve real world problems. And that’s
RE: Cassandra Needs to Grow Up by Version Five!
Hi Michael, actually I do very much like the database. thanks for the thoughts... a few comments: 1) Lots of big companies like, let's see, Apple is a big one, probably could easily justify contributing resources to finish up the basic development of Cassandra. 2) There are lots of big companies using Cassandra. Each could contribute a tiny effort and everyone would benefit greatly. 3) A focused effort by a small group of talented people like there are in this group could knock it out easily. 4) Not everyone is a Cassandra coder. It's not for me to do Michael. 5) I'm an individual. I am not working at a big company at the moment Michael. Best, Kenneth Brotman -Original Message- From: Michael Kjellman [mailto:kjell...@apple.com] Sent: Sunday, February 18, 2018 10:18 PM To: dev@cassandra.apache.org Subject: Re: Cassandra Needs to Grow Up by Version Five! hi ken, sorry you don’t like the database. some thoughts: 1) please file actionable jiras for places you feel need to be improved in the database... this is the best way to make and encourage the change you’re looking for. it seems you have quite a few ideas from your post that could be broken down into individual actionable jiras. 2) please don’t cross post between mailing lists. 3) pull requests are always welcomed! best, kjellman > On Feb 18, 2018, at 9:39 PM, Kenneth Brotman <kenbrot...@yahoo.com.INVALID> > wrote: > > Cassandra feels like an unfinished program to me. The problem is not > that it's open source or cutting edge. It's an open source cutting > edge program that lacks some of its basic functionality. We are all > stuck addressing fundamental mechanical tasks for Cassandra because > the basic code that would do that part has not been contributed yet. > > Ease of use issues need to be given much more attention. For an > administrator, the ease of use of Cassandra is very poor. > > Furthermore, currently Cassandra is an idiot. We have to do > everything for Cassandra. Contrast that with the fact that we are in > the dawn of artificial intelligence. > > Software exists to automate tasks for humans, not mechanize humans to > administer tasks for a database. I'm an engineering type. My job is > to apply science and technology to solve real world problems. And > that's where I need an organization's I.T. talent to focus; not in > crank starting an unfinished database. > > For example, I should be able to go to any node, replace the > Cassandra.yaml file and have a prompt on the display ask me if I want > to update all the yaml files across the cluster. I shouldn't have to > manually modify yaml files on each node or have to create a script for > some third party automation tool to do it. > > I should not have to turn off service, clear directories, restart > service in coordination with the other nodes. It's already a computer > system. It can do those things on its own. > > How about read repair. First there is something wrong with the name. > Maybe it should be called Consistency Repair. An administrator > shouldn't have to do anything. It should be a behavior of Cassandra > that is programmed in. It should consider the GC setting of each node, > calculate how often it has to run repair, when it should run it so all > the nodes aren't trying at the same time and when other circumstances > indicate it should also run it. > > Certificate management should be automated. > > Cluster wide management should be a big theme in any next major release. > What is a major release? How many major releases could a program have > before all the coding for basic stuff like installation, configuration > and maintenance is included! > > Finish the basic coding of Cassandra, make it easy to use for > administrators, make is smart, add cluster wide management. Keep > Cassandra competitive or it will soon be the old Model T we all remember > fondly. > > I ask the Committee to compile a list of all such items, make a plan, > and commit to including the completed and tested code as part of major > release 5.0. I further ask that release 4.0 not be delayed and then > there be an unusually short skip to version 5.0. > > Kenneth Brotman > - To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org For additional commands, e-mail: dev-h...@cassandra.apache.org - To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org For additional commands, e-mail: dev-h...@cassandra.apache.org
Cassandra Needs to Grow Up by Version Five!
Cassandra feels like an unfinished program to me. The problem is not that it's open source or cutting edge. It's an open source cutting edge program that lacks some of its basic functionality. We are all stuck addressing fundamental mechanical tasks for Cassandra because the basic code that would do that part has not been contributed yet. Ease of use issues need to be given much more attention. For an administrator, the ease of use of Cassandra is very poor. Furthermore, currently Cassandra is an idiot. We have to do everything for Cassandra. Contrast that with the fact that we are in the dawn of artificial intelligence. Software exists to automate tasks for humans, not mechanize humans to administer tasks for a database. I'm an engineering type. My job is to apply science and technology to solve real world problems. And that's where I need an organization's I.T. talent to focus; not in crank starting an unfinished database. For example, I should be able to go to any node, replace the Cassandra.yaml file and have a prompt on the display ask me if I want to update all the yaml files across the cluster. I shouldn't have to manually modify yaml files on each node or have to create a script for some third party automation tool to do it. I should not have to turn off service, clear directories, restart service in coordination with the other nodes. It's already a computer system. It can do those things on its own. How about read repair. First there is something wrong with the name. Maybe it should be called Consistency Repair. An administrator shouldn't have to do anything. It should be a behavior of Cassandra that is programmed in. It should consider the GC setting of each node, calculate how often it has to run repair, when it should run it so all the nodes aren't trying at the same time and when other circumstances indicate it should also run it. Certificate management should be automated. Cluster wide management should be a big theme in any next major release. What is a major release? How many major releases could a program have before all the coding for basic stuff like installation, configuration and maintenance is included! Finish the basic coding of Cassandra, make it easy to use for administrators, make is smart, add cluster wide management. Keep Cassandra competitive or it will soon be the old Model T we all remember fondly. I ask the Committee to compile a list of all such items, make a plan, and commit to including the completed and tested code as part of major release 5.0. I further ask that release 4.0 not be delayed and then there be an unusually short skip to version 5.0. Kenneth Brotman
RE: Release votes
I'd like to help as well. For me the issue is I have only my time to contribute. The resources to test Cassandra extensively are beyond that of most individuals including me, aren't they? If resources are made available and I can help, count me in. Also, perhaps having a standard reference like Slender Cassandra (18 nodes total, two regions, three AZ's total, six nodes per AZ) would help. I'll have it done very soon. Kenneth Brotman -Original Message- From: kurt greaves [mailto:k...@instaclustr.com] Sent: Thursday, February 15, 2018 3:48 PM To: dev@cassandra.apache.org Subject: Re: Release votes It seems there has been a bit of a slip in testing as of recently, mostly due to the fact that there's no canonical testing environment that isn't flaky. We probably need to come up with some ideas and a plan on how we're going to do testing in the future, and how we're going to make testing accessible for all contributors. I think this is the only way we're really going to change behaviour. Having an incredibly tedious process and then being aggressive about it only leads to resentment and workarounds. I'm completely unsure of where dtests are at since the conversion to pytest, and there's a lot of failing dtests on the ASF jenkins jobs (which appear to be running pytest). As there's currently not a lot of visibility into what people are doing with CircleCI for this it's hard to say if things are better over there. I'd like to help here if anyone wants to fill me in. On 15 February 2018 at 21:14, Josh McKenzie <jmcken...@apache.org> wrote: > > > > We’ve said in the past that we don’t release without green tests. > > The PMC gets to vote and enforce it. If you don’t vote yes without > > seeing the > test > > results, that enforces it. > > I think this is noble and ideal in theory. In practice, the tests take > long enough, hardware infra has proven flaky enough, and the tests > *themselves* flaky enough, that there's been a consistent low-level of > test failure noise that makes separating signal from noise in this > context very time consuming. Reference 3.11-test-all for example re:noise: > https://builds.apache.org/view/A-D/view/Cassandra/job/ > Cassandra-3.11-test-all/test/?width=1024=768 > > Having spearheaded burning test failures to 0 multiple times and have > them regress over time, my gut intuition is we should have one person > as our Source of Truth with a less-flaky source for release-vetting CI > (dedicated hardware, circle account, etc) we can use as a reference to > vote on release SHA's. > > We’ve declared this a requirement multiple times > > Declaring things != changed behavior, and thus != changed culture. The > culture on this project is one of having a constant low level of test > failure noise in our CI as a product of our working processes. Unless > we change those (actually block release w/out green board, actually > aggressively block merge w/any failing tests, aggressively > retroactively track down test failures on a daily basis and RCA), the > situation won't improve. Given that this is a volunteer organization / > project, that kind of daily time investment is a big ask. > > On Thu, Feb 15, 2018 at 1:10 PM, Jeff Jirsa <jji...@gmail.com> wrote: > > > Moving this to it’s own thread: > > > > We’ve declared this a requirement multiple times and then we > > occasionally get a critical issue and have to decide whether it’s > > worth the delay. I assume Jason’s earlier -1 on attempt 1 was an > > enforcement of that earlier stated goal. > > > > It’s up to the PMC. We’ve said in the past that we don’t release > > without green tests. The PMC gets to vote and enforce it. If you > > don’t vote yes without seeing the test results, that enforces it. > > > > -- > > Jeff Jirsa > > > > > > > On Feb 15, 2018, at 9:49 AM, Josh McKenzie <jmcken...@apache.org> > wrote: > > > > > > What would it take for us to get green utest/dtests as a blocking > > > part > of > > > the release process? i.e. "for any given SHA, here's a link to the > tests > > > that passed" in the release vote email? > > > > > > That being said, +1. > > > > > >> On Wed, Feb 14, 2018 at 4:33 PM, Nate McCall <zznat...@gmail.com> > > wrote: > > >> > > >> +1 > > >> > > >> On Thu, Feb 15, 2018 at 9:40 AM, Michael Shuler < > mich...@pbandjelly.org > > > > > >> wrote: > > >>> I propose the following artifacts for release as 3.0.16. > > >>> > > >>> sha1: 890f319142ddd3cf2692ff45ff28e71001365e96 > > >>> Git: >