Re: Tools to manage repairs

2016-10-28 Thread Edward Capriolo
Big partitions are an anti-pattern here is why: First Cassandra is not an analytic datastore. Sure it has some UDFs and aggregate UDFs, but the true purpose of the data store is to satisfy point reads. Operations have strict timeouts: # How long the coordinator should wait for read operations to

Re: Question on Read Repair

2016-10-11 Thread Edward Capriolo
This is theory but not the all practice. The failure detector heartbeats is a process happening outside the read. Take for example a cluster with Replication Factor 3. At time('1) the failure detector might read three nodes as UP. A request "soon after '1" issued at time(`2) might start a read

Re: Why does Cassandra need to have 2B column limit? why can't we have unlimited ?

2016-10-12 Thread Edward Capriolo
The "2 billion column limit" press clipping "puffery". This statement seemingly became popular because highly traffic traffic-ed story, in which a tech reporter embellished on a statement to make a splashy article. The effect is something like this:

Re: Inconsistencies in materialized views

2016-10-17 Thread Edward Capriolo
https://issues.apache.org/jira/browse/CASSANDRA-11198 Which has problems "maybe" fixed by: https://issues.apache.org/jira/browse/CASSANDRA-11475 Which has it's own set of problems. One of these patches was merged into 3.7 which tells you are running a version 3.6 with known bugs. Also as the

Re: Batch size warnings

2016-12-07 Thread Edward Capriolo
I have been circling around a thought process over batches. Now that Cassandra has aggregating functions, it might be possible write a type of record that has an END_OF_BATCH type marker and the data can be suppressed from view until it was all there. IE you write something like a checksum record

Re: Benefit of LOCAL_SERIAL consistency

2016-12-08 Thread Edward Capriolo
on.) >> >> As you said, SERIAL with multi-DCs incurs latency increase, >> but it's a trade-off between latency and high availability bacause one >> DC can be down from a disaster. >> I don't think there is any way to achieve global linearlizability >> without

Re: implementing a 'sorted set' on top of cassandra

2017-01-13 Thread Edward Capriolo
On Fri, Jan 13, 2017 at 8:14 PM, Jonathan Haddad wrote: > I've thought about this for years and have never arrived on a particularly > great implementation. Your idea will be maybe OK if the sets are very > small and if the values don't change very often. But in a system

Re: implementing a 'sorted set' on top of cassandra

2017-01-13 Thread Edward Capriolo
On Fri, Jan 13, 2017 at 5:14 PM, Mike Torra wrote: > We currently use redis to store sorted sets that we increment many, many > times more than we read. For example, only about 5% of these sets are ever > read. We are getting to the point where redis is becoming difficult

Re: Strange issue wherein cassandra not being started from cron

2017-01-10 Thread Edward Capriolo
On Tuesday, January 10, 2017, Jonathan Haddad wrote: > Last I checked, cron doesn't load the same, full environment you see when > you log in. Also, why put Cassandra on a cron? > On Mon, Jan 9, 2017 at 9:47 PM Bhuvan Rawal

Re: Why does `now()` produce different times within the same query?

2016-12-03 Thread Edward Capriolo
On Saturday, December 3, 2016, Edward Capriolo <edlinuxg...@gmail.com> wrote: > > > On Saturday, December 3, 2016, Jonathan Haddad <j...@jonhaddad.com > <javascript:_e(%7B%7D,'cvml','j...@jonhaddad.com');>> wrote: > >> That isn't what the origin

Re: Why does `now()` produce different times within the same query?

2016-12-03 Thread Edward Capriolo
timestamp out of both? > On Fri, Dec 2, 2016 at 10:24 AM Edward Capriolo <edlinuxg...@gmail.com > <javascript:_e(%7B%7D,'cvml','edlinuxg...@gmail.com');>> wrote: > >> >> On Thu, Dec 1, 2016 at 11:09 AM, Sylvain Lebresne <sylv...@datastax.com >> <javascript:_e(

Re: Why does `now()` produce different times within the same query?

2016-12-03 Thread Edward Capriolo
> unexpected thing virtually every time. > On Sat, Dec 3, 2016 at 7:09 AM Edward Capriolo <edlinuxg...@gmail.com > <javascript:_e(%7B%7D,'cvml','edlinuxg...@gmail.com');>> wrote: > >> >> >> On Friday, December 2, 2016, Jonathan Haddad <j...@jonhaddad.com

Re: Why does `now()` produce different times within the same query?

2016-12-03 Thread Edward Capriolo
On Sat, Dec 3, 2016 at 11:01 AM, Edward Capriolo <edlinuxg...@gmail.com> wrote: > > > On Saturday, December 3, 2016, Edward Capriolo <edlinuxg...@gmail.com> > wrote: > >> >> >> On Saturday, December 3, 2016, Jonathan Haddad <j...@jonhaddad.com>

Re: Why does `now()` produce different times within the same query?

2016-12-01 Thread Edward Capriolo
On Thu, Dec 1, 2016 at 4:06 AM, Sylvain Lebresne wrote: > One can of course always open a JIRA, but I'm going to strongly disagree > with a > change here (outside of a documentation one that is). > > The now() function is a timeuuid generator, and it thus generates a unique

Re: Why does `now()` produce different times within the same query?

2016-12-01 Thread Edward Capriolo
On Thu, Dec 1, 2016 at 11:09 AM, Sylvain Lebresne <sylv...@datastax.com> wrote: > On Thu, Dec 1, 2016 at 4:44 PM, Edward Capriolo <edlinuxg...@gmail.com> > wrote: > >> >> I am not sure you saw my reply on thread but I believe everyone's needs >> can be

Re: Why does `now()` produce different times within the same query?

2016-11-30 Thread Edward Capriolo
On Wed, Nov 30, 2016 at 10:53 PM, Cody Yancey wrote: > This is not a bug, and in fact changing it would be a serious bug. > > False. Absolutely no consumer would be broken by a change to guarantee an > identical time component that isn't broken already, for the simple reason

Re: Why does `now()` produce different times within the same query?

2016-12-02 Thread Edward Capriolo
On Thu, Dec 1, 2016 at 11:09 AM, Sylvain Lebresne <sylv...@datastax.com> wrote: > On Thu, Dec 1, 2016 at 4:44 PM, Edward Capriolo <edlinuxg...@gmail.com> > wrote: > >> >> I am not sure you saw my reply on thread but I believe everyone's needs >> can be

Re: weird jvm metrics

2017-01-05 Thread Edward Capriolo
On Thu, Jan 5, 2017 at 1:53 PM, Alain Rastoul <alf.mmm@gmail.com> wrote: > On 01/04/2017 11:12 PM, Edward Capriolo wrote: > >> The metric-reporter is actually leveraged from another project. >> >> https://github.com/addthis/metrics-reporter-config >> >

Re: Logs appear to contradict themselves during bootstrap steps

2017-01-06 Thread Edward Capriolo
On Fri, Jan 6, 2017 at 6:45 PM, Sotirios Delimanolis wrote: > I forgot to check nodetool gossipinfo. Still, why does the first check > think that the address exists, but the second doesn't? > > > On Friday, January 6, 2017 1:11 PM, David Berry >

Re: Help

2017-01-09 Thread Edward Capriolo
On Sun, Jan 8, 2017 at 11:30 PM, Anshu Vajpayee wrote: > Gossip shows - all nodes are up. > > But when we perform writes , coordinator stores the hints. It means - > coordinator was not able to deliver the writes to few nodes after meeting > consistency requirements.

Re: Why does Cassandra recommends Oracle JVM instead of OpenJDK?

2017-01-02 Thread Edward Capriolo
On Mon, Jan 2, 2017 at 3:51 PM, Benjamin Roth <benjamin.r...@jaumo.com> wrote: > Does this discussion really make sense any more? To me it seems it turned > opinionated and religious. From my point of view anything that has to be > said was said. > > Am 02.01.2017 21:27 sch

Re: Why does Cassandra recommends Oracle JVM instead of OpenJDK?

2017-01-02 Thread Edward Capriolo
On Mon, Jan 2, 2017 at 11:56 AM, Eric Evans <john.eric.ev...@gmail.com> wrote: > On Fri, Dec 23, 2016 at 9:15 PM, Edward Capriolo <edlinuxg...@gmail.com> > wrote: > > "I don't really have any opinions on Oracle per say, but Cassandra is a > > Free Software

Re: Query

2016-12-29 Thread Edward Capriolo
You should start with understanding your needs. Once you understand your need you can pick the software that fits your need. Staring with a software stack is backwards. On Thu, Dec 29, 2016 at 11:34 PM, Ben Slater wrote: > I wasn’t familiar with Gizzard either so I

Re: Why does Cassandra recommends Oracle JVM instead of OpenJDK?

2017-01-02 Thread Edward Capriolo
; open the JDK source code and read it. you will encounter some great ideas > and Algorithms. > > > > > > On Mon, Jan 2, 2017 at 1:04 PM, Edward Capriolo <edlinuxg...@gmail.com> > wrote: > >> >> On Mon, Jan 2, 2017 at 3:51 PM, Benjamin Roth <benjamin

Re: weird jvm metrics

2017-01-04 Thread Edward Capriolo
The metric-reporter is actually leveraged from another project. https://github.com/addthis/metrics-reporter-config Check the version of metric-reporter (in cassandra/lib) and see if it has changed from your old version to your new version. On Wed, Jan 4, 2017 at 12:02 PM, Mike Torra

Re: Why does Cassandra recommends Oracle JVM instead of OpenJDK?

2016-12-21 Thread Edward Capriolo
On Wednesday, December 21, 2016, Kant Kodali wrote: > https://www.youtube.com/watch?v=9ei-rbULWoA > > On Wed, Dec 21, 2016 at 2:59 AM, Kant Kodali > wrote: > >>

Re: All subsequent CAS requests time out after heavy use of new CAS feature

2016-12-23 Thread Edward Capriolo
Anecdotal CAS works differently than the typical cassandra workload. If you run a stress instance 3 nodes one host, you find that you typically run into CPU issues, but if you are doing a CAS workload you see things timing out and before you hit 100% CPU. It is a strange beast. On Fri, Dec 23,

Re: Why does Cassandra recommends Oracle JVM instead of OpenJDK?

2016-12-23 Thread Edward Capriolo
On Fri, Dec 23, 2016 at 6:01 AM, Kant Kodali wrote: > Java 9 Module system looks really interesting. I would be very curious to > see how Cassandra would leverage that. > > On Thu, Dec 22, 2016 at 9:09 AM, Kant Kodali wrote: > >> I would agree with Eric

Re: Why does Cassandra recommends Oracle JVM instead of OpenJDK?

2016-12-26 Thread Edward Capriolo
hese days not many people are willing to > pay even for distributed databases so I don't think anyone would pay for > programming language. In short, Let me end by saying Oracle just has lot of > self interest but I really hope that I am wrong since I am a big fan of JVM. > > > > >

Re: Advice in upgrade plan from 1.2.18 to 2.2.8

2016-12-22 Thread Edward Capriolo
Also before you get started. Make sure: 1) no one attempts to change schema during the process 2) no one attempts to run a repair 3) no one attempts to join a node 4) no one attempts to remove/move nodes from the cluster Each of these things trigger repair sessions and stream data which do not

Re: Benefit of LOCAL_SERIAL consistency

2016-12-07 Thread Edward Capriolo
On Wed, Dec 7, 2016 at 8:25 AM, DuyHai Doan wrote: > The reason you don't want to use SERIAL in multi-DC clusters is the > prohibitive cost of lightweight transaction (in term of latency), > especially if your data centers are separated by continents. A ping from > London

Re: Assertions being hit on Cassandra 3.5 cluster (UnfilteredRowIterators.concat)

2017-03-22 Thread Edward Capriolo
On Wed, Mar 22, 2017 at 4:34 PM, Daniel Miranda wrote: > I found out the problem is conditioned to having the row cache enabled. > Whenever a query would return an empty result set in a particular table, it > would fail instead with the exception being thrown in all all nodes. >

Ye old singleton debate

2017-03-15 Thread Edward Capriolo
This question came up today: OK, say you mock, how do you construct a working multi-process representation of how C* actually works from within a unit test without running the code that actually constructs the cluster? 1) Don't do that (construct a multinode cluster in a test) just mock the crap

Re: OOM on Apache Cassandra on 30 Plus node at the same time

2017-03-04 Thread Edward Capriolo
On Saturday, March 4, 2017, Thakrar, Jayesh wrote: > LCS does not rule out frequent updates - it just says that there will be > more frequent compaction, which can potentially increase compaction > activity (which again can be throttled as needed). > > But STCS will

Re: scylladb

2017-03-11 Thread Edward Capriolo
On Sat, Mar 11, 2017 at 2:08 PM, Bhuvan Rawal wrote: > "Lastly, why don't you test Scylla yourself? It's pretty easy to set up, > there's nothing to tune." > - The details are indeed compelling to have a go ahead and test it for > specific use case. > > If it works out

Re: scylladb

2017-03-11 Thread Edward Capriolo
On Sat, Mar 11, 2017 at 9:41 PM, daemeon reiydelle wrote: > Recall that garbage collection on a busy node can occur minutes or seconds > apart. Note that stop the world GC also happens as frequently as every > couple of minutes on every node. Remove that and do the simple

Re: scylladb

2017-03-12 Thread Edward Capriolo
On Sun, Mar 12, 2017 at 1:38 AM, benjamin roth wrote: > There is no reason to be angry. This is progress. This is the circle of > live. > > It happens anywhere at any time. > > Am 12.03.2017 07:34 schrieb "Dor Laor" : > >> On Sat, Mar 11, 2017 at 10:02 PM,

Re: scylladb

2017-03-12 Thread Edward Capriolo
On Sun, Mar 12, 2017 at 11:40 AM, Edward Capriolo <edlinuxg...@gmail.com> wrote: > > > On Sun, Mar 12, 2017 at 1:38 AM, benjamin roth <brs...@gmail.com> wrote: > >> There is no reason to be angry. This is progress. This is the circle of >> live. >> &g

Re: scylladb

2017-03-12 Thread Edward Capriolo
On Sun, Mar 12, 2017 at 3:45 PM, Dor Laor <d...@scylladb.com> wrote: > On Sun, Mar 12, 2017 at 12:11 PM, Edward Capriolo <edlinuxg...@gmail.com> > wrote: > >> The simple claim that "Scylla IS a drop in replacement for C*" shows >> that they clear

Re: scylladb

2017-03-12 Thread Edward Capriolo
The simple claim that "Scylla IS a drop in replacement for C*" shows that they clearly don't know as much as they think they do. Even if it did supposedly "support everything" it would not actually work like that. For example, some things in Cassandra work "the way they work" . They are not

Re: question of keyspace that just disappeared

2017-03-03 Thread Edward Capriolo
On Fri, Mar 3, 2017 at 7:56 AM, Romain Hardouin wrote: > I suspect a lack of 3.x reliability. Cassandra could had gave up with > dropped messages but not with a "drop keyspace". I mean I already saw some > spark jobs with too much executors that produce a high load average

Re: Cassandra 3.10: ClassCastException in ThreadAwareSecurityManager

2017-03-31 Thread Edward Capriolo
I created https://issues.apache.org/jira/browse/CASSANDRA-13396 for you https://issues.apache.org/jira/browse/CASSANDRA-13396 /** * The purpose of this class is */ this purpose of this class is ...what ? this class is who? sicka sicka slim shady. On Thu, Mar 30, 2017 at 1:48 PM, Anton PASSIOUK

<    3   4   5   6   7   8