UDF related question

2018-04-03 Thread Kant Kodali
Hi All, I was reading the article below and I was wondering how did one manage to block all I/O calls given that there is no byte code instruction for I/O in java instead all the I/O calls in java will go through *invokevirtual *byte code instruction. But this can call a C function that just add t

Is there a plan for Feature like this in C* ?

2018-07-02 Thread Kant Kodali
https://www.cockroachlabs.com/docs/v2.1/change-data-capture.html

Re: Is there a plan for Feature like this in C* ?

2018-07-02 Thread Kant Kodali
Hi Justin, Thanks, Looks like a very early stage feature and no integration with Kafka yet I suppose. Thanks! On Mon, Jul 2, 2018 at 6:24 PM, Justin Cameron wrote: > yes, take a look at http://cassandra.apache.org/ > doc/latest/operating/cdc.html > > On Tue, 3 Jul 2018 at 01:20

How to write a trigger in Cassandra to only detect updates of an existing row?

2016-10-04 Thread Kant Kodali
Hi all, How to write a trigger in Cassandra to detect updates? My requirement is that I want a trigger to alert me only when there is an update to an existing row and looks like given the way INSERT and Update works this might be hard to do because INSERT will just overwrite if there is an existin

Re: How to write a trigger in Cassandra to only detect updates of an existing row?

2016-10-04 Thread Kant Kodali
o overcome that, along with its associated overhead.  But all that said, it should be possible, though you'll have to write it for yourself in your trigger code. On Tue, Oct 4, 2016 at 12:29 PM Kant Kodali wrote: Hi all, How to write a trigger in Cassandra to detect updates? My requirement is t

Re: How to write a trigger in Cassandra to only detect updates of an existing row?

2016-10-04 Thread Kant Kodali
arth Verma On Wed, Oct 5, 2016 at 12:20 AM, Kant Kodali wrote: Thanks a lot, This helps me to make a decision on not to write one for the performance reasons you pointed out! On Tue, Oct 4, 2016 11:42 AM, Eric Stevens migh...@gmail.com wrote: You would have to perform a SELECT on the row in

Re: How to write a trigger in Cassandra to only detect updates of an existing row?

2016-10-04 Thread Kant Kodali
elete if length is zero."If you are sure case 3 will never occur,then checking the next.length(), you can decide whether it is an insert/update(length > 0) OR delete(length == 0) I would urge you to try the snippet once on you own, to see what kind of data it produces in next. You could dump

Re: Running Cassandra in Integration Tests

2016-10-06 Thread Kant Kodali
sure as long as that isolated instance is treated as separate cluster you shouldn't run into any problems. On Thu, Oct 6, 2016 4:08 PM, Ali Akhtar ali.rac...@gmail.com wrote: Is it possible to create an isolated cassandra instance which is run during integration tests and it disappears after

Re: Running Cassandra in Integration Tests

2016-10-06 Thread Kant Kodali
java api to programmatically start / destroy an instance during tests? On Fri, Oct 7, 2016 at 4:56 AM, Kant Kodali wrote: sure as long as that isolated instance is treated as separate cluster you shouldn't run into any problems. On Thu, Oct 6, 2016 4:08 PM, Ali Akhtar ali.rac...@gmai

Re: Running Cassandra in Integration Tests

2016-10-06 Thread Kant Kodali
M Ali Akhtar wrote: Ok, but that's not what this question is about. I'm looking for a way to run an embedded cassandra instance which is created & destroyed during tests and which doesn't persist any state outside the tests. On Fri, Oct 7, 2016 at 5:10 AM, Kant Kodali wrote: y

Why does Cassandra need to have 2B column limit? why can't we have unlimited ?

2016-10-12 Thread Kant Kodali
Hi All, I understand Cassandra can have a maximum of 2B rows per partition but in practice some people seem to suggest the magic number is 100K. why not create another partition/rowkey automatically (whenever we reach a safe limit that we consider would be efficient) with auto increment bigint

Re: Why does Cassandra need to have 2B column limit? why can't we have unlimited ?

2016-10-12 Thread Kant Kodali
st like with random > partition) > > Can you tell what rowkey1, page1, col(x) actually are ? Maybe there is a > way. > The most "recent", means there's a timestamp in there ? > > On Wed, Oct 12, 2016 at 9:58 AM, Kant Kodali wrote: > >> Hi All, >> >

Re: Why does Cassandra need to have 2B column limit? why can't we have unlimited ?

2016-10-12 Thread Kant Kodali
> >> Can you tell what rowkey1, page1, col(x) actually are ? Maybe there is a >> way. >> The most "recent", means there's a timestamp in there ? >> >> On Wed, Oct 12, 2016 at 9:58 AM, Kant Kodali wrote: >> >>> Hi All, >>> >>

what does this Note mean?

2016-10-13 Thread Kant Kodali
Note: Non-system keyspaces don't have the same replication settings, effective ownership information is meaningless

Re: Why does Cassandra need to have 2B column limit? why can't we have unlimited ?

2016-10-14 Thread Kant Kodali
e. >> >> The effect is something like this: >> http://www.healthnewsreview.org/2012/08/iced-tea-kidney-ston >> es-and-the-study-that-never-existed/ >> >> Iced tea does not cause kidney stones! Cassandra does not store rows with >> 2 billion columns! It is

Re: Why does Cassandra need to have 2B column limit? why can't we have unlimited ?

2016-10-14 Thread Kant Kodali
GB. (thats like 15 million columns where each column can have a data of size 1KB). On Fri, Oct 14, 2016 at 11:30 PM, Kant Kodali wrote: > "Robert said he could treat safely 10 15GB partitions at his presentation" > This sounds like there is there is a row limit too not only columns?

Is SASI index in Cassandra efficient for high cardinality columns?

2016-10-14 Thread Kant Kodali
I understand Secondary Indexes in general are inefficient on high cardinality columns but since SASI is built from scratch I wonder if the same argument applies there? If not, Why? Because I believe primary keys in Cassandra are indeed indexed and since Primary key is supposed to be the column with

Re: Why does Cassandra need to have 2B column limit? why can't we have unlimited ?

2016-10-15 Thread Kant Kodali
15GB partition". > What I wanted to say is we can store much more rows(and columns) in a > partition than before 3.6. > > 2016-10-15 15:34 GMT+09:00 Kant Kodali : > >> "Robert said he could treat safely 10 15GB partitions at his presentation" >> This sounds

Re: Is SASI index in Cassandra efficient for high cardinality columns?

2016-10-15 Thread Kant Kodali
1) a single indexed value is present in a lot of rows > 2) a single indexed value has only a few (if not just one) matching row > > > On Sat, Oct 15, 2016 at 8:37 AM, Kant Kodali wrote: > >> I understand Secondary Indexes in general are inefficient on high >> cardinality

Re: Why does Cassandra need to have 2B column limit? why can't we have unlimited ?

2016-10-15 Thread Kant Kodali
compaction > - bootstrapping of new nodes --> failure to stream a partition in the > middle will force to re-send the whole partition from the beginning again --> > the receiving node has a bunch of duplicate data --> pressure on compaction > > > > On Sat, Oct 15, 201

Re: Why does Cassandra need to have 2B column limit? why can't we have unlimited ?

2016-10-15 Thread Kant Kodali
dle of a partition, the operational pains will still > be there. Same for compaction > > > > On Sat, Oct 15, 2016 at 12:00 PM, Kant Kodali wrote: > >> 1) It will be great if someone can confirm that there is no limit >> 2) so what is optimal limit in terms of data size?

What is the point of counter type when we can do the same thing with int or bigint?

2016-10-17 Thread Kant Kodali
I just read the following link https://docs.datastax.com/en/cql/3.3/cql/cql_using/useCounters.html and I wonder what is the point of counter type when we can do the same thing with int or bigint? what are benefits of using counter data type?

Re: What is the point of counter type when we can do the same thing with int or bigint?

2016-10-17 Thread Kant Kodali
write using > lightweight transactions, which have a significant performance penalty. > > > > The primary reason for this is because no individual Cassandra node is > guaranteed to know the current state of any cell at the time the write > arrives – counters attempt to solve this > &g

Re: What is the point of counter type when we can do the same thing with int or bigint?

2016-10-17 Thread Kant Kodali
Also are you saying counters are atomic? On Mon, Oct 17, 2016 at 6:43 PM, Kant Kodali wrote: > How about “Set the value 1 above what it is now" ? The same principle > should apply right? > > > > > On Mon, Oct 17, 2016 at 6:21 PM, Jeff Jirsa > wrote: > >> Y

quick question

2016-10-19 Thread Kant Kodali
can Cassandra cluster direct or load balance the requests by detecting the resource usage of a particular node?

Re: quick question

2016-10-19 Thread Kant Kodali
/winguzone.com?from=list> - Hosted Cloud > CassandraLaunch your cluster in minutes.* > > > On Wed, 19 Oct 2016 06:14:27 -0400*Kant Kodali >* wrote > > can Cassandra cluster direct or load balance the requests by detecting the > resource usage of a particular node? > > >

Re: non incremental repairs with cassandra 2.2+

2016-10-19 Thread Kant Kodali
Another question on a same note would be what would be the fastest way to do repairs of size 10TB cluster ? Full repairs are taking days. So among repair parallel or repair sub range which is faster in the case of say adding a new node to the cluster? Sent from my iPhone > On Oct 19, 2016, at

Re: non incremental repairs with cassandra 2.2+

2016-10-19 Thread Kant Kodali
gt; use a proper tool to orchestrate it, that would save you from repairing all > 10TB each time. > CASSANDRA-12580 might help too as Romain showed us : > https://www.mail-archive.com/user@cassandra.apache.org/msg49344.html > > Cheers, > > > > On Wed, Oct 19, 2016 a

Re: non incremental repairs with cassandra 2.2+

2016-10-19 Thread Kant Kodali
you know what the caveats are and > use a proper tool to orchestrate it, that would save you from repairing all > 10TB each time. > CASSANDRA-12580 might help too as Romain showed us : > https://www.mail-archive.com/user@cassandra.apache.org/msg49344.html > > Cheers, > >

Re: non incremental repairs with cassandra 2.2+

2016-10-19 Thread Kant Kodali
Sorry I shouldn't have said adding a node. Sometimes data seems to be corrupted or inconsistent in which case would like to run a repair. Sent from my iPhone > On Oct 19, 2016, at 10:10 AM, Sean Bridges > wrote: > > Thanks, we will try that. > > Sean > >> On 16-10-19 09:34 AM, Alexander De

Re: Is SASI index in Cassandra efficient for high cardinality columns?

2016-10-21 Thread Kant Kodali
ee that for each search indexed value, you only > hit a single node (or N replicas depending on the used consistency level) > > However, materialized views have their own drawbacks (weeker consistency > guarantee) and you can't use range queries (<, >, ≤, ≥) or full text > se

What is the maximum value of Cassandra Counter Column?

2016-10-23 Thread Kant Kodali
What is the maximum value of Cassandra Counter Column?

Re: What is the maximum value of Cassandra Counter Column?

2016-10-23 Thread Kant Kodali
where does it say counter is implemented as long? On Sun, Oct 23, 2016 at 1:13 AM, Ali Akhtar wrote: > Probably: https://docs.oracle.com/javase/8/docs/api/java/ > lang/Long.html#MAX_VALUE > > On Sun, Oct 23, 2016 at 1:12 PM, Kant Kodali wrote: > >> What is the maxim

is there any problem having too many clustering columns?

2016-10-23 Thread Kant Kodali
Hi All, Is there any problem having too many clustering columns? My goal is to store data by columns in order and for any given partition (primary key) each of its non-clustering column (columns that are not part of primary key) can lead to a new column underneath or the CQL equivalent would be a

Re: is there any problem having too many clustering columns?

2016-10-23 Thread Kant Kodali
l create a bunch of tombstones and > duplicates of values > > > > On Sun, Oct 23, 2016 at 9:35 PM, Kant Kodali wrote: > >> Hi All, >> >> Is there any problem having too many clustering columns? My goal is to >> store data by columns in order and for any gi

which one of the following choices is more efficient?

2016-10-26 Thread Kant Kodali
If one were given a choice of fitting all the data into one table vs fitting the data into two tables while say (keeping all the runtime and space complexity for CRUD operations the same in either case) which one would you choose and why?

Re: which one of the following choices is more efficient?

2016-10-26 Thread Kant Kodali
I guess the question can be rephrased into "What is the overhead of creating and maintaining an additional table?" On Wed, Oct 26, 2016 at 1:12 AM, Ali Akhtar wrote: > Depends on the use case. No one right answer. > > On Wed, Oct 26, 2016 at 1:03 PM, Kant Kodali wrote: >

Re: which one of the following choices is more efficient?

2016-10-26 Thread Kant Kodali
just restating my first question. On Wed, Oct 26, 2016 at 1:19 AM, Ali Akhtar wrote: > You would need to do each write twice and data will take up twice the > space as its duplicated in two places. > > On Wed, Oct 26, 2016 at 1:17 PM, Kant Kodali wrote: > >> I guess the question c

Re: which one of the following choices is more efficient?

2016-10-26 Thread Kant Kodali
levels are not necessarily logN in my case because I may divide the number of nodes at each level by 1/4 or 1/8. On Wed, Oct 26, 2016 at 1:24 AM, Kant Kodali wrote: > @Ali hmm..I didn't mean to say I store the same data across two tables and > neither tables are dependent for me to do the w

How to handle dependent writes in the most efficient way?

2016-10-26 Thread Kant Kodali
I have a scenario where every write/row depends on some of the data written in the previous row so I end up doing a read before write which is degrading the performance by a significant margin so I am thinking if I should keep track of the last row written for every partition in a cache so every su

question on an article

2016-10-31 Thread Kant Kodali
Hi Guys, I keep reading the articles below but the biggest questions for me are as follows 1) what is the "data size" per request? without data size it hard for me to see anything sensible 2) is there batching here? http://www.datastax.com/1-million-writes http://techblog.netflix.com/2014/07/r

Re: question on an article

2016-10-31 Thread Kant Kodali
tching - it may be one of the parameters to >> stress.jar. >> >> Peter >> >> On Mon, Oct 31, 2016 at 4:07 PM, Kant Kodali wrote: >> >>> Hi Guys, >>> >>> >>> I keep reading the articles below but the biggest questions for m

Re: Cassandra Poor Read Performance Response Time

2016-11-01 Thread Kant Kodali
+1 On Tue, Nov 1, 2016 at 2:10 AM, _ _ wrote: > Hi > > Currently i am running a cassandra cluster of 3 nodes (with it replicating > to both nodes) and am experiencing poor performance, usually getting second > response times when running queries when i am expecting/needing millisecond > response

Re: Custom receiver for WebSocket in Spark not working

2016-11-02 Thread Kant Kodali
I don't see a store() call in your receive(). Search for store() in here http://spark.apache.org/docs/latest/streaming-custom-receivers.html On Wed, Nov 2, 2016 at 10:23 AM, Cassa L wrote: > Hi, > I am using spark 1.6. I wrote a custom receiver to read from WebSocket. > But when I start my spar

Is there a way to do Read and Set at Cassandra level?

2016-11-05 Thread Kant Kodali
I have a requirement where I need to know last value that is written successfully so I could read that value and do some computation and include it in the subsequent write. For now we are doing read before write which significantly degrades the performance. Light weight transactions are more of a c

Re: Is there a way to do Read and Set at Cassandra level?

2016-11-05 Thread Kant Kodali
lly so I could read that value and do some computation and include > it in the subsequent write" > > Maybe keeping the last written value in a distributed cache is cheaper > than doing a read before write in Cassandra ? > > On Sat, Nov 5, 2016 at 11:24 AM, Kant Kodali wrote:

Re: Java GC pauses, reality check

2016-11-25 Thread Kant Kodali
+1 Chris Lohfink response I would also restate the following sentence "java GC pauses are pretty much a fact of life" to "Any GC based system pauses are pretty much a fact of life". I would be more than happy to see if someone can counter prove. On Fri, Nov 25, 2016 at 1:41 PM, Chris Lohfink

Re: Java GC pauses, reality check

2016-11-26 Thread Kant Kodali
5.11.2016 23:38 schrieb "Kant Kodali" : > >> +1 Chris Lohfink response >> >> I would also restate the following sentence "java GC pauses are pretty >> much a fact of life" to "Any GC based system pauses are pretty much a >> fact of li

Re: Java GC pauses, reality check

2016-11-26 Thread Kant Kodali
Good to know about Zing! I will have to take a look. On Sat, Nov 26, 2016 at 8:27 PM, Kant Kodali wrote: > Benjamin Roth: How do you know Arc eliminates GC pauses completely? By > completely I mean no GC pauses whatsoever. > > When you say Java is NOT the First choice for Server

Re: Java GC pauses, reality check

2016-11-26 Thread Kant Kodali
writes/s > with azul .we never had a major gc above 10 ms . > > Sent from my iPhone > > > On Nov 25, 2016, at 3:49 PM, Martin Schröder wrote: > > > > 2016-11-25 23:38 GMT+01:00 Kant Kodali : > >> I would also restate the following sentence "java GC pauses ar

Re: Java GC pauses, reality check

2016-11-26 Thread Kant Kodali
t; are build in c or c++ has never been such a pain. > > On the other Hand Java is easier to handle for Developers. And coding > plain c is also a pain. > > Thats why i Said its a philosophic discussion. > Anyway Cassandra rund on Java so We have to Deal with it. > > Am 2

Re: Java GC pauses, reality check

2016-11-27 Thread Kant Kodali
looked very > interesting and promising! By the way it's written in C++. > > > 2016-11-27 7:06 GMT+01:00 Kant Kodali : > >> Automatic Reference counting sounds like college level idea that we all >> have been hearing for since GC is born! There seem to be bunch of co

Re: Java GC pauses, reality check

2016-11-28 Thread Kant Kodali
's Garbage Collector?) What timeouts are you referring to here? Thanks, kant On Sun, Nov 27, 2016 at 9:57 PM, Harikrishnan Pillai < hpil...@walmartlabs.com> wrote: > Hi @Kant Kodali, > > We have multiple clusters running zing . > > One cluster has 11/11 and another one

Re: Java GC pauses, reality check

2016-11-28 Thread Kant Kodali
C4 (Zing's Garbage Collector?)* *What timeouts are you referring to ?* On Mon, Nov 28, 2016 at 7:39 AM, Harikrishnan Pillai < hpil...@walmartlabs.com> wrote: > Hi @Kant Kodali, > > 11 /11 , 11 nodes in DC1 and 11 nodes in DC2. > > > ------

What is the size of each Virtual Node token range?

2016-11-28 Thread Kant Kodali
1) What is the size of each Virtual Node token range? 2) Are all Vnode token ranges in one server are of the same size? 3) If these token ranges are predefined then isn't it implying that the maximum total number of rows in a server is also predefined? maximum total number of rows in a server = nu

Re: What is the size of each Virtual Node token range?

2016-11-28 Thread Kant Kodali
ies and comparing them with a key you are looking for. > > This thesis is maybe more correct: > There can be no more than 2^64 nodes in a cluster as then 2 nodes would > share exactly the same token and this does not make really sense. > > 2016-11-28 17:28 GMT+01:00 Kant Kodali

Are Materialized views persisted on disk?

2016-12-13 Thread Kant Kodali
Are Materialized views persisted on disk? sorry for the naive question.

quick questions

2016-12-17 Thread Kant Kodali
I keep hearing that the minimum number of Cassandra nodes required to achieve Quorum consensus is 4 I wonder why not 3? In fact, many container deployments by default seem to deploy 4 nodes. Can anyone shine some light on this? What happens if I have 3 nodes and replication factor of 3 and consist

Re: quick questions

2016-12-17 Thread Kant Kodali
t, Dec 17, 2016 at 10:21 PM, Kant Kodali wrote: > >> I keep hearing that the minimum number of Cassandra nodes required to >> achieve Quorum consensus is 4 I wonder why not 3? In fact, many container >> deployments by default seem to deploy 4 nodes. Can anyone shine some light

Re: quick questions

2016-12-17 Thread Kant Kodali
; consistency then the read/write operation will fail. You could still do > reads/writes with CL=ONE, though (provided that at least 1 of the replicas > was up). > > - Max > > > On Dec 17, 2016, at 1:21 pm, Kant Kodali wrote: > > > > I keep hearing that the minim

Re: quick questions

2016-12-18 Thread Kant Kodali
kload > > > > Option (a) will impact more the cluster stability than (b). > > > > Dominique > > > > [@@ THALES GROUP INTERNAL @@] > > > > *De :* Kant Kodali [mailto:k...@peernova.com] > *Envoyé :* samedi 17 décembre 2016 22:21 > *À :* user@cassandra.apache.o

Why does Cassandra recommends Oracle JVM instead of OpenJDK?

2016-12-20 Thread Kant Kodali
Looking at this http://www.theregister.co.uk/2016/12/16/oracle_targets_java_users_non_compliance/?mt=1481919461669 I don't know why Cassandra recommends Oracle JVM? JVM is a great piece of software but I would like to stay away from Oracle as much as possible. Oracle is just horrible the way they

Re: Why does Cassandra recommends Oracle JVM instead of OpenJDK?

2016-12-21 Thread Kant Kodali
offer support but maybe only for Oracle JDK. > >Twitter uses OpenJDK, but they have their own JVM support team. Not >sure everyone can afford that. > > As a side note I’ll add that Oracle is paying talented engineers to work > on the JVM to make it great. > > Cheers,

Re: Why does Cassandra recommends Oracle JVM instead of OpenJDK?

2016-12-21 Thread Kant Kodali
ice > > On Wed, Dec 21, 2016 at 11:34 AM, Kant Kodali wrote: > >> yeah well I don't think Oracle is treating Java the way Google is >> treating Go and I am not a big fan of Go mainly because I understand the >> JVM is far more robust than anything that is out there.

Re: Why does Cassandra recommends Oracle JVM instead of OpenJDK?

2016-12-21 Thread Kant Kodali
https://www.elastic.co/guide/en/elasticsearch/guide/current/_java_virtual_machine.html On Wed, Dec 21, 2016 at 2:58 AM, Kant Kodali wrote: > The fact is Oracle is horrible :) > > > On Wed, Dec 21, 2016 at 2:54 AM, Brice Dutheil > wrote: > >> Let's not debate opin

Re: Why does Cassandra recommends Oracle JVM instead of OpenJDK?

2016-12-21 Thread Kant Kodali
https://www.youtube.com/watch?v=9ei-rbULWoA On Wed, Dec 21, 2016 at 2:59 AM, Kant Kodali wrote: > https://www.elastic.co/guide/en/elasticsearch/guide/ > current/_java_virtual_machine.html > > On Wed, Dec 21, 2016 at 2:58 AM, Kant Kodali wrote: > >> The fact is Oracle is ho

Re: Why does Cassandra recommends Oracle JVM instead of OpenJDK?

2016-12-22 Thread Kant Kodali
uff that was never free is still not free - >> the change is that Oracle may actually be interested in the fact that some >> are using non-free products for free. >> >> Pretty much a non-story, it seems like. >> >> On Tue, Dec 20, 2016 at 11:55 PM, Kant Kodali wr

Re: Why does Cassandra recommends Oracle JVM instead of OpenJDK?

2016-12-23 Thread Kant Kodali
Java 9 Module system looks really interesting. I would be very curious to see how Cassandra would leverage that. On Thu, Dec 22, 2016 at 9:09 AM, Kant Kodali wrote: > I would agree with Eric with his following statement. In fact, I was > trying to say the same thing. > > "I

Re: Why does Cassandra recommends Oracle JVM instead of OpenJDK?

2016-12-24 Thread Kant Kodali
nd by saying Oracle just has lot of self interest but I really hope that I am wrong since I am a big fan of JVM. On Fri, Dec 23, 2016 at 7:15 PM, Edward Capriolo wrote: > > On Fri, Dec 23, 2016 at 6:01 AM, Kant Kodali wrote: > >> Java 9 Module system looks really interest

Re: Why does Cassandra recommends Oracle JVM instead of OpenJDK?

2016-12-26 Thread Kant Kodali
> > > -- Brice > > On Mon, Dec 26, 2016 at 3:55 PM, Edward Capriolo > wrote: > >> >> >> On Sat, Dec 24, 2016 at 5:58 AM, Kant Kodali wrote: >> >>> @Edward Agreed JVM is awesome and it is a work of many smart people and >>> this is

Re: Why does Cassandra recommends Oracle JVM instead of OpenJDK?

2017-01-02 Thread Kant Kodali
This is a subjective question and of course it would turn into opinionated answers and I think we should welcome that (Nothing wrong in debating a topic). we have many such debates as SE's such as programming language comparisons, Architectural debates, Framework/Library debates and so on. people w

Re: Why does Cassandra recommends Oracle JVM instead of OpenJDK?

2017-01-02 Thread Kant Kodali
ey. You look at Google or FB and see how much open source contribution they have done. Oracle doesnt come anywhere close to that. On Mon, Jan 2, 2017 at 8:08 PM, Edward Capriolo wrote: > > > On Mon, Jan 2, 2017 at 8:30 PM, Kant Kodali wrote: > >> This is a subjective question

Re: Cassandra cluster performance

2017-01-06 Thread Kant Kodali
yeah you should async writes also you cannot neglect data size so you might want to let us know what your data size is? On Thu, Jan 5, 2017 at 2:57 PM, kurt Greaves wrote: > you should try switching to async writes and then perform the test. sync > writes won't make much difference from a sing

question on multi DC setup and LWT's

2017-01-23 Thread Kant Kodali
HI Guys, Lets say I have 2 DC's and I have 3 node cluster on each DC and one replica on each DC. I would like to maintain Strong consistency and high availability so 1) First of all, How do I even set up one replica on each DC? 2) what should my read and write consistent levels be when I am using

quick question

2017-02-01 Thread Kant Kodali
When you initiate a LWT(write) and do a QUORUM read is there a chance that one might not see the LWT write ? If so, can someone explain a bit more? Thanks!

Re: quick question

2017-02-01 Thread Kant Kodali
- al...@thelastpickle.com > France > > The Last Pickle - Apache Cassandra Consulting > http://www.thelastpickle.com > > 2017-02-01 10:57 GMT+01:00 Kant Kodali : > >> When you initiate a LWT(write) and do a QUORUM read is there a chance >> that one might not see the LWT write ? If so, can someone explain a bit >> more? >> >> Thanks! >> > >

Re: quick question

2017-02-01 Thread Kant Kodali
What is the difference between accepting a value and committing a value? On Wed, Feb 1, 2017 at 4:25 AM, Kant Kodali wrote: > Hi, > > Thanks for the response. I finished watching this video but I still got > few questions. > > 1) The speaker seems to suggest that t

Re: quick question

2017-02-01 Thread Kant Kodali
Adding dev only for this thread. On Wed, Feb 1, 2017 at 4:39 AM, Kant Kodali wrote: > What is the difference between accepting a value and committing a value? > > > > On Wed, Feb 1, 2017 at 4:25 AM, Kant Kodali wrote: > >> Hi, >> >> Thanks for the response.

Why does CockroachDB github website say Cassandra has no Availability on datacenter failure?

2017-02-07 Thread Kant Kodali
Why does CockroachDB github website say Cassandra has no Availability on datacenter failure? https://github.com/cockroachdb/cockroach

Re: Why does CockroachDB github website say Cassandra has no Availability on datacenter failure?

2017-02-07 Thread Kant Kodali
lol. But seriously are they even allowed to say something that is not true about another product ? On Tue, Feb 7, 2017 at 4:05 AM, kurt greaves wrote: > Marketing never lies. Ever >

Re: AW: Why does CockroachDB github website say Cassandra has noAvailability on datacenter failure?

2017-02-07 Thread Kant Kodali
Re: Why does CockroachDB github website say Cassandra has > noAvailability on datacenter failure? > > > > The link you posted doesn't say anything about Cassandra > > Le 7 févr. 2017 11:41, "Kant Kodali" a écrit : > > Why does CockroachDB github website say Cassandra has no Availability on > datacenter failure? > > https://github.com/cockroachdb/cockroach > > > >

Re: AW: Why does CockroachDB github website say Cassandra has noAvailability on datacenter failure?

2017-02-07 Thread Kant Kodali
https://github.com/cockroachdb/cockroach/commit/f46a547827d3439b57baa5c3a11f8f9ad2d8b153 On Tue, Feb 7, 2017 at 3:20 PM, Kant Kodali wrote: > LOL They took down that image finally!! But I would still keep an eye on > what kind of fake benchmarks they might come up with. > > On Tue,

If reading from materialized view with a consistency level of quorum am I guaranteed to have the most recent view?

2017-02-09 Thread Kant Kodali
If reading from materialized view with a consistency level of quorum am I guaranteed to have the most recent view? other words is w + r > n contract maintained for MV's as well for both reads and writes? Thanks!

How does cassandra achieve Linearizability?

2017-02-09 Thread Kant Kodali
How does Cassandra achieve Linearizability with “Last write wins” (conflict resolution methods based on time-of-day clocks) ? Relying on synchronized clocks are almost certainly non-linearizable, because clock timestamps cannot be guaranteed to be consistent with actual event ordering due to clock

Re: How does cassandra achieve Linearizability?

2017-02-09 Thread Kant Kodali
stamps/ > > We use our own NTP cluster to reduce clock drift as much as possible, but > public NTP servers are good enough for most uses. https://www.instaclustr. > com/blog/2015/11/05/apache-cassandra-synchronization/ > > Cheers, > Justin > > On Thu, 9 Feb 2017 at 16:0

Re: How does cassandra achieve Linearizability?

2017-02-09 Thread Kant Kodali
ably stratum 2 or 3) source. > > As Jonathan mentioned, there's no guarantee from Cassandra, but if you > need as close as you can get, you'll probably need to do it yourself. > > (I run several stratum 2 ntpd servers for pool.ntp.org) > > -- > Kind regards, &

Re: How does cassandra achieve Linearizability?

2017-02-10 Thread Kant Kodali
going away anytime soon. > This problem is sometimes addressed with event sourcing rather than > mutating in place. > > Hope this helps. > > Jon > > > On Feb 9, 2017, at 5:21 PM, Kant Kodali wrote: > > @Justin I read this article http://www.datastax.com/dev/ > blo

Re: How does cassandra achieve Linearizability?

2017-02-10 Thread Kant Kodali
"That’s the safety blanket everyone wants but is extremely expensive, especially in Cassandra." yes LWT's are expensive. Are there any plans to make this better? On Fri, Feb 10, 2017 at 12:17 AM, Kant Kodali wrote: > Hi Jon, > > Thanks a lot for your response. I am

Re: How does cassandra achieve Linearizability?

2017-02-10 Thread Kant Kodali
pensive. CS is made for > A+P mostly with tunable C. In ACID databases this is a completely different > thing as they are mostly either not partition tolerant, not highly > available or not scalable (in a distributed manner, not speaking of > "monolithic super servers").

Re: Why does Cassandra recommends Oracle JVM instead of OpenJDK?

2017-02-11 Thread Kant Kodali
Saw this one today... https://news.ycombinator.com/item?id=13624062 On Tue, Jan 3, 2017 at 6:27 AM, Eric Evans wrote: > On Mon, Jan 2, 2017 at 2:26 PM, Edward Capriolo > wrote: > > Lets be clear: > > What I am saying is avoiding being loose with the word "free" > > > > https://en.wikipedia.org

Re: How does cassandra achieve Linearizability?

2017-02-16 Thread Kant Kodali
o satisfy N = 2F + 1 constraint to tolerate F failures ? I understand it is not needed when not using LWT's since Cassandra is a master-less system. On Fri, Feb 10, 2017 at 10:25 AM, Kant Kodali wrote: > Thanks Ariel! Yes I knew there are so many variations and optimizations of > Paxos. I j

is there a query to find out the largest partition in a table?

2017-02-18 Thread Kant Kodali
is there a query to find out the largest partition in a table? Does the query below give me the largest partition? select max(mean_partition_size) from size_estimates ; Thanks, Kant

Re: is there a query to find out the largest partition in a table?

2017-02-18 Thread Kant Kodali
61.21 802187.44 10299432635 43388628 Min 0.00 5.72 9.89 125 5 Max 6.00 668489.538582860.53 10299432635 43388628 On Sat, Feb 18, 2017 at 12:28 AM, Kant Kodali wrote

question on uda/udf

2017-02-18 Thread Kant Kodali
Hi All, Goal: want to create check_duplicate UDA on a blob column Context: I have a partition of 10Million rows with size of 10GB (I know this is bad). I want to check if there are duplicate in a blob column in this partition. The blob column can at most be 256 bytes. Question: can I create stat

Are Cassandra Triggers Thread Safe? ("Tough questions perhaps!")

2017-02-20 Thread Kant Kodali
Hi, 1. Are Cassandra Triggers Thread Safe? what happens if two writes invoke the trigger where the trigger is trying to modify same row in a partition? 2. Had anyone used it successfully on production? If so, any issues? (I am using the latest version of C* 3.10) 3. I have partitions that are abou

Does C* coordinator writes to replicas in same order or different order?

2017-02-20 Thread Kant Kodali
Hi, when C* coordinator writes to replicas does it write it in same order or different order? other words, Does the replication happen synchronously or asynchrnoulsy ? Also does this depend sync or async client? What happens in the case of concurrent writes to a coordinator ? Thanks, kant

Re: Does C* coordinator writes to replicas in same order or different order?

2017-02-21 Thread Kant Kodali
s, you'll have a large penalty for latency. > > On Mon, Feb 20, 2017 at 10:11 PM, Kant Kodali wrote: > >> Thanks again for the response! if they mean it between client and server >> I am not sure why they would use the word "replication" in the statement >>

Re: Does C* coordinator writes to replicas in same order or different order?

2017-02-21 Thread Kant Kodali
was not designed to guarantee order. It was build to be linear > scalable, highly concurrent and eventual consistent. > To me it sounds like a ACID DB better serves what you are asking for. > > 2017-02-21 10:17 GMT+01:00 Kant Kodali : > >> Agreed that async performs better than sync i

Re: Does C* coordinator writes to replicas in same order or different order?

2017-02-21 Thread Kant Kodali
der based on > PK (locally) and update "the pointer" on every write (also locally). If you > then store your pointer with the last known timestamp of your base data, > you also have a LWW on your pointer so also the last pointer wins when > reading with > CL_ONE. > But t

Re: Does C* coordinator writes to replicas in same order or different order?

2017-02-21 Thread Kant Kodali
It looks like there is ordering within one client (ordering based on timestamp) and looks like this *order is preserved across all replicas* however the benefits of async given the ordering restriction are slightly blur for me. On Tue, Feb 21, 2017 at 2:35 AM, Kant Kodali wrote: > Agreed w

  1   2   >