Re: scylladb

2017-03-10 Thread Rakesh Kumar
ailto:mrbur...@gmail.com>> wrote: They spend an enormous amount of time focusing on performance. You can expect them to continue on with their optimization and keep crushing it. P.S., I don't work for ScyllaDB. On Thu, Mar 9, 2017 at 6:02 PM, Rakesh Kumar <rakeshkumar...@outlook.com&

Re: scylladb

2017-03-09 Thread Rakesh Kumar
In all of their presentation they keep harping on the fact that scylladb is written in C++ and does not carry the overhead of Java. Still the difference looks staggering. From: daemeon reiydelle Sent: Thursday, March 9, 2017 14:21

Re: Limit on number of keyspaces/tables

2017-03-05 Thread Rakesh Kumar
> I ask back: what's your intention May be documenting the limitations of Cassandra to show Oracle is better :-) Am 05.03.2017 11:58 schrieb "Lata Kannan" >: ^

Re: Which compaction strategy when modeling a dumb set

2017-02-27 Thread Rakesh Kumar
typo: " If yes, it is considered a good practice for Cassandra" should read as " If yes, is it considered a good practice for Cassandra ?" ________ From: Rakesh Kumar <rakeshkumar...@outlook.com> Sent: Monday, February 27, 2017 10:06 To:

Re: Which compaction strategy when modeling a dumb set

2017-02-27 Thread Rakesh Kumar
Do you update this table when an event is processed? If yes, it is considered a good practice for Cassandra. I read somewhere that using Cassandra as a queuing table is anti pattern. From: Vincent Rischmann Sent: Friday, February

Cassandra version numbering

2017-02-23 Thread Rakesh Kumar
Is ver 3.0.10 same as 3.10. Cassandra website mentions this: Cassandra 3.10 Changelog But in other places 3.0.10 is mentioned.

Re: Partition size

2016-09-09 Thread Rakesh Kumar
On Fri, Sep 9, 2016 at 11:46 AM, Mark Curtis wrote: > If your partition sizes are over 100MB iirc then you'll normally see > warnings in your system.log, this will outline the partition key, at least > in Cassandra 2.0 and 2.1 as I recall. Has it improved in C* 3.x.

Re: Cassandra: The Definitive Guide, 2nd Edition

2016-08-02 Thread Rakesh Kumar
I have this book thru early release program by O'Reilly. It is a must-have book for those who want to learn how C* works under the hood.

Re: Blog post on Cassandra's inner workings and performance - feedback welcome

2016-07-07 Thread Rakesh Kumar
http://localhost:4000/tutorials/2016/02/29/cassandra-inner-workings-and-how-this-relates-to-performance/ This is not a valid address. On Thu, Jul 7, 2016 at 9:11 AM, Manuel Kiessling wrote: > Hi all, > > I'm currently in the process of understanding the inner workings of >

Re: Backup strategy

2016-06-16 Thread Rakesh Kumar
On Thu, Jun 16, 2016 at 7:30 PM, Bhuvan Rawal wrote: > 2. Snapshotting : Hardlinks of sstables will get created. This is a very > fast process and latest data is captured into sstables after flushing > memtables, snapshots will be created in snapshots directory. But snapshot

Time series data with only inserts

2016-05-30 Thread Rakesh Kumar
Let us assume that there is a table which gets only inserts and under normal circumstances no reads on it. If we assume TTL to be 7 days, what event will trigger a compaction/purge of old data if the old data is not in the mem cache and no session needs it. thanks.

Re: Why is write failing

2016-03-28 Thread Rakesh Kumar
> This is in my cassandra-topology.properties my bad. I used wrong file, instead of rackdc properties file.

Why is write failing

2016-03-28 Thread Rakesh Kumar
Cassandra: 3.0.3 I am new to Cassandra. I am creating a test instance of four nodes, two in each data center. The idea is to verify that Cassandra can continue with writes even if one DC is down and we further lose one machine in the surviving DC. This is in my cassandra-topology.properties

Re: How many nodes do we require

2016-03-25 Thread Rakesh Kumar
On Fri, Mar 25, 2016 at 11:45 AM, Jack Krupansky wrote: > It depends on how much data you have. A single node can store a lot of data, > but the more data you have the longer a repair or node replacement will > take. How long can you tolerate for a full repair or node

How many nodes do we require

2016-03-25 Thread Rakesh Kumar
We have two data centers. Our requirement is simple Assuming that we have equal number of nodes in each DC we should be able to run with the loss of one DC and loss of at most one node in the surviving DC. Can this be achieved with 6 nodes (3 in each). Obviously for that all data must be

Re: Client drivers

2016-03-24 Thread Rakesh Kumar
> Every language has a different means of working with dependencies. Some are > compiled in (java, c), some are pulled in via libraries (python). You'll > have to be more specific. I am interested mainly in C++ and Java. Thanks.

Client drivers

2016-03-24 Thread Rakesh Kumar
Is it possible to install multiple versions of language drivers on the client machines. This will be typically useful during an upgrade process, where by fallback to the old version can be easy. thanks.

Apache Cassandra's license terms

2016-03-19 Thread Rakesh Kumar
What type of Open source license does Cassandra follow? If we use open source Cassandra for a revenue generating product, are we expected to contribute back our code to the open source. thanks

Python to type field

2016-03-19 Thread Rakesh Kumar
Hi I have a type defined as follows CREATE TYPE etag ( ttype int, tvalue text ); And this is used in a col of a table as follows evetag list > I have the following value in a file [{ttype: 3 , tvalue: '90A1'}] This gets inserted via COPY command with no issues. However when I try

Questions about Datastax support

2016-03-19 Thread Rakesh Kumar
Few questions: 1 - Has there been an announcement as to when Datastax will stop supporting 2.x version. I am aware that the community will stop supporting 2.x in Nov 2016. What about support to paid customers of Datastax. Will it go beyond Nov. 2 - Are there any plans by

Re: Questions about Datastax support

2016-03-18 Thread Rakesh Kumar
> 1. They have a published support policy: > http://www.datastax.com/support-policy/supported-software Why is the version number so different from the cassandra community edition. Take a look at this: 4.8.2Release NotesNov 11, 2015Mar 23, 2016Sep 23, 2017 What is version 4.8.2

Re:

2016-03-15 Thread Rakesh Kumar
How are you trying to insert. Paste your code here.

Re: What is wrong in this token function

2016-03-10 Thread Rakesh Kumar
k Krupansky On Thu, Mar 10, 2016 at 5:14 PM, Rakesh Kumar <dcrunch...@aim.com> wrote: I am using default Murmur3. So are you saying in case of Murmur3 the following two queries select count*) where customer_id = '289' and event_time >= '2016-03-01 18:45:00+' and event_time &l

Re: What is wrong in this token function

2016-03-10 Thread Rakesh Kumar
the hashes/tokens, so that tokens will not be ordered even if your PKs are ordered. You probably want to use customer as your partition key and event time as a clustering column - then you can use RDBMS-like WHERE conditions to select a slice of the partition. -- Jack Krupansky On Thu, Mar

Re: What is wrong in this token function

2016-03-10 Thread Rakesh Kumar
typo: the primary key was (customer_id + event_time ) -Original Message- From: Rakesh Kumar <dcrunch...@aim.com> To: user <user@cassandra.apache.org> Sent: Thu, Mar 10, 2016 4:44 pm Subject: What is wrong in this token function C* 3.0.3 I have a table table1 which has

What is wrong in this token function

2016-03-10 Thread Rakesh Kumar
C* 3.0.3 I have a table table1 which has the primary key on ((customer_id,event_id)). I loaded 1.03 million rows from a csv file. Business case: Show me all events for a given customer in a given time frame In RDBMS it will be (Query1) where customer_id = '289' and event_time >=

Possible bug in Cassandra

2016-03-09 Thread Rakesh Kumar
Cassandra : 3.3 CQLSH : 5.0.1 If there is a typo in the column name of the copy command, we get this: copy mytable (event_id,event_class_cd,event_ts,receive_ts,event_source_instance,client_id,client_id_type,event_tag,event_udf,client_event_date) from '/pathtofile.dat' with DELIMITER =

Automatically connect to any up node via cqlsh

2016-03-09 Thread Rakesh Kumar
Cassandra : 3.3 CQLSH : 5.0.1 Is it possible to set up cassandra/cqlsh so that if any node is down, cqlsh will automatically try to connect to the other surviving nodes, instead of erroring out. I know it is possible to supply ip_address and port of the UP node as arguments to cqlsh,

Re: Querying on index

2016-03-01 Thread Rakesh Kumar
Looks like Bloom filter size was the issue. Once I disabled it, the query returns rows correctly, but it was terrible slow (expected since it will hit SStable every time). -Original Message- From: Rakesh Kumar <dcrunch...@aim.com> To: user <user@cassandra.apache.org> Sent

Re: Querying on index

2016-03-01 Thread Rakesh Kumar
At this time no one else is using this table. So the data is static. -Original Message- From: Rakesh Kumar To: user Sent: Tue, Mar 1, 2016 4:54 pm Subject: Querying on index Cassandra: 3.3On my test system I create a tablecreate table eventinput( event_id varchar

Querying on index

2016-03-01 Thread Rakesh Kumar
Cassandra: 3.3 On my test system I create a table create table eventinput ( event_id varchar , event_class_cd int , event_ts timestamp , client_id varchar , event_message text , primary key ((client_id,event_id),event_ts) ) I created an index on client_id create

Disable writing to debug.log

2016-03-01 Thread Rakesh Kumar
Version: Cassandra 3.3 Can anyone tell on how to disable writing to debug.log. thanks.

CRT

2016-02-23 Thread Rakesh Kumar
https://www.aphyr.com/posts/294-jepsen-cassandra How much of this is still valid in ver 3.0. The above seems to have been written for ver 1.0. thanks.