do Cassandra generate a event or log containing key value of column when a column expires due to TTL

2014-08-22 Thread Gaurav Bhatnagar
Hi, I have stored following data structure in cassandra RowKey: 119551747098 = (name=c:per:@batchId, value=ad1, timestamp=1408345109805011, ttl=1436489) = (name=c:per:@currency, value=USD, timestamp=1408345109805009, ttl=1436489) = (name=c:per:@decimalValue, value=2,

efficiently generate complete database dump in text format

2014-10-09 Thread Gaurav Bhatnagar
Hi, We have a Cassandra database column family containing 320 millions rows and each row contains about 15 columns. We want to take monthly dump of this single column family contained in this database in text format. We are planning to take following approach to implement this functionality 1.

how to scan all rows of cassandra using multiple threads

2015-02-24 Thread Gaurav Bhatnagar
Hi, I have a cassandra cluster of 3 nodes holding around 300 million rows of items. I have a replication factor of 3 with read/write consistency as Quorum. I want to scan all rows of database to generate sum of items having value available in column name state and value batch1 in column name

minimum bandwidth requirement between two Geo Redundant sites of Cassandra database

2015-04-27 Thread Gaurav Bhatnagar
Hi, Is there any minimum bandwidth requirement between two Geo Redundant data centres? What is the minimum latency that link between two Geo Redundant data centres should have to get best efficient operations? Regards, Gaurav

Re: Spark Memory Error - Not enough space to cache broadcast

2016-06-14 Thread Gaurav Bhatnagar
try setting the option --driver-memory 4G On Tue, Jun 14, 2016 at 3:52 PM, Ben Slater wrote: > A high level shot in the dark but in our testing we found Spark 1.6 a lot > more reliable in low memory situations (presumably due to >