Re: Getting all unique keys

2017-08-20 Thread Christophe Schmitz
Hi Avi, Have you thought of using Spark for that work? If you collocate the spark workers on each Cassandra nodes, the spark-cassandra connector will split automatically the token range for you in such a way that each spark worker only hit the Cassandra local node. This will also be done in

Re: Moving all LCS SSTables to a repaired state

2017-08-20 Thread kurt greaves
Correction: Full repairs do mark SSTables as repaired in 2.2 (CASSANDRA-7586 ). My mistake, I thought that was only introduced in 3.0. Note that if mixing full and incremental repairs you probably want to be using at least 2.2.10 because of

Re: Getting all unique keys

2017-08-20 Thread Avi Levi
Thank you very much , one question . you wrote that I do not need distinct here since it's a part from the primary key. but only the combination is unique (*PRIMARY KEY (id, timestamp) ) .* also if I take the last token and feed it back as you showed wouldn't I get overlapping boundaries ? On

Re: Getting all unique keys

2017-08-20 Thread Eric Stevens
You should be able to fairly efficiently iterate all the partition keys like: select id, token(id) from table where token(id) >= -9204925292781066255 limit 1000; id | system.token(id) +-- ...

Re: Moving all LCS SSTables to a repaired state

2017-08-20 Thread kurt greaves
Pretty much, I wouldn't set your heart on having 0 unrepaired SSTables.

Re: Getting all unique keys

2017-08-20 Thread Avi Levi
I need to get all unique keys (not the complete primary key, just the partition key) in order to aggregate all the relevant records of that key and apply some calculations on it. *CREATE TABLE my_table ( id text, timestamp bigint, value double, PRIMARY KEY (id, timestamp) )*