Re: Optimizing queries for partition keys

2018-04-24 Thread Sam Klock
ter to use seek. > Feel free to open a JIRA ticket and attach your patch. It will be great if > you could add to the ticket your table schema as well > as some information on your environment (e.g. disk type). > > On Tue, Apr 17, 2018 at 8:53 PM, Sam Klock wrote: > >> Thanks

Re: Optimizing queries for partition keys

2018-05-08 Thread Sam Klock
K On 2018-04-24 14:16, Sam Klock wrote: > Thanks. For those interested: opened CASSANDRA-14415. > > SK > > On 2018-04-19 06:04, Benjamin Lerer wrote: >> Hi Sam, >> >> Your finding is interesting. Effectively, if the number of bytes to skip is >> larger than th

Question about PartitionUpdate.singleRowUpdate()

2018-12-19 Thread Sam Klock
Cassandra devs, I have a question about the implementation of PartitionUpdate.singleRowUpdate(), in particular the choice to use EncodingStats.NO_STATS when building the resulting PartitionUpdate. Is there a functional reason for that -- i.e., is it safe to modify it to use an EncodingStats built

Re: Question about PartitionUpdate.singleRowUpdate()

2018-12-20 Thread Sam Klock
; mailing list post. > > > On Wed, Dec 19, 2018 at 1:58 PM Sam Klock wrote: > >> Cassandra devs, >> >> I have a question about the implementation of >> PartitionUpdate.singleRowUpdate(), in particular the choice to use >> EncodingStats.NO_STATS whe

Reconciling expiring cells and tombstones

2015-06-16 Thread Sam Klock
Hi folks, I have a question about a design choice on how expiring cells are reconciled with tombstones. For two cells with the same timestamp, if one is expiring and one is a tombstone, Cassandra *always* prefers the tombstone. This matches its behavior for normal/non-expiring cells, but the fol

Re: Reconciling expiring cells and tombstones

2015-06-18 Thread Sam Klock
log.logentries.com/2014/03/synchronizing-clocks-in-a-cassandra-cluster-pt-2-solutions/ >> < >> https://blog.logentries.com/2014/03/synchronizing-clocks-in-a-cassandra-cluster-pt-2-solutions/ >>> >> >> >> Josef Lindman Hörnlund >> Chief Data Scientist &

Optimizing queries for partition keys

2018-03-22 Thread Sam Klock
Cassandra devs, We use workflows in some of our clusters (running 3.0.15) that involve "SELECT DISTINCT key FROM..."-style queries. For some tables, we observed extremely poor performance under light load (i.e., a small number of rows per second and frequent timeouts), which we eventually traced

Re: Optimizing queries for partition keys

2018-04-17 Thread Sam Klock
redicate with a count of 1, to make sure we do have a live column before including the partition key. " By using ColumnFilter.selectionBuilder(); you do not select all the columns. By consequence, some partitions might be returned while they should not. On Thu, Mar 22, 2018 at 6:24 PM