Re: row cache hit is costlier for partiton with large rows

Sylvain Lebresne Wed, 21 Jan 2015 06:02:14 -0800

The row cache saves partition data off-heap, which means that every cache
hit require copying/deserializing the cached partition into the heap, and
the more rows per partition you cache, the long it will take. Which is why
it's currently not a good cache too much rows per partition (unless you
know what you're doing).


On Wed, Jan 21, 2015 at 1:15 PM, nitin padalia <padalia.ni...@gmail.com>
wrote:

> Hi,
>
> With two different families when I do a read, row cache hit is almost
> 15x costlier with larger rows (10000 rows per partition), in
> comparison to partition with only 100 rows.
>
> Difference in two column families is one is having 100 rows per
> partition another 10000 rows per partition. Schema for two tables is:
> CREATE TABLE table1_row_cache (
>   user_id uuid,
>   dept_id uuid,
>   location_id text,
>   locationmap_id uuid,
>   PRIMARY KEY ((user_id, location_id), dept_id)
> )
>
> CREATE TABLE table2_row_cache (
>   user_id uuid,
>   dept_id uuid,
>   location_id text,
>   locationmap_id uuid,
>   PRIMARY KEY ((user_id, dept_id), location_id)
> )
>
> Here is the tracing:
>
> Row cache Hit with Column Family table1_row_cache, 100 rows per partition:
>  Preparing statement [SharedPool-Worker-2] | 2015-01-20
> 14:35:47.540000 | x.x.x.x |           1023
>           Row cache hit [SharedPool-Worker-5] | 2015-01-20
> 14:35:47.542000 | x.x.x.x |           2426
>
> Row cache Hit with CF table2_row_cache, 10000 rows per partition:
> Preparing statement [SharedPool-Worker-1] | 2015-01-20 16:02:51.696000
> | x.x.x.x |            490
>          Row cache hit [SharedPool-Worker-2] | 2015-01-20
> 16:02:51.711000 | x.x.x.x |          15146
>
>
> If for both cases data is in memory why its not same? Can someone
> point me what wrong here?
>
> Nitin Padalia
>

Re: row cache hit is costlier for partiton with large rows

Reply via email to