[jira] [Commented] (CASSANDRA-7438) Serializing Row cache alternative (Fully off heap)

Pavel Yaskevich (JIRA) Sun, 23 Nov 2014 20:53:09 -0800

    [ 
https://issues.apache.org/jira/browse/CASSANDRA-7438?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14222660#comment-14222660
 ]


Pavel Yaskevich commented on CASSANDRA-7438:
--------------------------------------------

Personally I like what Vijay did a bit more just because main ideas where taken 
from the memcached which is proven to be working fine for the majority of the 
use-cases and is pretty simple inside.

Regarding Robert's implementation I have few comments which he'll have to 
address (if not already) before I would consider this for inclusion:

- rehashing is must have, we want to grow/shrink caches based on usage to 
lessen burden on users trying to size it appropriately from day 1;
- if "put" operation fails it should at least invalidate previously inserted 
value if any, and probably kick-off maintenance activities like LRU cleanup 
and/or rehashing;
- Fixed size data block create a lot of allocation "slop" which could be 
sometimes take majority of allocate memory (e.g. Firefox had that problem), 
cache should at least have blocks of different sizes to minimize that;
- would be great to have benchmarks for per-partition CAS vs. per-partition RW 
lock in different operation modes, cache invalidation could be noticeable 
factor for performance as well as CAS-races;
- metrics (if not yet added).

Also based on discussion [~snazy] had with [[email protected]], I would 
avoid using DirectByteBuffer because they are a problematic to GC.


> Serializing Row cache alternative (Fully off heap)
> --------------------------------------------------
>
>                 Key: CASSANDRA-7438
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-7438
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Core
>         Environment: Linux
>            Reporter: Vijay
>            Assignee: Vijay
>              Labels: performance
>             Fix For: 3.0
>
>         Attachments: 0001-CASSANDRA-7438.patch
>
>
> Currently SerializingCache is partially off heap, keys are still stored in 
> JVM heap as BB, 
> * There is a higher GC costs for a reasonably big cache.
> * Some users have used the row cache efficiently in production for better 
> results, but this requires careful tunning.
> * Overhead in Memory for the cache entries are relatively high.
> So the proposal for this ticket is to move the LRU cache logic completely off 
> heap and use JNI to interact with cache. We might want to ensure that the new 
> implementation match the existing API's (ICache), and the implementation 
> needs to have safe memory access, low overhead in memory and less memcpy's 
> (As much as possible).
> We might also want to make this cache configurable.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (CASSANDRA-7438) Serializing Row cache alternative (Fully off heap)

Reply via email to