[ 
https://issues.apache.org/jira/browse/SAMZA-543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14359728#comment-14359728
 ] 

Chris Riccomini commented on SAMZA-543:
---------------------------------------

bq. should disableDataSync be set to true ? I couldn't figure out the default 
value used in the JNI bindings.

We should force this to true. Looks like I was mistaken. disableDataSync 
[defaults to 
false|https://github.com/facebook/rocksdb/blob/master/java/src/main/java/org/rocksdb/DBOptionsInterface.java#L228]
 according to the Javadocs.

bq. Do we want to run any performance tests before we commit this change?

I think so, just to prove that it's still faster with the WAL disabled. Maybe 
you can modify TestKeyValuePerformance to make it useful? Right now it just 
tests a very specific use case (all with deletes). It'd be nice to test bulk 
loading some data both with/without WAL/data sync, and see how it performs.

bq. My suggestion is to do this one change at a time and measure the 
performance improvement.

Agreed.

> Disable WAL in RocksDB KV store
> -------------------------------
>
>                 Key: SAMZA-543
>                 URL: https://issues.apache.org/jira/browse/SAMZA-543
>             Project: Samza
>          Issue Type: Bug
>          Components: kv
>    Affects Versions: 0.9.0
>            Reporter: Chris Riccomini
>            Assignee: Navina Ramesh
>             Fix For: 0.9.0
>
>         Attachments: SAMZA-543-0.patch
>
>
> RocksDB uses a write-ahead log by default. This is unnecessary in Samza, 
> since we have full durability from a state store's changelog topic. We should 
> [disable the 
> WAL|https://github.com/facebook/rocksdb/wiki/Basic-Operations#asynchronous-writes]
>  in the RocksDB KV store.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to