Re: Why Cassandra is space inefficient compared to MySQL?

2010-05-25 Thread Peter Schüller
Could you please tell me why? There might be pending sstable removals on disk, which won't happen until GC or restart. If you just did a bulk insert and checked diskspace immediately afterwards, I think this is a possible explanation. (See Write path on

Re: Why are writes faster than reads?

2010-05-25 Thread Peter Schüller
I'm fairly certain the write path hits the commit log first, then the memtable. I didn't mean to imply an ordering between the two (I probably should not have said memtable plus commit log...), and yes I believe so. -- / Peter Schuller aka scode

Re: how does cassandra compare with mongodb?

2010-05-14 Thread Peter Schüller
Not sure if this was mentioned, but MongoDB is strongly consistent while Cassandra is eventually consistent -- at least about a month ago when I looked at it in more detail, though with vector clocks in 0.7, this may be less of an issue. Did Mongo switch away from the fsync() every now and

Re: replication impact on write throughput

2010-05-11 Thread Peter Schüller
The biggest impact on your write performance will most likely be the consistency level of your writes. In other words, how many nodes you want to wait for before you acknowledge the write back to the client. I believe the consistency level is only expected to have a significant impact on

Re: bloom filter

2010-05-07 Thread Peter Schüller
what is the benefit of creating bloom filter when cassandra writes data, how does it helps ? It allows Cassandra to answer requests for non-existent keys without going to disk, except in cases where the bloom filter gives a false positive. See: