Re: Why Cassandra is space inefficient compared to MySQL?

2010-05-25 Thread sharanabasava raddi
-- *发件人:* sharanabasava raddi *发送时间:* 2010-05-25 13:46:38 *收件人:* user@cassandra.apache.org *抄送:* *主题:* Why Cassandra is space inefficient compared to MySQL? Hi all, Am running Cassandra on Windows XP (single node) machine. I have made insertion of about 10 million

Re: Why Cassandra is space inefficient compared to MySQL?

2010-05-25 Thread Peter Schüller
Could you please tell me why? There might be pending sstable removals on disk, which won't happen until GC or restart. If you just did a bulk insert and checked diskspace immediately afterwards, I think this is a possible explanation. (See Write path on

Re: Why Cassandra is space inefficient compared to MySQL?

2010-05-25 Thread sharanabasava raddi
Hi Peter, Thanks a lot. Regards, Sharan 2010/5/25 Peter Schüller sc...@spotify.com Could you please tell me why? There might be pending sstable removals on disk, which won't happen until GC or restart. If you just did a bulk insert and checked diskspace immediately afterwards, I think

Re: Why Cassandra is space inefficient compared to MySQL?

2010-05-25 Thread Ryan King
Also, timestamps for each column. -ryan On Tue, May 25, 2010 at 5:41 AM, Jonathan Ellis jbel...@gmail.com wrote: That's true.  But fundamentally Cassandra is expected to use more space than mysql for a few reasons; usually the biggest factor is that Cassandra has to write out each column name

Re: Why Cassandra is space inefficient compared to MySQL?

2010-05-25 Thread Chris Goffinet
My money is on the fact that the serializer is just horribly verbose. It's using a basic set of the java serializer. -Chris On Tue, May 25, 2010 at 10:02 AM, Ryan King r...@twitter.com wrote: Also, timestamps for each column. -ryan On Tue, May 25, 2010 at 5:41 AM, Jonathan Ellis

Re: Why Cassandra is space inefficient compared to MySQL?

2010-05-25 Thread Jonathan Ellis
the only place we use a java serializer is for the BitSet in bloom filters. On Tue, May 25, 2010 at 12:37 PM, Chris Goffinet goffi...@digg.com wrote: My money is on the fact that the serializer is just horribly verbose. It's using a basic set of the java serializer. -Chris On Tue, May 25,

Re: Why Cassandra is space inefficient compared to MySQL?

2010-05-25 Thread Robert Edmonds
On 2010-05-25, Jonathan Ellis jbel...@gmail.com wrote: That's true. But fundamentally Cassandra is expected to use more space than mysql for a few reasons; usually the biggest factor is that Cassandra has to write out each column name in each row, since column names are dynamic unlike in

Why Cassandra is space inefficient compared to MySQL?

2010-05-24 Thread sharanabasava raddi
Hi all, Am running Cassandra on Windows XP (single node) machine. I have made insertion of about 10 million records into Cassandra , and it took around 90 minutes to insert and 8GB of space. For the same number of records MySQL will take 3 GB space. Could you please tell me why? And please Give