Re: how large cassandra could scale when it need to do manual operation?

2011-07-09 Thread Chris Goffinet
, e.g. Twitter currently runs a couple hundred Cassandra nodes across a half dozen clusters. http://www.datastax.com/2011/06/chris-goffinet-of-twitter-to-speak-at-cassandra-sf-2011 http://www.datastax.com/2011/06/chris-goffinet-of-twitter-to-speak-at-cassandra-sf-2011If you are working with a 3

Re: commodity server spec

2011-09-03 Thread Chris Goffinet
It will also depend on how long you can handle recovery time. So imagine this case: 3 nodes w/ RF of 3 Each node has 30TB of space used (you never want to fill up entire node). If one node fails and you must recover, that will take over 3.6 days in just transferring data alone. That's with a

Re: Anybody out there using 0.8 in production

2011-09-08 Thread Chris Goffinet
Twitter runs 0.8 in production/closer to trunk. No big issues from us. On Thu, Sep 8, 2011 at 8:53 PM, Eric Czech e...@nextbigsound.com wrote: We just migrated from .7.5 to .8.4 in our production environment and it was definitely the least painful transition yet (coming all the way from the

Re: shutdown by KILL

2011-09-10 Thread Chris Goffinet
For things like rolling restarts, we do: disablethrift disablegossip (...wait for all nodes to see this node go down..) drain 2011/9/10 Radim Kolar h...@sendmail.cz what is recommended node stop method. drain or kill Java process? i haven't seen anybody using drain in stop scripts yet If i

Re: Massive writes when only reading from Cassandra

2011-09-10 Thread Chris Goffinet
You could tail the commit log with `strings` to see what keys are being inserted. On Sat, Sep 10, 2011 at 2:24 PM, Jonathan Ellis jbel...@gmail.com wrote: Two possibilities: 1) Hinted handoff (this will show up in the logs on the sending machine, on the receiving one it will just look like

Re: cassandra performance degrades after 12 hours

2011-10-03 Thread Chris Goffinet
Most likely what could be happening is you are running single threaded compaction. Look at the cassandra.yaml of how to enable multi-threaded compaction. As more data comes into the system, bigger files get created during compaction. You could be in a situation where you might be compacting at a

Re: cfstats - check Read Count per minute

2011-10-03 Thread Chris Goffinet
If he puts the mx4j jar (http://mx4j.sourceforge.net/) in his lib/ folder, he can fetch stats out over HTTP. mx4j is a bridge for JMX-HTTP. On Mon, Oct 3, 2011 at 2:53 AM, aaron morton aa...@thelastpickle.comwrote: Other than manually pull them from JMX, not really. Most monitoring templates

Re: cassandra performance degrades after 12 hours

2011-10-03 Thread Chris Goffinet
(5000 total), do you think it will help me in this case? ( smaller files and so smaller load on compaction ) Is it normal to have 5000 column families? thanks Ramesh On Mon, Oct 3, 2011 at 2:50 PM, Chris Goffinet c...@chrisgoffinet.comwrote: Most likely what could be happening is you

Re: Does anybody know why Twitter stop integrate Cassandra as Twitter store?

2011-10-04 Thread Chris Goffinet
At the time of that project, there wasn't enough resources and dedicated team. Since then we changed that (based on the presentation I gave). We decided to focus on other areas, and newer projects. We spent a lot of time with the community improving failure conditions, performance, etc. We chose

Re: Size calculations for off heap caching

2011-10-18 Thread Chris Goffinet
My best advice on this is, insert a bit of data into the tree, and then do a heap dump to calculate the extra overhead. It's unfortunately more than you would like from our testing. On Tue, Oct 18, 2011 at 8:14 PM, Todd Nine t...@spidertracks.com wrote: ** Hi guys, We've just built a K tree

Re: Cassandra cluster HW spec (commit log directory vs data file directory)

2011-10-29 Thread Chris Goffinet
RE: RAID0 Recommendation Cassandra supports multiple data file directories. Because we do compactions, it's just much easier to deal with (1) data file directory that is stripped across all disks as 1 volume (RAID0). There are other ways to accomplish this though. At Twitter we use software raid

Re: Cassandra cluster HW spec (commit log directory vs data file directory)

2011-10-30 Thread Chris Goffinet
memcache on each node and allocate the remaining to that. 2. What your network speed ? Do you use trunks ? Do you have a dedicated VLAN for gossip/store traffic ? No dedicated VLAN for gossip. We run at 2Gb/s. We have bonded NIC's. Cheers, Sorin On Sun, Oct 30, 2011 at 5:00 AM, Chris

Re: Cassandra cluster HW spec (commit log directory vs data file directory)

2011-10-30 Thread Chris Goffinet
No. We built a pluggable cache provider for memcache. On Sun, Oct 30, 2011 at 7:31 PM, Mohit Anchlia mohitanch...@gmail.comwrote: On Sun, Oct 30, 2011 at 6:53 PM, Chris Goffinet c...@chrisgoffinet.com wrote: On Sun, Oct 30, 2011 at 3:34 PM, Sorin Julean sorin.jul...@gmail.com wrote

Re: 0.8.1 Vs 1.0.7

2012-03-19 Thread Chris Goffinet
When creating a new CF, defaults are now in fact compression enabled. On Sat, Mar 17, 2012 at 5:50 AM, R. Verlangen ro...@us2.nl wrote: Check your log for messages about rebuilding indices: that might grow your dataset some. One thing is for sure: the data import removed all the crap that

Re: Read Latency

2010-10-20 Thread Chris Goffinet
If you are using Python, and raw Thrift, use the following: protocol = TBinaryProtocol.TBinaryProtocolAccelerated(transport) The serialization/deserialization is done directly in C. On Wed, Oct 20, 2010 at 11:53 AM, Wayne wav...@gmail.com wrote: We did some testing and the object is 23megs

Re: Cassandra and -XX:+UseLargePages

2011-01-16 Thread Chris Goffinet
I've seen about a 13% improvement in practice. -Chris On Jan 16, 2011, at 4:01 PM, David Dabbs wrote: Hello. Can anyone comment on the performance impact (positive or negative) of running Cassandra configured to use large pages under Linux? Yes, YMMV applies, but I thought I'd ask before

Re: [RELEASE] 0.6.11

2011-01-28 Thread Chris Goffinet
+1 On Fri, Jan 28, 2011 at 3:13 PM, Eric Evans eev...@rackspace.com wrote: It seems like it was just earlier this week that we announced the release of 0.6.10. Oh wait, it was. In the time since though, CASSANDRA-2058[1] was found and fixed, and that seemed like reason enough to

Re: [RELEASE] 0.6.11

2011-01-28 Thread Chris Goffinet
Err. I mean't, thanks Evan for getting this released so fast :) On Fri, Jan 28, 2011 at 3:18 PM, Chris Goffinet c...@chrisgoffinet.comwrote: +1 On Fri, Jan 28, 2011 at 3:13 PM, Eric Evans eev...@rackspace.com wrote: It seems like it was just earlier this week that we announced

Re: Subscribe

2011-02-15 Thread Chris Goffinet
I would like to subscribe to your newsletter. On Tue, Feb 15, 2011 at 8:04 AM, A J s5a...@gmail.com wrote:

Re: Nodes frozen in GC

2011-03-07 Thread Chris Goffinet
Can you tell me how many SSTables on disk when you see GC pauses? In your 3 node cluster, what's the RF factor? On Mon, Mar 7, 2011 at 1:50 PM, ruslan usifov ruslan.usi...@gmail.comwrote: 2011/3/8 Jonathan Ellis jbel...@gmail.com It sounds like you're complaining that the JVM sometimes does

Re: Nodes frozen in GC

2011-03-07 Thread Chris Goffinet
The rows you are inserting, what is your update ratio to those rows? On Mon, Mar 7, 2011 at 4:03 PM, ruslan usifov ruslan.usi...@gmail.comwrote: 2011/3/8 Chris Goffinet c...@chrisgoffinet.com Can you tell me how many SSTables on disk when you see GC pauses? In your 3 node cluster, what's

Re: Nodes frozen in GC

2011-03-07 Thread Chris Goffinet
How large are your SSTables on disk? My thought was because you have so many on disk, we have to store the bloom filter + every 128 keys from index in memory. On Mon, Mar 7, 2011 at 4:35 PM, ruslan usifov ruslan.usi...@gmail.comwrote: 2011/3/8 Chris Goffinet c...@chrisgoffinet.com The rows

Re: How to use join_ring=false?

2011-03-21 Thread Chris Goffinet
-Dcassandra.join_ring=false -Chris On Mar 21, 2011, at 10:32 PM, Jason Harvey wrote: I set join_ring=false in my java opts: -Djoin_ring=false However, when the node started up, it joined the ring. Is there something I am missing? Using 0.7.4 Thanks, Jason

Re: nodetool move hammers the next node in the ring

2011-04-08 Thread Chris Goffinet
We also have a ticket open at https://issues.apache.org/jira/browse/CASSANDRA-2399 We have observed in production the impact of streaming data to new nodes being added. We actually have our entire dataset in page cache in one of our clusters, our 99th percentiles go from 20ms to 1 second on

Re: CASSANDRA-721

2010-03-22 Thread Chris Goffinet
-- Chris Goffinet

Re: Nodes Timing Out

2010-03-27 Thread Chris Goffinet
what's the ulimit set to? -Chris On Mar 27, 2010, at 10:29 AM, James Golick wrote: Hey, I put our first cluster in to production (writing but not reading) a couple of days ago. Right now, it's got two pretty sizeable nodes taking about 200 writes per second each and virtually no reads.

Re: Hackathon?!?

2010-03-28 Thread Chris Goffinet
Awesome! 2 tickets left. -Chris On Mar 27, 2010, at 11:42 PM, Evan Weaver wrote: Me too. On Tue, Mar 23, 2010 at 12:48 PM, Jeff Hodges jhod...@twitter.com wrote: I'll be there. -- Jeff On Mon, Mar 22, 2010 at 8:40 PM, Eric Florenzano flo...@gmail.com wrote: Nice, I'll go! -Eric

Re: Stalled Bootstrapping Process

2010-04-02 Thread Chris Goffinet
1, 2010 at 4:23 PM, Jonathan Ellis jbel...@gmail.com wrote: I would turn debug logging on globally on the new node, that will answer more questions than just the streaming package. -- Dan Di Spaltro -- Dan Di Spaltro -- Chris Goffinet

Re: if cassandra isn't ideal for keep track of counts, how does digg count diggs?

2010-04-06 Thread Chris Goffinet
! On Tue, Apr 6, 2010 at 12:54 PM, Chris Goffinet goffi...@digg.com wrote: That's not true. We have been using the Zookeper work we posted on jira. That's what we are using internally and have been for months. We are now just wrapping up our vector clocks + distributed counter patch so we can begin

Re: [RELEASE] 0.6.0

2010-04-17 Thread Chris Goffinet
I wonder if that might be related to this: https://issues.apache.org/jira/browse/CASSANDRA-896 We switched from a Concurrent structure to LinkedBlockingQueue in 0.6. -Chris On Apr 17, 2010, at 9:26 PM, Schubert Zhang wrote: We are testing 0.6.0, compares with 0.5.1, and it seems: 1.

Re: Cassandra cluster runs into OOM when bulk loading data

2010-04-26 Thread Chris Goffinet
; if a compaction is going on the individual node is I/O limited tpstats: caught me, didn't know this. I will set up a test and try to catch a node during the critical time. Thanks, Roland 2010/4/26 Chris Goffinet goffi...@digg.com Which version of Cassandra? Which version of Java JVM are you using

Re: zookeeper, how do you feed the pets?

2010-05-16 Thread Chris Goffinet
If you are running multiple datacenters, intend to have a lot of writes for counters, I highly advise against it. We got rid of ZK because of that. -Chris On May 16, 2010, at 7:04 PM, S Ahmed wrote: Can someone quickly go over how you go about using zookeeper if you want to store counts and

Re: Why Cassandra is space inefficient compared to MySQL?

2010-05-25 Thread Chris Goffinet
My money is on the fact that the serializer is just horribly verbose. It's using a basic set of the java serializer. -Chris On Tue, May 25, 2010 at 10:02 AM, Ryan King r...@twitter.com wrote: Also, timestamps for each column. -ryan On Tue, May 25, 2010 at 5:41 AM, Jonathan Ellis

Re: Digg 4 Preview on TWiT

2010-06-28 Thread Chris Goffinet
Digg is not forking Cassandra. We use 0.6 for production, with a few in-house patches (related to our infrastructure). The biggest difference with our branch and apache 0.6 branch is we have the work Kelvin and Twitter has done in regards to Vector Clocks + Distributed Counters. This will never

Re: what causes MESSAGE-DESERIALIZER-POOL to spike

2010-07-30 Thread Chris Goffinet
When you can't get the number of threads, that means you have way too many running (8,000+) usually. Try running `ps -eLf | grep cassandra`. How many threads? -Chris On Jul 29, 2010, at 8:40 PM, Dathan Pattishall wrote: To Follow up on this thread. I blew away the data for my entire