,
e.g. Twitter currently runs a couple hundred Cassandra nodes across a half
dozen clusters.
http://www.datastax.com/2011/06/chris-goffinet-of-twitter-to-speak-at-cassandra-sf-2011
http://www.datastax.com/2011/06/chris-goffinet-of-twitter-to-speak-at-cassandra-sf-2011If
you are working with a 3
It will also depend on how long you can handle recovery time. So imagine
this case:
3 nodes w/ RF of 3
Each node has 30TB of space used (you never want to fill up entire node).
If one node fails and you must recover, that will take over 3.6 days in
just transferring data alone. That's with a
Twitter runs 0.8 in production/closer to trunk. No big issues from us.
On Thu, Sep 8, 2011 at 8:53 PM, Eric Czech e...@nextbigsound.com wrote:
We just migrated from .7.5 to .8.4 in our production environment and it was
definitely the least painful transition yet (coming all the way from the
For things like rolling restarts, we do:
disablethrift
disablegossip
(...wait for all nodes to see this node go down..)
drain
2011/9/10 Radim Kolar h...@sendmail.cz
what is recommended node stop method. drain or kill Java process? i haven't
seen anybody using drain in stop scripts yet
If i
You could tail the commit log with `strings` to see what keys are being
inserted.
On Sat, Sep 10, 2011 at 2:24 PM, Jonathan Ellis jbel...@gmail.com wrote:
Two possibilities:
1) Hinted handoff (this will show up in the logs on the sending
machine, on the receiving one it will just look like
Most likely what could be happening is you are running single threaded
compaction. Look at the cassandra.yaml of how to enable multi-threaded
compaction. As more data comes into the system, bigger files get created
during compaction. You could be in a situation where you might be compacting
at a
If he puts the mx4j jar (http://mx4j.sourceforge.net/) in his lib/ folder,
he can fetch stats out over HTTP. mx4j is a bridge for JMX-HTTP.
On Mon, Oct 3, 2011 at 2:53 AM, aaron morton aa...@thelastpickle.comwrote:
Other than manually pull them from JMX, not really.
Most monitoring templates
(5000
total), do you think it will help me in this case? ( smaller files and so
smaller load on compaction )
Is it normal to have 5000 column families?
thanks
Ramesh
On Mon, Oct 3, 2011 at 2:50 PM, Chris Goffinet c...@chrisgoffinet.comwrote:
Most likely what could be happening is you
At the time of that project, there wasn't enough resources and dedicated
team. Since then we changed that (based on the presentation I gave). We
decided to focus on other areas, and newer projects. We spent a lot of time
with the community improving failure conditions, performance, etc. We chose
My best advice on this is, insert a bit of data into the tree, and then do a
heap dump to calculate the extra overhead. It's unfortunately more than you
would like from our testing.
On Tue, Oct 18, 2011 at 8:14 PM, Todd Nine t...@spidertracks.com wrote:
**
Hi guys,
We've just built a K tree
RE: RAID0 Recommendation
Cassandra supports multiple data file directories. Because we do
compactions, it's just much easier to deal with (1) data file directory
that is stripped across all disks as 1 volume (RAID0). There are other ways
to accomplish this though. At Twitter we use software raid
memcache on each node
and allocate the remaining to that.
2. What your network speed ? Do you use trunks ? Do you have a dedicated
VLAN for gossip/store traffic ?
No dedicated VLAN for gossip. We run at 2Gb/s. We have bonded NIC's.
Cheers,
Sorin
On Sun, Oct 30, 2011 at 5:00 AM, Chris
No. We built a pluggable cache provider for memcache.
On Sun, Oct 30, 2011 at 7:31 PM, Mohit Anchlia mohitanch...@gmail.comwrote:
On Sun, Oct 30, 2011 at 6:53 PM, Chris Goffinet c...@chrisgoffinet.com
wrote:
On Sun, Oct 30, 2011 at 3:34 PM, Sorin Julean sorin.jul...@gmail.com
wrote
When creating a new CF, defaults are now in fact compression enabled.
On Sat, Mar 17, 2012 at 5:50 AM, R. Verlangen ro...@us2.nl wrote:
Check your log for messages about rebuilding indices: that might grow your
dataset some.
One thing is for sure: the data import removed all the crap that
If you are using Python, and raw Thrift, use the following:
protocol = TBinaryProtocol.TBinaryProtocolAccelerated(transport)
The serialization/deserialization is done directly in C.
On Wed, Oct 20, 2010 at 11:53 AM, Wayne wav...@gmail.com wrote:
We did some testing and the object is 23megs
I've seen about a 13% improvement in practice.
-Chris
On Jan 16, 2011, at 4:01 PM, David Dabbs wrote:
Hello.
Can anyone comment on the performance impact (positive or negative)
of running Cassandra configured to use large pages under Linux?
Yes, YMMV applies, but I thought I'd ask before
+1
On Fri, Jan 28, 2011 at 3:13 PM, Eric Evans eev...@rackspace.com wrote:
It seems like it was just earlier this week that we announced the
release of 0.6.10. Oh wait, it was. In the time since though,
CASSANDRA-2058[1] was found and fixed, and that seemed like reason
enough to
Err. I mean't, thanks Evan for getting this released so fast :)
On Fri, Jan 28, 2011 at 3:18 PM, Chris Goffinet c...@chrisgoffinet.comwrote:
+1
On Fri, Jan 28, 2011 at 3:13 PM, Eric Evans eev...@rackspace.com wrote:
It seems like it was just earlier this week that we announced
I would like to subscribe to your newsletter.
On Tue, Feb 15, 2011 at 8:04 AM, A J s5a...@gmail.com wrote:
Can you tell me how many SSTables on disk when you see GC pauses? In your 3
node cluster, what's the RF factor?
On Mon, Mar 7, 2011 at 1:50 PM, ruslan usifov ruslan.usi...@gmail.comwrote:
2011/3/8 Jonathan Ellis jbel...@gmail.com
It sounds like you're complaining that the JVM sometimes does
The rows you are inserting, what is your update ratio to those rows?
On Mon, Mar 7, 2011 at 4:03 PM, ruslan usifov ruslan.usi...@gmail.comwrote:
2011/3/8 Chris Goffinet c...@chrisgoffinet.com
Can you tell me how many SSTables on disk when you see GC pauses? In your 3
node cluster, what's
How large are your SSTables on disk? My thought was because you have so many
on disk, we have to store the bloom filter + every 128 keys from index in
memory.
On Mon, Mar 7, 2011 at 4:35 PM, ruslan usifov ruslan.usi...@gmail.comwrote:
2011/3/8 Chris Goffinet c...@chrisgoffinet.com
The rows
-Dcassandra.join_ring=false
-Chris
On Mar 21, 2011, at 10:32 PM, Jason Harvey wrote:
I set join_ring=false in my java opts:
-Djoin_ring=false
However, when the node started up, it joined the ring. Is there
something I am missing? Using 0.7.4
Thanks,
Jason
We also have a ticket open at
https://issues.apache.org/jira/browse/CASSANDRA-2399
We have observed in production the impact of streaming data to new nodes being
added. We actually have our entire dataset in page cache in one of our
clusters, our 99th percentiles go from 20ms to 1 second on
--
Chris Goffinet
what's the ulimit set to?
-Chris
On Mar 27, 2010, at 10:29 AM, James Golick wrote:
Hey,
I put our first cluster in to production (writing but not reading) a couple
of days ago. Right now, it's got two pretty sizeable nodes taking about 200
writes per second each and virtually no reads.
Awesome! 2 tickets left.
-Chris
On Mar 27, 2010, at 11:42 PM, Evan Weaver wrote:
Me too.
On Tue, Mar 23, 2010 at 12:48 PM, Jeff Hodges jhod...@twitter.com wrote:
I'll be there.
--
Jeff
On Mon, Mar 22, 2010 at 8:40 PM, Eric Florenzano flo...@gmail.com wrote:
Nice, I'll go!
-Eric
1, 2010 at 4:23 PM, Jonathan Ellis jbel...@gmail.com
wrote:
I would turn debug logging on globally on the new node, that will
answer more questions than just the streaming package.
--
Dan Di Spaltro
--
Dan Di Spaltro
--
Chris Goffinet
!
On Tue, Apr 6, 2010 at 12:54 PM, Chris Goffinet goffi...@digg.com wrote:
That's not true. We have been using the Zookeper work we posted on jira.
That's what we are using internally and have been for months. We are now just
wrapping up our vector clocks + distributed counter patch so we can begin
I wonder if that might be related to this:
https://issues.apache.org/jira/browse/CASSANDRA-896
We switched from a Concurrent structure to LinkedBlockingQueue in 0.6.
-Chris
On Apr 17, 2010, at 9:26 PM, Schubert Zhang wrote:
We are testing 0.6.0, compares with 0.5.1, and it seems:
1.
; if a compaction is going
on the individual node is I/O limited
tpstats: caught me, didn't know this. I will set up a test and try to catch
a node during the critical time.
Thanks,
Roland
2010/4/26 Chris Goffinet goffi...@digg.com
Which version of Cassandra?
Which version of Java JVM are you using
If you are running multiple datacenters, intend to have a lot of writes for
counters, I highly advise against it. We got rid of ZK because of that.
-Chris
On May 16, 2010, at 7:04 PM, S Ahmed wrote:
Can someone quickly go over how you go about using zookeeper if you want to
store counts and
My money is on the fact that the serializer is just horribly verbose. It's
using a basic set of the java serializer.
-Chris
On Tue, May 25, 2010 at 10:02 AM, Ryan King r...@twitter.com wrote:
Also, timestamps for each column.
-ryan
On Tue, May 25, 2010 at 5:41 AM, Jonathan Ellis
Digg is not forking Cassandra. We use 0.6 for production, with a few in-house
patches (related to our infrastructure). The biggest difference with our branch
and apache 0.6 branch is we have the work Kelvin and Twitter has done in
regards to Vector Clocks + Distributed Counters. This will never
When you can't get the number of threads, that means you have way too many
running (8,000+) usually.
Try running `ps -eLf | grep cassandra`. How many threads?
-Chris
On Jul 29, 2010, at 8:40 PM, Dathan Pattishall wrote:
To Follow up on this thread. I blew away the data for my entire
35 matches
Mail list logo