Hi,
I am new to Cassandra and I am not sure if this is the normal behavior but
nodetool repair runs for too long even for small dataset per node. As I am
writing I started a nodetool repair last night at 18:41 and now it's 9:18
and it's still running, the size of my data is only ~500mb per node.
Hey,
Version 1.1 of Cassandra introduced live traffic sampling, which allows you
to measure the performance of a node without it really joining the cluster:
http://www.datastax.com/dev/blog/whats-new-in-cassandra-1-1-live-traffic-sampling
That page mentions that you can change the compaction
Just turn off dynamic snitch on survey node and make read requests from it
directly with CL.ONE, watch histograms, compare.
Regarding switching compaction strategy there're a lot of info already.
Best regards / Pagarbiai
Viktor Jevdokimov
Senior Developer
Email:
The 40 TB use case you heard about is probably one 40TB mysql machine
that someone migrated to mongo so it would be web scale Cassandra is
NOT good with drives that big, get a blade center or a high density
chassis.
On Mon, Feb 18, 2013 at 8:00 PM, Hiller, Dean dean.hil...@nrel.gov wrote:
I
Hey Guys,
I just wanted to follow up on this thread on how I go JNA to work with the
cassandra 1.2.1 tarball I downloaded.
On CentOS I went :
[root@cassandra-node01 ~]# yum provides */jna.jar
...
jna-3.4.0-4.el5.x86_64 : Pure Java access to native libraries
Repo:
based on your mount/selinux settings sometimes the os is unwilling to
tolerate so files outside certain directories.
Edward
On Tue, Feb 19, 2013 at 10:13 AM, Tim Dunphy bluethu...@gmail.com wrote:
Hey Guys,
I just wanted to follow up on this thread on how I go JNA to work with the
cassandra
This is very normal (unfortunately). Are you doing a repair –pr or a straight
up repair?
Does nodetool netstats show anything? I frequently see repair hang in 1.2.1,
and I haven't been able to figure out why yet though. Feel free to take a stack
dump with jstack on the node doing the repair
Well, that answer didn't really help. I know how to make a survey node, and
I know how to simulate reads to it, it's just that that's a lot of work,
and I wouldn't be sure that the simulated load is the same as the
production load.
We gather a lot of metrics from our production servers, so we
From my limited experience with Mongo, it seems that Mongo only performs when
the whole data set is in the memory which makes me wonder how the 40TB data
works..
- Original Message -
From: Edward Capriolo edlinuxg...@gmail.com
To: user@cassandra.apache.org
Sent: Tuesday, February 19,
It should not take that long. For my 200G node, it takes about an hour to
calculate the Merkle tree and then data streaming.
By the way, how do you know the repair is not done?
If you run nodetool tpstats, it should give you the AntiEntropy session info,
active/pending/completed etc. While
Unsubscribe me please.
Thanks
A
Read the message you answered to, and help yourself !
Alain
2013/2/19 Anurag Gujral anurag.guj...@gmail.com
Unsubscribe me please.
Thanks
A
Thank you Aaron.
From: aa...@thelastpickle.com
Subject: Re: Question on Cassandra Snapshot
Date: Mon, 18 Feb 2013 06:37:34 +1300
To: user@cassandra.apache.org
With incremental_backup turned OFF in Cassandra.yaml - Are all SSTables are
under /data/TestKeySpace/ColumnFamily at all times?No. They
I have a 5 node cluster and currently running ver 1.2. Prior to full scale
deployment, I'm running some benchmarks using YCSB. From a hadoop cluster
deployment we saw an excellent improvement using higher speed networks. However
Cassandra does not include network latencies and I would like to
With CQL or an API.
Does the rpc_timeout not control the client timeout ?
No it is how long a node will wait for a response from other nodes before
raising a TimedOutException if less than CL nodes have responded.
Set the client side socket timeout using your preferred client.
Is there any param which is
16 matches
Mail list logo