active queries

2015-03-19 Thread Rahul Bhardwaj
Hi , Can we see active queries on cassandra cluster. Is there any tool? Please help. Regards: Rahul Bhardwaj -- Follow IndiaMART.com http://www.indiamart.com for latest updates on this and more: https://plus.google.com/+indiamart https://www.facebook.com/IndiaMART

Re: Adding new node to Cassandra cluster is too slow

2015-03-19 Thread Robert Coli
On Thu, Mar 19, 2015 at 5:32 PM, Pranay Agarwal agarwalpran...@gmail.com wrote: I have 14 nodes cassandra cluster, each node as around 50gb of data. I added 3 new nodes to the cluster and I can see the status as *UJ *for the new nodes. They have been in that for almost a day now and their data

Re: Seed Node

2015-03-19 Thread Robert Coli
On Thu, Mar 19, 2015 at 3:56 PM, jean paul researche...@gmail.com wrote: Please,i have a question a bout the seed node.. as i read it is the bootstrap node, each new node joins the seed node that's it? if it leaves the cluster, how can a new node joins the rest of the group ? What a seed is

Re: best way to measure repair times?

2015-03-19 Thread Rahul Neelakantan
Wouldn't GC Grace set to 34 days increase the bloat in the DB? Rahul On Mar 19, 2015, at 3:02 PM, Robert Coli rc...@eventbrite.com wrote: On Thu, Mar 19, 2015 at 10:30 AM, Ian Rose ianr...@fullstory.com wrote: I'd like to (a) monitor how long my repairs are taking, and (b) know when a

Re: Adding new node to Cassandra cluster is too slow

2015-03-19 Thread Pranay Agarwal
Thanks Rob, You are right. I am using ReleaseVersion: 2.1.0 What do you mean by point 3? Also, by doing one at a time, does it mean wait till nodetool status of the new node is UN from UJ? On Thu, Mar 19, 2015 at 5:44 PM, Robert Coli rc...@eventbrite.com wrote: On Thu, Mar 19, 2015 at 5:32 PM,

Seed Node

2015-03-19 Thread jean paul
Hello All, Please,i have a question a bout the seed node.. as i read it is the bootstrap node, each new node joins the seed node that's it? if it leaves the cluster, how can a new node joins the rest of the group ? Thanks a lot for answer. Best Regards.

Re: Timeout error in fetching million rows as results using clustering keys

2015-03-19 Thread Kai Wang
With your reading path and data model, it doesn't matter how many nodes you have. All data with the same image_caseid is physically located on one node (Well, on RF nodes but only one of those will try to server your query). You are not taking advantage of Cassandra by creating hot spots on both

Re: active queries

2015-03-19 Thread Rahul Bhardwaj
Thanks to all, I will get back to you after looking all. On Fri, Mar 20, 2015 at 2:56 AM, Robert Coli rc...@eventbrite.com wrote: On Thu, Mar 19, 2015 at 1:30 PM, Jan cne...@yahoo.com wrote: Answer: nodetool tpstatsnodetool cfsstats For a better view of this :

Re: Cassandra Read Timeout

2015-03-19 Thread Jan
Yulian;  Quote :Raw size is aroung 190MB.There are bigger raws with similar structure ( its index raws , which actually stores keys ) and everything is working fine on them, everything is working also fine on this cf but on other raw. Tables data from CFStats ( First table has bigger raws but

Re: Adding new node to Cassandra cluster is too slow

2015-03-19 Thread Pranay Agarwal
Also, the new nodes (3 of them, in *UJ state*) are showing some data size (~10g). Is there any data loss chances with stopping the cassandra on them? On Thu, Mar 19, 2015 at 6:02 PM, Pranay Agarwal agarwalpran...@gmail.com wrote: Thanks Rob, You are right. I am using ReleaseVersion: 2.1.0

Re: best way to measure repair times?

2015-03-19 Thread Robert Coli
On Thu, Mar 19, 2015 at 12:13 PM, Ali Akhtar ali.rac...@gmail.com wrote: Cassandra doesn't guarantee eventual consistency? If you run regularly scheduled repair, it does. If you do not run repair, it does not. Hinted handoff, for example, is considered an optimization for repair, and does not

Re: best way to measure repair times?

2015-03-19 Thread Robert Coli
On Thu, Mar 19, 2015 at 12:53 PM, Paulo Motta pauloricard...@gmail.com wrote: This is only true if you never have hardware failure. Hardware failure means that For the record, I hate this formulation for being a little too clever. This is never true, because we live in a world where hardware

Re: best way to measure repair times?

2015-03-19 Thread Robert Coli
On Thu, Mar 19, 2015 at 1:03 PM, Jan cne...@yahoo.com wrote: to respond to your specific question: You could pipe the output of your repair into a file and subsequently determine the time taken. By this method, what is the duration of a repair which will never complete? =Rob

Re: best way to measure repair times?

2015-03-19 Thread Jan
Ian;  to respond to your specific question: You could pipe the output of your repair into a file and subsequently determine the time taken.    example: nodetool repair -dc DC1 [2014-07-24 21:59:55,326] Nothing to repair for keyspace 'system' [2014-07-24 21:59:55,617] Starting repair command #2,

best way to measure repair times?

2015-03-19 Thread Ian Rose
Howdy - I'd like to (a) monitor how long my repairs are taking, and (b) know when a repair is finished so that I can take some kind of followup action. What's the best way to tackle either or both of these? Some potentially relevant details: - running community apache cassandra (not DSE) -

Re: active queries

2015-03-19 Thread Robert Coli
On Thu, Mar 19, 2015 at 12:11 AM, Rahul Bhardwaj rahul.bhard...@indiamart.com wrote: Can we see active queries on cassandra cluster. Is there any tool? Not really : https://issues.apache.org/jira/browse/CASSANDRA-5084 =Rob

Re: best way to measure repair times?

2015-03-19 Thread Robert Coli
On Thu, Mar 19, 2015 at 10:32 AM, Ali Akhtar ali.rac...@gmail.com wrote: Just wondering - why do you have to trigger the repairs? Is that necessary in Cassandra? Manual repair is the only mechanism in Cassandra which guarantees consistency. A repair must be run once per gc_grace_seconds in

Re: best way to measure repair times?

2015-03-19 Thread Ali Akhtar
Cassandra doesn't guarantee eventual consistency? On Fri, Mar 20, 2015 at 12:04 AM, Robert Coli rc...@eventbrite.com wrote: On Thu, Mar 19, 2015 at 10:32 AM, Ali Akhtar ali.rac...@gmail.com wrote: Just wondering - why do you have to trigger the repairs? Is that necessary in Cassandra?

Re: active queries

2015-03-19 Thread Robert Coli
On Thu, Mar 19, 2015 at 1:30 PM, Jan cne...@yahoo.com wrote: Answer: nodetool tpstatsnodetool cfsstats For a better view of this : https://github.com/hailocab/ctop =Rob

RE: best way to measure repair times?

2015-03-19 Thread Jason Kushmaul | WDA
Ian, In my experience I don’t get any output from repair (2.0.7) that is useful until the keyspace is finished. Perhaps this has been solved but we do something much more painful: We tail the log on the node having repair run on it, watching for the first repair session, and then count each

Re: active queries

2015-03-19 Thread Jan
HI Rahul;  your question: Can we see active queries on cassandra cluster. Is there any tool? Answer:     nodetool tpstats    nodetool  cfsstats  The nodetool tpstats command provides statistics about the number of active, pending, and completed tasks for each stage of Cassandra operations by

Re: Delete columns

2015-03-19 Thread Jan
Benyi ;  have you considered using the TTL option in case your columns are meant to be deleted after a predetermined amount of time ? Its probably the easiest way to get the task accomplished. cheersJan On Friday, February 27, 2015 10:38 AM, Benyi Wang bewang.t...@gmail.com wrote:

Re: best way to measure repair times?

2015-03-19 Thread Ian Rose
Thanks Jan, although I'm a bit unsure of the details. It looks like when you run a repair this actually occurs over several sessions. e.g. in your example above there are 2 different repair session [...] finished lines. So does it makes sense that I would want to measure between when I first

Re: FileNotFoundException

2015-03-19 Thread Jan
HI Batranut; In both errors you described above the files seem to be missing while compaction is running. Without knowing what else is going on your system,  I would presume that this error occurs on this single node only and not your entire cluster.  Some guesses:a)  You may have a disk

Re: Timeout error in fetching million rows as results using clustering keys

2015-03-19 Thread Jack Krupansky
Content management (large blobs such as images and video) can be done with Cassandra, but it is tricky and great care is needed. As with any Cassandra app, you need to model your data based on how you intend to query and access the data. You can certainly access large amounts of data with