Re: Working with legacy data via CQL

2014-11-19 Thread Erik Forsberg
On 2014-11-19 01:37, Robert Coli wrote: Thanks, I can reproduce the issue with that, and I should be able to look into it tomorrow. FWIW, I believe the issue is server-side, not in the driver. I may be able to suggest a workaround once I figure out what's going on. Is

Re: Repair completes successfully but data is still inconsistent

2014-11-19 Thread André Cruz
On 19 Nov 2014, at 00:43, Robert Coli rc...@eventbrite.com wrote: @OP : can you repro if you run a major compaction between the deletion and the tombstone collection? This happened in production and, AFAIK, for the first time in a system that has been running for 2 years. We have upgraded

Re: Repair completes successfully but data is still inconsistent

2014-11-19 Thread André Cruz
On 19 Nov 2014, at 11:37, André Cruz andre.c...@co.sapo.pt wrote: All the nodes were restarted on 21-23 October, for the upgrade (1.2.16 - 1.2.19) I mentioned. The delete happened after. I should also point out that we were experiencing problems related to CASSANDRA-4206 and CASSANDRA-7808.

cassandra-stress: Clarification on yaml profile needed

2014-11-19 Thread Preussner, Jens
Hi all, can someone point me to the latest documentation on how a yaml profile has to look for the latest cassandra-stress? There seem to be some differences between the format described in the blog (http://www.datastax.com/dev/blog/improved-cassandra-2-1-stress-tool-benchmark-any-schema) and

A tale of a node that never joins...

2014-11-19 Thread Stan Lemon
Hello, I'm working on a two data center cluster with 12 nodes in each data center. I recently wanted to add a thirteenth node to one of the data centers to try and validate some load improvements to our hardware configuration. I added the node following DataStax directions (

Re: A tale of a node that never joins...

2014-11-19 Thread DuyHai Doan
Hello Stan Which version of Cassandra are you using ? There are some known issues of streaming failure that prevent a node from finishing joining Regards On Wed, Nov 19, 2014 at 3:57 PM, Stan Lemon sle...@salesforce.com wrote: Hello, I'm working on a two data center cluster with 12 nodes

Re: A tale of a node that never joins...

2014-11-19 Thread Stan Lemon
We are currently using 2.0.11 Thanks, Stan Hello Stan Which version of Cassandra are you using ? There are some known issues of streaming failure that prevent a node from finishing joining Regards On Wed, Nov 19, 2014 at 3:57 PM, Stan Lemon sle...@salesforce.com wrote: Hello, I'm

Re: sstables keep growing on cassandra 2.1

2014-11-19 Thread Colin Kuo
Hi, Can you please firstly check the nodetool compactionstats during repair? I'm afraid that minor compaction may be blocked by whatever tasks that causes the number of SStable keep growing. On Sat, Nov 15, 2014 at 7:47 AM, James Derieg james.der...@uplynk.com wrote: Hi everyone, I'm hoping

Re: Repair completes successfully but data is still inconsistent

2014-11-19 Thread Robert Coli
On Wed, Nov 19, 2014 at 5:18 AM, André Cruz andre.c...@co.sapo.pt wrote: Each node has 4-9 of these exceptions as it is going down after being drained. It seems Cassandra was trying to delete an sstable. Can this be related? That seems plausible, though the versions of the files you indicate

Re: Removing commit log files

2014-11-19 Thread Robert Coli
On Tue, Nov 18, 2014 at 6:30 PM, Jacob Rhoden jacob.rho...@me.com wrote: Is it correct to assume that if you do a “nodetool drain” on a node and then shutdown a node, you can safely remove all commit logs on that node as long as all nodes are up? Assuming you are in a version where nodetool

Re: Deduplicating data on a node (RF=1)

2014-11-19 Thread Robert Coli
On Tue, Nov 18, 2014 at 10:04 AM, Alain Vandendorpe al...@tapstream.com wrote: Rob - thanks for that, I was wondering whether either of those would successfully deduplicate the data. We were hypothesizing that a decommission would merely stream the duplicates out as well as though they were

Re: Cassandra backup via snapshots in production

2014-11-19 Thread Robert Coli
On Tue, Nov 18, 2014 at 6:50 AM, Ngoc Minh VO ngocminh...@bnpparibas.com wrote: We are looking for a solution to backup data in our C* cluster (v2.0.x, 16 nodes, 4 x 500GB SSD, RF = 6 over 2 datacenters). The main purpose is to protect us from human errors (eg. unexpected manipulations:

Re: Force purging of tombstones

2014-11-19 Thread Robert Coli
On Tue, Nov 18, 2014 at 12:41 AM, Rahul Neelakantan ra...@rahul.be wrote: Is this page incorrect then and needs to be updated or am I interpreting it incorrectly ? http://www.datastax.com/documentation/cassandra/2.0/cassandra/dml/dml_about_deletes_c.html Particularly this sentence After

Re: read repair across DC and latency

2014-11-19 Thread Jimmy Lin
Tyler, thanks for the detail explanation. Still have few questions in my mind # When you said send read digest request to the rest of the replica, do you mean all replica(s) in current and other DC? or just the one last replica in my current DC and one of the co-ordinate node in other DC?

any way to get nodetool proxyhistograms data for an entire cluster?

2014-11-19 Thread Clint Kelly
If I run this tool on a given host, it shows me stats for only the cases where that host was the coordinator node, correct? Is there any way (other than me cooking up a little script) to automatically get the proxyhistogram stats for my entire cluster? -Clint

Re: any way to get nodetool proxyhistograms data for an entire cluster?

2014-11-19 Thread Robert Coli
On Wed, Nov 19, 2014 at 3:22 PM, Clint Kelly clint.ke...@gmail.com wrote: Is there any way (other than me cooking up a little script) to automatically get the proxyhistogram stats for my entire cluster? OpsCenter might expose this as an aggregate, and can be used with free Apache Cassandra.

Re: Trying to build Cassandra for FreeBSD 10.1

2014-11-19 Thread Michael Shuler
On 11/18/2014 04:58 PM, William Arbaugh wrote: Happy to do so - but the ticket indicates that FreeBSD is unsupported and thus this is unlikely to get fixed. I'm the person that said that in the JIRA ticket :) I also quoted it to indicate that it's really not officially unsupported - it's

querying data from Cassandra through the Spark SQL Thrift JDBC server

2014-11-19 Thread Mohammed Guller
Hi - I was curious if anyone is using the Spark SQL Thrift JDBC server with Cassandra. It would be great be if you could share how you got it working? For example, what config changes have to be done in hive-site.xml, what additional jars are required, etc.? I have a Spark app that can

Question: How to monitor the QPS in Cassandra local node or cluster

2014-11-19 Thread luolee.me
Hi, everyone, I want to monitor the Cassandra cluster using Zabbix, but I have no idea about hot monitor the QPS on local Cassandra node ? I search the internet but haven't any result about how to get the QPS. anyone had any idea? Thanks!

Re: any way to get nodetool proxyhistograms data for an entire cluster?

2014-11-19 Thread Clint Kelly
Thanks for the reply. We have DSE so I can use opscenter. I was just looking for something more precise than the graphs that I get from opscenter. On Wed, Nov 19, 2014 at 5:53 PM, Rahul Neelakantan ra...@rahul.be wrote: So what do use as a good alternate to it? Rahul Neelakantan On Nov