Re: nodetool cleanup error

2015-03-31 Thread Amlan Roy
Hi, Thanks for the reply. Since nodetool cleanup is not working even after upgrading to 2.0.13, is it recommended to go to an older version (2.0.11 for example, with 2.0.12 also it did not work). Is there any other way of cleaning data from existing nodes after adding a new node. Regards,

Re: sstable writer and creating bytebuffers

2015-03-31 Thread Sylvain Lebresne
On Tue, Mar 31, 2015 at 7:42 AM, Peer, Oded oded.p...@rsa.com wrote: Thanks Sylvain. Is there any way to create a composite key with only one column in Cassandra when creating a table, or should creating a CompositeType instance with a single column be prohibited? It's hard to answer

Re: nodetool cleanup error

2015-03-31 Thread Marcus Eriksson
It should work on 2.0.13. If it fails with that assertion, you should just retry. If that does not work, and you can reproduce this, please file a ticket /Marcus On Tue, Mar 31, 2015 at 9:33 AM, Amlan Roy amlan@cleartrip.com wrote: Hi, Thanks for the reply. Since nodetool cleanup is not

Multinode Cassandra and sstableloader

2015-03-31 Thread Serega Sheypak
Hi, I have a simple question and can't find related info in docs. I have cluster1 with 3 nodes and cluster2 with 5 nodes. I want to transfer whole keyspace named 'mykeyspace' data from cluster1 to cluster2 using sstableloader. I understand that it's not the best solution, I need it for testing

Re: Why select returns tombstoned results?

2015-03-31 Thread Paulo Ricardo Motta Gomes
What version of Cassandra are you running? Are you by any chance running repairs on your data? On Mon, Mar 30, 2015 at 5:39 PM, Benyi Wang bewang.t...@gmail.com wrote: Thanks for replying. In cqlsh, if I change to Quorum (Consistency quorum), sometime the select return the deleted row,

RE: sstable writer and creating bytebuffers

2015-03-31 Thread Peer, Oded
I may have overcomplicated things. In my opinion creating a CompositeType with a single type should throw an exception. From: Sylvain Lebresne [mailto:sylv...@datastax.com] Sent: Tuesday, March 31, 2015 10:18 AM To: user@cassandra.apache.org Subject: Re: sstable writer and creating bytebuffers

Re: Multinode Cassandra and sstableloader

2015-03-31 Thread Alain RODRIGUEZ
Hi, Despite of I understand that it's not the best solution, I need it for testing purposes, I have to ask you if you considered doing an Alter keyspace, change RF 1 for mykeyspace on cluster2 and nodetool rebuild to add a new DC (your cluster2) ? In the case you go your way (sstableloader)

Re: How to store unique visitors in cassandra

2015-03-31 Thread Alain RODRIGUEZ
People keep asking me if we finally found a solution (even if this is 3+ years old) so I will just update this thread with our findings. We finally achieved doing this thanks to our bigdata and reporting stacks by storing blobs corresponding to HLL (HyperLogLog) structures. HLL is an algorithm

Help understanding aftermath of death by GC

2015-03-31 Thread Robert Wille
I moved my site over to Cassandra a few months ago, and everything has been just peachy until a few hours ago (yes, it would be in the middle of the night) when my entire cluster suffered death by GC. By death by GC, I mean this: [rwille@cas031 cassandra]$ grep GC system.log | head -5 INFO

Re: SSTable structure

2015-03-31 Thread Bharatendra Boddu
Some time back I created a blog article about the SSTable storage format with some code references. Cassandra: SSTable Storage Format http://distributeddatastore.blogspot.com/2013/08/cassandra-sstable-storage-format.html - bharat On Mon, Mar 30, 2015 at 5:24 PM, Jacob Rhoden jacob.rho...@me.com

Getting NoClassDefFoundError for com/datastax/spark/connector/mapper/ColumnMapper

2015-03-31 Thread Tiwari, Tarun
Hi Experts, I am getting java.lang.NoClassDefFoundError: com/datastax/spark/connector/mapper/ColumnMapper while running a app to load data to Cassandra table using the datastax spark connector Is there something else I need to import in the program or dependencies? RUNTIME ERROR: Exception

Cassandra 2.0.12 - lots of Got an IOException in internalRead!

2015-03-31 Thread Caraballo, Rafael
In Cassandra 2.0.12, we are seeing a hundreds of these warning in every node... WARN [Thrift-Selector_16] 2015-03-31 00:48:49,700 Message.java (line 364) Got an IOException in internalRead! java.io.IOException: Connection reset by peer at sun.nio.ch.FileDispatcherImpl.read0(Native

Re: High latencies for simple queries

2015-03-31 Thread Tyler Hobbs
To clarify, that's in Cassandra 2.1+. In 2.0 and earlier, we used http://code.google.com/a/apache-extras.org/p/cassandra-dbapi2/ for cqlsh. On Tue, Mar 31, 2015 at 10:40 AM, Tyler Hobbs ty...@datastax.com wrote: The python driver that we bundle with Cassandra for cqlsh is the normal python

Re: Multinode Cassandra and sstableloader

2015-03-31 Thread Alain RODRIGUEZ
IMHO, the most straight forward solution is to add cluster2 as a new DC for mykeyspace and then drop the old DC. That's how we migrated to VPC (AWS) and we love this approach since you don't have to mess with your existing cluster, plus sync is made automatically and you can then drop your old DC

Re: High latencies for simple queries

2015-03-31 Thread Tyler Hobbs
The python driver that we bundle with Cassandra for cqlsh is the normal python driver (https://github.com/datastax/python-driver), although sometimes it's patched for bugfixes or is not an official release. On Sat, Mar 28, 2015 at 5:36 PM, Ben Bromhead b...@instaclustr.com wrote: cqlsh runs on

Re: How to store unique visitors in cassandra

2015-03-31 Thread Laing, Michael
We use Alain's solution as well to make major operational revisions. We have a red team and a blue team in each AWS region, so we just add and drop datacenters to get where we want to be. Pretty simple. ml On Tue, Mar 31, 2015 at 8:16 AM, Alain RODRIGUEZ arodr...@gmail.com wrote: People keep

Re: Getting NoClassDefFoundError for com/datastax/spark/connector/mapper/ColumnMapper

2015-03-31 Thread Dave Brosius
Is there an 'initial cause' listed under that exception you gave? As NoClassDefFoundError is not exactly the same as ClassNotFoundException. It meant that ColumnMapper couldn't initialize it's static initializer, it could be because some other class couldn't be found, or it could be some other

Re: Column value not getting updated

2015-03-31 Thread Mark Greene
Hey Saurabh, We're actually preparing for this ourselves and spinning up our own NTP server pool. The public NTP pools have a lot of drift and should not be relied upon for cluster technology that is sensitive to time skew like C*. The folks at Logentries did a great write up about this which we

Re: Help understanding aftermath of death by GC

2015-03-31 Thread Jens Rantil
Hi Robert, On Tue, Mar 31, 2015 at 2:22 PM, Robert Wille rwi...@fold3.com wrote: Can anybody help me understand why Cassandra wouldn’t recover? One issue when you are running a JVM and start running out of memory is that the JVM can start throwing `OutOfMemoryError` in any thread - not

Re: How to store unique visitors in cassandra

2015-03-31 Thread Alain RODRIGUEZ
Hi Laing, I think you answered the wrong mail =). This one is around UV on custom range model. But I am happy that you agree on my last message about the Datacenter switch. C*heers 2015-03-31 16:29 GMT+02:00 Laing, Michael michael.la...@nytimes.com: We use Alain's solution as well to make

Re: Cassandra 2.0.12 - lots of Got an IOException in internalRead!

2015-03-31 Thread Michael Shuler
On 03/31/2015 10:15 AM, Caraballo, Rafael wrote: In Cassandra 2.0.12, we are seeing a hundreds of these warning in every node… WARN [Thrift-Selector_16] 2015-03-31 00:48:49,700 Message.java (line 364) Got an IOException in internalRead! java.io.IOException: Connection reset by peer

Re: Help understanding aftermath of death by GC

2015-03-31 Thread Robert Coli
On Tue, Mar 31, 2015 at 9:12 AM, Jens Rantil jens.ran...@tink.se wrote: One issue when you are running a JVM and start running out of memory is that the JVM can start throwing `OutOfMemoryError` in any thread - not necessarily in the thread which is taking all the memory. I've seen this

Cross-datacenter requests taking a very long time.

2015-03-31 Thread Andrew Vant
I have a Cassandra 2.0.13 cluster with three datacenters, three nodes per datacenter. If I open cqlsh and do a select with any consistency level that crosses datacenters (e.g. QUORUM or ALL), it works, but takes 2+ minutes to return. The same statement with consistency ONE or LOCAL_QUORUM is as

Re: Why select returns tombstoned results?

2015-03-31 Thread Ken Hancock
Have you checked time sync across all servers? The fact that you've changed consistency levels and you're getting different results may indicate something inherently wrong with the cluster such as writes being dropped or time differences between the nodes. A brute-force approach to better

Re: Multinode Cassandra and sstableloader

2015-03-31 Thread Serega Sheypak
I have to ask you if you considered doing an Alter keyspace, change RF The idea is dead simple: get data from cluster1, put it to cluster2 vipe cluster1 I understand drawbacks of streaming sstableloader approach, I need right now something easy. Later we consider switch to Priam since it does

Re: Help understanding aftermath of death by GC

2015-03-31 Thread Jason Wee
Hey Robert, you might want to start by looking into the statistics of cassandra, either exposed via nodetool or if you have monitoring system monitor the important metrics. I have read this article moment ago and I hope it help you http://aryanet.com/blog/cassandra-garbage-collector-tuning to

Re: Cross-datacenter requests taking a very long time.

2015-03-31 Thread Robert Coli
On Tue, Mar 31, 2015 at 1:54 PM, Andrew Vant andrew.v...@rackspace.com wrote: I have a Cassandra 2.0.13 cluster with three datacenters, three nodes per datacenter. If I open cqlsh and do a select with any consistency level that crosses datacenters (e.g. QUORUM or ALL), it works, but takes 2+

Re: Column value not getting updated

2015-03-31 Thread Nate McCall
You would see that if the servers' clocks were out of sync. Make sure the time on the servers is in sync or set the client timestamps explicitly. On Tue, Mar 31, 2015 at 3:23 PM, Saurabh Sethi saurabh_se...@symantec.com wrote: I have written a unit test that creates a column family, inserts a

Re: Column value not getting updated

2015-03-31 Thread Jonathan Haddad
It's not enough to set up ntp, you're going to need to force the time to sync. ntp is only meant to correct for drift. You can either use ntpdate or I think there's a flag for ntpd (that I can't remember and am in a rush out the door) that you can use to force it to adjust to the correct time.

Re: Column value not getting updated

2015-03-31 Thread Saurabh Sethi
That’s what I found out that the clocks were not in sync. But I have setup NTP on all 3 nodes and would expect the clocks to be in sync. From: Nate McCall n...@thelastpickle.commailto:n...@thelastpickle.com Reply-To: user@cassandra.apache.orgmailto:user@cassandra.apache.org

Column value not getting updated

2015-03-31 Thread Saurabh Sethi
I have written a unit test that creates a column family, inserts a row in that column family and then updates the value of one of the columns. After updating, unit test immediately tries to read the updated value for that column, but Cassandra returns the old value. * I am using

Re: Cross-datacenter requests taking a very long time.

2015-03-31 Thread daemeon reiydelle
What is your replication factor? Any idea how much data has to be processed under the query? With that few nodes (3) in each DC, even with replication=1, you are probably not getting much inter-node data transfer in a local quorum, until of course you do cross data centers and at least one full