Counters and replication factor

2011-11-07 Thread Alain RODRIGUEZ
Hi, I trying to switch from a RF = 1 to a RF = 3, but I get wrong values from counters when doing so... I got a CF that contains many counters of some events. When I'm at RF = 1 and simulate 10 events, they are well counted. However, when I switch to a RF = 3, my counter show a wrong value that

Re: Counters and replication factor

2011-11-07 Thread Alain RODRIGUEZ
) counter.increment (v=3) counter.getValue = returns 7 counter.getValue = returns 7 counter.getValue = returns 7 or something inconsistent like that? On Mon, Nov 7, 2011 at 9:09 AM, Alain RODRIGUEZ arodr...@gmail.comwrote: I've tried with CL.All, but it doesn't wotk better. I still have strange

Re: Counters and replication factor

2011-11-08 Thread Alain RODRIGUEZ
, Alain RODRIGUEZ arodr...@gmail.comwrote: I retried it after restarting all the servers. I still have wrong results (I simulated an event 5 times and it was counted 3 times by some counters 4 or 5 times by others. What I meant by but now every request returns me always the same count value

OOM : key caches, mmap ?

2011-11-10 Thread Alain RODRIGUEZ
Hi, I faced a similar issue as described there : http://comments.gmane.org/gmane.comp.db.cassandra.user/11184. I was running Cassandra 1.0.0 with a 3 node cluster on 3 t1.micro from Amazon EC2. I have no error in cassandra logs, but an OOM in /var/log/kern.log which put one of my nodes down.

Compaction - CPU load 100% - time out

2011-11-15 Thread Alain RODRIGUEZ
Hi, I'm running a 3 node cassandra 1.0.2 cluster on 3 Amazon EC2 t1.micro. I managed to fix some OOM I had, but I still have some spike of cpu load. I know that t1.micro have small resources, but I think it could be enough if they were well managed. My application works well, excepted when

Re: key and storage proximity

2011-11-21 Thread Alain RODRIGUEZ
I think taht it depends on your partitioner. With an OPP you will have this behaviour. With a RP you won't have this behaviour. The hash randomize the keys, which is exactly its job to be sure that even if you are storing sequences you won't create hot spots. Alain 2011/11/21 Kent Tong

Re: Compaction - CPU load 100% - time out

2011-11-22 Thread Alain RODRIGUEZ
(cassandra.yaml) as low as you possibly can (1 is the minimum I believe), try disabling hinted handoff (on all nodes), and use lower read/write consistency levels if you can. ** ** Dan ** ** *From:* Alain RODRIGUEZ [mailto:arodr...@gmail.com] *Sent:* November-15-11 6:34 *To:* user

Re: Compaction - CPU load 100% - time out

2011-11-22 Thread Alain RODRIGUEZ
. start by turning compaction_throughput_mb_per_sec all the way down to 1MB/s. On Tue, Nov 22, 2011 at 9:58 AM, Alain RODRIGUEZ arodr...@gmail.com wrote: I followed your advice and install a 3 m1.small instance cluster. The problem is still there. I've got less timeouts because I have less

Re: Increase replication factor

2011-12-05 Thread Alain RODRIGUEZ
http://www.datastax.com/docs/1.0/operations/cluster_management#changing-the-replication-factor It looks like a nodetool repair is needed after increasing your RF. I guess that a cleanup has to be performed after decreasing the RF to remove useless replica. Alain 2011/12/5 A J s5a...@gmail.com

Re: Schematool

2011-12-08 Thread Alain RODRIGUEZ
You should be able to use the CLI show schema yourkeyspace if your cassandra is recent enough ( = 0.8 if I remember well. I think it is better if you are in 0.8.7 because this command was fixed a couple of times in 8.6 and 8.7). You can put the show schema command into a file and call it with :

Re: BulkLoader

2011-12-09 Thread Alain RODRIGUEZ
Hi, I'm running a 4 nodes Cassandra cluster, and I'm facing the same problem (node not present on nodetool ring, but unreachable on CLI describe cluster...). I'm currently running version 1.0.2, but I have update from 0.8.x, the problem may exist since a while, I don't really know. I can't stop my

Re: BulkLoader

2011-12-09 Thread Alain RODRIGUEZ
By the way, nice comment on the patch // do not pass go, do not collect 200 dollars, just gtfo, it looks like you have some fun while developping Cassandra @Datastax ;) Alain 2011/12/9 Alain RODRIGUEZ arodr...@gmail.com Hi, I'm running a 4 nodes Cassandra cluster, and I'm facing the same

Re: Schematool

2011-12-11 Thread Alain RODRIGUEZ
it. ** ** Thanks Michael ** ** *From:* Alain RODRIGUEZ [mailto:arodr...@gmail.com] *Sent:* Thursday, December 08, 2011 11:52 AM *To:* user@cassandra.apache.org *Subject:* Re: Schematool ** ** You should be able to use the CLI show schema yourkeyspace if your cassandra

Re: show schema bombs in 0.8.6

2011-12-13 Thread Alain RODRIGUEZ
Hi, this reminds me a problem I had with a truncate. I think that to show schema, you're cluster must be stable and your schema agreed across the cluster. See if your cluster is really the one described by your nodetool ring and if the same schema is present on each node. You should visit this

Counters != Counts

2011-12-14 Thread Alain RODRIGUEZ
Hi everybody. I'm using a lot of counters to make statistics on a 4 nodes cluster (ec2 m1.small) with phpcassa (cassandra v1.0.2). I store some events and increment counters at the same time. Counters give me over-counts compared with the count of every corresponding events. I sure that my

Re: Counters != Counts

2011-12-16 Thread Alain RODRIGUEZ
, Alain RODRIGUEZ arodr...@gmail.comwrote: Hi everybody. I'm using a lot of counters to make statistics on a 4 nodes cluster (ec2 m1.small) with phpcassa (cassandra v1.0.2). I store some events and increment counters at the same time. Counters give me over-counts compared with the count

Re: Counters != Counts

2011-12-20 Thread Alain RODRIGUEZ
By the way, I wonder if a rolling restart is still possible when I am using counters ? Don't we lose the SPOF too, considering that counters are wrong every time we retry to write them with the client ? 2011/12/16 Alain RODRIGUEZ arodr...@gmail.com Can we have a hope that counters

Re: Presentations from NYC?

2011-12-27 Thread Alain RODRIGUEZ
Anything new about this ? I'm specifically interestead in the Joe Stein (Medialets) talk about how to manage real-time multidimensional metrics. 2011/12/10 Jonathan Ellis jbel...@gmail.com Not yet -- we're working on it. On Fri, Dec 9, 2011 at 1:48 PM, Brian O'Neill b...@alumni.brown.edu

Hadoop + Cassandra

2012-01-06 Thread Alain RODRIGUEZ
Hello. I have a 4 nodes cluster running Cassandra (without Datastax Brisk) in production. Now I want to add hadoop (and maybe Pig / Hive ?) to be able to perform some analytics. I don't know how to get started ? Is there a tutorial explaining how to install, configure and use hadoop

Re: Incremental backups

2012-01-18 Thread Alain RODRIGUEZ
As this option is in the cassandra.yaml file, you might need to perform a restart of your entire cluster (a rolling restart should work). Hope this will help. Alain 2012/1/18 Michael Vaknine micha...@citypath.com Hi, I am configured to do incremental backups on all my node on the

How to store unique visitors in cassandra

2012-01-18 Thread Alain RODRIGUEZ
I'm wondering how to modelize my CFs to store the number of unique visitors in a time period in order to be able to request it fast. I thought of sharding them by day (row = 20120118, column = visitor_id, value = '') and perform a getcount. This would work to get unique visitors per day, per week

Re: How to store unique visitors in cassandra

2012-01-19 Thread Alain RODRIGUEZ
Lucas de Souza Santos lucas...@gmail.com Why not http://www.countandra.org/ Lucas de Souza Santos (ldss) On Wed, Jan 18, 2012 at 3:23 PM, Alain RODRIGUEZ arodr...@gmail.comwrote: I'm wondering how to modelize my CFs to store the number of unique visitors in a time period in order to be able

Re: How to store unique visitors in cassandra

2012-01-19 Thread Alain RODRIGUEZ
/cassandra-nyc-2011-data-modeling Cheers - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 19/01/2012, at 10:30 PM, Alain RODRIGUEZ wrote: Hi thanks for your answer but I don't want to add more layer on top of Cassandra. I also have done all of my

Re: Any tools like phpMyAdmin to see data stored in Cassandra ?

2012-02-01 Thread Alain RODRIGUEZ
If you want to try any others UI to browse your Cassandra data I can suggest you 2 : - https://github.com/sebgiroux/Cassandra-Cluster-Admin - http://code.google.com/p/cassui/ The first one is built to be a phpMyAdmin Like. Alain 2012/1/31 Brandon Williams dri...@gmail.com On Sun, Jan 29,

Re: Unsubscribe

2012-02-03 Thread Alain RODRIGUEZ
Help yourself : http://wiki.apache.org/cassandra/FAQ#unsubscribe Alain 2012/2/3 francesco.tangari@gmail.com lol -- francesco.tangari@gmail.com Inviato con Sparrow http://www.sparrowmailapp.com/?sig Il giorno venerdì 3 febbraio 2012, alle ore 14.51, Larry Liu ha scritto: Please

Flume and Cassandra

2012-02-09 Thread Alain RODRIGUEZ
Hi, 1 - I would like to generate some statistics and store some raw events from log files tailed with flume. I saw some plugins giving Cassandra sinks but I would like to store data in a custom way, storing raw data but also incrementing counters to get near real-time statistcis. How to do it ?

Re: Flume and Cassandra

2012-02-21 Thread Alain RODRIGUEZ
/#_buffered_sink_and_decorator_semantics Hope that helps. A - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 10/02/2012, at 4:27 AM, Alain RODRIGUEZ wrote: Hi, 1 - I would like to generate some statistics and store some raw events from log files tailed

Re: Flume and Cassandra

2012-02-23 Thread Alain RODRIGUEZ
Tanks for all these informations. Twitter Kestrel-Storm-Cassandra solution looks very powerfull, scalable and well documented. I'll try to use this solution. Alain 2012/2/23 Milind Parikh milindpar...@gmail.com Coolwww.countandra.org calls them cascaded counters and it will be also based

Re: upgrade from 1.0.7 to 1.0.8

2012-03-14 Thread Alain RODRIGUEZ
Did everything go well with your update ? Do the version 1.0.8 worth the risk of an update? Alain 2012/3/11 Tamar Fraenkel ta...@tok-media.com Thanks! will check it in the following days :) *Tamar Fraenkel * Senior Software Engineer, TOK Media [image: Inline image 1]

Re: Adding nodes to cluster (Cassandra 1.0.8)

2012-03-14 Thread Alain RODRIGUEZ
It looks like Cassandra is not able to find your existing cluster. Do you use the same name of cluster in the 2 nodes ? Are you able to contact the existing node on port 7000 from the new node ? Is the first node defined as a seed in the cassandra.yaml of your 2 nodes ? If you can't make a

Re: Counter Column

2012-04-03 Thread Alain RODRIGUEZ
Sylvain explained a lot of things about counters at Cassandra SF 2011 : http://blip.tv/datastax/counters-in-cassandra-5497678 (video), http://www.datastax.com/wp-content/uploads/2011/07/cassandra_sf_counters.pdf(slides). I think it is always important knowing how the things work. Alain 2012/4/3

Trouble with wrong data

2012-04-10 Thread Alain RODRIGUEZ
Hi, I'm experimenting a strange and very annoying phenomena. I had a problem with the commit log size which grew too much and full one of the hard disks in all my nodes almost at the same time (2 nodes only, RF=2, so the 2 nodes are behaving exactly in the same way) My data are mounted in an

Re: Trouble with wrong data

2012-04-10 Thread Alain RODRIGUEZ
By the way, I am using Cassandra 1.0.7, CL = ONE (R/W), RF = 2, 2 EC2 c1.medium nodes cluster Alain 2012/4/10 Alain RODRIGUEZ arodr...@gmail.com Hi, I'm experimenting a strange and very annoying phenomena. I had a problem with the commit log size which grew too much and full one of the hard

Re: Trouble with wrong data

2012-04-13 Thread Alain RODRIGUEZ
are not idempotent. If you client retries a counter operation it may result in the increment been applied twice. Could this have been your issue ? Cheers - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 11/04/2012, at 2:35 AM, Alain RODRIGUEZ

Re: desuscribe

2012-05-03 Thread Alain RODRIGUEZ
Did you mean unsubscribe ? One more time : http://wiki.apache.org/cassandra/FAQ#unsubscribe. Message to all : You can unsubscribe, that's up to you, but please stop flooding people that continue using this mailing list. Alain 2012/5/2 Daniel Gimenez danie...@gmail.com

Re: Thrift error occurred during processing of message

2012-05-11 Thread Alain RODRIGUEZ
Hi, I guess you finally solved this issue. I'm experimenting the same one when trying to upgrade to phpcass 1.0.a.1. Do you remember how you fixed it or what the problem was exactly ? Thanks, Alain 2011/12/19 Tamil selvan R.S tamil.3...@gmail.com Hi, We are using PHPCassa to connect to

Re: Thrift error occurred during processing of message

2012-05-11 Thread Alain RODRIGUEZ
:) Alain 2012/5/11 Alain RODRIGUEZ arodr...@gmail.com Hi, I guess you finally solved this issue. I'm experimenting the same one when trying to upgrade to phpcass 1.0.a.1. Do you remember how you fixed it or what the problem was exactly ? Thanks, Alain 2011/12/19 Tamil selvan R.S tamil.3

Re: Thrift error occurred during processing of message

2012-05-11 Thread Alain RODRIGUEZ
and calling get()? On Fri, May 11, 2012 at 8:16 AM, Alain RODRIGUEZ arodr...@gmail.com wrote: I got the error above in cassandra logs. In my web browser I have the following error : 500 | Internal Server Error | TApplicationException Required field 'reversed' was not found in serialized data

Tuning cassandra (compactions overall)

2012-05-15 Thread Alain RODRIGUEZ
Hi, I'm using a 2 node cluster in production ( 2 EC2 c1.medium, CL.ONE, RF = 2, using RP) 1 - I got this kind of message quite often (let's say every 30 seconds) : WARN [ScheduledTasks:1] 2012-05-15 15:44:53,083 GCInspector.java (line 145) Heap is 0.8081418550931491 full. You may need to

Re: Tuning cassandra (compactions overall)

2012-05-16 Thread Alain RODRIGUEZ
Using c1.medium, we are currently able to deliver the service. What is the the benefit of having more memory ? I mean, I don't understand why having 1, 2, 4, 8 or 16 GB of memory is so different. In my mind, Cassandra will fill the heap and from then, start to flush and compact to avoid OOMing

Re: 1.1 not removing commit log files?

2012-05-21 Thread Alain RODRIGUEZ
commitlog_total_space_in_mb: 4096 By default this line is commented in 1.0.x if I remember well. I guess it is the same in 1.1. You really should remove this comment or your commit logs will entirely fill up your disk as it happened to me a while ago. Alain 2012/5/21 Pieter Callewaert

Re: Tuning cassandra (compactions overall)

2012-05-21 Thread Alain RODRIGUEZ
* reduce column_index_size_in_kb * reduce in_memory_compaction_limit_in_mb * increase index_interval * set concurrent_compactors to 2 Cheers - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 17/05/2012, at 12:40 AM, Alain RODRIGUEZ wrote

Wrong data after rolling restart

2012-05-21 Thread Alain RODRIGUEZ
Hi, I re-post this here because it's a new subject far away from my initial tuning questions. I wanted to try a new config. After doing a rolling restart I have all my counters false, with wrong values. I stopped my servers with the following : nodetool -h localhost disablegossip nodetool -h

Re: Wrong data after rolling restart

2012-05-21 Thread Alain RODRIGUEZ
Here are my 2 nodes starting logs, I hop it can help... https://gist.github.com/2762493 https://gist.github.com/2762495 Alain 2012/5/21 Alain RODRIGUEZ arodr...@gmail.com: Hi, I re-post this here because it's a new subject far away from my initial tuning questions. I wanted to try a new

Re: Tuning cassandra (compactions overall)

2012-05-22 Thread Alain RODRIGUEZ
not sure what you mean by And after restarting the second one I have lost all the consistency of my data. All my statistics since September are totally false now in production Can you give some examples? After restarting my 2 nodes (one after the other), All my counters have become wrong. The

Re: Tuning cassandra (compactions overall)

2012-05-24 Thread Alain RODRIGUEZ
, at 12:42 AM, Alain RODRIGUEZ wrote: not sure what you mean by And after restarting the second one I have lost all the consistency of my data. All my statistics since September are totally false now in production Can you give some examples? After restarting my 2 nodes (one after the other

Re: Snapshot failing on JSON files in 1.1.0

2012-05-28 Thread Alain RODRIGUEZ
I have the same error with the last Datastax AMI (1.0.9). Is that the same bug ? Requested snapshot for: cassa_teads Exception in thread main java.io.IOError: java.io.IOException: Unable to create hard link from /raid0/cassandra/data/cassa_teads/stats_product-hc-233-Index.db to

Re: Doubt in Row key range scan

2012-05-28 Thread Alain RODRIGUEZ
You are using the Random Partitioner. Using the RP is a good thing because you avoid hot spots, but it has its defaults too. You can't scan a slice of row, they won't be ordered because all your keys are stored using their md5 values. You should review your data model to use columns to order

Re: Snapshot failing on JSON files in 1.1.0

2012-06-13 Thread Alain RODRIGUEZ
:27 PM, Alain RODRIGUEZ wrote: I have the same error with the last Datastax AMI (1.0.9). Is that the same bug ? Requested snapshot for: cassa_teads Exception in thread main java.io.IOError: java.io.IOException: Unable to create hard link from /raid0/cassandra/data/cassa_teads/stats_product-hc

Re: Snapshot failing on JSON files in 1.1.0

2012-06-13 Thread Alain RODRIGUEZ
this need a restart ? Alain 2012/6/13 Alain RODRIGUEZ arodr...@gmail.com: Hi Aaron, We are using Ubuntu (AMI Datastax 1.0.9 as I said). Release:        10.10 Codename:       maverick ERROR [RMI TCP Connection(37732)-10.248.10.94] 2012-06-13 15:00:17,157 CLibrary.java (line 153) Unable

Re: Snapshot failing on JSON files in 1.1.0

2012-06-19 Thread Alain RODRIGUEZ
/Support On Wed, Jun 13, 2012 at 10:28 AM, Alain RODRIGUEZ arodr...@gmail.com wrote: if I do : locate jna /opt/java/64/jdk1.6.0_31/db/docs/html/ref/rrefsqljnaturaljoin.html /root/.m2/repository/net/java/dev/jna /root/.m2/repository/net/java/dev/jna/jna /root/.m2/repository/net/java/dev/jna/jna

Re: Snapshot failing on JSON files in 1.1.0

2012-06-20 Thread Alain RODRIGUEZ
...@palominodb.com: On Tue, Jun 19, 2012 at 8:55 PM, Rob Coli rc...@palominodb.com wrote: On Tue, Jun 19, 2012 at 2:55 AM, Alain RODRIGUEZ arodr...@gmail.com wrote: Unable to create hard link from /raid0/cassandra/data/cassa_teads/stats_product-hc-233-Data.db to /raid0/cassandra/data/cassa_teads/snapshots

CQL / ASSUME for keys

2012-06-27 Thread Alain RODRIGUEZ
Hi, I'm trying to do the following : update keyspace.CF set '2' = '2' + 12 WHERE KEY = 'mykey'; And got this answer: Bad Request: cannot parse 'mykey' as hex bytes Using this doesn't help: assume keyspace.CF(KEY) VALUES ARE text; (Found here

Unreachable node, not in nodetool ring

2012-07-19 Thread Alain RODRIGUEZ
Hi, I tried to add a node a few days ago and it failed. I finally made it work with an other node but now when I describe cluster on cli I got this : Cluster Information: Snitch: org.apache.cassandra.locator.Ec2Snitch Partitioner: org.apache.cassandra.dht.RandomPartitioner Schema

Re: Unreachable node, not in nodetool ring

2012-07-19 Thread Alain RODRIGUEZ
removetoken (via nodetool) to remove the token associated with 10.56.62.211. in case of failure, you can use removetoken -f instead. then, the unreachable IP should have disappeared. HTH On Thu, Jul 19, 2012 at 10:38 AM, Alain RODRIGUEZ arodr...@gmail.com wrote: Hi, I tried to add a node

Re: Unreachable node, not in nodetool ring

2012-07-19 Thread Alain RODRIGUEZ
and it went fine. Next I decommissionned the node 10.248.10.94 and moved 10.59.21.241 to the token 0. Now I am on the situation described before. Alain 2012/7/19 Alain RODRIGUEZ arodr...@gmail.com: Hi, I wasn't able to see the token used currently by the 10.56.62.211 (ghost node). I already

Re: Unreachable node, not in nodetool ring

2012-07-20 Thread Alain RODRIGUEZ
On 19/07/2012, at 9:37 PM, Alain RODRIGUEZ wrote: Not sure if this may help : nodetool -h localhost gossipinfo /10.58.83.109 RELEASE_VERSION:1.1.2 RACK:1b LOAD:5.9384978406E10 SCHEMA:e7e0ec6c-616e-32e7-ae29-40eae2b82ca8 DC:eu-west STATUS:NORMAL

Re: Unreachable node, not in nodetool ring

2012-07-23 Thread Alain RODRIGUEZ
Does anyone knows how to totally remove a dead node that only appears when doing a describe cluster from the cli ? I still got this issue in my production cluster. Alain 2012/7/20 Alain RODRIGUEZ arodr...@gmail.com: Hi Aaron, I have repaired and cleanup both nodes already and I did it after

Re: Unreachable node, not in nodetool ring

2012-07-27 Thread Alain RODRIGUEZ
Hi again, Nobody has a clue about this issue ? I'm still facing this problem. Alain 2012/7/23 Alain RODRIGUEZ arodr...@gmail.com: Does anyone knows how to totally remove a dead node that only appears when doing a describe cluster from the cli ? I still got this issue in my production

Re: Dead node still being pinged

2012-08-06 Thread Alain RODRIGUEZ
Hi sorry about reopening this old thread but I think that this function (unsafeAssassinateEndpoint) can help me to resolve a problem I have in production for a while : http://grokbase.com/t/cassandra/user/127knx7nn0/unreachable-node-not-in-nodetool-ring I have no Idea about how to use it, Can

Re: Project Management

2012-08-06 Thread Alain RODRIGUEZ
Cassandra modeling is well documented on the web and a bit too complex to be explained in one mail. I advice you reading a lot before you make modeling choices. You may start with these links :

Re: Unreachable node, not in nodetool ring

2012-08-08 Thread Alain RODRIGUEZ
Hi, I finally successfully removed the ghost node using unsafeAssassinateEndpoint() as described there : http://tumblr.doki-pen.org/post/22654515359/assinating-cassandra-nodes, I hope this can help more people. Nodetool gossipinfo gives me now the following info for the ghost node :

Re: Heap size question

2012-08-21 Thread Alain RODRIGUEZ
I have the same configuration and I recently change my cassandra-sh.yaml to : MAX_HEAP_SIZE=4G HEAP_NEWSIZE=200M I guess it depends on how much you use the cache (which is now in the off-heap memory). I don't use row cache and use the default key cache size. I have no more memory pressure nor

Re: Heap size question

2012-08-21 Thread Alain RODRIGUEZ
[image: Inline image 1] ta...@tok-media.com Tel: +972 2 6409736 Mob: +972 54 8356490 Fax: +972 2 5612956 On Tue, Aug 21, 2012 at 12:57 PM, Alain RODRIGUEZ arodr...@gmail.comwrote: I have the same configuration and I recently change my cassandra-sh.yaml to : MAX_HEAP_SIZE=4G

Re: Ec2Snitch to Ec2MultiRegionSnitch

2013-04-24 Thread Alain RODRIGUEZ
://www.thelastpickle.com On 23/04/2013, at 7:26 PM, Alain RODRIGUEZ arodr...@gmail.com wrote: Hi,these advice are very welcome. @Dane, about the rack awareness, we use only one rack per DC, so I guess using EC2MultiRegionSnitch will do just fine and it doesn't need any configuration. Does

Deletes, null values

2013-04-25 Thread Alain RODRIGUEZ
Hi, I tried to delete some columns using cql2 as well as thrift on C*1.2.2 and instead of being unreachable, deleted columns have a null value. I am using no value in this CF, the only information I use is the existence of the column. So when I select all the column for a given key I have the

Re: Deletes, null values

2013-04-26 Thread Alain RODRIGUEZ
sor...@gmail.com On 2013-04-25 11:48, Alain RODRIGUEZ wrote: Hi, I tried to delete some columns using cql2 as well as thrift on C*1.2.2 and instead of being unreachable, deleted columns have a null value. I am using no value in this CF, the only information I use is the existence

Re: How to change existing cluster to multi-center

2013-04-26 Thread Alain RODRIGUEZ
I just asked this exact same question but after maybe after reading a bit more doc than you did. You may want to read this thread: http://grokbase.com/t/cassandra/user/134j85av4x/ec2snitch-to-ec2multiregionsnitch You also may want to read some doc. Datastax explain things quite well and update

Re: Deletes, null values

2013-04-26 Thread Alain RODRIGUEZ
I copied the wrong query: In CQL 2 it was: delete '1228#16857','1228#16866','1228#16875' from myCF where key = 'all'; Sorry about the mistake. 2013/4/26 Alain RODRIGUEZ arodr...@gmail.com Of course: From CQL 2 (cqlsh -2): delete '183#16684','183#16714','183#16717' from myCF where key

Re: CQL update and TTL

2013-04-26 Thread Alain RODRIGUEZ
This seems to be the correct behavior. An update refreshes the TTL, as it does in memcache for example. Yet, what I do not know is whether this behavior can be changed somehow to let the initial TTL, this might be useful on some use cases. Alain 2013/4/26 Shahryar Sedghi shsed...@gmail.com

Re: CQL update and TTL

2013-04-26 Thread Alain RODRIGUEZ
That is more or less what I was guessing, thanks for these precision. 2013/4/26 Sylvain Lebresne sylv...@datastax.com This is indeed intended. That behavior is largely dictated by how the storage engine works, and the fact that an update does no read internally in particular. Yet, what I

Re: Deletes, null values

2013-04-29 Thread Alain RODRIGUEZ
Zealand @aaronmorton http://www.thelastpickle.com On 27/04/2013, at 12:48 AM, Sorin Manolache sor...@gmail.com wrote: On 2013-04-26 11:55, Alain RODRIGUEZ wrote: Of course: From CQL 2 (cqlsh -2): delete '183#16684','183#16714','183#16717' from myCF where key = 'all'; And selecting

Re: How does a healthy node look like?

2013-05-02 Thread Alain RODRIGUEZ
Well, maybe should you describe us your hardware and the C* release toi are using. Also give us some metrics. Le 30 avr. 2013 18:48, Steppacher Ralf ralf.steppac...@derivativepartners.com a écrit : Hi, I have troubles finding some quantitative information as to how a healthy Cassandra node

Re: (unofficial) Community Poll for Production Operators : Repair

2013-05-14 Thread Alain RODRIGUEZ
Hi Rob, 1) 1.2.2 on 6 to 12 EC2 m1.xlarge 2) Quorum RW . Almost no deletes (just some TTL) 3) Yes 4) On each node once a week (rolling repairs using crontab) 5) The only behavior that is quite odd or unexplained to me is why a repair doesn't fix a counter mismatch between 2 nodes. I mean when I

Re: (unofficial) Community Poll for Production Operators : Repair

2013-05-15 Thread Alain RODRIGUEZ
on CL_QUOROM instead of just one (as itself was one as well) or something like that. (I don't know too much as my colleague was the one that debugged this issue) Dean From: Alain RODRIGUEZ arodr...@gmail.commailto:arodr...@gmail.com Reply-To: user@cassandra.apache.orgmailto:user@cassandra.apache.org

Re: Decommission nodes starts to appear from one node (1.0.11)

2013-05-16 Thread Alain RODRIGUEZ
Not sure to understand you correctly, but if you are dealing with ghost nodes that you want to remove, I never saw a node that could resist to an unsafeAssassinateEndpoint. http://grokbase.com/t/cassandra/user/12b9eaaqq4/remove-crashed-node

Re: (unofficial) Community Poll for Production Operators : Repair

2013-05-16 Thread Alain RODRIGUEZ
”, but they all required some form of user intervention. Riak 1.3 introduces automatic, self-healing properties that repair entropy on an ongoing basis. On Wed, May 15, 2013 at 5:32 PM, Robert Coli rc...@eventbrite.com wrote: On Wed, May 15, 2013 at 1:27 AM, Alain RODRIGUEZ arodr...@gmail.com wrote: Rob

vnodes ready for production ?

2013-05-16 Thread Alain RODRIGUEZ
Hi, Adding vnodes is a big improvement to Cassandra, specifically because we have a fluctuating load on our Cassandra depending on the week, and it is quite annoying to add some nodes for one week or two, move tokens and then having to remove them and then move tokens again. Even more if we could

Re: (unofficial) Community Poll for Production Operators : Repair

2013-05-16 Thread Alain RODRIGUEZ
, Alain RODRIGUEZ arodr...@gmail.com wrote: Rob, I was wondering something. Are you a commiter working on improving the repair or something similar ? I am not a committer [1], but I have an active interest in potential improvements to the best practices for repair. The specific change that I

Re: Unreachable Nodes

2013-05-22 Thread Alain RODRIGUEZ
Hi. I think that the unsafeAssassinateEndpoint was the good solution here. I was going to lead you to this solution after reading the first part of your message. Does anyone know why the dead nodes still appear when we run nodetool gossipinfo but they don't when we run describe cluster from the

Re: Unreachable Nodes

2013-05-22 Thread Alain RODRIGUEZ
an eye on it then. Many thanks, Vasilis On Wed, May 22, 2013 at 10:54 AM, Alain RODRIGUEZ arodr...@gmail.comwrote: Hi. I think that the unsafeAssassinateEndpoint was the good solution here. I was going to lead you to this solution after reading the first part of your message. Does

Can't reach itself

2013-06-04 Thread Alain RODRIGUEZ
Hi, I have an issue since switch to multiple DC. I use AWS EC2 instances, C*1.2.2, 12 nodes eu-west + 6 nodes us-east (new DC). Datacenter: eu-west === Status=Up/Down |/ State=Normal/Leaving/Joining/Moving -- Address Load Owns Host ID UN public ip 133.43

Re: Can't reach itself

2013-06-04 Thread Alain RODRIGUEZ
I see a lot of hinted handoff compactions too. I might have not been clear enough, I see a lot of compaction of system.hints that I interpret as being due to a lot of data that couldn't reach their destination. 2013/6/4 Alain RODRIGUEZ arodr...@gmail.com Hi, I have an issue since switch

Looking for a fully working AWS multi DC configuration.

2013-06-05 Thread Alain RODRIGUEZ
Hi, We use to work on a single DC (EC2Snitch / SimpleStrategy). For latency reason we had top open a new DC in the US (us-east). We run C* 1.2.2. We don't use VPC. Now we use: - 2 DC (eu-west, us-east) - EC2MultiRegionSnitch / NTS - public IPs as broadcast_address and seeds - private IPs as

Re: Looking for a fully working AWS multi DC configuration.

2013-06-05 Thread Alain RODRIGUEZ
DC need access over both public and private IP on the storage port (7000/7001). Nodes from other DC will need access over public IP on the storage port. All Cassandra nodes also need access over the public IP on the Thrift port (9160). ** ** Dan ** ** *From:* Alain

Re: Dynamic Columns Question Cassandra 1.2.5, Datastax Java Driver 1.0

2013-06-06 Thread Alain RODRIGUEZ
Not sure if you remember this Jonathan, but Sylvain already wrote a very clear documentation about it : http://www.datastax.com/dev/blog/thrift-to-cql3 (OCTOBER 26, 2012) Yet a second page will give to this important topic a greater visibility. 2013/6/6 Jonathan Ellis jbel...@gmail.com This

Re: replication factor is zero

2013-06-06 Thread Alain RODRIGUEZ
But afaik you can set the RF only per Keyspace. So you will have to pull those tables apart, in a different Keyspace. 2013/6/6 Tyler Hobbs ty...@datastax.com On Thu, Jun 6, 2013 at 1:28 PM, Daning Wang dan...@netseer.com wrote: could we set replication factor to 0 on other data center? what

Re: Why so many vnodes?

2013-06-11 Thread Alain RODRIGUEZ
I think he actually meant *increase*, for this reason For small T, a random choice of initial tokens will in most cases give a poor distribution of data. The larger T is, the closer to uniform the distribution will be, with increasing probability. Alain 2013/6/11 Theo Hultberg t...@iconara.net

Re: Multiple data center performance

2013-06-12 Thread Alain RODRIGUEZ
counter will replicate to all replicas during write regardless the consistency level I that the normal behavior or a bug ? 2013/6/11 Daning Wang dan...@netseer.com It is counter caused the problem. counter will replicate to all replicas during write regardless the consistency level. In our

Re: [Cassandra] Expanding a Cassandra cluster

2013-06-12 Thread Alain RODRIGUEZ
We run it concurrently each RF nodes (If RF = 3, we run it on 3 waves). If the node is busy cleaning up, then the client will time out and ask to an other node having a copy of the data and that is not being cleaned up. Will node tool cleanup consume lot of IO and CPU even though there is nothing

Re: Multiple data center performance

2013-06-12 Thread Alain RODRIGUEZ
12, 2013 at 4:56 AM, Alain RODRIGUEZ arodr...@gmail.comwrote: counter will replicate to all replicas during write regardless the consistency level I that the normal behavior or a bug ? 2013/6/11 Daning Wang dan...@netseer.com It is counter caused the problem. counter will replicate to all

Re: Multiple data center performance

2013-06-12 Thread Alain RODRIGUEZ
replicas. The consistency level only decides how replica acknowledgement are waited for. -- Sylvain On Wed, Jun 12, 2013 at 4:56 AM, Alain RODRIGUEZ arodr...@gmail.comwrote: counter will replicate to all replicas during write regardless the consistency level I that the normal behavior

Re: Looking for a fully working AWS multi DC configuration.

2013-06-14 Thread Alain RODRIGUEZ
as the source. ** ** *From:* Alain RODRIGUEZ [mailto:arodr...@gmail.com] *Sent:* Wednesday, June 05, 2013 5:45 PM *To:* user@cassandra.apache.org *Subject:* Re: Looking for a fully working AWS multi DC configuration. ** ** Do you open all these nodes one by one on every Security Group

Re: vnodes ready for production ?

2013-06-18 Thread Alain RODRIGUEZ
Any insights on vnodes, one month after my original post ? 2013/5/16 Alain RODRIGUEZ arodr...@gmail.com Hi, Adding vnodes is a big improvement to Cassandra, specifically because we have a fluctuating load on our Cassandra depending on the week, and it is quite annoying to add some nodes

Re: opscentrer is spying

2013-06-20 Thread Alain RODRIGUEZ
Good, fast and appreciated reaction from Datastax. Also thanks to Radim for the warning. Alain, Opscenter-free user. 2013/6/20 Nick Bailey n...@datastax.com Thanks everyone. We always appreciate constructive criticism. Regarding what OpsCenter collects, we completely agree it should be

Re: going down from RF=3 to RF=2, repair constantly falls over with JVM OOM

2013-07-04 Thread Alain RODRIGUEZ
@Michal: all true, a clean up would certainly remove a lot of useless data there, and I also advice Evan to do it. However, Evan may want to continue repairing his cluster as a routine operation an there is no reason a RF change shouldn't lead to this kind of issues. @Evan : With this amount of

Restart node = hinted handoff flood

2013-07-04 Thread Alain RODRIGUEZ
Hi, Using C*1.2.2 12 EC2 xLarge cluster. When I restart a node, if it spend a few minutes down, when I bring it up, all the cpu are blocked at 100%, even once compactions are disabled, inducing a very big and intolerable latency in my app. I suspect Hinted Handoff to be the cause of this.

Re: Restart node = hinted handoff flood

2013-07-04 Thread Alain RODRIGUEZ
. How to avoid this Hinted Handoff flood on returning nodes ? Alain 2013/7/4 Alain RODRIGUEZ arodr...@gmail.com Hi, Using C*1.2.2 12 EC2 xLarge cluster. When I restart a node, if it spend a few minutes down, when I bring it up, all the cpu are blocked at 100%, even once compactions

Decommissioned nodes not leaving and Hinted Handoff flood

2013-07-09 Thread Alain RODRIGUEZ
Hi, C*1.2.2. I have removed 4 nodes with nodetool decommission. 2 of them have left with no issue, while the 2 others nodes remained leaving even after streaming their data. The only specific thing of these 2 nodes is that they had a lot of hints pending. Hints from a node that couldn't come

High performance hardware with lot of data per node - Global learning about configuration

2013-07-09 Thread Alain RODRIGUEZ
Hi, Using C*1.2.2. We recently dropped our 18 m1.xLarge (4CPU, 15GB RAM, 4 Raid-0 Disks) servers to get 3 hi1.4xLarge (16CPU, 60GB RAM, 2 Raid-0 SSD) servers instead, for about the same price. We tried it after reading some benchmark published by Netflix. It is awesome and I recommend it to

<    1   2   3   4   5   6   7   >