Read-repair working, repair not working?

2013-02-10 Thread Brian Fleming
** Hi, ** ** I have a 20 node cluster running v1.0.7 split between 5 data centres, each with an RF of 2, containing a ~1TB unique dataset/~10TB of total data. ** ** I’ve had some intermittent issues with a new data centre (3 nodes, RF=2) I brought online late last year with data

High CPU usage during repair

2013-02-10 Thread Tamar Fraenkel
Hi! I run repair weekly, using a scheduled cron job. During repair I see high CPU consumption, and messages in the log file INFO [ScheduledTasks:1] 2013-02-10 11:48:06,396 GCInspector.java (line 122) GC for ParNew: 208 ms for 1 collections, 1704786200 used; max is 3894411264 From time to time,

Re: Netflix/Astynax Client for Cassandra

2013-02-10 Thread Renato Marroquín Mogrovejo
Sorry to hijack into this email thread, but what are the use cases/benefits of using the new binary protocol? and why doesn't Cassandra offer a drive as a project driver? Renato M. 2013/2/8 aaron morton aa...@thelastpickle.com: I'm going to guess Netflix are running Astynax in production with

Re: High CPU usage during repair

2013-02-10 Thread aaron morton
During repair I see high CPU consumption, Repair reads the data and computes a hash, this is a CPU intensive operation. Is the CPU over loaded or is just under load? I run Cassandra version 1.0.11, on 3 node setup on EC2 instances. What machine size? there are compactions waiting. That's

Re: Read-repair working, repair not working?

2013-02-10 Thread aaron morton
I’d request data, nothing would be returned, I would then re-request the data and it would correctly be returned: What CL are you using for reads and writes? I see a number of dropped ‘MUTATION’ operations : just under 5% of the total ‘MutationStage’ count. Dropped mutations in a multi

Re: High CPU usage during repair

2013-02-10 Thread Tamar Fraenkel
Hi! Thanks for the response. See my answers and questions below. Thanks! Tamar *Tamar Fraenkel * Senior Software Engineer, TOK Media [image: Inline image 1] ta...@tok-media.com Tel: +972 2 6409736 Mob: +972 54 8356490 Fax: +972 2 5612956 On Sun, Feb 10, 2013 at 10:04 PM, aaron morton

Re: Issues with writing data to Cassandra column family using a Hive script

2013-02-10 Thread aaron morton
Don't use the variable length Cassandra integer, use the Int32Type. It also sounds like you want to use a DoubleType rather than FloatType. http://www.datastax.com/docs/datastax_enterprise2.2/solutions/about_hive#hive-to-cassandra-table-mapping Cheers - Aaron Morton

Re: Cassandra 1.1.2 - 1.1.8 upgrade

2013-02-10 Thread aaron morton
I would do #1. You can play with nodetool setcompactionthroughput to speed things up, but beware nothing comes for free. Cheers - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http://www.thelastpickle.com On 10/02/2013, at 6:40 AM, Mike

Re: Cassandra flush spin?

2013-02-10 Thread aaron morton
Sounds like flushing due to memory consumption. The flush log messages include the number of ops, so you can see if this node was processing more mutations that the others. Try to see if there was more (serialised) data being written or more operations being processed. Also just for fun

Re: persisted ring state

2013-02-10 Thread aaron morton
Is that the right way to do? No. If you want to change the token for a node use nodetool move. Changing it like this will not make the node change it's token. Because after startup the token is stored in the System.LocationInfo CF. or -Dcassandra.load_ring_state=false|true is only limited

Re: Issues with writing data to Cassandra column family using a Hive script

2013-02-10 Thread Dinusha Dilrukshi
Hi Aaron, Thanks for the reply.. I ll try out your suggestion. Regards, Dinusha. On Mon, Feb 11, 2013 at 1:55 AM, aaron morton aa...@thelastpickle.comwrote: Don't use the variable length Cassandra integer, use the Int32Type. It also sounds like you want to use a DoubleType rather than

Querying composite keys

2013-02-10 Thread Rishabh Agrawal
Hello I have key and columns defined in following fashion: HotelName1:RoomNum1 HotelName2:RoomNum2 HotelName3:RoomNum3 Key1:TimeStamp:VersionNum Is there a way that I can query this schema by only 'key' or 'HotelName' i.e. querying using a part of composite key and not the full

Re: Querying composite keys

2013-02-10 Thread Vivek Mishra
You can query over composite columns as: 1) Partition key 2) First part of clustered key(using EQ ops). Secondary indexes over non composite columns are not possible. -Vivek On Mon, Feb 11, 2013 at 12:06 PM, Rishabh Agrawal rishabh.agra...@impetus.co.in wrote: Hello I have key and

Re: Cassandra 1.1.2 - 1.1.8 upgrade

2013-02-10 Thread Michal Michalski
2) Upgrade one node at a time, running the clustered in a mixed 1.1.2-1.1.9 configuration for a number of days. I'm about to upgrade my 1.1.0 cluster and http://www.datastax.com/docs/1.1/install/upgrading#info says: If you are upgrading to Cassandra 1.1.9 from a version earlier than 1.1.7,