Re: Amazingly bad compaction performance

2012-06-27 Thread Igor
Hello Too much GC? Check JVM heap settings and real usage. On 06/27/2012 01:37 AM, Dustin Wenz wrote: We occasionally see fairly poor compaction performance on random nodes in our 7-node cluster, and I have no idea why. This is one example from the log: [CompactionExecutor:45]

Re: Amazingly bad compaction performance

2012-06-27 Thread Derek Andree
Last I heard only Oracle's JDK was officially supported with Cassandra, possibly nitpicky but is this still the case? On Jun 26, 2012, at 3:37 PM, Dustin Wenz wrote: (OpenJDK 7) was pegged at 200% CPU

Re: bulk load problem

2012-06-27 Thread Nury Redjepow
What is your yaml setting for rpc and listen server on destination node? Nury Tue, 26 Jun 2012 17:07:49 -0700 от James Pirz james.p...@gmail.com: Dear all, I am trying to use sstableloader in cassandra 1.1.1, to bulk load some data into a single node cluster. I am running the

Re: How to use row caching to enable faster retrieval of rows in Cassandra

2012-06-27 Thread Nury Redjepow
You should read this page for settings http://www.datastax.com/dev/blog/caching-in-cassandra-1-1 Basically, you have to set row_cache size in yaml, and create table with caching flags - all or rows_only, but beware that you shouldn't use row cache, if you using wide rows. Tue, 26 Jun 2012

Re: Interpreting system.log MeteredFlusher messages

2012-06-27 Thread aaron morton
, but I do not understand the remedy to the problem. Is increasing this variable my only option? There is nothing to be fixed. This is Cassandra flushing data to disk to free memory and checkpoint the commit log. I see memtables of serialized size of 100-200 MB with estimated live size of

Re: Fat Client Commit Log

2012-06-27 Thread aaron morton
Fat clients aren't involved in writes or HH, and I think my previous thought about it having some info in the System KS may be wrong. Can you recreate the issue ? Care to raise a ticket on https://issues.apache.org/jira/browse/CASSANDRA ? Thanks - Aaron Morton Freelance

Re: Interpreting system.log MeteredFlusher messages

2012-06-27 Thread rohit bhatia
On Wed, Jun 27, 2012 at 2:27 PM, aaron morton aa...@thelastpickle.com wrote: , but I do not understand the remedy to the problem. Is increasing this variable my only option? There is nothing to be fixed. This is Cassandra flushing data to disk to free memory and checkpoint the commit log.

Re: Enable CQL3 from Astyanax

2012-06-27 Thread aaron morton
Had a quick look, the current master does not appear to support it. Cheers - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 26/06/2012, at 7:46 AM, Thierry Templier wrote: Hello, How can I enable CQL3 support in Astyanax? Thanks very much for

Re: repair never finishing 1.0.7

2012-06-27 Thread aaron morton
Setting up a Cassandra ring across NAT ( without a VPN ) is impossible in my experience. The broadcast_address allows a node to broadcast an address that is different to the ones it's bound to on the local interfaces https://github.com/apache/cassandra/blob/trunk/conf/cassandra.yaml#L270

Re: repair never finishing 1.0.7

2012-06-27 Thread Andras Szerdahelyi
Aaron, The broadcast_address allows a node to broadcast an address that is different to the ones it's bound to on the local interfaces https://github.com/apache/cassandra/blob/trunk/conf/cassandra.yaml#L270 Yes and thats not where the problem is IMO.. If you broadcast your translated address

Re: Secondary index data gone after restart (1.1.1)

2012-06-27 Thread aaron morton
CASSANDRA-3954 disabled caches on secondary index CF's in 1.1.0 and CASSANDRA-4197 enabled it in 1.1.1 Can you create a ticket on https://issues.apache.org/jira/browse/CASSANDRA I guessing this has something to do with the local partitioner used for the secondary index Cf. That would explain

Re: Enable CQL3 from Astyanax

2012-06-27 Thread Thierry Templier
Hello Aaron, Thanks very much for your answer! Could you give me some hints on how implement that? So I could contribute a patch for that. After having a look at the Astyanax code, I saw that CQL is executed in the ThriftColumnFamilyQueryImpl class based on the

Re: Multi datacenter, WAN hiccups and replication

2012-06-27 Thread aaron morton
Therefore I was wondering if Cassandra already intelligently optimizes for HH-over-WAN (since this is common) or alternately if there's a way to enable HH for WAN replication? When the coordinator is preparing to process the request, done nodes in a foreign DC are treated like down nodes in a

CQL / ASSUME for keys

2012-06-27 Thread Alain RODRIGUEZ
Hi, I'm trying to do the following : update keyspace.CF set '2' = '2' + 12 WHERE KEY = 'mykey'; And got this answer: Bad Request: cannot parse 'mykey' as hex bytes Using this doesn't help: assume keyspace.CF(KEY) VALUES ARE text; (Found here

Re: Enable CQL3 from Astyanax

2012-06-27 Thread Thierry Templier
Hello Aaron, I created an issue on the Astyanax github for this problem. I added a fix to support CQL3 in the tool. See the link https://github.com/Netflix/astyanax/issues/75. Thierry Had a quick look, the current master does not appear to support it. Cheers

Re: Ball is rolling on High Performance Cassandra Cookbook second edition

2012-06-27 Thread Raj N
Great stuff!!! On Tue, Jun 26, 2012 at 5:25 PM, Edward Capriolo edlinuxg...@gmail.comwrote: Hello all, It has not been very long since the first book was published but several things have been added to Cassandra and a few things have changed. I am putting together a list of changed content,

Node crashing during read repair

2012-06-27 Thread Robin Verlangen
Hi there, Today I found one node (running 1.1.1 in a 3 node cluster) being dead for the third time this week, it died with the following message: ERROR [ReadRepairStage:3] 2012-06-27 14:28:30,929 AbstractCassandraDaemon.java (line 134) Exception in thread Thread[ReadRepairStage:3,5,main]

Re: Ball is rolling on High Performance Cassandra Cookbook second edition

2012-06-27 Thread Robin Verlangen
Hi Edward, Looking forward to your book. It's always interesting to read what others have to say about a certain subject, and hopefully even learn new things! 2012/6/27 Raj N raj.cassan...@gmail.com Great stuff!!! On Tue, Jun 26, 2012 at 5:25 PM, Edward Capriolo edlinuxg...@gmail.comwrote:

Re: Ball is rolling on High Performance Cassandra Cookbook second edition

2012-06-27 Thread Aaron Turner
Hey Edward, I finally posted my (short) blog post on using Hector with Jruby: http://synfin.net/sock_stream/technology/code/cassandra-hector-jruby-awesome If you're interested in documenting that more in detail in your book, let me know and I can help you with that in your book if you'd like.

Re: Amazingly bad compaction performance

2012-06-27 Thread Tyler Hobbs
On Wed, Jun 27, 2012 at 1:42 AM, Derek Andree dand...@lacunasystems.comwrote: Last I heard only Oracle's JDK was officially supported with Cassandra, possibly nitpicky but is this still the case? On Jun 26, 2012, at 3:37 PM, Dustin Wenz wrote: (OpenJDK 7) was pegged at 200% CPU Java 7

Re: bulk load problem

2012-06-27 Thread James Pirz
Thank you so much ! The problem was the RPC address, it was different than the listen address. I appreciate your help. Best, James On Wed, Jun 27, 2012 at 1:29 AM, Nury Redjepow nreje...@mail.ru wrote: What is your yaml setting for rpc and listen server on destination node? Nury Tue, 26

Re: Ball is rolling on High Performance Cassandra Cookbook second edition

2012-06-27 Thread Courtney Robinson
Sounds good. One thing I'd like to see is more coverage on Cassandra Internals. Out of the box Cassandra's great but having a little inside knowledge can be very useful because it helps you design your applications to work with Cassandra; rather than having to later make endless optimizations that

Re: Ball is rolling on High Performance Cassandra Cookbook second edition

2012-06-27 Thread Edward Capriolo
On Wed, Jun 27, 2012 at 3:08 PM, Courtney Robinson court...@crlog.info wrote: Sounds good. One thing I'd like to see is more coverage on Cassandra Internals. Out of the box Cassandra's great but having a little inside knowledge can be very useful because it helps you design your applications

Re: Ball is rolling on High Performance Cassandra Cookbook second edition

2012-06-27 Thread Brian O'Neill
RE: API method signatures changing That triggers another thought... What terminology will you use in the book to describe the data model? CQL? When we wrote the RefCard on DZonehttp://refcardz.dzone.com/refcardz/apache-cassandra, we intentionally favored/used CQL terminology. On advisement

Adding New Nodes to a Production Cluster

2012-06-27 Thread Eran Chinthaka Withana
Hi, We have a production cluster with few nodes in each data center. Each node is being contacted in each data center to serve front end requests. I have a question about the method adding new nodes to the cluster (say, to improve RF or scalability). AFAIK, there are two methods to do this. 1.

Re: Ball is rolling on High Performance Cassandra Cookbook second edition

2012-06-27 Thread Edward Capriolo
On Wed, Jun 27, 2012 at 4:34 PM, Brian O'Neill b...@alumni.brown.edu wrote: RE: API method signatures changing That triggers another thought... What terminology will you use in the book to describe the data model?  CQL? When we wrote the RefCard on DZone, we intentionally favored/used CQL

Re: Adding New Nodes to a Production Cluster

2012-06-27 Thread Robin Verlangen
Hi Eran, As far as I'm aware of a node will not serve requests until the bootstrap (starts automatically these days) has been completed. So the problem of #1 is not really there Solution #2 is not straight forward and easy to make mistakes. When you're concerned about read consistency use a

Re: Ball is rolling on High Performance Cassandra Cookbook second edition

2012-06-27 Thread Bill
I'm looking forward to getting a few copies of this. Some areas that would be great to cover - Indexing strategies - Configuring clients/env for sane timestamping - Efficient CQL - Top 8/10 perf issues/stacktraces and common resolutions - understanding nodetool

Re: Ball is rolling on High Performance Cassandra Cookbook second edition

2012-06-27 Thread Rustam Aliyev
Hi Edward, That's a great news! One thing I'd like to see in the new edition is Counters, known issues and how to avoid them: - avoid double counting (don't retry on failure, use write consistency level ONE, use dedicated Hector connector?) - delete counters (tricky, reset to zero?) -

Re: Ball is rolling on High Performance Cassandra Cookbook second edition

2012-06-27 Thread Franc Carter
On Thu, Jun 28, 2012 at 7:32 AM, Edward Capriolo edlinuxg...@gmail.comwrote: On Wed, Jun 27, 2012 at 4:34 PM, Brian O'Neill b...@alumni.brown.edu wrote: RE: API method signatures changing That triggers another thought... What terminology will you use in the book to describe the data

Re: Ball is rolling on High Performance Cassandra Cookbook second edition

2012-06-27 Thread Aaron Turner
On Wed, Jun 27, 2012 at 1:34 PM, Brian O'Neill b...@alumni.brown.edu wrote: RE: API method signatures changing That triggers another thought... What terminology will you use in the book to describe the data model?  CQL? When we wrote the RefCard on DZone, we intentionally favored/used CQL

which high level Java client

2012-06-27 Thread James Pirz
Dear all, I am interested in using Cassandra 1.1.1 in a read-intensive scenario, where more than 95% of my operations are get(). I have a cluster with ~10 nodes, around 15-20 GB of data on each, while in the extreme case I expect to have 20-40 concurrent clients. I am kind of confused about

Re: which high level Java client

2012-06-27 Thread Roshan
Hello We are using Hector and it perfectly matching to our case. https://github.com/hector-client/hector -- View this message in context: http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/which-high-level-Java-client-tp7580842p7580844.html Sent from the