CMS GC initial-mark taking 6 seconds , bad?

2011-09-25 Thread Yang
I see the following in my GC log 1910.513: [GC [1 CMS-initial-mark: 2598619K(26214400K)] 13749939K(49807360K), 6.0696680 secs] [Times: user=6.10 sys=0.00, real=6.07 secs] so there is a stop-the-world period of 6 seconds. does this sound bad ? or 6 seconds is OK and we should expect the built-in

messages stopped for 3 minutes?

2011-09-25 Thread Yang
I constantly see TimedOutException , then followed by UnavailableException in my logs, so I added some extra debugging to Gossiper. notifyFailureDetector() void notifyFailureDetector(InetAddress endpoint, EndpointState remoteEndpointState) { IFailureDetector fd =

Seed nodes in cassandra.yaml can not be hostnames

2011-09-25 Thread Radim Kolar
I just discovered that using host names for seed nodes in cassandra.yaml do not work. This is done on purpose?

Re: adding node without bootstrap

2011-09-25 Thread Radim Kolar
If you join a node with auto_bootstrap=false you had better be working at quorum or higher to avoid stale/not found reads. You should then repair the node right away to get all the missing data back on the node. This is not suggested. It is best to leave auto_boostrap=true and let Cassandra

Token != DecoratedKey assertion

2011-09-25 Thread Philippe
Hello, I've seen a couple of these in my logs, running 0.8.4. This is a RF=3, 3-node cluster. 2 nodes including this one are on 0.8.4 and one is on 0.8.5 The node is still functionning hours later. Should I be worried ? Thanks ERROR [ReadStage:94911] 2011-09-24 22:40:30,043

Seed vs non-seed in YAML

2011-09-25 Thread Philippe
Hello, I'm deploying my cluster with Puppet so it's actually easier for me to add all cassandra nodes to the seed list in the YAML file than to choose a few. Would there be any reason NOT to do this ? Thanks

Re: frequent node UP/Down?

2011-09-25 Thread Philippe
I have this happening on 0.8.x It looks to me as this happens when the node is under heavy load such as unthrottled compactions or a huge GC. 2011/9/24 Yang tedd...@gmail.com I'm using 1.0.0 there seems to be too many node Up/Dead events detected by the failure detector. I'm using a 2

Re: Increasing thrift_framed_transport_size_in_mb

2011-09-25 Thread aaron morton
Some discussion of large data here http://wiki.apache.org/cassandra/LargeDataSetConsiderations When creating large rows you also need to be aware of in_memory_compaction_limit_in_mb (see the yaml) and that all columns for a row are stored on the same node. So if you store one file in a one row

Re: Can not connect to cassandra 0.7 using CLI

2011-09-25 Thread aaron morton
Make sure that the directory /var/log/cassandra exists and the user running cassandra has permission to use it. There are some instructions here in the readme file https://github.com/apache/cassandra/blob/cassandra-0.7.9/README.txt#L27 Good luck. A - Aaron Morton Freelance

Re: Could not reach schema agreement when adding a new node.

2011-09-25 Thread aaron morton
Check the schema agreement using the CLI by running describe cluster; it will tell you if they are in agreement. it may have been a temporary thing while the new machine was applying it's schema. if the nodes are not in agreement or you want to dig deeper look for log messages from

Re: progress of sstableloader keeps 0?

2011-09-25 Thread aaron morton
That can read data from previous versions, i.e. if you upgrade to 0.8 it can read the existing files from 0.7. But what you are doing with the sstable loader is (AFAIK) only copying the Data portion of the CF. Once the table is loaded the node will then build the Index and the Filter, this is

Re: Moving to a new cluster

2011-09-25 Thread aaron morton
sounds like it. A - Aaron Morton Freelance Cassandra Developer @aaronmorton http://www.thelastpickle.com On 25/09/2011, at 6:10 PM, Yan Chunlu wrote: thanks! is that similar problem described in this thread?

Re: CMS GC initial-mark taking 6 seconds , bad?

2011-09-25 Thread aaron morton
It does seem long and will be felt by your application. Are you running a 47GB heap ? Most peeps seem to think 8 to 12 is about the viable maximum. Cheers - Aaron Morton Freelance Cassandra Developer @aaronmorton http://www.thelastpickle.com On 25/09/2011, at 7:14 PM, Yang

Re: progress of sstableloader keeps 0?

2011-09-25 Thread Yan Chunlu
thanks! another problem is what if cluster number are not the same? in my case I am move 3 nodes cluster data to 1 node, the keyspace files in 3 nodes might use the same name... I am using the new cluster only for emergency usage, so only 1 node is attached. On Sun, Sep 25, 2011 at 5:20 PM,

Re: CMS GC initial-mark taking 6 seconds , bad?

2011-09-25 Thread Peter Schuller
I see the following in my GC log 1910.513: [GC [1 CMS-initial-mark: 2598619K(26214400K)] 13749939K(49807360K), 6.0696680 secs] [Times: user=6.10 sys=0.00, real=6.07 secs] so there is a stop-the-world period of 6 seconds. does this sound bad ? or 6 seconds is OK  and we should expect the

Re: frequent node UP/Down?

2011-09-25 Thread Radim Kolar
Dne 25.9.2011 9:29, Philippe napsal(a): I have this happening on 0.8.x It looks to me as this happens when the node is under heavy load such as unthrottled compactions or a huge GC. i have this problem too. Node down detection must be improved - increased timeouts a bit or make more tries

Re: Token != DecoratedKey assertion

2011-09-25 Thread Jonathan Ellis
Assertion errors are bugs, so that should worry you. However, I'd upgrade before filing a ticket. There were a lot of fixes in 0.8.5. On Sun, Sep 25, 2011 at 2:27 AM, Philippe watche...@gmail.com wrote: Hello, I've seen a couple of these in my logs, running 0.8.4. This is a RF=3, 3-node

Re: messages stopped for 3 minutes?

2011-09-25 Thread Jonathan Ellis
What makes you think the problem is on the receiving node, rather than the sending node? On Sun, Sep 25, 2011 at 1:19 AM, Yang tedd...@gmail.com wrote: I constantly see TimedOutException , then followed by UnavailableException in my logs, so I added some extra debugging to Gossiper.

Re: frequent node UP/Down?

2011-09-25 Thread Radim Kolar
Dne 25.9.2011 14:31, Radim Kolar napsal(a): Dne 25.9.2011 9:29, Philippe napsal(a): I have this happening on 0.8.x It looks to me as this happens when the node is under heavy load such as unthrottled compactions or a huge GC. i have this problem too. Node down detection must be improved -

Re: messages stopped for 3 minutes?

2011-09-25 Thread Yang
thanks Jonathan, I really don't know, I just did further tests to catch the jstack on the receiving side over the last night. going through these stacks now. if I can't find anything suspicious, I'll add these debugging to the sending side too. another useful piece of info: when I did a

Re: frequent node UP/Down?

2011-09-25 Thread Brandon Williams
On Sat, Sep 24, 2011 at 4:54 PM, Yang tedd...@gmail.com wrote: I'm using 1.0.0 there seems to be too many node Up/Dead events detected by the failure detector. I'm using  a 2 node cluster on EC2, in the same region, same security group, so I assume the message drop rate should be

Re: CMS GC initial-mark taking 6 seconds , bad?

2011-09-25 Thread Yang
Thanks Peter and Aaron. right now I have too much logging so the CMS logging is flushed (somehow it does not appear in the system.log, only on stdout ), I'll keep an eye on the correlation with ParNew as I get more logging Yang On Sun, Sep 25, 2011 at 3:59 AM, Peter Schuller

Re: frequent node UP/Down?

2011-09-25 Thread Yang
Thanks Brandon. I suspected that, but I think that's precluded as a possibility since I setup another background job to do echo | nc other_box 7000 in a loop, this job seems to be working fine all the time, so network seems fine. Yang On Sun, Sep 25, 2011 at 10:39 AM, Brandon Williams

Re: frequent node UP/Down?

2011-09-25 Thread Brandon Williams
On Sun, Sep 25, 2011 at 12:52 PM, Yang tedd...@gmail.com wrote: Thanks Brandon. I suspected that, but I think that's precluded as a possibility since I setup another background job to do echo | nc other_box 7000 in a loop, this job seems to be working fine all the time, so network seems

Re: frequent node UP/Down?

2011-09-25 Thread Yang
Thanks Brandon. I'll try this. but you can also see my later post regarding message drop : http://mail-archives.apache.org/mod_mbox/cassandra-user/201109.mbox/%3ccaanh3_8aehidyh9ybt82_emh3likbcdsenrak3jhfzaj2l+...@mail.gmail.com%3E that seems to show something in either code or background load

Re: frequent node UP/Down?

2011-09-25 Thread Brandon Williams
On Sun, Sep 25, 2011 at 1:10 PM, Yang tedd...@gmail.com wrote: Thanks Brandon. I'll try this. but you can also see my later post regarding message drop :

Re: adding node without bootstrap

2011-09-25 Thread aaron morton
That message will be logged if there RF on the keyspace is 1 or if the other nodes are not up. What's the RF ? You should also sort out the tokens before going to far. Cheers - Aaron Morton Freelance Cassandra Developer @aaronmorton http://www.thelastpickle.com On

Re: Seed vs non-seed in YAML

2011-09-25 Thread aaron morton
Seeds will not auto-bootstrap themselves when you add them to the cluster. Normal approach is to have 2 or 3 per DC. You may also be interested in how Gossip uses the seed list http://wiki.apache.org/cassandra/ArchitectureGossip cheers - Aaron Morton Freelance Cassandra

Re: progress of sstableloader keeps 0?

2011-09-25 Thread aaron morton
If you had RF3 in a 3 node cluster and everything was repaired you *should* be ok to only take the data from 1 node, if the cluster is not receiving writes. If you want to merge the data from 3 nodes rename the files AFAIK they do not have to have contiguous file numbers. Cheers

Re: adding node without bootstrap

2011-09-25 Thread Radim Kolar
Dne 25.9.2011 22:40, aaron morton napsal(a): That message will be logged if there RF on the keyspace is 1 or if the other nodes are not up. What's the RF ? rf is 1.

Re: adding node without bootstrap

2011-09-25 Thread aaron morton
Then there is nothing to repair. Set a better token, cassandra-cli to increase the RF to 2 and then kick off repair. A - Aaron Morton Freelance Cassandra Developer @aaronmorton http://www.thelastpickle.com On 26/09/2011, at 10:12 AM, Radim Kolar wrote: Dne 25.9.2011

Surgecon Meetup?

2011-09-25 Thread Chris Burroughs
Surge [1] is scalability focused conference in late September hosted in Baltimore. It's a pretty cool conference with a good mix of operationally minded people interested in scalability, distributed systems, systems level performance and good stuff like that. You should go! [2] Anyway, I'll be

Re: Possibility of going OOM using get_count

2011-09-25 Thread Boris Yen
Hi Aaron, Thanks for the explanation, I know the performance will be varied when the offset is a very large number, just like what has been mentioned on CASSANDRA-261. Even if the users implement the offset on the client side, they suffer the same issues, I just think it would be nice if

Re: progress of sstableloader keeps 0?

2011-09-25 Thread Yan Chunlu
thank you very much aaron. your explanation is clear enough and very helpful! On Mon, Sep 26, 2011 at 4:58 AM, aaron morton aa...@thelastpickle.comwrote: If you had RF3 in a 3 node cluster and everything was repaired you *should* be ok to only take the data from 1 node, if the cluster is not