Re: nodetool repairs spawns many "invalid remote counter shard detected" errors on new node

DuyHai Doan Tue, 02 Sep 2014 14:44:32 -0700

Hello Bram

You're probably running into this :
https://issues.apache.org/jira/browse/CASSANDRA-4417


It's marked as won't fix because it is related to the current counter
design. Fortunately C* 2.1 will fix this.

Worth reading :
www.datastax.com/dev/blog/whats-new-in-cassandra-2-1-a-better-implementation-of-counters
Le 2 sept. 2014 21:24, "Bram Avontuur" <b...@longtailvideo.com> a écrit :

> Hi,
>
> Cassandra setup:
>
>  * 2 nodes on EC2, m1.large
>  * Cassandra version 2.0.10
>
> One node died over the weekend, and I couldn't revive it. I deleted it
> with nodetool removenode, and added a new node with a copy of the
> cassandra.yaml config with the ip addresses changed.
>
> Once reconfigured and started, nodetool status listed it as part of the
> 2-node cluster. I then ran nodetool repair on the new node to get it to
> take replication data from a keyspace with replication factor 2. The first
> 600-ish MB (of 14GB) synced pretty fast, but then the system.log starts
> spawning " invalid remote counter shard detected" nodes at a rapid rate
> (too fast to follow with tail -f). Example log line:
>
>  WARN [CompactionExecutor:6] 2014-09-02 19:18:49,109 CounterContext.java
> (line 467) invalid remote counter shard detected;
> (03afc080-2f01-11e4-948b-15a04b0b4bd9, 1, 158) and
> (03afc080-2f01-11e4-948b-15a04b0b4bd9, 1, 79) differ only in count; will
> pick highest to self-heal on compaction
>
> Transfer speed from that point on was quite slow, couple hundred MB's per
> 10 minutes.
>
> After a while nodetool netstats stops listing transfers, and the warnings
> also calm down. There's still a handful of them per minute, while the
> cluster is not being used though.
>
> Any idea what could be going on here?
>
> Bram
>

Re: nodetool repairs spawns many "invalid remote counter shard detected" errors on new node

Reply via email to