Re: auto_bootstrap for seed node

2018-03-27 Thread Peng Xiao
We followed this https://docs.datastax.com/en/cassandra/2.1/cassandra/operations/ops_add_dc_to_cluster_t.html, but it does not mention that change bootstrap for seed nodes after the rebuild. Thanks, Peng Xiao -- Original -- From: "Ali Hubail";

auto_bootstrap for seed node

2018-03-27 Thread Peng Xiao
Dear All, For adding a new DC ,we need to set auto_bootstrap: false and then run the rebuild,finally we need to change auto_bootstrap: true,but for seed nodes,it seems that we still need to keep bootstrap false? Could anyone please confirm? Thanks, Peng Xiao

?????? disable compaction in bootstrap process

2018-03-23 Thread Peng Xiao
Many thanks Alain for the thorough explanation,we will not disable compaction for now. Thanks, Peng Xiao -- -- ??: "arodrime"; : 2018??3??23??(??) 8:57 ??????: "Peng Xiao"<2535...@qq.com>; : "user&

?????? replace dead node vs remove node

2018-03-22 Thread Peng Xiao
painful process. Thanks, Peng Xiao -- -- ??: "Anthony Grasso"; : 2018??3??22??(??) 7:13 ??: "user"; : Re: replace dead node vs remove node Hi Peng, Depending on the hardware failure you can do one of

?????? disable compaction in bootstrap process

2018-03-22 Thread Peng Xiao
54 ??: "user"; : ?? disable compaction in bootstrap process Thanks Alain.We are using C* 2.1.18,7core/30G/1.5T ssd,as the cluster is growing too fast,we are painful in bootstrap/rebuild/remove node. Thanks, Peng Xiao -- -

?????? disable compaction in bootstrap process

2018-03-22 Thread Peng Xiao
Thanks Alain.We are using C* 2.1.18,7core/30G/1.5T ssd,as the cluster is growing too fast,we are painful in bootstrap/rebuild/remove node. Thanks, Peng Xiao -- -- ??: "Alain RODRIGUEZ"; : 2018??3??22??(??) 7:31 ???

replace dead node vs remove node

2018-03-22 Thread Peng Xiao
-03-12/replace-a-dead-node-in-cassandra.html,we can replace this dead node,is it the same as bootstrap new node?that means we don't need to remove node and rejoin? Could anyone please advise? Thanks, Peng Xiao

disable compaction in bootstrap process

2018-03-22 Thread Peng Xiao
Dear All, We noticed that when bootstrap new node,the source node is also quite busy doing compactions which impact the rt severely.Is it reasonable to disable compaction on all the source node? Thanks, Peng Xiao

?????? does copy command will clear all the old data?

2018-02-12 Thread Peng Xiao
eb 13, 2018 at 12:49 PM, Peng Xiao <2535...@qq.com> wrote: Dear All, I'm trying to import csv file to a table with copy command?The question is: will the copy command clear all the old data in this table?as we only want to append the csv file to this table Thanks

does copy command will clear all the old data?

2018-02-12 Thread Peng Xiao
Dear All, I'm trying to import csv file to a table with copy command?The question is: will the copy command clear all the old data in this table?as we only want to append the csv file to this table Thanks

C* keyspace layout

2018-01-11 Thread Peng Xiao
Hi there, We plan to set keyspace1 in DC1 and DC2,keyspace2 in DC3 and DC4,all still in the same cluster,to avoid the interrupt.Is there any potential risk for this architecture? Thanks, Peng Xiao

?????? secondary index creation causes C* oom

2018-01-10 Thread Peng Xiao
Thanks Kurt. -- -- ??: "kurt";; : 2018??1??11??(??) 11:46 ??: "User"; : Re: secondary index creation causes C* oom 1.not sure if secondary index creation is the same as index rebuild Fairly sure they are the same. 2.we

secondary index creation causes C* oom

2018-01-09 Thread Peng Xiao
Dear All, We met some C* nodes oom during secondary index creation with C* 2.1.18. As per https://issues.apache.org/jira/browse/CASSANDRA-12796,the flush writer will be blocked by index rebuild.but we still have some confusions: 1.not sure if secondary index creation is the same as index rebuild

how to check C* partition size

2018-01-07 Thread Peng Xiao
Hi guys, Could anyone please help on this simple question? How to check C* partition size and related information. looks nodetool ring only shows the token distribution. Thanks

Gossip stage pending tasks cause application rt jump

2017-12-23 Thread Peng Xiao
? Thanks, Peng Xiao

Re: run cleanup and rebuild simultaneously

2017-12-22 Thread Peng Xiao
be held by the rebuild streams, causing you to temporarily increase disk usage until rebuild finishes streaming -- Jeff Jirsa On Dec 22, 2017, at 4:30 PM, Peng Xiao <2535...@qq.com> wrote: Hi there,Can we run nodetool cleanup in DC1,and run rebuild in DC2 against DC1 simultaneously

run cleanup and rebuild simultaneously

2017-12-22 Thread Peng Xiao
Hi there,Can we run nodetool cleanup in DC1,and run rebuild in DC2 against DC1 simultaneously? in C* 2.1.18 Thanks, Peng Xiao

???????????? rebuild in the new DC always failed

2017-12-16 Thread Peng Xiao
hat it was ever invoked? Did it calculate a streaming plan? Did it start sending files? -- Jeff Jirsa On Dec 16, 2017, at 4:56 PM, Peng Xiao <2535...@qq.com> wrote: Hi Jeff, This is the only informaiton we found from system.log <2e06f...@9f7caf6b.d1c

?????? rebuild in the new DC always failed

2017-12-16 Thread Peng Xiao
ack beneath the null pointer exception? -- Jeff Jirsa > On Dec 16, 2017, at 4:11 PM, Peng Xiao <2535...@qq.com> wrote: > > Hi there, > > We need to rebuild a new DC,but the stream is always failed with the > following errors. > we are using C* 2.1.18.Could anyone

rebuild in the new DC always failed

2017-12-16 Thread Peng Xiao
Hi there, We need to rebuild a new DC,but the stream is always failed with the following errors. we are using C* 2.1.18.Could anyone please advise? error: null -- StackTrace -- java.io.EOFException at java.io.DataInputStream.readByte(DataInputStream.java:267) at sun.rmi.transp

nodetool rebuild data size

2017-12-13 Thread Peng Xiao
Hi there, if we have a Cassandra DC1 with data size 60T,RF=3,then we rebuild a new DC2(RF=3),how much data will stream to DC2?20T or 60T? Thanks, Peng Xiao

rt jump during new node bootstrap

2017-12-11 Thread Peng Xiao
Dear All, We are using C* 2.1.18,when we bootstrap a new node,the rt will jump when the new node start up,then it back to normal.Could anyone please advise? Thanks, Peng Xiao

?????? ?????? rebuild stream issue

2017-12-10 Thread Peng Xiao
l compact away when it??s done You can also use subrange repair instead of rebuild if you??re short on disk space -- Jeff Jirsa On Dec 10, 2017, at 9:14 PM, Peng Xiao <2535...@qq.com> wrote: Then,how can we restore the rebuild?we are using C* 3.11.0 Can we just delete the data f

?????? rebuild stream issue

2017-12-10 Thread Peng Xiao
ser"; : Re: rebuild stream issue The streams fail, the rebuild times out if you??ve set a timeout. Or you??ll need to restart the nodes if you didn??t set a streaming timeout. -- Jeff Jirsa > On Dec 10, 2017, at 9:05 PM, Peng Xiao <2535...@qq.com> wrote: > > Dear All, >

rebuild stream issue

2017-12-10 Thread Peng Xiao
Dear All, We are rebuilding a new DC,if one of the source node was restarted,what will happed with the rebuild? Thanks, Peng Xiao

?????? run Cassandra on physical machine

2017-12-07 Thread Peng Xiao
you probably need to wait this storage port patch. https://issues.apache.org/jira/browse/CASSANDRA-7544 . On Thu, Dec 7, 2017 at 7:01 AM, Oliver Ruebenacker wrote: Hello, Yes, you can. Best, Oliver On Thu, Dec 7, 2017 at 7:12 AM, Peng Xiao <2535...@qq.com> wrote: Dear

run Cassandra on physical machine

2017-12-07 Thread Peng Xiao
, Peng Xiao

Re: update a record which does not exists

2017-12-03 Thread Peng Xiao
actually does not exist in Cassandra,will it generate a new record or exit? UPDATE columnfamily SET data = 'test data' WHERE key = 'row1'; as in CQL Update and insert are semantically the same.Could anyone please advise? Thanks, Peng Xiao

update a record which does not exists

2017-12-03 Thread Peng Xiao
Dear All,If we update a record which actually does not exist in Cassandra,will it generate a new record or exit? UPDATE columnfamily SET data = 'test data' WHERE key = 'row1'; as in CQL Update and insert are semantically the same.Could anyone please advise? Thanks, Peng Xiao

consistency against rebuild a new DC

2017-11-27 Thread Peng Xiao
Hi there, We know that we need to run repair regularly to make data consistency,suppose we have DC1 & DC2, if we add a new DC3 and rebuild from DC1,can we suppose the DC3 is consistency with DC1 at least at the time when DC3 is rebuild successfully? Thanks, Peng Xiao,

Re: best practice for repair

2017-11-13 Thread Peng Xiao
on, Nov 13, 2017 06:51 PM To: "user"; Subject: best practice for repair Hi there, we need to repair a huge CF,just want to clarify 1.nodetool repair -pr keyspace cf 2.nodetool repair -st -et -dc which will be better? or any other advice? Thanks, Peng Xiao

best practice for repair

2017-11-13 Thread Peng Xiao
Hi there, we need to repair a huge CF,just want to clarify 1.nodetool repair -pr keyspace cf 2.nodetool repair -st -et -dc which will be better? or any other advice? Thanks, Peng Xiao

cassandra gc issue

2017-11-02 Thread Peng Xiao
1048576K Could anyone please advise? Thanks, Peng Xiao

?????? decommissioned node still in gossip

2017-11-01 Thread Peng Xiao
it's not showing up in the cluster it should be OK. On 1 Nov. 2017 20:25, "Peng Xiao" <2535...@qq.com> wrote: Dear All, We have decommisioned a DC,but from system.log,it'still gossiping INFO [GossipStage:1] 2017-11-01 17:21:36,310 Gossiper.java:1008 - InetAddress

decommissioned node still in gossip

2017-11-01 Thread Peng Xiao
Dear All, We have decommisioned a DC,but from system.log,it'still gossiping INFO [GossipStage:1] 2017-11-01 17:21:36,310 Gossiper.java:1008 - InetAddress /x.x.x.x is now DOWN Could you please advise? Thanks, Peng Xiao

?????? Tuning bootstrap new node

2017-10-31 Thread Peng Xiao
se stream throughput from default 200 to a heifer value if your network can handle it. Sent from my iPhone On Oct 31, 2017, at 6:35 AM, Peng Xiao <2535...@qq.com> wrote: Can we stop the compaction during the new node bootstraping and enable it after the new node joined? Than

Re: Tuning bootstrap new node

2017-10-31 Thread Peng Xiao
se advise how to tune this ? Many Thanks, Peng Xiao

Tuning bootstrap new node

2017-10-31 Thread Peng Xiao
how to tune this ? Many Thanks, Peng Xiao

how to identify the root cause of cassandra hang

2017-10-26 Thread Peng Xiao
Hi, We have a cluster with 48 nodes configured with RACK,sometimes it's hang for even 2 minutes.the response time jump from 300ms to 15s. Could anyone please advise how to identified the root cause ? The following is from the system log INFO [Service Thread] 2017-10-26 21:45:46,796 GCInspec

can repair and bootstrap run simultaneously

2017-10-24 Thread Peng Xiao
Hi there, Can we add a new node (bootstrap) and run repair on another DC in the cluster or even run repair in the same DC? Thanks, Peng Xiao

回复: split one DC from a cluster

2017-10-20 Thread Peng Xiao
Thanks Kurt,we may will still use snapshot and sstableloader to split this schema to another cluster. -- 原始邮件 -- 发件人: "kurt";; 发送时间: 2017年10月19日(星期四) 晚上6:11 收件人: "User"; 主题: Re: split one DC from a cluster Easiest way is to separate them via firewall/network

split one DC from a cluster

2017-10-19 Thread Peng Xiao
Hi, We want to split one DC from a cluster and make this DC a new cluster(rename this DC to a new cluster name). Could you please advise? Thanks, Peng Xiao

split one keyspace from one cluster to another

2017-10-16 Thread Peng Xiao
Dear All, We'd like to migrate one keyspace from one cluster to another,the keyspace is about 100G. If we use sstableloader,we have to stop the application during the migration.any good idea? Thanks, Peng Xiao

space left for compaction

2017-09-30 Thread Peng Xiao
Dear All, As for STCS,datastax suggest us to keep half of the free space for compaction,this is not strict,could anyone advise how many space should we left for one node? Thanks, Peng Xiao

?????? limit the sstable file size

2017-09-29 Thread Peng Xiao
it will default to 160MB (except in cases where you have a very large partition - in those cases, the sstable will scale with your partition size, but you really shouldn't have partitions larger than 160MB) On Fri, Sep 29, 2017 at 8:41 PM, Peng Xiao <2535...@qq.com> wrote: Dear All,

limit the sstable file size

2017-09-29 Thread Peng Xiao
Dear All, Can we limit the sstable file size?as we have a huge cluster,the sstable file is too large for ETL to extract,Could you please advise? Thanks, Peng Xiao

cassandra hardware requirements (STAT/SSD)

2017-09-28 Thread Peng Xiao
Hi there, we are struggling on hardware selection,we all know that ssd is good,and Datastax suggests us to use ssd,as Cassandra is a CPU bound db,we are considering to use sata disk,we noticed that the normal IO throughput is 7MB/s. Could anyone give some advice? Thanks, Peng Xiao

?????? data loss in different DC

2017-09-28 Thread Peng Xiao
s those will be. For strong consistency with multiple DCs, You can either: - write at quorum and read at quorum from any dc, or - write each_quorum and read local_quorum from any dc, or - write at local_quorum and read local_quorum from the same DC only -- Jeff Jirsa > On Sep 28, 2017

?????? data loss in different DC

2017-09-28 Thread Peng Xiao
loss in different DC If you're writing into DC1 with CL = LOCAL_xxx, there is no guarantee to be sure to read the same data in DC2. Only repair will help you On Thu, Sep 28, 2017 at 11:41 AM, Peng Xiao <2535...@qq.com> wrote: Dear All, We have a cluster with one DC1:RF=3,another

Re: data loss in different DC

2017-09-28 Thread Peng Xiao
DC2:RF=1,DC2 only for ETL,but we found that sometimes we can query records in DC1,while not able not find the same record in DC2 with local_quorum.How it happens?looks data loss in DC2. Could anyone please advise? looks we can only run repair to fix it. Thanks, Peng Xiao

data loss in different DC

2017-09-28 Thread Peng Xiao
repair to fix it. Thanks, Peng Xiao

data loss in different DC

2017-09-28 Thread Peng Xiao
, Peng Xiao

?????? nodetool cleanup in parallel

2017-09-26 Thread Peng Xiao
. On 27 Sep. 2017 13:20, "Peng Xiao" <2535...@qq.com> wrote: hi, nodetool cleanup will only remove those keys which no longer belong to those nodes,than theoretically we can run nodetool cleanup in parallel,right?the document suggests us to run this one by one,but it's too slow. Thanks, Peng Xiao

nodetool cleanup in parallel

2017-09-26 Thread Peng Xiao
hi, nodetool cleanup will only remove those keys which no longer belong to those nodes,than theoretically we can run nodetool cleanup in parallel,right?the document suggests us to run this one by one,but it's too slow. Thanks, Peng Xiao

add new nodes in two DCs at the same time

2017-09-22 Thread Peng Xiao
Hi, as Datastax suggests,we should only bootstrap one new node one time. but can we add new nodes in two DCs at the same time? Thanks, Peng Xiao

Pending-range-calculator during bootstrapping

2017-09-22 Thread Peng Xiao
this? Thanks, Peng Xiao

Pending-range-calculator during bootstrapping

2017-09-21 Thread Peng Xiao
Dear All, when we are bootstrapping a new node,we are experiencing high cpu load and this affect the rt ,and we noticed that it's mainly costing on Pending-range-calculator ,this did not happen before. We are using C* 2.1.13. Could anyone please advise on this? Thanks, Peng Xiao

回复:RE: network down between DCs

2017-09-21 Thread Peng Xiao
in the Cassandra log. Hint storage has been re-written in 3.0+ to flat files, thus tombstone related troubles in that area should be gone. Thomas From: Hannu Kröger [mailto:hkro...@gmail.com] Sent: Donnerstag, 21. September 2017 10:32 To: Peng Xiao <2535...@qq.com>;

network down between DCs

2017-09-21 Thread Peng Xiao
Hi there, We have two DCs for a Cassandra Cluster,if the network is down less than 3 hours(default hint window),with my understanding,it will recover automatically,right?Do we need to run repair manually? Thanks, Peng Xiao

??????RE: Row Cache hit issue

2017-09-19 Thread Peng Xiao
ache will be empty after restart. --Dikang. On Tue, Sep 19, 2017 at 8:27 PM, Peng Xiao <2535...@qq.com> wrote: And we are using C* 2.1.18. -- Original -- From: "";<2535...@qq.com>;

Re: Row Cache hit issue

2017-09-19 Thread Peng Xiao
ot work in this situation? but we can still see the row cache hit. Row Cache : entries 202787, size 100 MB, capacity 100 MB, 3095293 hits, 6796801 requests, 0.455 recent hit rate, 0 save period in seconds Could anyone please explain this? Thanks, Peng Xiao

Row Cache hit issue

2017-09-19 Thread Peng Xiao
Could anyone please explain this? Thanks, Peng Xiao

optimal value for native_transport_max_threads

2017-08-08 Thread Peng Xiao
Dear All, any suggestion for optimal value for native_transport_max_threads? as per https://issues.apache.org/jira/browse/CASSANDRA-11363,max_queued_native_transport_requests=4096,how about native_transport_max_threads? Thanks, Peng Xiao

Re: MUTATION messages were dropped in last 5000 ms for cross nodetimeout

2017-08-04 Thread Peng Xiao
hi?? Does message drop mean data loss? Thanks -- Original -- From: Akhil Mehra Date: ,8?? 4,2017 16:00 To: user Subject: Re: MUTATION messages were dropped in last 5000 ms for cross nodetimeout Glad I could be of help :) Hopefully the partition size

回复: Data Loss irreparabley so

2017-08-02 Thread Peng Xiao
Due to the tombstone,we have set GC_GRACE_SECONDS to 6 hours.And for a huge table with 4T size,repair is a hard thing for us. -- 原始邮件 -- 发件人: "kurt";; 发送时间: 2017年8月3日(星期四) 中午12:08 收件人: "User"; 主题: Re: Data Loss irreparabley so You should run repairs every G

回复: Data Loss irreparabley so

2017-08-02 Thread Peng Xiao
Hi, We are also experiencing the same issue.we have 3 DCs(DC1 RF=3,DC2 RF=3,DC3,RF=1),if we use local_quorum,we are not meant to loss any data,right? if we use local_one, maybe loss data? then we need to run repair regularly? Could anyone advise? Thanks -- 原始邮件

Cassandra data loss in come DC

2017-08-02 Thread Peng Xiao
Hi there, We have a three DCs Cluster (two DCs with RF=3,one remote DC with RF=1),we currently find that in DC1/DC2 select count(*) from t=1250,while in DC3 select count(*) from t=750. looks some data is missing in DC3(remote DC).there are no node down or anything exceptional. we only upgrade

?????? ?????? ?????? t olerate how many nodes down in the cluster

2017-07-27 Thread Peng Xiao
Thanks all for your thorough explanation. -- -- ??: "Anuj Wadehra";; : 2017??7??28??(??) 0:49 ??: "User cassandra.apache.org"; "Peng Xiao"<2535...@qq.com>; : Re: ?? ?? t olerat

?????? ?????? tolerate how many nodes down in the cluster

2017-07-26 Thread Peng Xiao
-- -- ??: "Anuj Wadehra";; : 2017??7??27??(??) 1:41 ??: "Brooke Thorley"; "user@cassandra.apache.org"; : "Peng Xiao"<2535...@qq.com>; : Re: ?? tolerate how many nodes d

回复: 回复: tolerate how many nodes down in the cluster

2017-07-26 Thread Peng Xiao
as per Brooke suggests,RACs a multipile of RF. https://www.youtube.com/watch?v=QrP7G1eeQTI if we have 6 machines with RF=3,then we can set up 6 RACs or setup 3RACs,which will be better? Could you please further advise? Many thanks -- 原始邮件 -- 发件人: "我自己的邮箱";<2

回复: 回复: tolerate how many nodes down in the cluster

2017-07-26 Thread Peng Xiao
One more question.why the # of racks should be equal to RF? For example,we have 4 machines,each virtualized to 8 vms ,can we set 4 RACs with RF3?I mean one machine one RAC. Thanks -- 原始邮件 -- 发件人: "我自己的邮箱";<2535...@qq.com>; 发送时间: 2017年7月26日(星期三) 上午10:32 收件人: "

Re: Timeout while setting keyspace

2017-07-26 Thread Peng Xiao
https://datastax-oss.atlassian.net/browse/JAVA-1002 This one says it's the driver issue,we will have a try. -- Original -- From: "";<2535...@qq.com>; Date: Wed, Jul 26, 2017 04:12 PM To: "user"; Subject: Timeout while setting keyspace Dear A

Timeout while setting keyspace

2017-07-26 Thread Peng Xiao
Dear All, We are expericencing a strange issue.Currently we have a Cluster with Cassandra 2.1.13. when the applications start,it will print the following warings.And it takes long time for applications to start. Could you please advise ? 2017-07-26 15:49:20.676 WARN 11706 --- [-] [cluster1-n

回复: 回复: tolerate how many nodes down in the cluster

2017-07-25 Thread Peng Xiao
Thanks for the remind,we will setup a new DC as suggested. -- 原始邮件 -- 发件人: "kurt greaves";; 发送时间: 2017年7月26日(星期三) 上午10:30 收件人: "User"; 抄送: "anujw_2...@yahoo.co.in"; 主题: Re: 回复: tolerate how many nodes down in the cluster Keep in mind that you shouldn't just

?????? ?????? tolerate how many nodes down in the cluster

2017-07-25 Thread Peng Xiao
Thanks All for your reply.We will begin using RACs in our C* cluster. Thanks. -- -- ??: "kurt greaves";; : 2017??7??25??(??) 6:27 ??: "User"; "anujw_2...@yahoo.co.in"; : "Pe

回复: tolerate how many nodes down in the cluster

2017-07-24 Thread Peng Xiao
letting cassandra know of the same. Regards, Bhuvan On Mon, Jul 24, 2017 at 3:28 PM, Peng Xiao <2535...@qq.com> wrote: Hi, Suppose we have a 30 nodes cluster in one DC with RF=3, how many nodes can be down?can we tolerate 10 nodes down? it seems that we are not able to avoid the data

tolerate how many nodes down in the cluster

2017-07-24 Thread Peng Xiao
Hi, Suppose we have a 30 nodes cluster in one DC with RF=3, how many nodes can be down?can we tolerate 10 nodes down? it seems that we are not able to avoid the data distribution 3 replicas in the 10 nodes?, then we can only tolerate 1 node down even we have 30 nodes? Could anyone please advise

gossip down failure detected

2017-07-06 Thread Peng Xiao
Hi, We are experiencing the following issue,the rt will fly to 15s sometime.and after adjusting the batch size, it looks better,but still have the following issue.Could any one advise? INFO [GossipTasks:1] 2017-07-07 08:56:33,410 Gossiper.java:1009 - InetAddress /172.16.xx.39 is now DOWN on

How do you monitoring Cassandra Cluster?

2017-06-28 Thread Peng Xiao
Dear All, we are currently using Cassandra 2.1.13,and it has grown to 5TB size with 32 nodes in one DC. For monitoring,opsCenter does not send alarm and not free in higher version.so we have to use a simple JMX+Zabbix template.And we plan to use Jolokia+JMX2Graphite to draw the metrics chart