Any better solution to avoid TombstoneOverwhelmingException?

2014-06-30 Thread Jason Tang
Our application will use Cassandra to persistent for asynchronous tasks, so in one time period, lots of records will be created in Cassandra (more then 10M). Later it will be executed. Due to disk space limitation, the executed records will be deleted. After gc_grace_seconds, it is expected to be

Re: Any better solution to avoid TombstoneOverwhelmingException?

2014-06-30 Thread Jason Tang
) On Mon, Jun 30, 2014 at 8:43 AM, Jason Tang ares.t...@gmail.com wrote: Our application will use Cassandra to persistent for asynchronous tasks, so in one time period, lots of records will be created in Cassandra (more then 10M). Later it will be executed. Due to disk space limitation

Re: heap issues - looking for advices on gc tuning

2013-10-30 Thread Jason Tang
What's configuration of following parameters memtable_flush_queue_size: concurrent_compactors: 2013/10/30 Piavlo lolitus...@gmail.com Hi, Below I try to give a full picture to the problem I'm facing. This is a 12 node cluster, running on ec2 with m2.xlarge instances (17G ram , 2 cpus).

Re: Side effects of hinted handoff lead to consistency problem

2013-10-14 Thread Jason Tang
I change GC grace seconds to 10 days. our problem solved, but it is still a strange behavior when using index query. 2013/10/8 Jason Tang ares.t...@gmail.com I have a 3 nodes cluster, replicate_factor is 3 also. Consistency level is Write quorum, Read quorum. Traffic has three major steps

Re: Failed to solve Digest mismatch

2013-10-09 Thread Jason Tang
://issues.apache.org/jira/browse/CASSANDRA Thanks - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 2/07/2012, at 1:49 AM, Jason Tang wrote: For the create/update/deleteColumn/deleteRow test case, for Quorum consistency level, 6 nodes

Side effects of hinted handoff lead to consistency problem

2013-10-08 Thread Jason Tang
I have a 3 nodes cluster, replicate_factor is 3 also. Consistency level is Write quorum, Read quorum. Traffic has three major steps Create: Rowkey: Column: status=new, requests=x Update: Rowkey: Column: status=executing, requests=x Delete: Rowkey:

Why Cassandra so depend on client local timestamp?

2013-10-01 Thread Jason Tang
Following case may be logical correct for Cassandra, but difficult for user. Let's say: Cassandra consistency level: write all, read one replication_factor:3 For one record, rowkey:001, column:status Client 1, insert value for rowkey 001, status:True, timestamp 11:00:05 Client 2 Slice Query,

Gossiper in Cassandra using unicast/broadcast/multicast ?

2013-06-20 Thread Jason Tang
Hi We are considering using Cassandra in virtualization environment. I wonder is Cassandra using unicast/broadcast/multicast for node discover or communication? From the code, I find the broadcast address is used for heartbeat in Gossiper.java, but I don't know how actually it works when

Re: Consistent problem when solve Digest mismatch

2013-03-06 Thread Jason Tang
- Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http://www.thelastpickle.com On 4/03/2013, at 4:30 PM, Jason Tang ares.t...@gmail.com wrote: Hi The timestamp provided by my client is unix timestamp (with ntp), and as I said, due to the ntp drift

Re: Consistent problem when solve Digest mismatch

2013-03-04 Thread Jason Tang
timestamp is just the default). The point being, unix timestamp is the better approximation we have in practice. -- Sylvain On Mon, Mar 4, 2013 at 9:26 AM, Jason Tang ares.t...@gmail.com wrote: Hi Previous I met a consistency problem, you can refer the link below for the whole story. http

Re: Cassandra Consistency problem with NTP

2013-01-16 Thread Jason Tang
PM, Jason Tang ares.t...@gmail.com wrote: Hi I am using Cassandra in a message bus solution, the major responsibility of cassandra is recording the incoming requests for later consumming. One strategy is First in First out (FIFO), so I need to get the stored request in reversed order. I

Re: Cassandra Consistency problem with NTP

2013-01-16 Thread Jason Tang
. In other words, Cassandra dependency on time synchronization is not a strong one even in that case. But again, that doesn't seem at all to be the problem you are trying to solve. -- Sylvain On Thu, Jan 17, 2013 at 2:56 AM, Jason Tang ares.t...@gmail.com wrote: Hi I am using Cassandra

Re: is it possible to disable compaction per CF ?

2012-07-27 Thread Jason Tang
setMaxCompactionThreshold(0) setMinCompactionThreshold(0) 2012/7/27 Илья Шипицин chipits...@gmail.com Hello! if we are dealing with append-only data model, so what if I disable compaction on certain CF ? any side effect ? can I do it with update column family with

Compaction not remove the deleted data from secondary index when use TTL

2012-07-19 Thread Jason Tang
Hi For some consistency problem, we can not use delete direct to delete one row, and then we use TTL for each column of the row. We using the Cassandra as the central storage of the stateful system. All request will be stored in Cassandra, and marked as status;NEW, and then we change

Re: Replication factor - Consistency Questions

2012-07-18 Thread Jason Tang
of QUORAM. ** ** *From:* Jason Tang [mailto:ares.t...@gmail.com] *Sent:* Tuesday, July 17, 2012 8:24 PM *To:* user@cassandra.apache.org *Subject:* Re: Replication factor - Consistency Questions ** ** Hi ** ** I am starting using Cassandra for not a long time, and also have

Re: Replication factor - Consistency Questions

2012-07-17 Thread Jason Tang
Hi I am starting using Cassandra for not a long time, and also have problems in consistency. Here is some thinking. If you have Write:Any / Read:One, it will have consistency problem, and if you want to repair, check your schema, and check the parameter Read repair chance:

Re: Cassandra take 100% CPU for 2~3 minutes every half an hour and mutation lost

2012-07-12 Thread Jason Tang
() @bci=28, line=908 (Compiled frame) - java.lang.Thread.run() @bci=11, line=662 (Interpreted frame) BRs //Jason 2012/7/11 Jason Tang ares.t...@gmail.com Hi I encounter the High CPU problem, Cassandra 1.0.3, happened on both sized and leveled compaction, 6G heap, 64bit Oracle java

Cassandra take 100% CPU for 2~3 minutes every half an hour and mutation lost

2012-07-10 Thread Jason Tang
Hi I encounter the High CPU problem, Cassandra 1.0.3, happened on both sized and leveled compaction, 6G heap, 64bit Oracle java. For normal traffic, Cassandra will use 15% CPU. But every half a hour, Cassandra will use almost 100% total cpu (SUSE, 12 Core). And here is the top

Re: Failed to solve Digest mismatch

2012-07-01 Thread Jason Tang
AM, Jason Tang ares.t...@gmail.com wrote: Hi First I delete one column, then I delete one row. Then try to read all columns from the same row, all operations from same client app. The consistency level is read/write quorum. Check the Cassandra log, the local node don't

Failed to solve Digest mismatch

2012-06-28 Thread Jason Tang
Hi First I delete one column, then I delete one row. Then try to read all columns from the same row, all operations from same client app. The consistency level is read/write quorum. Check the Cassandra log, the local node don't perform the delete operation but send the mutation to

Re: Consistency Problem with Quorum consistencyLevel configuration

2012-06-26 Thread Jason Tang
) (b20ac6ec0d29393d70e200027c094d13 vs d41d8cd98f00b204e9800998ecf8427e) 2012/6/25 Jason Tang ares.t...@gmail.com Hi I met the consistency problem when we have Quorum for both read and write. I use MultigetSubSliceQuery to query rows from super column limit size 100, and then read

Consistency Problem with Quorum consistencyLevel configuration

2012-06-24 Thread Jason Tang
Hi I met the consistency problem when we have Quorum for both read and write. I use MultigetSubSliceQuery to query rows from super column limit size 100, and then read it, then delete it. And start another around. But I found, the row which should be delete by last query, it still

Re: GCInspector works every 10 seconds!

2012-06-18 Thread Jason Tang
? Cheers - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 18/06/2012, at 12:36 AM, Jason Tang wrote: Hi After I change log level to DEBUG, I found some log. Although we don't have traffic to Cassandra, but we have scheduled the task

GCInspector works every 10 seconds!

2012-06-17 Thread Jason Tang
Hi After running load testing for 24 hours(insert, update and delete), now no new traffic to Cassandra, but Cassnadra shows still have high load(CPU usage), from the system.log, it shows it always perform GC. I don't know why it work as that, seems memory is not low. Here is some

Re: GCInspector works every 10 seconds!

2012-06-17 Thread Jason Tang
: 7fff0137f6340e396cfdc9fa:true:4@133986545195 BRs //Ares 2012/6/17 Jason Tang ares.t...@gmail.com Hi After running load testing for 24 hours(insert, update and delete), now no new traffic to Cassandra, but Cassnadra shows still have high load(CPU usage), from the system.log

Re: Much more native memory used by Cassandra then the configured JVM heap size

2012-06-13 Thread Jason Tang
@aaronmorton http://www.thelastpickle.com On 12/06/2012, at 5:52 PM, Jason Tang wrote: Hi I found some information of this issue And seems we can have other strategy for data access to reduce mmap usage, in order to use less memory. But I didn't find the document to describe the parameters

Re: Much more native memory used by Cassandra then the configured JVM heap size

2012-06-11 Thread Jason Tang
spin up JConsole as it will give you much more detai kon what your VM is actually doing. On Mon, Jun 11, 2012 at 9:14 PM, Jason Tang ares.t...@gmail.com wrote: Hi We have some problem with Cassandra memory usage, we configure the JVM HEAP 6G, but after runing Cassandra for several hours

Re: Much more native memory used by Cassandra then the configured JVM heap size

2012-06-11 Thread Jason Tang
/gmane.comp.db.cassandra.user/7390 2012/6/12 Jason Tang ares.t...@gmail.com See my post, I limit the HVM heap 6G, but actually Cassandra will use more memory which is not calculated in JVM heap. I use top to monitor total memory used by Cassandra. = -Xms6G -Xmx6G -Xmn1600M 2012/6/12

TimedOutException caused by Stop the world activity

2012-05-27 Thread Jason Tang
Hi My system is 4 nodes 64 bit cassandra cluster, 6G big per node,default configuration (which means 1/3 heap for memtable), replicate number 3, write all, read one. When I run stress load testing, I got this TimedOutException, and some operation failed, and all traffic hang for a while. And

Re: Cassandra search performance

2012-05-12 Thread Jason Tang
I try to search one column, this column store the time as the type Long, 1,000,000 data equally distributed in 24 hours, I only want to search certain time rang, eg from 01:30 to 01:50 or 08:00 to 12:00, but something stranger happened. Search 00:00 to 23:59 limit 100 It took less then 1 second

Cassandra search performance

2012-04-25 Thread Jason Tang
Hi We have the such CF, and use secondary index to search for simple data status, and among 1,000,000 row records, we have 200 records with status we want. But when we start to search, the performance is very poor, and check with the command ./bin/nodetool -h localhost -p 8199 cfstats ,

Re: Cassandra search performance

2012-04-25 Thread Jason Tang
records, it only scan 200 records. So it impacted by scan execution plan, if we have several search conditions, how it works? Do we have the similar execution plan in Cassandra? 在 2012年4月25日 下午9:18,Jason Tang ares.t...@gmail.com写道: Hi We have the such CF, and use secondary index to search

Re: Cassandra search performance

2012-04-25 Thread Jason Tang
that helps. 2012/4/25 Jason Tang ares.t...@gmail.com And I found, if I only have the search condition status, it only scan 200 records. But if I combine another condition partition then it scan all records because partition condition match all records. But combine with other condition

Consistence for node shutdown and startup

2011-12-11 Thread Jason Tang
Hi Here is the case, if we have only two nodes, which share the data (write one, read one), node One node Two | Stopped Continue working and update the data. | stopped stopped | start working