Re: strange gossip messages after node reboot with different ip

2012-05-01 Thread Piavlo
Hi Aaron, Below is the reposted gossipinfo on a fresh 6 node cluster for which I stop/started all nodes one by one ~12hours ago, As you can see gossipinfo reports on 11 nodes, but what bothers me is why it reports STATUS:NORMAL for all of them and decides that non existing node is UP just o

RE: Taking a Cluster Wide Snapshot

2012-05-01 Thread Shubham Srivastava
On another thought I am writing a code/script for taking a backup of all the nodes in a single DC , renaming data files with some uid and then merging them . The storage however would happen on some storage medium nas for ex which would be in the same DC. This would help in data copying a non

Re: Question regarding major compaction.

2012-05-01 Thread Fredrik Stigbäck
Thank you Aaron. That explanation cleared things up. 2012/4/30 aaron morton aa...@thelastpickle.com: Depends on your definition of significantly, there are a few things to consider. * Reading from SSTables for a request is a serial operation. Reading from 2 SSTables will take twice as long

Re: AssertionError: originally calculated column size ...

2012-05-01 Thread Patrik Modesto
https://issues.apache.org/jira/browse/CASSANDRA-4206 Regards, Patrik On Tue, May 1, 2012 at 03:46, Jonathan Ellis jbel...@gmail.com wrote: On Mon, Apr 30, 2012 at 2:11 PM, Patrik Modesto patrik.mode...@gmail.com wrote: I think the problem is somehow connected to an IntegerType secondary

Re: Question regarding major compaction.

2012-05-01 Thread Henrik Schröder
But what's the difference between doing an extra read from that One Big File, than doing an extra read from whatever SSTable happen to be largest in the course of automatic minor compaction? We have a pretty update-heavy application, and doing a major compaction can remove up to 30% of the used

Re: Question regarding major compaction.

2012-05-01 Thread Edward Capriolo
Henrik, There are use cases where major compaction works well like yours and mine. Essentially cases with a high amount of churn, updates and deletes we get a lot of benefit from forced tombstone removal in the form of less physical data. However we end up with really big sstables that naturally

unsubscribe

2012-05-01 Thread Gmail
unsubscribe

Re: Question regarding major compaction.

2012-05-01 Thread Jason Rutherglen
I wonder if TieredMergePolicy [1] could be used in Cassandra for compaction? 1. http://blog.mikemccandless.com/2011/02/visualizing-lucenes-segment-merges.html On Tue, May 1, 2012 at 6:38 AM, Edward Capriolo edlinuxg...@gmail.com wrote: Henrik, There are use cases where major compaction works

Re: JNA + Cassandra security

2012-05-01 Thread Rob Coli
On Mon, Apr 30, 2012 at 6:48 PM, Jonathan Ellis jbel...@gmail.com wrote: On Mon, Apr 30, 2012 at 7:49 PM, Cord MacLeod cordmacl...@gmail.com wrote: Hello group, I'm a new Cassandra and Java user so I'm still trying to get my head around a few things.  If you've disabled swap on a machine

How Cassandra determines the splits

2012-05-01 Thread Filippo Diotalevi
Hi, I'm having problems in my Cassandra/Hadoop (1.0.8 + cdh3u3) cluster related to how cassandra splits the data to be processed by Hadoop. I'm currently testing a map reduce job, starting from a CF of roughly 1500 rows, with cassandra.input.split.size 10 cassandra.range.batch.size 1 but

Re: Bulkload into a different CF

2012-05-01 Thread Benoit Perroud
!! Without any guarantee. I know it works but I never used this in production !! You can copy the sstables (renaming them accordingly) and call nodetool refresh. Don't forget to create your column family CF2 before. 2012/5/1 Oleg Proudnikov ol...@cloudorange.com: Hello, Is it possible to

Re: Question regarding major compaction.

2012-05-01 Thread Rob Coli
On Tue, May 1, 2012 at 4:31 AM, Henrik Schröder skro...@gmail.com wrote: But what's the difference between doing an extra read from that One Big File, than doing an extra read from whatever SSTable happen to be largest in the course of automatic minor compaction? The primary differences, as I

Data modeling advice (time series)

2012-05-01 Thread Tim Wintle
I believe that the general design for time-series schemas looks something like this (correct me if I'm wrong): (storing time series for X dimensions for Y different users) Row Keys: {USET_ID}_{TIMESTAMP/BUCKETSIZE} Columns: {DIMENSION_ID}_{TIMESTAMP%BUCKETSIZE} - {Counter} But I've not found

Re: Server Side Logic/Script - Triggers / StoreProc

2012-05-01 Thread Praveen Baratam
The point with NoSQL is flexibility and RDBMS is structure and guarantees. Both patterns IMHO do overlap. But they do have different USPs. On Mon, Apr 30, 2012 at 3:51 AM, Maxim Potekhin potek...@bnl.gov wrote: About a year ago I started getting a strange feeling that the noSQL community is

Re: Data modeling advice (time series)

2012-05-01 Thread Aaron Turner
On Tue, May 1, 2012 at 10:20 AM, Tim Wintle timwin...@gmail.com wrote: I believe that the general design for time-series schemas looks something like this (correct me if I'm wrong): (storing time series for X dimensions for Y different users) Row Keys:  {USET_ID}_{TIMESTAMP/BUCKETSIZE}

Re: Question regarding major compaction.

2012-05-01 Thread Edward Capriolo
Also there are some tickets in JIRA to impose a max sstable size and some other related optimizations that I think got stuck behind levelDB in coolness factor. Not every use case is good for leveled so adding more tools and optimizations of the Size Tiered tables would be awesome. On Tue, May 1,

Re: Bulkload into a different CF

2012-05-01 Thread Oleg Proudnikov
Benoit Perroud benoit at noisette.ch writes: You can copy the sstables (renaming them accordingly) and call nodetool refresh. Thank you, Benoit. In that case could I try snapshot+moverename+refresh on a live system? Regards, Oleg

Re: Question regarding major compaction.

2012-05-01 Thread Mohit Anchlia
+1 On Tue, May 1, 2012 at 12:06 PM, Edward Capriolo edlinuxg...@gmail.comwrote: Also there are some tickets in JIRA to impose a max sstable size and some other related optimizations that I think got stuck behind levelDB in coolness factor. Not every use case is good for leveled so adding

Re: Question regarding major compaction.

2012-05-01 Thread Oleg Proudnikov
Henrik Schröder skrolle at gmail.com writes: But what's the difference between doing an extra read from that One Big File, than doing an extra read from whatever SSTable happen to be largest in the course of automatic minor compaction? There is this note regarding major compaction in the

Re: Bulkload into a different CF

2012-05-01 Thread Benoit Perroud
I would just try to copy instead of moving first, and dropping the old CF or the not needed snapshot if necessary when everything is ok. 2012/5/1 Oleg Proudnikov ol...@cloudorange.com: Benoit Perroud benoit at noisette.ch writes: You can copy the sstables (renaming them accordingly) and

Re: Data modeling advice (time series)

2012-05-01 Thread aaron morton
I would try to avoid 100's on MB's per row. It will take longer to compact and repair. 10's is fine. Take a look at in_memory_compaction_limit and thrift_frame_size in the yaml file for some guidance. Cheers - Aaron Morton Freelance Developer @aaronmorton

Re: Cassandra backup queston regarding commitlogs

2012-05-01 Thread aaron morton
If you delete the commit logs you are rolling back to exactly what was in the snapshot. When you take a snapshot it flushes the memtables first, so there is nothing in the commit log that is not in the snapshot. Rolling back to a snapshot is rollback to that point in time. If you want to

Using CNAMEs for broadcast_address

2012-05-01 Thread Tom Duffield
Hey, There is a push to use Akamai IPA to accelerate traffic between our Cassandra nodes. Ignoring all other complexities this introduces, is it possible to use CNAMEs for broadcast addresses? I'm also assuming this restricts us to using only the PropertyFileSnitch (since we are not strictly in

Re: Cassandra backup queston regarding commitlogs

2012-05-01 Thread Roshan
Many thanks Aaron. I will post a support issue for them. But will keep the snapshot + incremental backups + commitlogs to recover any failure situation. -- View this message in context:

Re: Taking a Cluster Wide Snapshot

2012-05-01 Thread Tamar Fraenkel
I think it make's sense and would be happy if you can share the incremental snapshot scripts. Thanks! *Tamar Fraenkel * Senior Software Engineer, TOK Media [image: Inline image 1] ta...@tok-media.com Tel: +972 2 6409736 Mob: +972 54 8356490 Fax: +972 2 5612956 On Tue, May 1, 2012 at