RE: data model advice needed

2013-02-27 Thread Sloot, Hans-Peter
What would be the best book to read about data modeling in Cassandra? I have ‘Cassandra the definitive guide’ but that is relatively old and has only a very limited example of how to design a model. Hans-Peter From: ka...@comcast.net [mailto:ka...@comcast.net] Sent: woensdag 27 februari 2013 19

Re: is upgradesstables required for 1.1.4 to 1.2.2? (I don't think it is)

2013-02-27 Thread Michal Michalski
I'm currently migrating 1.1.0 to 1.2.1 and on our small CI cluster, that I was testing some stuff on, it seems that it's not required to run upgradesstables (this doc doesn't mention about it too: http://www.datastax.com/docs/1.2/install/upgrading but the previous versions did). Of course I'd l

Re: no other nodes seen on priam cluster

2013-02-27 Thread Ben Bromhead
Off the top of my head I would check to make sure the Autoscaling Group you created is restricted to a single Availability Zone, also Priam sets the number of EC2 instances it expects based on the maximum instance count you set on your scaling group (it did this last time i checked a few months ago

Re: misreports on nodetool ring command on 1.1.4

2013-02-27 Thread Hiller, Dean
I finally gave up as it was supposed to be creating SimpleStrategy by default but was creating NTS by default so eventually I forced it to SimpleStrategy which did not have the issue. I never really figured out what was wrong there but my simpleStrategy correctly shows every node owns 75% which

best way to clean up a column family? 60Gig of dangling data

2013-02-27 Thread Hiller, Dean
Okay, we had 6 nodes of 130Gig and it was slowly increasing. Through our operations to modify bloomfilter fp chance, we screwed something up as trying to relieve memory pressures was tough. Anyways, somehow, this caused nodes 1, 2, and 3 to jump to around 200Gig and our incoming data stream is

Reading old data problem

2013-02-27 Thread Víctor Hugo Oliveira Molinar
Hello, I need some help to manage my live cluster! I'm currently running a cluster with 2 nodes, RF:2, CL:1. Since I'm limited to hardware upgrade issues, I'm not able to increase my ConsitencyLevel for now. Anyway, * *I ran a full repair on each node of the cluster followed by a flush. Although

Re: misreports on nodetool ring command on 1.1.4

2013-02-27 Thread Tyler Hobbs
What are the replication settings for the keyspace you created? Perhaps you used NTS with a bad DC name? On Wed, Feb 27, 2013 at 7:50 AM, Hiller, Dean wrote: > I just installed 1.1.4 as I need to test upgrade to 1.2.2. I have an > existing 6 node cluster which shows 50% ownership on each node

Re: is upgradesstables required for 1.1.4 to 1.2.2? (I don't think it is)

2013-02-27 Thread Hiller, Dean
H, wouldn't I have to run upgradesstables BEFORE I start the 1.2.2 node? But running upgradesstables as I recall required cassandra to be running.so does it somehow understand the old format when it starts I suspect? I am thinking I just keep the node out of the ring while I run the upgra

Re: is upgradesstables required for 1.1.4 to 1.2.2? (I don't think it is)

2013-02-27 Thread Hiller, Dean
Hmmm, I have this info from Aaron, but what about bringing up version 1.2.2 with thrift off so I can run upgradesstables before I rejoin the ring? Quote from Aaron... In pre 1.2 add these jvm startup params -Dcassandra.join_ring=false -Dcassandra.start_rpc=false Thanks, Dean On 2/27/13 12:00

Re: is upgradesstables required for 1.1.4 to 1.2.2? (I don't think it is)

2013-02-27 Thread Michael Kjellman
Yes, it's required between majors. Which your upgrade would be. On 2/27/13 10:54 AM, "Hiller, Dean" wrote: >My script to upgrade our first node in QA is thus (basically, snapshot, >drain, stop, then switch over then start)Š > >#!/bin/bash > >export NODE=$1 >export VERSION=1.1.4 >export USER=cass

is upgradesstables required for 1.1.4 to 1.2.2? (I don't think it is)

2013-02-27 Thread Hiller, Dean
My script to upgrade our first node in QA is thus (basically, snapshot, drain, stop, then switch over then start)… #!/bin/bash export NODE=$1 export VERSION=1.1.4 export USER=cassandra #NOTE: This script requires you have cassandra 1.2.2 in /opt/cassandra-1.2.2 but # feel free to modify if you

Re: data model advice needed

2013-02-27 Thread Hiller, Dean
There are many different patterns in noSQL with 90% being different than an RDBMS. Check out this page for some things to get you thinking http://buffalosw.com/wiki/Patterns-Page/ If you ever consider playorm and you can figure out how to partition your data(perhaps by month), you can do querie

Re: data model advice needed

2013-02-27 Thread kadey
One possibility would be to use dynamic columns, with each column name being a composite made from a timestamp, and the value of each containing serialized json of the details. The host could be the key. Then you could slice the data by column name. Ken - Original Message - Fro

data model advice needed

2013-02-27 Thread Sloot, Hans-Peter
Hi, I would like to get some advice on how to model columnfamilies for storing log of firewalls. The columns are listed further below. All the possibilities confuse me a bit (super columns, secondary indexes etc). My main question is how can I create the columnfamily in order to be able to get

Re: System.log

2013-02-27 Thread Akshay V. Rao
The conf/ directory of cassandra contains log4j-server.properties. I would assume cassandra just rides on top of whatever log4j does. -- Akshay On Wednesday, February 27, 2013 at 12:44 PM, Andy Stec wrote: > Cassandra rotates system.log when it reaches 20MB. We see that old logs > are kept

System.log

2013-02-27 Thread Andy Stec
Cassandra rotates system.log when it reaches 20MB. We see that old logs are kept for over a month. Is Cassandra going to delete or compress these logs when a certain threshold is reached or are we supposed to do it ourselves?

Re: no other nodes seen on priam cluster

2013-02-27 Thread Marcelo Elias Del Valle
One additional important info, I checked here and the seeds seems really different on each node. The command echo `curl http://127.0.0.1:8080/Priam/REST/v1/cassconfig/get_seeds` returns ip2 on first node and ip1,ip1 on second node. Any idea why? It's probably what is causing cassandra to die, right

Re: no other nodes seen on priam cluster

2013-02-27 Thread Marcelo Elias Del Valle
Hello Ben, Thanks for the willingness to help, 2013/2/27 Ben Bromhead > > Have your added the priam java agent to cassandras JVM argurments (e.g. > -javaagent:$CASS_HOME/lib/priam-cass-extensions-1.1.15.jar) and does the > web container running priam have permissions to write to the cassandra >

misreports on nodetool ring command on 1.1.4

2013-02-27 Thread Hiller, Dean
I just installed 1.1.4 as I need to test upgrade to 1.2.2. I have an existing 6 node cluster which shows 50% ownership on each node which makes sense since RF=3 on everything I have. I brought up all 4 nodes in this cluster and ran nodetool ring and it shows every node with 25%. Then, I create

Re: Understanding system.log

2013-02-27 Thread Alain RODRIGUEZ
Hi Victor. AFAIK, there is nothing like that. But I am quite sure that if you don't understand a log entry someone in this mailing list will be able to help you with it. Alain 2013/2/25 Víctor Hugo Oliveira Molinar > Hello everyone! > I'd like to know if there is any guide or description of t

Re: range queries

2013-02-27 Thread Alain RODRIGUEZ
Things you can find searching on the web : http://wiki.apache.org/cassandra/DataModel#Range_queries 2013/2/27 Alain RODRIGUEZ > "What am I doing wrong here?" > > You are probably using a RandomPartitioner (or Murmur3Partitioner) which > randomize keys to avoid hot spots. > > Basically, you jus

Re: range queries

2013-02-27 Thread Alain RODRIGUEZ
"What am I doing wrong here?" You are probably using a RandomPartitioner (or Murmur3Partitioner) which randomize keys to avoid hot spots. Basically, you just can't use range query because 'nlxx' is stored as md5('nlxx'). You should better modify your model to use column slice, whi

range queries

2013-02-27 Thread Sloot, Hans-Peter
Hello, I have what is perhaps a silly question. Column family other2 which has a varchar as primary key and a uuid column. I have inserted 2000 rows All rows keys start with 'nl' followed by other characters. To my surprise when I do : select count(*) from other2 where key > 'z'; It shows :

RE: NULL values

2013-02-27 Thread Sloot, Hans-Peter
Ok so eventually the row which has only the initial key will disappear? -Original Message- From: Tristan Seligmann [mailto:mithra...@mithrandi.net] Sent: woensdag 27 februari 2013 11:02 To: user@cassandra.apache.org Subject: Re: NULL values Cassandra only stores keys, not columns; once

Re: NULL values

2013-02-27 Thread Michal Michalski
W dniu 27.02.2013 10:57, Marco Matarazzo pisze:> You may also be interested in this: > > https://issues.apache.org/jira/browse/CASSANDRA-3783 CASSANDRA-3783 might not be the case here. The question is about using null in SELECT statements, which will require modifications in secondary indexes

Re: NULL values

2013-02-27 Thread Tristan Seligmann
Cassandra only stores keys, not columns; once all of the columns in a row have been deleted, there is nothing left to delete, although the row may still appear in some queries (with no columns) until the tombstones for those columns have been removed (which occurs during compaction once gc_grace_se

RE: NULL values

2013-02-27 Thread Sloot, Hans-Peter
But how do you check whether it exists? Can I select rows from a columnfamily which do not have a column set? What if I have set a ttl on all columns. After the expiration everything will be removed except the key. How can I determine the keys that have no additional columns and delete them? Fr

Re: NULL values

2013-02-27 Thread Marco Matarazzo
You may also be interested in this: https://issues.apache.org/jira/browse/CASSANDRA-3783 Also, I *guess* that a deleted column takes some (very small?) space with its tombstone, until it's removed. But I leave the details to someone else, as I'm definitely not an expert. Il giorno 27/feb/20

Re: NULL values

2013-02-27 Thread Alain RODRIGUEZ
Cassandra (C*) has no NULL values. C* is column schemaless, meaning you can have different columns on each row of the same ColumnFamily (CF). So if you want to check if a certain column is NULL for a row, you just check if it exist. By the way, you can store a column with a name and no value (empt

NULL values

2013-02-27 Thread Sloot, Hans-Peter
Hi, How does Cassandra handle NULL values? I want to know how I can see rows where a certain column has no values. For example if I set the TTL for columns is it possible to select rows where the ttl has expired for deletion. Regards Hans-Peter Dit bericht is vertrouwelijk en kan geheime i