Using 5-6 bytes for cassandra timestamps vs 8…

2011-08-29 Thread Kevin Burton
I keep thinking about the usage of cassandra timestamps and feel that for a lot of applications swallowing a 2-4x additional cost to to memory might be a nonstarter. Has there been any discussion of using alternative date encodings? Maybe 1ms resolution is too high ….. perhaps 10ms resolution?

Re: What are the things to watch out for with big nodes

2011-08-29 Thread Peter Schuller
If I have a cluster with 15-20T nodes, somethings that I know will be a potential problem are Compactions taking longer Higher read latencies Long time for adding/removing nodes What are other things that can be problematic with big nodes?

flushwriter all time blocked

2011-08-29 Thread Roland Gude
Hi all, On a 0.7.8 cluster In tpstats i can see flushwriter stage having several tasks in state all-time-blocked (immendiatly after node restart its 8 but grows over time to around 300). What does it mean (or how can I find out) and what can I do about it? -- YOOCHOOSE GmbH Roland Gude

Re: Using 5-6 bytes for cassandra timestamps vs 8…

2011-08-29 Thread Terje Marthinussen
I have a patch for trunk which I just have to get time to test a bit before I submit. It is for super columns and will use the super columns timestamp as the base and only store variant encoded offsets in the underlying columns. If the timestamp equals that of the SC, it will store nothing

The way to query a CF with start 10 and end 100

2011-08-29 Thread Guofeng Zhang
Hi, I have a standard CF that has column start and end. I need to query its rows using condition start10 and end100. Is there any better way to do it? Using native secondary index or creating a specific CF for the search. I do not know which one is better. If the late is preferred to, how the

unsubscribe

2011-08-29 Thread Stanislav Vodetskyi

Re: Scaling Out / Replication Factor too?

2011-08-29 Thread Boris Yen
I am not sure, but I think the problem might be order preserving partitioners you used. When using order preserving partitioners data might be skewed meaning most data only stay in a few servers, so that might create a few heavy load servers. On Mon, Aug 29, 2011 at 7:24 AM, Ryan Lowe

[no subject]

2011-08-29 Thread Stanislav Vodetskyi
unsubscribe

Re:

2011-08-29 Thread Joris van der Wel
send a message to user-unsubscr...@cassandra.apache.org On Mon, Aug 29, 2011 at 1:23 PM, Stanislav Vodetskyi stanislav.vodets...@globallogic.com wrote: unsubscribe

Re: inconsistent counter value?

2011-08-29 Thread Sylvain Lebresne
On Sat, Aug 13, 2011 at 9:38 AM, Boris Yen yulin...@gmail.com wrote: I posted a comment for Cassandra-3006 after 0.8.4 is released, but it seems not be noticed there, so I re-post here, wondering if anyone could help. --- Follow the same steps posted on

Re:

2011-08-29 Thread Stanislav Vodetskyi
Thanks On Mon, Aug 29, 2011 at 2:38 PM, Joris van der Wel jo...@jorisvanderwel.com wrote: send a message to user-unsubscr...@cassandra.apache.org On Mon, Aug 29, 2011 at 1:23 PM, Stanislav Vodetskyi stanislav.vodets...@globallogic.com wrote: unsubscribe

Re: flushwriter all time blocked

2011-08-29 Thread Jonathan Ellis
the javadoc for the mbeans explains: /** * Get the number of tasks that had blocked before being accepted (or * rejected). */ public int getTotalBlockedTasks(); /** * Get the number of tasks currently blocked, waiting to be accepted by * the executor (because

hw requirements

2011-08-29 Thread Helder Oliveira
Hello guys, What is the type of profile of a cassandra server. Are SSD an option ? Does cassandra needs better CPU ou lots of memory ? Are SATA II disks ok ? I am making some tests, and i started evaluating the possible hardware. If someone already has conclusions about it, please share :D

AW: flushwriter all time blocked

2011-08-29 Thread Roland Gude
Hi, This still leaves me puzzled. Is it a bad thing? Why is it happening? And what does blocked before being accepted mean? Does it mean Cassandra did not even try to put the task into a queue? Thanks for enlightening me, roland -Ursprüngliche Nachricht- Von: Jonathan Ellis

Matt Dennis' presentation on Cassandra best practices on EC2

2011-08-29 Thread Jeremy Hanna
Just wanted to let people know about a great presentation that Matt Dennis did here at the Cassandra Austin meetup. It's on Cassandra best practices on EC2. We found the presentation extremely helpful. http://www.slideshare.net/mattdennis/cassandra-on-ec2

can't keep node running.

2011-08-29 Thread Ernst D Schoen-René
Hi, I've written about this before, but I can't find the original thread. I have one node on my 16 node cassandra cluster (running 8.1) that will not stay running. All nodes were configured the same way, but this one kept going down, so I changed these lines in my cassandra-env.sh:

Re: hw requirements

2011-08-29 Thread Richard Low
Hi, The hardware you choose depends a bit on your workload - writes vs reads, amount of cacheable data, latency requirements, etc.. What sort of workload do you expect? See http://wiki.apache.org/cassandra/CassandraHardware for some general advice. People typically have 8-24 GB RAM per node

Re: The way to query a CF with start 10 and end 100

2011-08-29 Thread Benoit Perroud
queries start 10 and end 100 is not straight forward to modelize, you should use the value of start as column name, and check on client side the second condition. Just for comparison, modeling 10 value 100 is rather much easier if you set your values as column name, or using CompositeType if

Recommendations on moving to Hadoop/Hive with Cassandra + RDBMS

2011-08-29 Thread Tharindu Mathew
Hi, I have an already running system where I define a simple data flow (using a simple custom data flow language) and configure jobs to run against stored data. I use quartz to schedule and run these jobs and the data exists on various data stores (mainly Cassandra but some data exists in RDBMS

Querying a composite key with cassandra-cli

2011-08-29 Thread Anthony Ikeda
Is there currently a way to query a composite key in cassandra-cli? I was sure I used to be able to call: get ColumnFamily['comp1:comp2'] But this has recently stopped working. Anthony

Updates lost

2011-08-29 Thread Jiang Chen
Hi, Just started developing using Cassandra (0.8.4). I noticed when updating the same row and column repeatedly, say, in a test case, updates may get lost. I found it in a Java client but the following python script also exhibits the same problem.

cassandra auto create snapshots?

2011-08-29 Thread Yan Chunlu
just found the data dir consume a lot of space, which is because there was many snapshots in it. but I have set snapshot_before_compaction: false. is that possible that cassandra create those snapshot automatically? could I delete them? the dir names is strange(normally it should contain date

Re: Querying a composite key with cassandra-cli

2011-08-29 Thread Jonathan Ellis
Sounds like you need to specify a key_validation_class so the cli knows how to encode the key. (It used to assume keys were always ascii.) On Mon, Aug 29, 2011 at 4:02 PM, Anthony Ikeda anthony.ikeda@gmail.com wrote: Is there currently a way to query a composite key in cassandra-cli? I was

Re: cassandra auto create snapshots?

2011-08-29 Thread Jonathan Ellis
Perhaps you are seeing auto-snapshots before destructive events such as truncate or drop CF/KS. On Mon, Aug 29, 2011 at 4:19 PM, Yan Chunlu springri...@gmail.com wrote: just found the data dir consume a lot of space, which is because there was many snapshots in it. but I have set

Disk usage for CommitLog

2011-08-29 Thread Derek Andree
I run a single node cassandra instance, and we have lots of overwrites on a hot CF and disk utilization seems to grow pretty fast. We've noticed that when we restart cassandra disk utilization decreases dramatically (dramatic being something close to 50%). Most of this growth seems to be in

Re: cassandra auto create snapshots?

2011-08-29 Thread Yan Chunlu
so it was useless? I didn't drop any CF/KS, could nodetool move, nodetool repair cause the problem? On Tue, Aug 30, 2011 at 5:23 AM, Jonathan Ellis jbel...@gmail.com wrote: Perhaps you are seeing auto-snapshots before destructive events such as truncate or drop CF/KS. On Mon, Aug 29, 2011

Re: Disk usage for CommitLog

2011-08-29 Thread Dan Hendry
First off, what version of Cassandra are you using? We've noticed that when we restart cassandra disk utilization decreases dramatically Presumably you mean 'utilization' as in free space. Specifically on a restart, this type of behavior is likely due to Cassandra deleting compacted SSTables.

Re: Disk usage for CommitLog

2011-08-29 Thread Jonathan Ellis
On Mon, Aug 29, 2011 at 8:04 PM, Derek Andree dand...@lacunasystems.com wrote: Is there an easy way to keep commit log size down without killing performance? There is in 1.0: commitlog_total_space_in_mb, added in https://issues.apache.org/jira/browse/CASSANDRA-2427. Until then you'll have to

Re: cassandra auto create snapshots?

2011-08-29 Thread Jonathan Ellis
No. On Mon, Aug 29, 2011 at 8:15 PM, Yan Chunlu springri...@gmail.com wrote: so it was useless?   I didn't drop any CF/KS,  could nodetool move, nodetool repair cause the problem? On Tue, Aug 30, 2011 at 5:23 AM, Jonathan Ellis jbel...@gmail.com wrote: Perhaps you are seeing auto-snapshots

Cassandra 0.8 CLI: Inconsistent treatment of literals for keys/columns and values

2011-08-29 Thread Oleg Proudnikov
Hi, After installing Cassandra 0.8 I discovered that my app stopped working. The issue is that the app is now unable to read a row that was inserted by a CLI set command with a numeric string key. CLI in Cassandra 0.8 seems to be treating literals inconsistently. Please let me know if I am

Re: Cassandra 0.8 CLI: Inconsistent treatment of literals for keys/columns and values

2011-08-29 Thread Jonathan Ellis
I answered this earlier today, so I added it to the FAQ: http://wiki.apache.org/cassandra/FAQ#cli_keys On Mon, Aug 29, 2011 at 9:31 PM, Oleg Proudnikov ol...@cloudorange.com wrote: Hi, After installing Cassandra 0.8 I discovered that my app stopped working. The issue is that the app is now

Re: Disk usage for CommitLog

2011-08-29 Thread Derek Andree
Thanks Dan, good info. First off, what version of Cassandra are you using? Sorry my bad, 0.8.4 Provided you are using a recent Cassandra version (late 0.7 or 0.8.x) I doubt the commit log is your problem. My experience using Cassandra as a time series data store (with a full 30 days of

Re: cassandra auto create snapshots?

2011-08-29 Thread Yan Chunlu
thanks for the help. have you tried use those snapshot to recover a node? I have not found anything related to those auto-created snapshots in the wiki page, then dont even have a timestamp, not sure how to use those files On Tue, Aug 30, 2011 at 10:27 AM, Jonathan Ellis jbel...@gmail.com

Re: Updates lost

2011-08-29 Thread Tyler Hobbs
On Mon, Aug 29, 2011 at 4:56 PM, Peter Schuller peter.schul...@infidyne.com wrote: If the client sleeps for a few ms at each loop, the success rate increases. At 15 ms, the script always succeeds so far. Interestingly, the problem seems to be sensitive to alphabetical order. Updating the

Cassandra 0.8 schematool

2011-08-29 Thread Jenny
Hi I notice that schematool was removed from the release of Cassandra 0.8. I would like to know the reason of doing that and how i can load the schema from configure file(like .yaml) automatically. Best Regards! Jenny