Re: SuperColumns

2010-04-15 Thread Vijay
Yes a super column can only have columns in it. Regards, /VJ On Wed, Apr 14, 2010 at 10:28 PM, Christian Torres chtor...@gmail.comwrote: I'm defining a ColumnFamily (Table) type Super, It's posible to have a SuperColumn inside another SuperColumn or SuperColumns can only have normal

Row key: string or binary (byte[])?

2010-04-15 Thread Roland Hänel
Is there any effort ongoing to make the row key a binary (byte[]) instead of a string? In the current cassandra.thrift file (0.6.0), I find: const string VERSION = 2.1.0 [...] struct KeySlice { 1: required *string* key, 2: required listColumnOrSuperColumn columns, } while on the current

AssertionError: DecoratedKey(...) != DecoratedKey(...)

2010-04-15 Thread Ran Tavory
When restarting one of the nodes in my cluster I found this error in the log. What does this mean? INFO [GC inspection] 2010-04-15 05:03:04,898 GCInspector.java (line 110) GC for ConcurrentMarkSweep: 712 ms, 11149016 reclaimed leaving 442336680 used; max is 4432068608 ERROR

Re: Time-series data model

2010-04-15 Thread Jean-Pierre Bergamin
Am 14.04.2010 15:22, schrieb Ted Zlatanov: On Wed, 14 Apr 2010 15:02:29 +0200 Jean-Pierre Bergaminja...@ractive.ch wrote: JB The metrics are stored together with a timestamp. The queries we want to JB perform are: JB * The last value of a specific metric of a device JB * The values of a

Re: AssertionError: DecoratedKey(...) != DecoratedKey(...)

2010-04-15 Thread Gary Dusbabek
Ran, It looks like you're seeing https://issues.apache.org/jira/browse/CASSANDRA-866. It's fixed in 0.6.1. Gary On Thu, Apr 15, 2010 at 04:06, Ran Tavory ran...@gmail.com wrote: When restarting one of the nodes in my cluster I found this error in the log. What does this mean?  INFO [GC

How to implement TOP TEN in Cassandra

2010-04-15 Thread Allen He
Hi , all How to implement *TOP TEN* in Cassandra, For example , *Top ten stories in Digg.com* How to model. Thanks

Get super-columns using SimpleCassie

2010-04-15 Thread Yésica Rey
I'm using SimpleCassie like cassandra client. I have a question: can I get all super-columns that there in one column-family? If yes, how can i do it? Regards!

Re: TException: Error: TSocket: timed out reading 1024 bytes from 10.1.1.27:9160

2010-04-15 Thread Jonathan Ellis
sounds like https://issues.apache.org/jira/browse/THRIFT-347 On Wed, Apr 14, 2010 at 11:58 PM, richard yao richard.yao2...@gmail.com wrote: I am having a try on cassandra, and I use php to access cassandra by thrift API. I got an error like this:     TException:  Error: TSocket: timed out

Re: TException: Error: TSocket: timed out reading 1024 bytes from 10.1.1.27:9160

2010-04-15 Thread richard yao
Thank you!

Re: AssertionError: DecoratedKey(...) != DecoratedKey(...)

2010-04-15 Thread Ran Tavory
yes, this looks like the same issue, thanks Gary. Other than seeing the errors in the log I haven't seen any other irregularities. (maybe there are, but they haven't surfaced). Does this assertion mean data corruption or something else that's worth waiting to 0.6.1 for? On Thu, Apr 15, 2010 at

Re: AssertionError: DecoratedKey(...) != DecoratedKey(...)

2010-04-15 Thread Gary Dusbabek
No data corruption. There was a bug in the way that the index was scanned that was manifesting itself when when the index got bigger than 2GB. Gary. On Thu, Apr 15, 2010 at 08:03, Ran Tavory ran...@gmail.com wrote: yes, this looks like the same issue, thanks Gary. Other than seeing the

Re: timestamp not found

2010-04-15 Thread Mike Malone
Looks like the timestamp, in this case, is 0. Does Cassandra allow zero timestamps? Could be a bug in Cassandra doing an implicit boolean coercion in a conditional where it shouldn't. Mike On Thu, Apr 15, 2010 at 8:39 AM, Lee Parker l...@socialagency.com wrote: We are currently migrating about

Re: timestamp not found

2010-04-15 Thread Lee Parker
When I am verifying the columns in the mutation map before sending it to cassandra, none of the timestamps are 0. I have had a difficult time recreating the error in a controlled environment so I can see the mutation map that was actually sent. Lee Parker l...@spredfast.com [image: Spredfast]

Re: timestamp not found

2010-04-15 Thread Lee Parker
I'm actually using PHP. I do have several php processes running, but each one should have it's own Thrift connection. Lee Parker l...@spredfast.com [image: Spredfast] On Thu, Apr 15, 2010 at 10:53 AM, Jonathan Ellis jbel...@gmail.com wrote: Looks like you are using C++ and not setting the

Re: RackAware and replication strategy

2010-04-15 Thread Benjamin Black
Have a look at locator/DatacenterShardStrategy.java. On Thu, Apr 15, 2010 at 8:16 AM, Ran Tavory ran...@gmail.com wrote: I'm reading this on this page http://wiki.apache.org/cassandra/ArchitectureInternals : AbstractReplicationStrategy controls what nodes get secondary, tertiary, etc.

busy thread on IncomingStreamReader ?

2010-04-15 Thread Ingram Chen
Hi all, We setup two nodes and simply set replication factor=2 for test run. After both nodes, say, node A and node B, serve several hours, we found that node A always keep 300% cpu usage. (the other node is under 100% cpu, which is normal) thread dump on node A shows that there are 3 busy

Re: BMT flush on windows?

2010-04-15 Thread Sonny Heer
From the jconsole, I go under ColumnFamilyStores-CF1-Column1-Operations and clicked force flush(). I'm getting a Operation return value null OK message box. what am I doing wrong? On Tue, Apr 13, 2010 at 3:12 PM, Jonathan Ellis jbel...@gmail.com wrote: you have three options (a) connect

Re: Recovery from botched compaction

2010-04-15 Thread Jonathan Ellis
On Tue, Apr 13, 2010 at 3:59 PM, Anthony Molinaro antho...@alumni.caltech.edu wrote: I actually got lucky and while it hovered in the 91-95% full, compaction finished and its now at 60%.  However, I still have around a dozen or so data files.  I thought 'nodeprobe compact' did a major

Re: batch_mutate silently failing

2010-04-15 Thread Jonathan Ellis
Could you create a ticket for us to return an error message in this situation? -Jonathan On Tue, Apr 13, 2010 at 4:24 PM, Lee Parker l...@socialagency.com wrote: nevermind. I figured out what the problem was. I was not putting the column inside a ColumnOrSuperColumn container. Lee Parker

Re: batch_mutate silently failing

2010-04-15 Thread Lee Parker
The entire thing was completely my own fault. I was making an invalid request and, somewhere in the code, I was catching the exception and not handling it at all. So it only appeared to be silent when in reality it was throwing a nice descriptive exception. Lee Parker l...@spredfast.com

Re: New User: OSX vs. Debian on Cassandra 0.5.0 with Thrift

2010-04-15 Thread Jonathan Ellis
You're right, to get those numbers on debian something is very wrong. Have you looked at http://spyced.blogspot.com/2010/01/linux-performance-basics.html ? What is the bottleneck on the linux machines? With the kind of speed you are seeing I wouldn't be surprised if it is swapping. -Jonathan

Re: batch_mutate silently failing

2010-04-15 Thread Jonathan Ellis
Ah, I see. Glad you resolved that. :) On Thu, Apr 15, 2010 at 12:31 PM, Lee Parker l...@socialagency.com wrote: The entire thing was completely my own fault. I was making an invalid request and, somewhere in the code, I was catching the exception and not handling it at all. So it only

Re: server crash - how to invertigate

2010-04-15 Thread Jonathan Ellis
There's a few things it could be: Out of memory: usually it can log the exception before dying but not always. there will be a java_$pid.hprof file with the heap dumped. JVM crash: there will be hs_err$pid.log file OS bug or hardware problem: sometimes your OS will log something -Jonathan On

Re: New User: OSX vs. Debian on Cassandra 0.5.0 with Thrift

2010-04-15 Thread Heath Oderman
I upgraded to 0.6 yesterday and it's bang on the same. I'll go read up on py_stress and give it a try. On Thu, Apr 15, 2010 at 1:57 PM, Jonathan Ellis jbel...@gmail.com wrote: What kind of numbers do you get from contrib/py_stress? (that's located somewhere else in 0.5, but you should really

Re: Time-series data model

2010-04-15 Thread Dan Di Spaltro
This is actually fairly similar to how we store metrics at Cloudkick. Below has a much more in depth explanation of some of that https://www.cloudkick.com/blog/2010/mar/02/4_months_with_cassandra/ So we store each natural point in the NumericArchive table. ColumnFamily CompareWith=LongType

Re: framed transport

2010-04-15 Thread Nathan McCall
FWIW, We just exposed this as an option in hector. -Nate On Thu, Apr 15, 2010 at 8:38 AM, Miguel Verde miguelitov...@gmail.com wrote: On Thu, Apr 15, 2010 at 10:22 AM, Eric Evans eev...@rackspace.com wrote: But, if you've enabled framing on the server, you will not be able to use C# clients

Re: BMT flush on windows?

2010-04-15 Thread Sonny Heer
Hmmm. Same code runs on ubuntu, and I'm able to flush using the nodetool. What is the difference between inserting data using : StorageProxy.mutateBlocking vs. sending oneway message using the MessagingService? On Thu, Apr 15, 2010 at 10:14 AM, Jonathan Ellis jbel...@gmail.com wrote: probably

Re: timestamp not found

2010-04-15 Thread Lee Parker
I have done more error checking and I am relatively certain that I am sending a valid timestamp to the thrift library. I was testing a switch to the Framed Transport instead of Buffered Transport and I am getting fewer errors, but now the cassandra server dies when this happens. It is starting

json2sstable

2010-04-15 Thread Lee Parker
Has anyone used json2sstable to migrate a large amount of data into cassandra? What was your methodology? I assume that this will be much faster than stepping through my data and doing writes via PHP/Thrift. Lee Parker

Re: framed transport

2010-04-15 Thread Lee Parker
It appears that after some testing, the buffered transport seems more stable. I am occasionally getting a missing timestamp error during batch_mutate calls. It happens both on framed and buffered transports, but when it happens on a framed transport, the server crashes. Is this typical? Lee

Re: framed transport

2010-04-15 Thread Jonathan Ellis
Have you tried other client machines? It sounds like your client is generating garbage, which is Bad. https://issues.apache.org/jira/browse/THRIFT-601 On Thu, Apr 15, 2010 at 4:20 PM, Lee Parker l...@socialagency.com wrote: It appears that after some testing, the buffered transport seems more

Data model question - column names sort

2010-04-15 Thread Sonny Heer
Need a way to have two different types of indexes. Key: aTextKey ColumnName: aTextColumnName:55 Value: Key: aTextKey ColumnName: 55:aTextColumnName Value: All the valuable information is stored in the column name itself. Above two can be in different column families... Queries: Given a key,

Clarification on Ring operations in Cassandra 0.5.1

2010-04-15 Thread Anthony Molinaro
Hi, I have a cluster running on ec2, and would like to do some ring management. Specifically, I'd like to replace an existing node without another node (I want to change the instance type). I was looking over http://wiki.apache.org/cassandra/Operations and it seems like I could do something

Re: Is it possible to get all records in a CF?

2010-04-15 Thread Gary Dusbabek
You'll have to scan the CF. If you're using OrderPreservingPartitioner please see 'get_range_slices' (http://wiki.apache.org/cassandra/API). It would help if you had an idea of where the key might be, so you would know where to start scanning. Gary. On Thu, Apr 15, 2010 at 21:01, Jared Laprise

Re: json2sstable

2010-04-15 Thread 孔令华
I tried that and found that it cannot handle large file at present. But you can write a tool according to it. eg: first sorting your data file according to it's hash key; second, write to a SSTable directly On Fri, Apr 16, 2010 at 4:47 AM, Lee Parker l...@socialagency.com wrote: Has anyone used

Re: json2sstable

2010-04-15 Thread Brandon Williams
On Thu, Apr 15, 2010 at 3:47 PM, Lee Parker l...@socialagency.com wrote: Has anyone used json2sstable to migrate a large amount of data into cassandra? What was your methodology? I assume that this will be much faster than stepping through my data and doing writes via PHP/Thrift. If you're