Re: Install Cassandra on EC2

2011-08-04 Thread Eldad Yamin
HI Aaron, Thanks for your replay. I've already saw that, but at the moment I'm interesting in installing Cassandra from scratch - I want to learn. well, yesterday I've installed 1 node - now I'm looking on how to add more nodes and read more about Cassandra's tools (node reaper etc.) Thanks! On

Re: HOW TO select a column or all columns that start with X

2011-08-04 Thread CASSANDRA learner
Can you please gimme an example on this using hector client On Thu, Aug 4, 2011 at 7:18 AM, Boris Yen yulin...@gmail.com wrote: It seems to me that your column name consists of two components. If you have the luxury to upgrade your cassandra to 0.8.1+, I think you can think about using the

Fewer wide rows vs. more smaller rows

2011-08-04 Thread Benoit Perroud
Hi All, In a conceptual point of view, I'm wondering what is the pros cons, mainly in term of access efficiency, of both approach : - Grouping row keys together to reduce the number of keys, but having wider rows (with more columns) - One object in one row Let's illustrate with an example : I

Re: how to solve one node is in heavy load in unbalanced cluster

2011-08-04 Thread Yan Chunlu
I have tried the nodetool move but get the following error node3:~# nodetool -h node3 move 0 Exception in thread main java.lang.IllegalStateException: replication factor (3) exceeds number of endpoints (2) at

Re: how to solve one node is in heavy load in unbalanced cluster

2011-08-04 Thread Yan Chunlu
also nothing happens about the streaming: nodetool -h node3 netstats Mode: Normal Not sending any streams. Nothing streaming from /10.28.53.11 Pool NameActive Pending Completed Commandsn/a 0 165086750 Responses

Re: HOW TO select a column or all columns that start with X

2011-08-04 Thread Benoit Perroud
https://github.com/edanuff/CassandraIndexedCollections 2011/8/4 CASSANDRA learner cassandralear...@gmail.com: Can you please gimme an example on this using hector client On Thu, Aug 4, 2011 at 7:18 AM, Boris Yen yulin...@gmail.com wrote: It seems to me that your column name consists of two

Re: HOW TO select a column or all columns that start with X

2011-08-04 Thread Boris Yen
Assume you have a column family named testCF with comparator * CompositeType*(AsciiType, IntegerType(reversed=true), IntegerType); and a few columns have been inserted into record key a. Composite start = new Composite() ; Composite end = new Composite() ;

Re: Planet Cassandra (an aggregation site for Cassandra News)

2011-08-04 Thread Boris Yen
Looking forward to it. ^^ On Thu, Aug 4, 2011 at 1:56 PM, Eldad Yamin elda...@gmail.com wrote: Great! I hope it will be open soon! On Wed, Aug 3, 2011 at 10:33 PM, Ed Anuff e...@anuff.com wrote: Awesome, great news! On Wed, Aug 3, 2011 at 11:53 AM, Lynn Bender line...@gmail.com wrote:

RE: Trying to find the problem with a broken pipe

2011-08-04 Thread Tim Snyder
I am getting the same problem (Broken Pipe) on a loader program, after about 8 million read, write pairs. I am pushing serialized objects into a column with the program, the object it seems to be doing it on is much larger than the prior objects, so I am wondering if it is possibly a column size

Re: Exception encountered during startup - org.apache.cassandra.db.marshal.MarshalException: UUIDs must be exactly 16 bytes

2011-08-04 Thread jens müller
Hello Sylvain, thank you very much for your kind help. Thanks Jens 2011/8/3 Sylvain Lebresne sylv...@datastax.com This is a known bug: https://issues.apache.org/jira/browse/CASSANDRA-2867 (note that the issue title is not exact, in that this is not a problem with an upgrade from 0.7 to

Re: Problems using Thrift API in C

2011-08-04 Thread Konstantin Naryshkin
I have had similar issues when I generated Cassandra for Erlang. It seems that Thrift 0.6.1 (the latest stable version) does not work with Cassandra. Using Thrift 0.7 does. I had issues where it would give me run time errors when trying to send an insert (it would not serialize correctly).

Re: Problems using Thrift API in C

2011-08-04 Thread Rafael Almeida
- Original Message - From: Konstantin Naryshkin konstant...@a-bb.net To: user@cassandra.apache.org Cc: Sent: Thursday, August 4, 2011 10:36 AM Subject: Re: Problems using Thrift API in C I have had similar issues when I generated Cassandra for Erlang. It seems that Thrift

Re: CQL: How to escape single quotes?

2011-08-04 Thread Eric Evans
On Wed, 2011-08-03 at 17:22 +0200, Jens Hartung wrote: Odd, when you generally escape special chars with an backslash. Not in SQL (which CQL is patterned after). SQL escapes single quotes using an additional quote character. -- Eric Evans eev...@rackspace.com

Re: Secondary indexing and order clauses

2011-08-04 Thread Jason Rutherglen
Hi Todd, The Lucene secondary index will support sorting fairly easily (eg, out of the box). See: CASSANDRA-2915 Jason On Tue, Aug 2, 2011 at 8:26 PM, Todd Nine t...@spidertracks.com wrote: ** Hi guys, Now that dynamic composite have been introduced as part of cassandra core, has any

Re: how to solve one node is in heavy load in unbalanced cluster

2011-08-04 Thread Yan Chunlu
hi, any help? thanks! On Thu, Aug 4, 2011 at 5:02 AM, Yan Chunlu springri...@gmail.com wrote: forgot to mention I am using cassandra 0.7.4 On Thu, Aug 4, 2011 at 5:00 PM, Yan Chunlu springri...@gmail.com wrote: also nothing happens about the streaming: nodetool -h node3 netstats Mode:

Re: Last manual repair time

2011-08-04 Thread Jonathan Ellis
Not yet. https://issues.apache.org/jira/browse/CASSANDRA-2405 On Thu, Aug 4, 2011 at 4:44 AM, Bart Swedrowski b...@timedout.org wrote: Hello, I was going through documentation, however couldn't find anything about whether Cassandra stores somewhere information about when manual repair

Re: Fewer wide rows vs. more smaller rows

2011-08-04 Thread Jonathan Ellis
The principle is, keep data you retrieve at the same time, in the same row. Here the logical unit to query is one user so I don't see any benefit to forcing them into a single row unnaturally. On Thu, Aug 4, 2011 at 2:38 AM, Benoit Perroud ben...@noisette.ch wrote: Hi All, In a conceptual

Re: Problems using Thrift API in C

2011-08-04 Thread Konstantin Naryshkin
Rafael: Try using the latest version of Thrift (0.7) it seems that older versions cannot talk to Cassandra properly. - Original Message - From: Rafael Almeida almeida...@yahoo.com To: user@cassandra.apache.org Sent: Thursday, August 4, 2011 1:53:20 PM Subject: Re: Problems using Thrift

Re: Write everywhere, read anywhere

2011-08-04 Thread Mike Malone
2011/8/3 Patricio Echagüe patric...@gmail.com On Wed, Aug 3, 2011 at 4:00 PM, Philippe watche...@gmail.com wrote: Hello, I have a 3-node, RF=3, cluster configured to write at CL.ALL and read at CL.ONE. When I take one of the nodes down, writes fail which is what I expect. When I run a

RE: Write everywhere, read anywhere

2011-08-04 Thread Jeremiah Jordan
If you have RF=3 quorum won't fail with one node down. So R/W quorum will be consistent in the case of one node down. If two nodes go down at the same time, then you can get inconsistent data from quorum write/read if the write fails with TimeOut, the nodes come back up, and then read asks

Re: Write everywhere, read anywhere

2011-08-04 Thread Mike Malone
On Thu, Aug 4, 2011 at 10:25 AM, Jeremiah Jordan jeremiah.jor...@morningstar.com wrote: If you have RF=3 quorum won’t fail with one node down. So R/W quorum will be consistent in the case of one node down. If two nodes go down at the same time, then you can get inconsistent data from

VM Thread taking 100% CPU?

2011-08-04 Thread Yang
I'm running my cassandra cluster (one node), it went fine for a while under heavy load , using 270% CPU etc (this is a multi-core machine). but after a while, the CPU utilization dropped to 100%, and the cluster stopped responding to requests. I did a jstack , and did ps ax -L -o

Re: VM Thread taking 100% CPU?

2011-08-04 Thread Yang
never mind, I found it, I was trying to dump out the entire keyspace of 1mil rows, which is not that big, but somehow caused the GC to bring down the server http://stackoverflow.com/questions/3895940/what-does-java-vm-thread-do On Thu, Aug 4, 2011 at 1:02 PM, Yang tedd...@gmail.com wrote:

Re: Fewer wide rows vs. more smaller rows

2011-08-04 Thread Benoit Perroud
Thanks for your advise. Make sense. And without sticking to my dummy example, conceptually, what has a smaller memory footprint : 1M rows of 1 column or 1 row with 1M columns ? And if the row key and column name are known, is there any performance difference between both scenarios ? Thanks

get_range_slices( start=null, end=null ) ??

2011-08-04 Thread Yang
our keyspace is really not that big, about 1million rows, each about 500 bytes but doing a get_range_slices() on the entire key range gives OOM errors (I bumped up the -Xmx arg now, still trying, but giving such a large chunk of data in one RPC call is still bad), so that leaves me the option to

Re: get_range_slices( start=null, end=null ) ??

2011-08-04 Thread Robert Jackson
You should be able to specify the count parameter to your KeyRange and just use the last key returned as your start_key for the next page. Sent from my iPhone On Aug 4, 2011, at 5:00 PM, Yang tedd...@gmail.com wrote: our keyspace is really not that big, about 1million rows, each about

Re: Last manual repair time

2011-08-04 Thread Teijo Holzer
That's simple, set your log level to INFO in log4j-server.properties and then do the following: Start of every repair: grep 'Waiting for repair requests:' system.log End of every repair: grep 'No neighbors to repair with' system.log If you perform individual repairs for each keyspace/column

Re: get_range_slices( start=null, end=null ) ??

2011-08-04 Thread Yang
ah got it , thanks! On Thu, Aug 4, 2011 at 2:59 PM, Robert Jackson robe...@promedicalinc.com wrote: You should be able to specify the count parameter to your KeyRange and just use the last key returned as your start_key for the next page. Sent from my iPhone On Aug 4, 2011, at 5:00 PM,

Re: Cassandra encountered an internal error processing this request: TApplicationError type: 6 message:Internal error

2011-08-04 Thread aaron morton
The error log will contain a call stack, we need that. e.g. Failed with exception java.io.IOException:java.lang.NullPointerException ERROR 15:22:33,528 Failed with exception java.io.IOException:java.lang.NullPointerException java.io.IOException: java.lang.NullPointerException at

Force a garbage collection with jmxterm from the shell

2011-08-04 Thread Teijo Holzer
Hi, The following command line triggers a garbage collection via JMX: echo 'run -b java.lang:type=Memory gc' | java -jar jmxterm-1.0-alpha-4-uber.jar -l service:jmx:rmi:///jndi/rmi://hostname:8080/jmxrmi -n It uses: http://wiki.cyclopsgroup.org/jmxterm The GC is necessary after a major

Re: transaction principle between multiple column families

2011-08-04 Thread aaron morton
A write for a single row is atomic, including writing to multiple CF's in with the same row key. http://wiki.apache.org/cassandra/FAQ#batch_mutate_atomic They are not isolated though, reads may see the write partially applied. Have a look at the data modelling slides here

Re: Trying to find the problem with a broken pipe

2011-08-04 Thread aaron morton
It's probably a network thing. The only thing I can think of in cassandra is thrift_max_message_length_in_mb in the config. That config setting will result in a TException thrown on the server side (i think), not sure if that makes the server kill the socket. I would hope the error returns to

Re: Problems using Thrift API in C

2011-08-04 Thread aaron morton
Cassandra 0.8 uses Thrift 0.6+ If you are having problems using thrift 0.6 send over the error. Cheers - Aaron Morton Freelance Cassandra Developer @aaronmorton http://www.thelastpickle.com On 5 Aug 2011, at 03:08, Konstantin Naryshkin wrote: Rafael: Try using the latest

no stack trace :(

2011-08-04 Thread Dean Hiller
There seems to be no stack trace on this message Error: Exception thrown by the agent : java.net.MalformedURLException: Local host name unknown: java.net.UnknownHostException: ip-10-0-1-10: ip-10-0-1-10 I am also on sure why cassandra is doing a getLocalHost when I tried to bind everything to

Re: Force a garbage collection with jmxterm from the shell

2011-08-04 Thread Jeffrey Kesselman
Technically, by the VM spec, you can never *force* a java VM to garbage collect. You can request, but thats it. Rather then open that whole debate again if anyone doubts this, i suggets they look back in the archives. JK On Thu, Aug 4, 2011 at 6:35 PM, Teijo Holzer thol...@wetafx.co.nz

Re: Cassandra encountered an internal error processing this request: TApplicationError type: 6 message:Internal error

2011-08-04 Thread Dikang Gu
Sure, I can find the stack trace for some exceptions: ERROR [pool-2-thread-132] 2011-07-23 13:29:04,869 Cassandra.java (line 3210) Internal error processing get_range_slices java.lang.NullPointerException at org.apache.cassandra.db.ColumnFamily.diff(ColumnFamily.java:298) at

what is taking up my JVM heap space?

2011-08-04 Thread Yang
I ran my cassandra cluster on one node, inserted about 50 rows, each about 400 bytes, then the heap went up to 800MB, and constantly goes into full GC. I guessed that the memtable was probably taking a lot of space, so I did a node tool flush, so that all memtable are changed to 0. after this,

Re: what is taking up my JVM heap space?

2011-08-04 Thread Jonathan Ellis
If you didn't GC post-flush then it's just leftovers. An empty Cassandra (nothing in memtables or cache) will use 5-10MB of heap. On Thu, Aug 4, 2011 at 9:38 PM, Yang tedd...@gmail.com wrote: I ran my cassandra cluster on one node, inserted about 50 rows, each about 400 bytes, then the

Re: what is taking up my JVM heap space?

2011-08-04 Thread Yang
Thanks Jonathan On Thu, Aug 4, 2011 at 7:44 PM, Jonathan Ellis jbel...@gmail.com wrote: If you didn't GC post-flush then it's just leftovers. An empty Cassandra (nothing in memtables or cache) will use 5-10MB of heap. On Thu, Aug 4, 2011 at 9:38 PM, Yang tedd...@gmail.com wrote: I ran

move one node for load re-balancing then it status stuck at Leaving

2011-08-04 Thread Yan Chunlu
I have 3 nodes and the RF used to be 2, after awhile I have changed it to 3;  using Cassandra 0.7.4 I have tried the nodetool move but get the following error node3:~# nodetool -h node3 move 0 Exception in thread main java.lang.IllegalStateException: replication factor (3) exceeds number of

Re: move one node for load re-balancing then it status stuck at Leaving

2011-08-04 Thread mcasandra
Check things like netstats, disk space etc to see why it's in Leaving state. Anything in the logs that shows Leaving? -- View this message in context: