How to add a new Keyspace?

2010-07-08 Thread ChingShen
Hi all, If I want to add a new Keyspace, does it mean I have to distribute my storage-conf.xml to whole nodes? and restart whole nodes? Shen

Re: How to add a new Keyspace?

2010-07-08 Thread Peter Schuller
  If I want to add a new Keyspace, does it mean I have to distribute my storage-conf.xml to whole nodes? and restart whole nodes? I *think* that is the case in Cassandra 0.6, but I'll let someone else comment. In trunk/upcoming 7 there are live schema upgrades that propagate through the

Re: How to add a new Keyspace?

2010-07-08 Thread Dave Viner
Here are my notes on how to make schema changes in 0.6: # Empty the commitlog with nodetool drain. = NOTE while this is running, the node will not accept writes. # Shutdown Cassandra and verify that there is no remaining data in the commitlog. = HOW to verify? # Delete the sstable files

Reading all rows in a column family in parallel

2010-07-08 Thread Brent N. Chun
Hello, I'm running Cassandra 0.6.0 on a cluster and have an application that needs to read all rows from a column family using the Cassandra Thrift API. Ideally, I'd like to be able to do this by having all nodes in the cluster read in parallel (i.e., each node reads a disjoint set of rows

Question about hinted handoff

2010-07-08 Thread ChingShen
Hi all, Please consider this case: (RF=1, CL=ONE) 1. I have A, B and C nodes. 2. A node is a coordinator node, it sends a request to B node to do write operation. 3. B node is down during write operation, so return failure message to client, and write a hint to C node. 4. B node comes

Re: Query on delete a column inside a super column

2010-07-08 Thread Moses Dinakaran
As per my knowledge in phpCassa I didnt find any option to remove a column from the supercolumn, The remove method removes the whole super column from the key, will check with thrift api. Through mutation object insert/update happens but removing a column dosent happen. Thank you all. Regards

How to stop Cassandra running in embeded mode

2010-07-08 Thread Andriy Kopachevsky
Hi, we are trying to set up intergation testing for Cassanrda, so we need to run and stop it as embeded service. Don't have any problem to start cassandra: import org.apache.cassandra.contrib.utils.service.CassandraServiceDataCleaner; class SomeTestClass { @Before public void setup()

Gossip round time

2010-07-08 Thread ChingShen
Hi, I found the http://www.slideshare.net/adorepump/cassandra-nosql ppt, that mentioned State disseminated in* O(logN)* rounds where N is the number of nodes in the cluster. about gossip on page 11. Is it wrong to draw on page 15? does it need round 4? Thanks. Shen

Re: Question about hinted handoff

2010-07-08 Thread ChingShen
So, am I correctly? Shen On Thu, Jul 8, 2010 at 5:33 PM, Anty anty@gmail.com wrote: Sorry I am wrong .Miss the CF=one. On Thu, Jul 8, 2010 at 5:27 PM, Anty anty@gmail.com wrote: On Thu, Jul 8, 2010 at 4:11 PM, ChingShen chingshenc...@gmail.comwrote: Hi all, Please consider

Re: High CPU usage on all nodes without any read or write

2010-07-08 Thread Olivier Rosello
Hi, Thank you for your help. I don't know if data is writing too fast to the cluster, but I don't think so (nodes are heavy, big CPU, 12GB RAM...) and there is no so much data (2000 inserts/sec for about 300 KB/sec of raw data). I trashed all data yesterday 6pm (GMT+2) and launched all

Re: Question about hinted handoff

2010-07-08 Thread Anty
On Thu, Jul 8, 2010 at 4:11 PM, ChingShen chingshenc...@gmail.com wrote: Hi all, Please consider this case: (RF=1, CL=ONE) 1. I have A, B and C nodes. 2. A node is a coordinator node, it sends a request to B node to do write operation. 3. B node is down during write operation, so

Re: Question about hinted handoff

2010-07-08 Thread ChingShen
If so, when does hinted handoff work? On Thu, Jul 8, 2010 at 9:55 PM, Anty anty@gmail.com wrote: On Thu, Jul 8, 2010 at 4:11 PM, ChingShen chingshenc...@gmail.com wrote: Hi all, Please consider this case: (RF=1, CL=ONE) 1. I have A, B and C nodes. 2. A node is a coordinator

Re: Reading all rows in a column family in parallel

2010-07-08 Thread Jonathan Ellis
CFRR does this. Is this possible? I guess I don't understand the question. :) On Thu, Jul 8, 2010 at 2:21 AM, Brent N. Chun b...@nutanix.com wrote: Hello, I'm running Cassandra 0.6.0 on a cluster and have an application that needs to read all rows from a column family using the Cassandra

Re: Some questions about the write operation and hinted handoff

2010-07-08 Thread ChingShen
Thanks Jonathan Ellis, I want to make sure that after A return failure message to client at CL.ONE, *does A write a hint to C?* If so, although the write operation is failed, but the data is still stored in C? if B comes back up, then C forwards to B? Shen On Thu, Jul 8, 2010 at 10:08 PM,

Re: Some questions about the write operation and hinted handoff

2010-07-08 Thread Benjamin Black
On Thu, Jul 8, 2010 at 9:02 AM, ChingShen chingshenc...@gmail.com wrote: Hmm.. as you mentioned that it will write a hint and report success at CL.ANY, does the hinted handoff only work at CL.ANY? Still no. Hints are written when nodes are down, regardless of CL, unless HH is disabled. CL

Re: Use of multiple Keyspaces

2010-07-08 Thread Benjamin Black
(and I'm sure someone will correct me if I am wrong on that) On Thu, Jul 8, 2010 at 11:24 AM, Benjamin Black b...@b3k.us wrote: There is a memtable per CF, regardless of how many keyspaces you have.

Re: Some questions about the write operation and hinted handoff

2010-07-08 Thread Jonathan Ellis
On Thu, Jul 8, 2010 at 1:19 PM, Benjamin Black b...@b3k.us wrote: On Thu, Jul 8, 2010 at 9:02 AM, ChingShen chingshenc...@gmail.com wrote: Hmm.. as you mentioned that it will write a hint and report success at CL.ANY, does the hinted handoff only work at CL.ANY? Still no.  Hints are written

RE: Use of multiple Keyspaces

2010-07-08 Thread Dwight Smith
Thanks - I found on Wiki that the memtables and sstables are on a per CF basis. Sorry about the mail client formatting - I have no choice - corporate controlled:) Now I am concerned about the deletions - what areas should I investigate to understand the concerns you raise? Thanks again

Re: Use of multiple Keyspaces

2010-07-08 Thread Benjamin Black
as rcoli just reminded me, i should be more clear that it is 1 _active_ memtable per CF, but there may be several pending flush. space from deletions is only reclaimed after GCGraceSeconds has elapsed AND a major compaction is run. default for the former is 10 days. the latter is not automatic.

Re: Reading all rows in a column family in parallel

2010-07-08 Thread Brent N. Chun
Hi Jonathan, The code snippet below was from the repository. I mentioned 0.6.0 specifically just to confirm that reading a CF using token-based range queries with the RandomPartitioner should (or shouldn't) also work in that version. I've seen discussions about whether range queries are now

http://scale.metaoptimize.com/

2010-07-08 Thread Ran Tavory
Just found this site and thought it might be interesting to folks on this list. http://scale.metaoptimize.com/ It's a stack-overflow style qna site, in their words: A community interested in scalability, high availability, data stores, NoSQL, distributed computing, parallel computing, cloud

Visual Tools for Cassandra

2010-07-08 Thread Torla, William
Does anybody know of any recently developed UI based tools for Cassandra? Ideally a tool capable of seeing nodes across a cluster would be preferred. The information contained in this communication may be CONFIDENTIAL and is intended only for the use of the recipient(s) named above. If you are

Re: Reading all rows in a column family in parallel

2010-07-08 Thread Jonathan Ellis
There have been a number of bug fixes to this since 0.6.0 -- as Thomas said, it works in 0.6.3. (Although there is one related bug scheduled to be fixed in 0.6.4, https://issues.apache.org/jira/browse/CASSANDRA-1042) On Thu, Jul 8, 2010 at 2:06 PM, Brent N. Chun b...@nutanix.com wrote: Hi

Re: Visual Tools for Cassandra

2010-07-08 Thread Eben Hewitt
Suguru Namura's Web Console may have some of what you need: http://github.com/suguru/cassandra-webconsole http://github.com/suguru/cassandra-webconsoleEben On Thu, Jul 8, 2010 at 1:00 PM, Torla, William william.to...@navteq.comwrote: Does anybody know of any recently developed UI based tools

Re: Reading all rows in a column family in parallel

2010-07-08 Thread Brent N. Chun
Jonathan Ellis wrote: There have been a number of bug fixes to this since 0.6.0 -- as Thomas said, it works in 0.6.3. (Although there is one related bug scheduled to be fixed in 0.6.4, https://issues.apache.org/jira/browse/CASSANDRA-1042) Ah, this is exactly one of the cases I've been seeing!

Re: Reading all rows in a column family in parallel

2010-07-08 Thread Brent N. Chun
Thomas Heller wrote: Hey, Is this possible in 0.6.0? (Note: for the next startToken, I was just planning on computing the MD5 digest of the last key directly since I'm accessing Cassandra through Thrift.) Can't speak for 0.6.0 but it works for 0.6.3. Just implemented this in ruby

get_range_slices

2010-07-08 Thread Jonathan Shook
Should I ever expect multiples of the same key (with non-empty column sets) from the same get_range_slices call? I've verified that the column data is identical byte-for-byte, as well, including column timestamps?

Why is cassandra named cassandra?

2010-07-08 Thread ChingShen
Hi, Why is cassandra named cassandra? Thanks. Shen

Re: Some questions about the write operation and hinted handoff

2010-07-08 Thread ChingShen
On Fri, Jul 9, 2010 at 4:32 AM, Jonathan Ellis jbel...@gmail.com wrote: If the coordinator knows it can't achieve the requested CL it won't do any writes, hinted or otherwise, and will immediately report UnavailableException to the client. To summarize: hinted writes are only generated when

Re: get_range_slices

2010-07-08 Thread Mike Malone
I think the answer to your question is no, you shouldn't. I'm feeling far too lazy to do even light research on the topic, but I remember there being a bug where replicas weren't consolidated and you'd get a result set that included data from each replica that was consulted for a query. That

Re: Some questions about the write operation and hinted handoff

2010-07-08 Thread Jonathan Ellis
On Thu, Jul 8, 2010 at 10:45 PM, ChingShen chingshenc...@gmail.com wrote: Ok, If so, I suppose that A sends requests to B, C and D nodes(RF=3) at CL.QUORUM, but D is down, then return success message to the client, and A write a hint to E node? until D comes back up then E forwards the data to