Re: Is Cassandra a document based DB?

2010-03-01 Thread HHB
What are the advantages/disadvantages of Cassandra over HBase? Thanks Ran. -- View this message in context: http://n2.nabble.com/Is-Cassandra-a-document-based-DB-tp4653418p4653644.html Sent from the cassandra-user@incubator.apache.org mailing list archive at Nabble.com.

Re: Is Cassandra a document based DB?

2010-03-01 Thread Brandon Williams
On Mon, Mar 1, 2010 at 5:34 AM, HHB hubaghd...@yahoo.ca wrote: What are the advantages/disadvantages of Cassandra over HBase? Ease of setup: all nodes are the same. No single point of failure: all nodes are the same. Speed: http://www.brianfrankcooper.net/pubs/ycsb-v4.pdf Richer model:

Use cases for Cassandra

2010-03-01 Thread HHB
Hey, What are the typical use cases for Cassandra? How to know if I should use Cassandra or documents-based data bases like CouchDB? I'm working for an ISP (Internet Service Provider), do you think we can employ Cassandra? and for what? Thanks all for help and time. -- View this message in

compaction threshold

2010-03-01 Thread Sonny Heer
* We recommend adjusting the compaction threshold to 0, while the import is running. After the import, you need * to run `nodeprobe -host IP flush_binary Keyspace` on every node, as this will flush the remaining data still left * in memory to disk. Then it's recommended to adjust the

Re: compaction threshold

2010-03-01 Thread Sonny Heer
I get: Min threshold must be at least 2 On Mon, Mar 1, 2010 at 8:55 AM, Brandon Williams dri...@gmail.com wrote: On Mon, Mar 1, 2010 at 10:53 AM, Sonny Heer sonnyh...@gmail.com wrote:  * We recommend adjusting the compaction threshold to 0, while the import is running. After the import, you

Re: Adjusting Token Spaces and Rebalancing Data

2010-03-01 Thread Jon Graham
Hello Everyone, Jonathan, Thanks for your advice :-) I have started a loadbalance operation on a busy cassandra node. The http://wiki.apache.org/cassandra/Operations web page indicates that nodetool streams can be used to monitor the status of the load balancing operation. I can't seem to

Re: Adjusting Token Spaces and Rebalancing Data

2010-03-01 Thread Jonathan Ellis
nodetool is the 0.6 replacement for nodeprobe. the stream info is new in that version. (0.6 beta release is linked from http://wiki.apache.org/cassandra/GettingStarted) -Jonathan On Mon, Mar 1, 2010 at 12:40 PM, Jon Graham sjclou...@gmail.com wrote: Hello Everyone, Jonathan,  Thanks for

Storage format

2010-03-01 Thread Erik Holstad
I've been looking at the source, but not quite find the things I'm looking for, so I have a few questions. Are columns for a row stored in a serialized data structure on disk or stored individually and put into a data structure when the call is being made? Because of the slice query, does that

Re: Adjusting Token Spaces and Rebalancing Data

2010-03-01 Thread Jon Graham
Jonathan, Thanks for the quick reply. After starting a loadbalance operation for about 30 minutes, I can see 3 ColumnFamily-tmp-Data, Filter and Index files on a lightly loaded node. The Data file has a size of 2,147,483,647 (max signed int) on the node being loaded. I hope I didn't run out of

Re: Adjusting Token Spaces and Rebalancing Data

2010-03-01 Thread Jon Graham
Thanks Jonathan. It seems like the load balance operation isn't moving. I haven't seen any data file time changes in 2 hours and no location file time changes in over an hour. I can see a tcp port # 7000 opened on the node where I ran the loadbalance command. It is connected to port 39033 on the

Re: Storage format

2010-03-01 Thread Erik Holstad
Sorry about that! Continuing: And in that case when using rows as indexes instead of columns we only need to read that specific row and might be more efficient in that case than to read a big row every time? -- Regards Erik

Re: Is Cassandra a document based DB?

2010-03-01 Thread Stu Hood
In HBase you have table:row:family:key:val:version, which some people might consider richer Cassandra is actually table:family:row:key:val[:subval], where subvals are the columns stored in a supercolumn (which can be easily arranged by timestamp to give the versioned approach). -Original

In-Memory Storage (no disk)

2010-03-01 Thread Masood Mortazavi
Hi there - Is there a setting of storage config or some other *user-level* programmatic means that would cause Cassandra not to write to disk? \ - m.

Re: Storage format

2010-03-01 Thread Jonathan Ellis
On Mon, Mar 1, 2010 at 4:06 PM, Erik Holstad erikhols...@gmail.com wrote: So that is kinda of what I want to do, but I want to go from a row with multiple columns to multiple rows with one column Right, and I'm trying to tell you that this is a bad idea unless you are worried about exhausting

Process for removing an old CF in 0.5.0

2010-03-01 Thread Anthony Molinaro
Hi, I was just wondering what the process might be for removing an old column family in 0.5.0. Can I just update the config and restart the server? Does it require stopping the entire cluster at once or can it be done in a rolling fashion? Once I update the config can I just delete all the

Re: Process for removing an old CF in 0.5.0

2010-03-01 Thread Jonathan Ellis
On Mon, Mar 1, 2010 at 4:41 PM, Anthony Molinaro antho...@alumni.caltech.edu wrote: Hi,  I was just wondering what the process might be for removing an old column family in 0.5.0. Can I just update the config and restart the server? Yes, but make sure your commitlog is flushed first (and

Re: Storage format

2010-03-01 Thread Jonathan Ellis
On Mon, Mar 1, 2010 at 4:49 PM, Erik Holstad erikhols...@gmail.com wrote: Haha! Thanks. Well I'm z little bit worried about this but since the indexes are pretty small I don't think it is going to be too bad. But was mostly thinking about performance and and having the index row as a

Re: Adjusting Token Spaces and Rebalancing Data

2010-03-01 Thread Jonathan Ellis
On Mon, Mar 1, 2010 at 3:18 PM, Jon Graham sjclou...@gmail.com wrote: Thanks Jonathan. It seems like the load balance operation isn't moving. I haven't seen any data file time changes in 2 hours and no location file time changes in over an hour. I can see a tcp port # 7000 opened on the

Re: Storage format

2010-03-01 Thread Erik Holstad
On Mon, Mar 1, 2010 at 2:51 PM, Jonathan Ellis jbel...@gmail.com wrote: On Mon, Mar 1, 2010 at 4:49 PM, Erik Holstad erikhols...@gmail.com wrote: Haha! Thanks. Well I'm z little bit worried about this but since the indexes are pretty small I don't think it is going to be too bad. But

Re: Storage format

2010-03-01 Thread Jonathan Ellis
Then you definitely want one row, range queries are slower than we'd like right now. (Ticket to fix that: https://issues.apache.org/jira/browse/CASSANDRA-821) On Mon, Mar 1, 2010 at 5:00 PM, Erik Holstad erikhols...@gmail.com wrote: On Mon, Mar 1, 2010 at 2:51 PM, Jonathan Ellis

Re: Adjusting Token Spaces and Rebalancing Data

2010-03-01 Thread Jon Graham
Hello, I did find these exceptions. I issued the loadbalance command on node 192.168.2.10. INFO [MESSAGING-SERVICE-POOL:3] 2010-03-01 10:34:40,764 TcpConnection.java (line 315) Closing errored connection java.nio.channels.SocketChannel[connected local=/192.168.2.10:55973 remote=/

Re: Adjusting Token Spaces and Rebalancing Data

2010-03-01 Thread Jonathan Ellis
On Mon, Mar 1, 2010 at 5:39 PM, Jon Graham sjclou...@gmail.com wrote: Reached an EOL or something bizzare occured. Reading from: /192.168.2.13 BufferSizeRemaining: 16 This one is harmless java.io.IOException: Value too large for defined data type     at

Error with Cassandra Only Example in contrib/client_only

2010-03-01 Thread JKnight JKnight
Dear all, I tried to run ClientOnlyExample.java on contrib/client_only. But the code did not run. The error is: Exception in thread main java.lang.IndexOutOfBoundsException: Index: 0, Size: 0 at java.util.ArrayList.rangeCheck(ArrayList.java:571) at

Re: Error with Cassandra Only Example in contrib/client_only

2010-03-01 Thread JKnight JKnight
Could you give me the config file? Thanks On Mon, Mar 1, 2010 at 11:34 PM, Jonathan Ellis jbel...@gmail.com wrote: That means it doesn't know any of your other nodes. Probably you don't have it configured with a seed. On Mon, Mar 1, 2010 at 9:31 PM, JKnight JKnight beukni...@gmail.com