Re: Connection issue with trunk using .net

2010-09-20 Thread Morten Wegelbye Nissen
On 19-09-2010 23:50, Michael Greene wrote: Trunk (and 0.7) use Thrift's framed transport, so you should wrap your TSocket in a TFramedTransport. On 0.6 and earlier you should have been wrapping with a TBufferedTransport for better performance but the framed transport is inherently buffered.

timestamp parameter for Thrift insert API ??

2010-09-20 Thread Kuan(謝冠生)
To whom it may concern, While timestamp parameter is auto generated by using cassandra-cli tool, why user have to manually insert the parameter in Cassandra-Thrift insert API?? In my point of view, leaving time-stamp to be inserted by client side will lead data into chaos, since clients'

Schema question

2010-09-20 Thread Morten Wegelbye Nissen
Hello List, No matter where you read, you almost every-where read the the noSQL datascema is completely different from the relational way - and after a little insight in cassandra everyone can 2nd that. But I miss to see some real-life examples on how a real system can be modelled. Lets

Re: Schema question

2010-09-20 Thread aaron morton
Here is a discussion about implementing twitter with Cassandra http://www.rackspacecloud.com/blog/2010/05/12/cassandra-by-example/ An example of the same on github http://github.com/ericflo/twissandra If you have not done already checkout the articles page on the wiki

Re: Schema question

2010-09-20 Thread vineet daniel
Hi Morten Simplest appraoch that comes to my mind (without considering any other use -cases just read and unread messages) is to use two CF's 'read' and 'unread', put all new messages in 'unread' and once user reads any one one them shift the same to 'read' and mark original for deletion.

Re: Schema question

2010-09-20 Thread Juho Mäkinen
We have built a facebook style messenger into our web site which uses cassandra as storage backend with two column families: TalkMessages and TalkLastMessages. I've uploaded a screenshot showing the feature in action to http://img138.imageshack.us/img138/3807/talkexample.jpg TalkMessages contains

commit log question

2010-09-20 Thread Даниел Симеонов
Hello,    I have one question about the 'commit log' in Cassandra, so imagine we issue a write with QUORUM, if the write was successful then we are sure that N/2 +1 replicas have the new data. If one of these replicas fail, no state is lost because the state is also available from another machine

FatClient Gossip error and some other problems

2010-09-20 Thread shimi
I have a cluster with 6 nodes on 2 datacenters (3 on each datacenter). I replaced all of the servers in the cluster (0.6.4) with new ones (0.6.5). My old cluster was unbalanced since I was using Random Partitioner and I bootstrapped all the nodes without specifying their tokens. Since I wanted

Building a local cluster in Linux

2010-09-20 Thread Rodrigo Hjort
Hello all, In order to build a *Cassandra cluster exclusively for availability and replication testings*, I thought of a simple solution, based on a single Linux instance, with no virtualization at all. The idea was to initialize every node, run a testing client, and manually kill some nodes

Re: Dazed and confused with Cassandra on EC2 ...

2010-09-20 Thread Dave Gardner
As a follow up to this conversation; we are still having issues with our Cassandra cluster on EC2. It *looks* to be related to Garbage Collection; however we aren't sure what the root cause of the problem is. Here is an extract from logs: INFO [GMFD:1] 2010-09-20 15:22:00,242 Gossiper.java

Re: FatClient Gossip error and some other problems

2010-09-20 Thread Gary Dusbabek
On Mon, Sep 20, 2010 at 09:51, shimi shim...@gmail.com wrote: I have a cluster with 6 nodes on 2 datacenters (3 on each datacenter). I replaced all of the servers in the cluster (0.6.4) with new ones (0.6.5). My old cluster was unbalanced since I was using Random Partitioner and I bootstrapped

Re: Building a local cluster in Linux

2010-09-20 Thread Gary Dusbabek
On Mon, Sep 20, 2010 at 10:02, Rodrigo Hjort rodrigo.hj...@gmail.com wrote: Hello all, In order to build a Cassandra cluster exclusively for availability and replication testings, I thought of a simple solution, based on a single Linux instance, with no virtualization at all. The idea was

Re: commit log question

2010-09-20 Thread Peter Schuller
   I have one question about the 'commit log' in Cassandra, so imagine we issue a write with QUORUM, if the write was successful then we are sure that N/2 +1 replicas have the new data. If one of these replicas fail, no state is lost because the state is also available from another machine in

Re: Dazed and confused with Cassandra on EC2 ...

2010-09-20 Thread Peter Schuller
Can anyone help shed any light on why this might be happening? We've tried a variety of JVM settings to alleviate this; currently with no luck. Extremely long ParNew (young generations) pause times are almost always due to swapping. Are you swapping? -- / Peter Schuller

Re: Dazed and confused with Cassandra on EC2 ...

2010-09-20 Thread Dave Gardner
Nope - no swap enabled. top - 16:53:14 up 12 days, 6:11, 3 users, load average: 1.99, 2.63, 5.03 Tasks: 133 total, 1 running, 132 sleeping, 0 stopped, 0 zombie Cpu(s): 0.0%us, 0.0%sy, 0.0%ni,100.0%id, 0.0%wa, 0.0%hi, 0.0%si, 0.0%st Mem: 35840228k total, 33077580k used, 2762648k

Re: Dazed and confused with Cassandra on EC2 ...

2010-09-20 Thread Dave Gardner
One other question for the list: I gather GMFD is gossip stage - but what does this actually mean? Is it an issue to have 203 pending operations? Thanks Dave INFO [GC inspection] 2010-09-20 16:56:12,792 GCInspector.java (line 129) GC for ParNew: 127970 ms, 570382800 reclaimed leaving

Re: Cassandra performance

2010-09-20 Thread Edward Capriolo
On Sat, Sep 18, 2010 at 9:26 AM, Peter Schuller peter.schul...@infidyne.com wrote:  - performance (it should be not as much less than shard of MySQL and scale linearly, we want to have not more that 10K inserts per second of writes, and probably not more than 1K/s reads which will be mostly

Re: a few generic questions

2010-09-20 Thread Mario Micklisch
Hello Scott, Thank you for your help and pointing me into some directions and also for making me aware that some things are more important! :-) I did read most of the wiki and many blogs and websites about Cassandra before deciding to go with it, even did several test installations and small

Re: Dazed and confused with Cassandra on EC2 ...

2010-09-20 Thread Peter Schuller
Nope - no swap enabled. Something is seriously weird, unless the system clock is broken... Given: INFO [GC inspection] 2010-09-20 15:27:42,046 GCInspector.java (line 129) GC for ParNew: 325411 ms, 84284896 reclaimed leaving 640770336 used; max is 25907560448 INFO [GC inspection] 2010-09-20

Re: Connection issue with trunk using .net

2010-09-20 Thread Michal Augustýn
Check this thread too: http://www.mail-archive.com/user@cassandra.apache.org/msg06024.html http://www.mail-archive.com/user@cassandra.apache.org/msg06024.htmlAugi 2010/9/20 Morten Wegelbye Nissen m...@monit.dk On 19-09-2010 23:50, Michael Greene wrote: Trunk (and 0.7) use Thrift's framed

How to get a list of keys of a CF?

2010-09-20 Thread Scott Mann
Hello, I need to get a list of all the keys of a particular CF. The API doc indicates that get_key_range is gone now (I'm using 0.6.5), and that I need to use get_range_slices, which returns the list of keys and their associated rows. Is this the only way to get the list of keys? It just seems a

Re: timestamp parameter for Thrift insert API ??

2010-09-20 Thread Michal Augustýn
Consider situation when you have more Cassandra servers. It would require time synchronization in Cassandra itself. So we have to have time synchronization - on applications servers or Cassandra servers. I'm not sure if this is concern of Cassandra but it would be good for me... 2010/9/20

Re: Connection issue with trunk using .net

2010-09-20 Thread Michael Greene
That... is this thread. I'm glad the solution works for you Morten. On Mon, Sep 20, 2010 at 12:33 PM, Michal Augustýn augustyn.mic...@gmail.com wrote: Check this thread too: http://www.mail-archive.com/user@cassandra.apache.org/msg06024.html

Re: Connection issue with trunk using .net

2010-09-20 Thread Michael Greene
Sorry, got threads mixed up, but yes the solution is the same. On Mon, Sep 20, 2010 at 12:40 PM, Michael Greene michael.gre...@gmail.comwrote: That... is this thread. I'm glad the solution works for you Morten. On Mon, Sep 20, 2010 at 12:33 PM, Michal Augustýn augustyn.mic...@gmail.com

Re: Connection issue with trunk using .net

2010-09-20 Thread Michal Augustýn
Yeah, the solution is the same. I wanted just to point out the connection lag issue that Morten could face too. Augi 2010/9/20 Michael Greene michael.gre...@gmail.com Sorry, got threads mixed up, but yes the solution is the same. On Mon, Sep 20, 2010 at 12:40 PM, Michael Greene

Re: FatClient Gossip error and some other problems

2010-09-20 Thread shimi
I was patient (although it is hard when you have millions of requests which are not served in time). I was waiting for a long time. There was nothing in the Logs and in JMX. Shimi On Mon, Sep 20, 2010 at 6:12 PM, Gary Dusbabek gdusba...@gmail.com wrote: On Mon, Sep 20, 2010 at 09:51, shimi

Re: what are ways to keep the SSTable Count down low

2010-09-20 Thread Edward Capriolo
On Mon, Sep 20, 2010 at 3:14 PM, Dathan Pattishall datha...@gmail.com wrote: How do you set the compaction threshold from storage-conf.xml? is this possible? What is the consensus on a basic Key-Value store of setting the compactionthreshold min/max from ./nodetool --host=localhost

Re: How to get a list of keys of a CF?

2010-09-20 Thread Aaron Morton
use get_range_slices with either a SliceRange start="", finish="" and count=0 OR SlicePredicate with column_names = []. This will return no columns. Not sure if the server will detect you are asking for zero columns and take a more efficient path, however it will not return any columns with the

anything on the roadmap to know the status return of executing nodetool maintenance commands

2010-09-20 Thread Dathan Pattishall
I would love the ability to run nodetool repair and get status updates on what is being done and when to what, as well as an estimated time of completion Like Grabbing data from nodes populating memtable cleanups etc. Currently this stuff happens asynchronously from the nodetool point of view

Re: what are ways to keep the SSTable Count down low

2010-09-20 Thread Peter Schuller
How do you set the compaction threshold from storage-conf.xml? is this possible? Not in 0.6 I believe, but I believe this recently went into trunk for the upcoming 0.7. What is the consensus on a basic Key-Value store of setting the compactionthreshold min/max from ./nodetool

Re: How to get a list of keys of a CF?

2010-09-20 Thread Scott Mann
Aaron, Thanks. Also for the reminder about tombstones. -- -Scott

Re: 0.7 memory usage problem

2010-09-20 Thread Alaa Zubaidi
Thanks Peter, I decreased the heap size, it did not help, however, it delayed the problem. I noticed that its swapping, so, do you think that I should set windows to Not to swap? Do you think its related to this issue? https://issues.apache.org/jira/browse/CASSANDRA-1014 Thanks, Alaa On

Re: timestamp parameter for Thrift insert API ??

2010-09-20 Thread Kuan(謝冠生)
By using cassandra-cli tool, we don't have to input timestamp while insertion. Does it mean that Cassandra have time synchronization build-in already? Since cassandra depending on time-stamp parameter very much (both read/write). The most ideal way to deal with timestamp is by cassandra itself,