On 19-09-2010 23:50, Michael Greene wrote:
Trunk (and 0.7) use Thrift's framed transport, so you should wrap your TSocket
in a TFramedTransport. On 0.6 and earlier you should have been wrapping with a
TBufferedTransport for better performance but the framed transport is
inherently buffered.
To whom it may concern,
While timestamp parameter is auto generated by using cassandra-cli tool, why
user have to manually insert the parameter in Cassandra-Thrift insert API??
In my point of view, leaving time-stamp to be inserted by client side will lead
data into chaos, since clients'
Hello List,
No matter where you read, you almost every-where read the the noSQL
datascema is completely different from the relational way - and after a
little insight in cassandra everyone can 2nd that.
But I miss to see some real-life examples on how a real system can be
modelled. Lets
Here is a discussion about implementing twitter with Cassandra
http://www.rackspacecloud.com/blog/2010/05/12/cassandra-by-example/
An example of the same on github
http://github.com/ericflo/twissandra
If you have not done already checkout the articles page on the wiki
Hi Morten
Simplest appraoch that comes to my mind (without considering any other use
-cases just read and unread messages) is to use two CF's 'read' and
'unread', put all new messages in 'unread' and once user reads any one one
them shift the same to 'read' and mark original for deletion.
We have built a facebook style messenger into our web site which
uses cassandra as storage backend with two column families:
TalkMessages and TalkLastMessages. I've uploaded a screenshot showing
the feature in action to
http://img138.imageshack.us/img138/3807/talkexample.jpg
TalkMessages contains
Hello,
I have one question about the 'commit log' in Cassandra, so imagine
we issue a write with QUORUM, if the write was successful then we are
sure that N/2 +1 replicas have the new data. If one of these replicas
fail, no state is lost because the state is also available from
another machine
I have a cluster with 6 nodes on 2 datacenters (3 on each datacenter).
I replaced all of the servers in the cluster (0.6.4) with new ones (0.6.5).
My old cluster was unbalanced since I was using Random Partitioner and I
bootstrapped all the nodes without specifying their tokens.
Since I wanted
Hello all,
In order to build a *Cassandra cluster exclusively for availability and
replication testings*, I thought of a simple solution, based on a single
Linux instance, with no virtualization at all.
The idea was to initialize every node, run a testing client, and manually
kill some nodes
As a follow up to this conversation; we are still having issues with our
Cassandra cluster on EC2.
It *looks* to be related to Garbage Collection; however we aren't sure what
the root cause of the problem is. Here is an extract from logs:
INFO [GMFD:1] 2010-09-20 15:22:00,242 Gossiper.java
On Mon, Sep 20, 2010 at 09:51, shimi shim...@gmail.com wrote:
I have a cluster with 6 nodes on 2 datacenters (3 on each datacenter).
I replaced all of the servers in the cluster (0.6.4) with new ones (0.6.5).
My old cluster was unbalanced since I was using Random Partitioner and I
bootstrapped
On Mon, Sep 20, 2010 at 10:02, Rodrigo Hjort rodrigo.hj...@gmail.com wrote:
Hello all,
In order to build a Cassandra cluster exclusively for availability and
replication testings, I thought of a simple solution, based on a single
Linux instance, with no virtualization at all.
The idea was
I have one question about the 'commit log' in Cassandra, so imagine
we issue a write with QUORUM, if the write was successful then we are
sure that N/2 +1 replicas have the new data. If one of these replicas
fail, no state is lost because the state is also available from
another machine in
Can anyone help shed any light on why this might be happening? We've tried a
variety of JVM settings to alleviate this; currently with no luck.
Extremely long ParNew (young generations) pause times are almost
always due to swapping. Are you swapping?
--
/ Peter Schuller
Nope - no swap enabled.
top - 16:53:14 up 12 days, 6:11, 3 users, load average: 1.99, 2.63, 5.03
Tasks: 133 total, 1 running, 132 sleeping, 0 stopped, 0 zombie
Cpu(s): 0.0%us, 0.0%sy, 0.0%ni,100.0%id, 0.0%wa, 0.0%hi, 0.0%si,
0.0%st
Mem: 35840228k total, 33077580k used, 2762648k
One other question for the list:
I gather GMFD is gossip stage - but what does this actually mean? Is it an
issue to have 203 pending operations?
Thanks
Dave
INFO [GC inspection] 2010-09-20 16:56:12,792 GCInspector.java (line 129) GC
for ParNew: 127970 ms, 570382800 reclaimed leaving
On Sat, Sep 18, 2010 at 9:26 AM, Peter Schuller
peter.schul...@infidyne.com wrote:
- performance (it should be not as much less than shard of MySQL and
scale linearly, we want to have not more that 10K inserts per second
of writes, and probably not more than 1K/s reads which will be mostly
Hello Scott,
Thank you for your help and pointing me into some directions and also for
making me aware that some things are more important! :-)
I did read most of the wiki and many blogs and websites about Cassandra
before deciding to go with it, even did several test installations and small
Nope - no swap enabled.
Something is seriously weird, unless the system clock is broken... Given:
INFO [GC inspection] 2010-09-20 15:27:42,046 GCInspector.java (line
129) GC for ParNew: 325411 ms, 84284896 reclaimed leaving 640770336
used; max is 25907560448
INFO [GC inspection] 2010-09-20
Check this thread too:
http://www.mail-archive.com/user@cassandra.apache.org/msg06024.html
http://www.mail-archive.com/user@cassandra.apache.org/msg06024.htmlAugi
2010/9/20 Morten Wegelbye Nissen m...@monit.dk
On 19-09-2010 23:50, Michael Greene wrote:
Trunk (and 0.7) use Thrift's framed
Hello,
I need to get a list of all the keys of a particular CF. The API doc
indicates that get_key_range is gone now (I'm using 0.6.5), and that
I need to use get_range_slices, which returns the list of keys and
their associated rows.
Is this the only way to get the list of keys? It just seems a
Consider situation when you have more Cassandra servers. It would require
time synchronization in Cassandra itself. So we have to have time
synchronization - on applications servers or Cassandra servers. I'm not sure
if this is concern of Cassandra but it would be good for me...
2010/9/20
That... is this thread. I'm glad the solution works for you Morten.
On Mon, Sep 20, 2010 at 12:33 PM, Michal Augustýn augustyn.mic...@gmail.com
wrote:
Check this thread too:
http://www.mail-archive.com/user@cassandra.apache.org/msg06024.html
Sorry, got threads mixed up, but yes the solution is the same.
On Mon, Sep 20, 2010 at 12:40 PM, Michael Greene
michael.gre...@gmail.comwrote:
That... is this thread. I'm glad the solution works for you Morten.
On Mon, Sep 20, 2010 at 12:33 PM, Michal Augustýn
augustyn.mic...@gmail.com
Yeah, the solution is the same. I wanted just to point out the connection
lag issue that Morten could face too.
Augi
2010/9/20 Michael Greene michael.gre...@gmail.com
Sorry, got threads mixed up, but yes the solution is the same.
On Mon, Sep 20, 2010 at 12:40 PM, Michael Greene
I was patient (although it is hard when you have millions of requests which
are not served in time). I was waiting for a long time. There was nothing in
the Logs and in JMX.
Shimi
On Mon, Sep 20, 2010 at 6:12 PM, Gary Dusbabek gdusba...@gmail.com wrote:
On Mon, Sep 20, 2010 at 09:51, shimi
On Mon, Sep 20, 2010 at 3:14 PM, Dathan Pattishall datha...@gmail.com wrote:
How do you set the compaction threshold from storage-conf.xml? is this
possible?
What is the consensus on a basic Key-Value store of setting the
compactionthreshold min/max from
./nodetool --host=localhost
use get_range_slices with either a SliceRange start="", finish="" and count=0 OR SlicePredicate with column_names = []. This will return no columns. Not sure if the server will detect you are asking for zero columns and take a more efficient path, however it will not return any columns with the
I would love the ability to run nodetool repair and get status updates
on what is being done and when to what, as well as an estimated time
of completion
Like
Grabbing data from nodes
populating memtable
cleanups
etc.
Currently this stuff happens asynchronously from the nodetool point of
view
How do you set the compaction threshold from storage-conf.xml? is this
possible?
Not in 0.6 I believe, but I believe this recently went into trunk for
the upcoming 0.7.
What is the consensus on a basic Key-Value store of setting the
compactionthreshold min/max from
./nodetool
Aaron,
Thanks. Also for the reminder about tombstones.
--
-Scott
Thanks Peter,
I decreased the heap size, it did not help, however, it delayed the problem.
I noticed that its swapping, so, do you think that I should set windows
to Not to swap?
Do you think its related to this issue?
https://issues.apache.org/jira/browse/CASSANDRA-1014
Thanks,
Alaa
On
By using cassandra-cli tool, we don't have to input timestamp while insertion.
Does it mean that Cassandra have time synchronization build-in already?
Since cassandra depending on time-stamp parameter very much (both read/write).
The most ideal way to deal with timestamp is by cassandra itself,
33 matches
Mail list logo