Yes, it does. However there's no real answer what's the limit: it depends
on your hardware and cluster configuration.
You might even want to search the archives of this mailinglist, I remember
this has been asked before.
Cheers!
2012/5/21 Luís Ferreira zamith...@gmail.com
Hi,
Does the
In some cases Cassandra is really good and in some cases it is not.
The way I see your approach is your are recording all of your events in
single key is it? Not recommended. It can go really big also if your have
cluster of servers, It will hit only one server all the time make it
overwhelm, and
I also dont understand if all these nodes are replicas of each other why is
that the first node has almost double the data.
Have you performed any token moves ? Old data is not deleted unless you run
nodetool cleanup.
Another possibility is things like a lot of hints. Admittedly it would have
Continuous computation is the sort of thing Storm
(https://github.com/nathanmarz/storm) can help with.
And good news everybody, storing the output from Storm is the sort of thing
Cassandra can help with http://www.youtube.com/watch?v=cF8a_FZwULI
Cheers
-
Aaron Morton
It repairs the ranges they have in common.
Cheers
-
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com
On 20/05/2012, at 4:05 PM, Raj N wrote:
Can I infer from this that if I have 3 replicas, then running repair without
-pr won 1 node will repair the
On 05/22/2012 12:45 AM, Tamar Fraenkel wrote:
Thanks for the response. But it still does not work.
I am running the script from a git bash on my windows 7.
adding some debug prints, this is what I am
running
With
heap size = 4 gigs
I would check for GC activity in the logs and consider setting it to 8 given
you have 16 GB. You can also check if the IO system is saturated
(http://spyced.blogspot.co.nz/2010/01/linux-performance-basics.html) Also take
a look at nodetool cfhistogram perhaps to see
In general read queries run on multiple nodes. But each node computes the
complete result to the query.
There is no support for aggregate queries.
Cheers
-
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com
On 20/05/2012, at 6:49 PM, Majid Azimi
CASSANDRA-3712 has not been applied to 0.8.X.
If I understand the problem correctly the issue is 0.8.10.
You may be able to avoid the race condition by:
1) Isolating the node from the cluster to stop write activity. You can either
start the node with the -Dcassandra.join_ring=false JVM
The first part of the name is the current system time in milliseconds.
If you run it twice do you get log messages about failing to create the same
directory twice ?
Cheers
-
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com
On 21/05/2012, at 5:09 AM,
kinds of like https://issues.apache.org/jira/browse/CASSANDRA-3733 but maybe
different.
Have you recently dropped as CF ? it looks like the hints CF is only compacted
if hints are replayed. If they are dropped because the CF no longer exists
compaction is not forced (
not sure what you mean by
And after restarting the second one I have lost all the consistency of
my data. All my statistics since September are totally false now in
production
Can you give some examples?
Counter are not idempotent so if the client app retries TimedOut requests you
can get an
4096 is also the internal hard coded default for commitlog_total_space_in_mb
If you are seeing more that 4GB of commit log files let us know.
Cheers
-
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com
On 22/05/2012, at 6:35 AM, Bryce Godfrey wrote:
Hi,
I use Cassandra 0.8.5 and am suddenly noticing some strange behavior. I run
a create column family command with some column meta-data and it runs
fine, but when I do describe keyspace, it shows me different column names
for those index columns.
a) Here is what I run:
create column family
I would look into the problems you are having with GC...
When ParNew runs the jvm pauses
https://blogs.oracle.com/jonthecollector/entry/our_collectors . If it's pausing
for 4 seconds it's not processing queries.
Then check the throughput on the san and the steal on the VM's.
Check to see
It's more the number of CF's than keyspaces.
Cheers
-
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com
On 22/05/2012, at 6:58 PM, R. Verlangen wrote:
Yes, it does. However there's no real answer what's the limit: it depends on
your hardware and
Hmm, you got me on that. I assumed (~ wrong) that more keyspaces would mean
more CF's.
2012/5/22 aaron morton aa...@thelastpickle.com
It's more the number of CF's than keyspaces.
Cheers
-
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com
On
On Tue, May 22, 2012 at 9:19 PM, aaron morton aa...@thelastpickle.comwrote:
It's more the number of CF's than keyspaces.
Oh - does increasing the number of Column Families affect performance ?
The design we are working on at the moment is considering using a Column
Family per year. We were
Not ideally, now cass has global memtable tuning. Each cf correspond to
memory in ram. Year wise cf means it will be in read only state for next
year, memtable will still consume ram.
On 22-May-2012 5:01 PM, Franc Carter franc.car...@sirca.org.au wrote:
On Tue, May 22, 2012 at 9:19 PM, aaron
Host not found in client.
On 22-May-2012 4:34 PM, Abhijit Chanda abhijit.chan...@gmail.com wrote:
Hi All,
Can any one suggest me why i am getting this error in Astyanax
NoAvailableHostsException: [host=None(0.0.0.0):0, latency=0(0),
attempts=0] No hosts to borrow from
Thanks In Advance
Thanks for all the answers, they definitely helped.
Just out of curiosity, is there any underlying architectural reason why it's
not possible to order a row based on its counters values? or is it something
that might be in the roadmap in the future?
--
Filippo Diotalevi
On Tuesday, 22
Hi,
I've had my suspicions some months, but I think I am sure about it.
Data is being written by the SSTableSimpleUnsortedWriter and loaded by the
sstableloader.
The data should be alive for 31 days, so I use the following logic:
int ttl = 2678400;
long timestamp = System.currentTimeMillis() *
Secondary index is not supported for counters plus you must know column
name to support secondary index on regular column.
On 22-May-2012 5:34 PM, Filippo Diotalevi fili...@ntoklo.com wrote:
Thanks for all the answers, they definitely helped.
Just out of curiosity, is there any underlying
Samal,
But I am setting up the Host.
On Tue, May 22, 2012 at 5:30 PM, samal samalgo...@gmail.com wrote:
Host not found in client.
On 22-May-2012 4:34 PM, Abhijit Chanda abhijit.chan...@gmail.com
wrote:
Hi All,
Can any one suggest me why i am getting this error in Astyanax
Data will remain till next compaction but won't be available. Compaction
will delete old sstable create new one.
On 22-May-2012 5:47 PM, Pieter Callewaert pieter.callewa...@be-mobile.be
wrote:
Hi,
** **
I’ve had my suspicions some months, but I think I am sure about it.
Data is
Hi Samal,
Thanks for your time looking into this.
I force the compaction by using forceUserDefinedCompaction on only that
particular sstable. This gurantees me the new sstable being written only
contains the data from the old sstable.
The data in the sstable is more than 31 days old and
Are you able to connect through cli?
Can you share your client code?
On 22-May-2012 5:59 PM, Abhijit Chanda abhijit.chan...@gmail.com wrote:
Samal,
But I am setting up the Host.
On Tue, May 22, 2012 at 5:30 PM, samal samalgo...@gmail.com wrote:
Host not found in client.
On 22-May-2012
not sure what you mean by
And after restarting the second one I have lost all the consistency of
my data. All my statistics since September are totally false now in
production
Can you give some examples?
After restarting my 2 nodes (one after the other), All my counters
have become wrong. The
Change your comparator to utf8type.
On 22-May-2012 4:32 PM, Roshan Dawrani roshandawr...@gmail.com wrote:
Hi,
I use Cassandra 0.8.5 and am suddenly noticing some strange behavior. I
run a create column family command with some column meta-data and it runs
fine, but when I do describe
Data will not be deleted when those keys appear in other stables outside of
compaction. This is to prevent obsolete data from appearing again.
yuki
On Tuesday, May 22, 2012 at 7:37 AM, Pieter Callewaert wrote:
Hi Samal,
Thanks for your time looking into this.
Additionally, it will always take at least two compaction passes to
purge an expired column: one to turn it into a tombstone, and a second
(after gcgs) to remove it.
On Tue, May 22, 2012 at 9:21 AM, Yuki Morishita mor.y...@gmail.com wrote:
Data will not be deleted when those keys appear in other
Can you please let me know why? Because I have created very similar column
familes many times with comparator = BytesType, and never run into this
issue before.
Here is an example:
ColumnFamily: Client
Key Validation Class:
Congrats!
On Tue, May 22, 2012 at 10:43 AM, Jonathan Ellis jbel...@gmail.com wrote:
Thanks to both of you for your help!
--
Jonathan Ellis
Project Chair, Apache Cassandra
co-founder of DataStax, the source for professional Cassandra support
http://www.datastax.com
What's the correct way to set the strategy options for the
networktopologystrategy with cqlsh?
I've tried several variations, but what's expected way to escape the hyphen in
us-east ?
Thanks,
-jeff
CREATE KEYSPACE something
... WITH strategy_class = 'NetworkTopologyStrategy'
... AND
AND strategy_options={us-east:1, us-west:1};
On Tue, May 22, 2012 at 11:10 AM, Damick, Jeffrey
jeffrey.dam...@neustar.biz wrote:
What’s the correct way to set the strategy options for the
networktopologystrategy with cqlsh?
I’ve tried several variations, but what’s expected way to escape
Thanks, but that would be for the cli, not cqlsh
CREATE KEYSPACE something
... WITH strategy_class = 'NetworkTopologyStrategy' AND
strategy_options={us-east:1};
Invalid syntax at line 2, char 72
WITH strategy_class = 'NetworkTopologyStrategy' AND
On Tue, May 22, 2012 at 3:00 AM, aaron morton aa...@thelastpickle.com wrote:
1) Isolating the node from the cluster to stop write activity. You can
either start the node with the -Dcassandra.join_ring=false JVM option or
use nodetool disablethrift and disablegossip to stop writes. Note that
Thanks for all the work and congratulations to both of you.
--
Sylvain
On Tue, May 22, 2012 at 5:06 PM, Edward Capriolo edlinuxg...@gmail.com wrote:
Congrats!
On Tue, May 22, 2012 at 10:43 AM, Jonathan Ellis jbel...@gmail.com wrote:
Thanks to both of you for your help!
--
Jonathan Ellis
could somebody clue me in to the cause of this exception? i see these
randomly.
AnalyzerService-2 2012-05-22 13:28:00,385 :: WARN
cassandra.connection.HConnectionManager - Exception:
me.prettyprint.hector.api.exceptions.HectorTransportException:
This is normally the result of not having the client properly set up to talk to
Cassandra. Can you send a code snippet of how you are initializing Astyanax?
- Eran
From: Abhijit Chanda
abhijit.chan...@gmail.commailto:abhijit.chan...@gmail.com
Reply-To:
The nodes appear to be holding steady at the 8G that I set it to in the config
file now. I'll keep an eye on them.
From: aaron morton [mailto:aa...@thelastpickle.com]
Sent: Tuesday, May 22, 2012 4:08 AM
To: user@cassandra.apache.org
Subject: Re: 1.1 not removing commit log files?
4096 is also
Correction: the first compaction after expiration + gcgs can remove
it, even if it hasn't been turned into a tombstone previously.
On Tue, May 22, 2012 at 9:37 AM, Jonathan Ellis jbel...@gmail.com wrote:
Additionally, it will always take at least two compaction passes to
purge an expired
Hi,
We are setting up a 6-node cassandra cluster within one data center. 3 in
rack1 and the other 3 in rack2. The tokens are assigned alternating
between rack 1 and rack 2. There is one seed node in each rack. Below is
the ring:
r1-node1DC1 r1 0 (seed)
r2-node1DC1
1 KS with 24 CF's will use roughly the same resources as 24 KS's with 1 CF.
Each CF:
* loads the bloom filter for each SSTable
* samples the index for each sstable
* uses row and key cache
* has a current memtable and potentially memtables waiting to flush.
* had secondary index CF's
I would
for what it's worth i've been having pretty good success using the
Datastax AMIs.
On 5/17/2012 6:59 PM, koji Lin wrote:
Hi
We use amazon ami 3.2.12-3.2.4.amzn1.x86_64
and some of our data file are more than 10G
thanks
koji
2012-5-16 下午6:00 於 aaron morton aa...@thelastpickle.com
Looks like this: https://issues.apache.org/jira/browse/CASSANDRA-4269
On Tue, May 22, 2012 at 4:10 PM, Yiming Sun yiming@gmail.com wrote:
Hi,
We are setting up a 6-node cassandra cluster within one data center. 3 in
rack1 and the other 3 in rack2. The tokens are assigned alternating
It indeed looks almost the same, except in our case, we are only using
UTF8Type. Hopefully when they release 1.1.1, all will be fixed. Thanks
for making me aware of this issue, Tyler.
-- Y.
On Tue, May 22, 2012 at 7:28 PM, Tyler Hobbs ty...@datastax.com wrote:
Looks like this:
Hi
I think amazon ami is based on RHEL.
thank you
2012/5/21 aaron morton aa...@thelastpickle.com
Are you using the Ubuntu operating system ?
Cheers
-
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com
On 18/05/2012, at 1:59 PM, koji Lin
Hi
Thanks for your information, we will try that.
koji
2012/5/23 Deno Vichas d...@syncopated.net
for what it's worth i've been having pretty good success using the
Datastax AMIs.
On 5/17/2012 6:59 PM, koji Lin wrote:
Hi
We use amazon ami 3.2.12-3.2.4.amzn1.x86_64
and some of our
Hi Aaron, Rob,
Thanks for the information, I will try it.
Regards,
Boris
On Tue, May 22, 2012 at 11:47 PM, Rob Coli rc...@palominodb.com wrote:
On Tue, May 22, 2012 at 3:00 AM, aaron morton aa...@thelastpickle.com
wrote:
1) Isolating the node from the cluster to stop write activity. You
On Wed, May 23, 2012 at 7:42 AM, aaron morton aa...@thelastpickle.comwrote:
1 KS with 24 CF's will use roughly the same resources as 24 KS's with 1
CF. Each CF:
* loads the bloom filter for each SSTable
* samples the index for each sstable
* uses row and key cache
* has a current memtable
Hi all,
I am a bit confused regarding the terms replica and
replication factor. Assume that I am using RandomPartitioner and
NetworkTopologyStrategy for replica placement.
From what I understand, with a RandomPartitioner, a row key will
always be hashed and be stored on the node that
I an not able to reproduce this in cli.
On 22-May-2012 8:12 PM, Roshan Dawrani roshandawr...@gmail.com wrote:
Can you please let me know why? Because I have created very similar column
familes many times with comparator = BytesType, and never run into this
issue before.
Here is an example:
Thanks I didn't knew two stage removal process.
On 23-May-2012 2:20 AM, Jonathan Ellis jbel...@gmail.com wrote:
Correction: the first compaction after expiration + gcgs can remove
it, even if it hasn't been turned into a tombstone previously.
On Tue, May 22, 2012 at 9:37 AM, Jonathan Ellis
Hello!
I'm about to schedule backups in the following way
a) snapshots are done daily
b) increment backups are enabled
so, backup will be consistent, very old snapshots must be removed (I guess,
a week depth should be enough).
couple of questions:
1) is there any good guide for scheduling
Did you upgrade DataStax AMIs? Did you add a node to an existing ring?
Thanks
*Tamar Fraenkel *
Senior Software Engineer, TOK Media
[image: Inline image 1]
ta...@tok-media.com
Tel: +972 2 6409736
Mob: +972 54 8356490
Fax: +972 2 5612956
On Wed, May 23, 2012 at 2:00 AM, Deno Vichas
56 matches
Mail list logo