Are you sure about those odds? Winning the UK national lottery has a chance of
13 983 816 to 1 so for just 2 days the odds are
13 983 816^2 = 1.9554711 x 10^14
Brendan Poole
Systems Developer
NewLaw Solicitors
Helmont House
Churchill Way
Cardiff
Thank you for the links, I did read a bit in the comments of the ticket,
but I couldn't get much out of it.
I am mainly interested in how the index is stored and partitioned, not how
it is used. I think the people in the dev list will probably be better
qualified to answer that. My questions
Hi All,
I'm sure people here have tried to solve similar questions.
Say I'm tracking pages, I want to access the least recently used 1000 unique
pages (i.e. columnnames). How can I achieve this?
Using a row with say, ttl=60 seconds would solve the problem of accessing
the least recently used
Alexander:
The secondary indexes in 0.7.0 (type KEYS) are stored internally in a column
family, and are kept synchronized with the base data via locking on a local
node, meaning they are always consistent on the local node. Eventual
consistency still applies between nodes, but a returned result
Until the release vote passes at mojo, you will need to do the
following to follow the example:
svn co https://svn.codehaus.org/mojo/trunk/sandbox/cassandra-maven-plugin
cd cassandra-maven-plugin
mvn install
cd ..
Otherwise the example should be fine.
It's a wiki page, so I'm hoping that people
Thank you very much, this is the information I was looking for. I started
adding secondary index functionality to Cassandra myself, and it turns out
I am doing almost exactly the same thing. I will try to change my code to
use your implementation as well to compare results.
Alexander
Alexander:
Yes i have done a mistake I know ! But I hoped nobody would notice :).
It is the odds of winning 3 days in a row (standard probability fail). Still
it is totally unlikely
Sorry about this mistake,
Best regards,
Victor K.
Aaron,
It looks like you're experiencing a side-effect of CASSANDRA-2083.
There was at least one place (when node B received updated schema from
node A) where gossip was not being updated with the correct schema
even though DatabaseDescriptor had the right version. I'm pretty sure
this is what
One more question: does each node keep an index of their own values, or is
the index global?
Alexander
Thank you very much, this is the information I was looking for. I started
adding secondary index functionality to Cassandra myself, and it turns out
I am doing almost exactly the same thing.
unsubscribe
unsubscribe
Hi,
There is already an email thread on memory issue on this email list, but I
creating a new thread as we are experiencing a different memory consumption
issue.
We are 12-server cluster. We use random partitioner with manually generated
server tokens. Memory usage on one server keeps growing
Iterating through all of the rows matching an index clause on your
cluster is guaranteed to touch N/RF of the nodes in your cluster,
because each node only knows about data that is indexed locally.
On Wed, Feb 9, 2011 at 9:13 AM, alta...@ceid.upatras.gr wrote:
One more question: does each node
On 02/09/2011 11:15 AM, Huy Le wrote:
There is already an email thread on memory issue on this email list, but I
creating a new thread as we are experiencing a different memory consumption
issue.
We are 12-server cluster. We use random partitioner with manually generated
server tokens.
oh you might have to check out and install mojo-sandbox-parent (a sibling
svn url) sandbox projects are not allowed to deploy releases... the vote on
dev@mojo will promote from sandbox and release in one vote 32 h to go
- Stephen
---
Sent from my Android phone, so random spelling mistakes,
We are 12-server cluster. We use random partitioner with manually generated
server tokens. Memory usage on one server keeps growing out of control. We
ran flush and cleared key and row caches but and ran GC but heap memory
usage won't go down. The only way to heap memory usage to go down
(If you're looking at e.g. jconsole graphs a screenshot of the graph
would not hurt.)
--
/ Peter Schuller
Is there any way to specify on per query basis(like we specify the
Consistency level), what rows be cached while you're reading them,
from a row_cache enabled CF. I believe, this could lead to much more
efficient use of the cache space!!( if you use same data for different
features/ parts in your
On Tue, Feb 8, 2011 at 2:03 AM, David Boxenhorn da...@lookin2.com wrote:
Shaun, I agree with you, but marking them as deprecated is not good enough
for me. I can't easily stop using supercolumns. I need an upgrade path.
David,
Cassandra is open source and community developed. The right thing
If the heap usages continues to grow an OOM will eventually be thrown.
Are you experiencing OOMs on these boxes? If you are not OOMing, then
what problem are you experiencing (excessive CPU use garbage collection
for one example)?
No OOM. The JVM just too busy doing GC when the used heap
I still think super-columns are useful you just need to be aware of
the limitations...
Bye,
Norman
2011/2/9 Mike Malone m...@simplegeo.com:
On Tue, Feb 8, 2011 at 2:03 AM, David Boxenhorn da...@lookin2.com wrote:
Shaun, I agree with you, but marking them as deprecated is not good enough
for
To be clear: You are not talking about the size of the Java process in
top, but the actual amount of heap used as reported by the JVM via
jmx/jconsole/etc?
This is memory usage shows in JMX that we are talking about.
Is the memory amount of memory that you consider high, the heap size
help update column family?
On Wed, Feb 9, 2011 at 1:15 PM, Eranda Sooriyabandara 0704...@gmail.com wrote:
Hi Vishan, Aron and all,
Thanks for the help. I tried it and successfully worked for me.
But I could not find a place where mention about the attributes of some
commands.
e.g.
update
Currently there is not.
On Wed, Feb 9, 2011 at 12:04 PM, Ertio Lew ertio...@gmail.com wrote:
Is there any way to specify on per query basis(like we specify the
Consistency level), what rows be cached while you're reading them,
from a row_cache enabled CF. I believe, this could lead to much
On Wed, Feb 9, 2011 at 11:04 AM, Huy Le hu...@springpartners.com wrote:
Memory usage grows overtime.
It is relatively typical for caches to exert memory pressure over time
as they fill. What are your cache settings, for how many
columnfamilies, and with what sized memtables? What version of
Is this under consideration for future releases ? or being thought about!?
On Thu, Feb 10, 2011 at 12:56 AM, Jonathan Ellis jbel...@gmail.com wrote:
Currently there is not.
On Wed, Feb 9, 2011 at 12:04 PM, Ertio Lew ertio...@gmail.com wrote:
Is there any way to specify on per query
Not really, no. If you can't trust LRU to cache the hottest rows
perhaps you should split the data into different ColumnFamilies.
On Wed, Feb 9, 2011 at 1:43 PM, Ertio Lew ertio...@gmail.com wrote:
Is this under consideration for future releases ? or being thought about!?
On Thu, Feb 10,
On Wed, Feb 9, 2011 at 2:43 PM, Ertio Lew ertio...@gmail.com wrote:
Is this under consideration for future releases ? or being thought about!?
On Thu, Feb 10, 2011 at 12:56 AM, Jonathan Ellis jbel...@gmail.com wrote:
Currently there is not.
On Wed, Feb 9, 2011 at 12:04 PM, Ertio Lew
I have a 4 node test cluster were I test the port to 0.7.0 from 0.6.X
On 3 out of the 4 nodes I get exceptions in the log.
I am using RP.
Changes that I did:
1. changed the replication factor from 3 to 4
2. configured the nodes to use Dynamic Snitch
3. RR of 0.33
I run repair on 2 nodes before I
Jonathan, what if the data is really homogeneous, but over a long period of
time. I decided that the users who hit the database for recent past should
have a better ride. Splitting into a separate CF also has costs, right?
In fact, if I were to go this way, do you think I can crank down the key
On Fri, Feb 4, 2011 at 11:13 AM, Dan Kuebrich dan.kuebr...@gmail.com wrote:
Is 2 seconds the normal I went to disk latency for cassandra?
Cassandra exposes metrics on a per-CF basis which indicate latency.
This includes both cache hits and misses, as well as requests for rows
which do not exist.
What's the easiest way to change the port nodes listen for comm on from other
nodes? It appears that the default is 8080 which collides with my tomcat server
on one of our dev boxes. I tried doing something in cassandra.yaml like
listen_address: 192.1.fake.2:
but that doesn't work it
On 02/09/2011 04:00 PM, jeremy.truel...@barclayscapital.com wrote:
What's the easiest way to change the port nodes listen for comm on
from other nodes? It appears that the default is 8080 which collides
with my tomcat server on one of our dev boxes. I tried doing
something in cassandra.yaml
Thanks for the heads up that worked.
-Original Message-
From: Chris Burroughs [mailto:chris.burrou...@gmail.com]
Sent: Wednesday, February 09, 2011 4:04 PM
To: user@cassandra.apache.org
Cc: Truelove, Jeremy: IT (NYK)
Subject: Re: Default Listen Port
On 02/09/2011 04:00 PM,
On Wed, Feb 9, 2011 at 4:00 PM, jeremy.truel...@barclayscapital.com wrote:
What’s the easiest way to change the port nodes listen for comm on from
other nodes? It appears that the default is 8080 which collides with my
tomcat server on one of our dev boxes. I tried doing something in
On Thu, 2011-02-03 at 15:35 -0800, Mike Malone wrote:
In my dealings with the Cassandra code, super columns end up making a
mess all over the place when algorithms need to be special cased and
branch based on the column/supercolumn distinction.
I won't even mention what it does to the
One of my nodes is 76% full. I know that one of CFs represents 90% of the
data, others are really minor. Can I still compact under these conditions?
Will it crash and lose the data? Will it try to create one very large file
out of fragments, for that dominating CF?
TIA
--
View this message in
Thanks Gary.I'll keep an eye on things and see if it happens again.From reading the code I'm wondering if there is a small chance of a race condition in HintedHandoffManager.waitForSchemaAgreement() .Could the following happen? I'm a little unsure on exactly how the endpoint state is removed from
unsubscribe
instructions are herehttp://wiki.apache.org/cassandra/FAQ#unsubscribeOn 10 Feb, 2011,at 02:38 PM, Chance Li chanc...@gmail.com wrote:unsubscribe
Hi all,
Thanks Jonathan and Eric, you both describes what I want. Now I am looking
forward to play with them.
thanks
Eranda
AFAIK 2nd index only works for operator EQ.
-邮件原件-
发件人: Kallin Nagelberg [mailto:kallin.nagelb...@gmail.com]
发送时间: 2011年2月9日 3:36
收件人: user@cassandra.apache.org
主题: Re: time to live rows
I'm thinking if this row expiry notion doesn't pan out then I might
create a 'lastAccessed' column
Did you set compare_with attribute of your ColumnFamily to TimeUUIDType?
-邮件原件-
发件人: Bill Speirs [mailto:bill.spe...@gmail.com]
发送时间: 2011年2月2日 0:47
收件人: Cassandra Usergroup
主题: Row Key Types
What is the type of a Row Key? Can you define how they are compared?
I ask because I'm using
SCFs are very useful and I hope lives forever. We need them!
Best regards/ Pagarbiai
Viktor Jevdokimov
Senior Developer
Email: viktor.jevdoki...@adform.com
Phone: +370 5 212 3063
Fax: +370 5 261 0453
Konstitucijos pr. 23,
LT-08105 Vilnius,
Lithuania
Disclaimer: The information contained in
Mike, my problem is that I have an database and codebase that already uses
supercolumns. If I had to do it over, it wouldn't use them, for the reasons
you point out. In fact, I have a feeling that over time supercolumns will
become deprecated de facto, if not de jure. That's why I would like to
45 matches
Mail list logo