Cassandra and count

2011-01-28 Thread Sébastien Druon
Hello, I have a question concerning count in cassandra, as I would like to count the rows of a CF: - is it mandatory to specify a range? - what is the cost of a count operation on a CF? Thanks in advance for the answers Sebastien

Secondary Index information

2011-01-28 Thread Sasha Dolgy
Hi there, Where can I find information regarding secondary indexes? Spent the past 2 days looking for some good details. http://wiki.apache.org/cassandra/SecondaryIndexes doesn't yet exist, althought it's referenced from http://wiki.apache.org/cassandra/StorageConfiguration Trying to

Re: Secondary Index information

2011-01-28 Thread Jake Luciani
http://www.datastax.com/blog/whats-new-cassandra-07-secondary-indexes On Fri, Jan 28, 2011 at 7:15 AM, Sasha Dolgy sasha.do...@gmail.com wrote: Hi there, Where can I find information regarding secondary indexes? Spent the past 2 days looking for some good details.

Re: Secondary Index information

2011-01-28 Thread Sasha Dolgy
Thank you. So, after reading, I'm still unsure if this feature will afford me a larger benefit when compared to an inverted index solution. Has anyone done a pros / cons ? -sd On Fri, Jan 28, 2011 at 3:22 PM, Jake Luciani jak...@gmail.com wrote:

Re: Secondary Index information

2011-01-28 Thread Sasha Dolgy
Hi Victor, In my research and playing around with nosql, specifically cassandra, I see the true benefit in defining search requirements and then creating CF's and the hierarchy based on this. So for me, I see quite a bit of simplicity in defining alternate CF's to allow me to efficiently find a

unsubscribe

2011-01-28 Thread S Ahmed

Re: reduced cached mem; resident set size growth

2011-01-28 Thread sridhar basam
On Thu, Jan 27, 2011 at 12:23 PM, Chris Burroughs chris.burrou...@gmail.com wrote: java -version java version 1.6.0_20 Java(TM) SE Runtime Environment (build 1.6.0_20-b02) Java HotSpot(TM) 64-Bit Server VM (build 16.3-b01, mixed mode) cmd line arg (paths edited):

Re: reduced cached mem; resident set size growth

2011-01-28 Thread Chris Burroughs
On 01/28/2011 10:51 AM, sridhar basam wrote: On Thu, Jan 27, 2011 at 12:23 PM, Chris Burroughs chris.burrou...@gmail.com wrote: java -version java version 1.6.0_20 Java(TM) SE Runtime Environment (build 1.6.0_20-b02) Java HotSpot(TM) 64-Bit Server VM (build 16.3-b01, mixed mode) cmd line

Re: Cassandra and count

2011-01-28 Thread buddhasystem
As far as I know, there are no aggregate operations built into Cassandra, which means you'll have to retrieve all of the data to count it in the client. I had a thread on this topic 2 weeks ago. It's pretty bad. -- View this message in context:

Re: Cassandra and count

2011-01-28 Thread Victor Kabdebon
Buddasystem is right. A count returns columns to the client which count it. My advice : do not count big columns / supercolumns. People in the dev team are trying to develop distributed counters but I don't know the state of this research. Best regards, Victor Kabdebon http://www.voxnucleus.fr

Re: reduced cached mem; resident set size growth

2011-01-28 Thread Jake Luciani
Are you using a row cache? if so what is it set too? in general it should not be a percentage. On Thu, Jan 27, 2011 at 12:23 PM, Chris Burroughs chris.burrou...@gmail.com wrote: We have a 6 node Cassandra 0.6.8 cluster running on boxes with 4 GB of RAM. Over the course of several weeks

Re: reduced cached mem; resident set size growth

2011-01-28 Thread Chris Burroughs
On 01/28/2011 11:29 AM, Jake Luciani wrote: Are you using a row cache? if so what is it set too? in general it should not be a percentage. ColumnFamily CompareWith=UTF8Type Name=mycolumnfamily KeysCached=0 KeyCacheSavePeriodInSeconds=0 RowsCached=40

Re: Secondary Index information

2011-01-28 Thread B. Todd Burruss
batch_mutate doesn't guarantee consistency. each mutation in the batch is guaranteed to be consistent based on your CL, but if it returns an error it means that it couldn't complete all mutations ... but the converse isn't true. it may have successfully completed some mutations. if you get

Re: unsubscribe

2011-01-28 Thread Eric Evans
Unsubscribing from this is list is easy, provided you read the instructions sent to you when confirming your subscription, or those conveniently located on the web. http://lmgtfy.com/?q=apache+mailing+lists+unsubscribel=1 If after reading these directions, you still require help getting off the

Re: unsubscribe

2011-01-28 Thread Sasha Dolgy
it's also in the header info of every email ... List-Help: mailto:user-h...@cassandra.apache.org List-Unsubscribe: mailto:user-unsubscr...@cassandra.apache.org On Fri, Jan 28, 2011 at 6:28 PM, Eric Evans eev...@rackspace.com wrote: Unsubscribing from this is list is easy, provided you read the

Re: reduced cached mem; resident set size growth

2011-01-28 Thread sridhar basam
What about your permgen usage? Do you track that? Use something like jstat -gc -t pid 5s 100 to track it. Or turn up verbose GC on your command line options to what is happening. Sridhar On Fri, Jan 28, 2011 at 11:38 AM, Chris Burroughs chris.burrou...@gmail.com wrote: On 01/28/2011

Is it recommended to store two types of data (not related to each other but need to be retrieved together) in one super column family ?

2011-01-28 Thread Ertio Lew
Hi, I have two kinds of data that I would like to fit in one super column family; I am trying this, for the reasons of implementing fast database retrievals by combining the data of two rows into just one row. First kind of data, in supercolumn family, is named with timeUUIDs as supercolumn

0.7.1 release

2011-01-28 Thread B. Todd Burruss
any word on when to expect 0.7.1? lots of good fixes we need. trying to decide if i should apply patches or wait. thx!

Re: 0.7.1 release

2011-01-28 Thread Jonathan Ellis
Eric just posted to -dev that he's putting a new set of artifacts together for vote. On Fri, Jan 28, 2011 at 12:48 PM, B. Todd Burruss bburr...@real.com wrote: any word on when to expect 0.7.1?  lots of good fixes we need.  trying to decide if i should apply patches or wait. thx! --

Re: Node going down when streaming data, what next?

2011-01-28 Thread aaron morton
nodetool remotetoken or nodetool decommission http://wiki.apache.org/cassandra/Operations#Removing_nodes_entirely Hope that helps Aaron On 28 Jan 2011, at 11:30, buddhasystem wrote: OK, after running repair and waiting overnight the rebalancing worked and now 3 nodes share the load as I

Re: Node going down when streaming data, what next?

2011-01-28 Thread buddhasystem
Sorry Aaron but this doesn't help. As I said, machine is dead, kaput, finished. So I can't do decommission. I can remove token to any other node, but -- the dead machine is going to hang around in my ring reports like a zombie. -- View this message in context:

[RELEASE] 0.6.11

2011-01-28 Thread Eric Evans
It seems like it was just earlier this week that we announced the release of 0.6.10. Oh wait, it was. In the time since though, CASSANDRA-2058[1] was found and fixed, and that seemed like reason enough to fast-track a new release. Source and binary archives are available from the Downloads

Re: [RELEASE] 0.6.11

2011-01-28 Thread Chris Goffinet
+1 On Fri, Jan 28, 2011 at 3:13 PM, Eric Evans eev...@rackspace.com wrote: It seems like it was just earlier this week that we announced the release of 0.6.10. Oh wait, it was. In the time since though, CASSANDRA-2058[1] was found and fixed, and that seemed like reason enough to

Re: [RELEASE] 0.6.11

2011-01-28 Thread Chris Goffinet
Err. I mean't, thanks Evan for getting this released so fast :) On Fri, Jan 28, 2011 at 3:18 PM, Chris Goffinet c...@chrisgoffinet.comwrote: +1 On Fri, Jan 28, 2011 at 3:13 PM, Eric Evans eev...@rackspace.com wrote: It seems like it was just earlier this week that we announced the

get_slice OOM on large row

2011-01-28 Thread Rajat Chopra
Hi! Trying to test the 0.7 release with some offbeat settings to check the behavior. - Single node cluster - Key_cache_size - default - Row_cache_size - default - Min/max compaction threshold - 0 (so this is disabled) - Disk_access_mode :

Re: Node going down when streaming data, what next?

2011-01-28 Thread Robert Coli
On Fri, Jan 28, 2011 at 1:51 PM, buddhasystem potek...@bnl.gov wrote: I can remove token to any other node, but -- the dead machine is going to hang around in my ring reports like a zombie. If you remove token on the other nodes and the dead machine hangs around, that sounds like a bug? I

Re: get_slice OOM on large row

2011-01-28 Thread Jonathan Ellis
Requesting too much data in a single request is user error. That is why you have start columns/rows, so you can page through a large set. On Fri, Jan 28, 2011 at 6:48 PM, Rajat Chopra rcho...@makara.com wrote: Hi!    Trying to test the 0.7 release with some offbeat settings to check the

RE: get_slice OOM on large row

2011-01-28 Thread Rajat Chopra
Thanks Jonathan. But the read fails in all cases, even when start_column/end_column span is 10 columns here, and even when column_count is set appropriately. Or did I miss what you said? The trace seems to suggest an entire super_column is being deserialized. Rajat -Original Message-

Re: Node going down when streaming data, what next?

2011-01-28 Thread buddhasystem
It does remove tokens, and the ring shows that the problematic node owns 0 tokens, which is OK. However, it's still there, listed. It's not a bug but kind of like a feature -- you can move that node back in two days later and move tokens in same or different way. What I wish happened was that

Re: reduced cached mem; resident set size growth

2011-01-28 Thread Chris Burroughs
On 01/28/2011 12:42 PM, sridhar basam wrote: What about your permgen usage? Do you track that? Use something like jstat -gc -t pid 5s 100 to track it. Or turn up verbose GC on your command line options to what is happening. http://img59.imageshack.us/img59/1056/permgen.png This is over

Re: reduced cached mem; resident set size growth

2011-01-28 Thread Chris Burroughs
On 01/28/2011 04:12 AM, Zhu Han wrote: Chris, Somebody else and I have the same problem as you, and reported it here: http://www.apacheserver.net/Very-high-memory-utilization-not-caused-by-mmap-on-sstables-at1082970.htm [NB: It is not solved although the titles said so. Some response from

Re: get_slice OOM on large row

2011-01-28 Thread Jonathan Ellis
http://wiki.apache.org/cassandra/CassandraLimitations Any request for a subcolumn deserializes _all_ the subcolumns in that supercolumn, so you want to avoid a data model that requires large numbers of subcolumns. On Fri, Jan 28, 2011 at 7:40 PM, Rajat Chopra rcho...@makara.com wrote: Thanks

Cassandra events next week around Strata

2011-01-28 Thread Jonathan Ellis
Next week is the Strata conference and not one, not two, but five Cassandra events! In chronological order: 1. My Strata Cassandra tutorial Tuesday afternoon: http://strataconf.com/strata2011/public/schedule/detail/16911 2. Cassandra Meetup organized by Jeremy Hanna Tuesday evening:

Re: Cassandra events next week around Strata

2011-01-28 Thread Roshan Dawrani
HelloNo India-based Cassandra / NoSQL events? :-( :-) On Sat, Jan 29, 2011 at 8:31 AM, Jonathan Ellis jbel...@gmail.com wrote: Next week is the Strata conference and not one, not two, but five Cassandra events! In chronological order: 1. My Strata Cassandra tutorial Tuesday afternoon:

Re: Cassandra events next week around Strata

2011-01-28 Thread B. Todd Burruss
web site says sold out, too bad for me ;) On 01/28/2011 07:01 PM, Jonathan Ellis wrote: Next week is the Strata conference and not one, not two, but five Cassandra events! In chronological order: 1. My Strata Cassandra tutorial Tuesday afternoon: