Re: cant seem to figure out secondary index definition

2011-03-07 Thread Jürgen Link
Am 07.03.2011 02:10, schrieb Tyler Hobbs: You want to add an index on a CF with TimeUUIDType column names? I think you've probably mistaken the purpose of compare_with. If you haven't, I think you'll need to add the index programmatically in order to specify a non ascii/unicode column name for

Re: Nodes frozen in GC

2011-03-07 Thread ruslan usifov
2011/3/6 aaron morton aa...@thelastpickle.com Your node is under memory pressure, after the GC there is still 5.7GB in use. In fact it looks like memory usage went up during the GC process. Can you reduce the memtable size, caches or the number of CF's or increase the JVM size? Also is this

debugging cassandra with eclipse Helios

2011-03-07 Thread Sagar Kohli
Hi all, I am trying to build Cassandra source(0.7.3) with eclipse helios, followed instruction given in the link http://wiki.apache.org/cassandra/RunningCassandraInEclipse thought I have followed complete instruction it is still showing errors that could not resolve class.(though it is in

Re: debugging cassandra with eclipse Helios

2011-03-07 Thread ಕರಿಯ
Following the instructions should give you an error free build and setup the project in eclipse. Otherwise, try something as follows: Once the project is checked-out, try running just the following ant targets and once the project has been stably built then choose required targets: *clean,

Re: cassandra in-production experiences with .7 series

2011-03-07 Thread Chris Burroughs
On 03/05/2011 05:27 PM, Paul Pak wrote: Hello all, I was wondering if people could share their overall experiences with using .7 series of Cassandra in production? Is anyone using it? For what it's worth we are using a dozen node 0.7.x cluster have not had any major problems (our uses

Re: Reducing memory footprint

2011-03-07 Thread Chris Burroughs
On 03/04/2011 03:51 PM, Casey Deccio wrote: Are you saying: that you want a smaller heap and what settings to change to accommodate that, or that you have already set a small heap of x and Cassandra is using significantly more than that? Based on my observation above, the latter. Casey

Re: Designing a decent data model for an online music shop...confused/stuck on decisions

2011-03-07 Thread Courtney
Thanks for the response, I haven't checked on the status of phpcassa in a while but does it now work with 0.7? That was one of the main reasons I switched to pandra, it seemed more up to date From: Tyler Hobbs Sent: Monday, March 07, 2011 2:40 AM To: user@cassandra.apache.org Subject: Re:

Re: Designing a decent data model for an online music shop...confused/stuck on decisions

2011-03-07 Thread Sasha Dolgy
phpcassa: https://github.com/thobbs/phpcassa is maintained and works well ... -sd On Mon, Mar 7, 2011 at 4:22 PM, Courtney e-mailadr...@hotmail.com wrote: Thanks for the response, I haven't checked on the status of phpcassa in a while but does it now work with 0.7? That was one of the

Re: [RELEASE] 0.7.3

2011-03-07 Thread Karl Hiramoto
just updated and after doing a scrub we see exceptions. ERROR [CompactionExecutor:1] 2011-03-07 15:46:53,811 AbstractCassandraDaemon.java (line 114) Fatal exception in thread Thread[CompactionExecutor:1,1,main] java.io.IOError: java.io.EOFException at

Re: What would be a good strategy for Storing the large text contents like blog posts in Cassandra.

2011-03-07 Thread Jean-Christophe Sirot
Hello, On 03/06/2011 06:35 PM, Aditya Narayan wrote: Next, I also need to store the blogComments which I am planning to store all, in another single row. 1 comment per column. Thus the entire information about the a single comment like commentBody, commentor would be serialized(using google

Is secondary index consistent with its base table?

2011-03-07 Thread Alvin UW
Hello, I was wondering whether Secondary Index is consistent with its base table? How did you guarantee the consistency? Thanks Alvin

Re: [RELEASE] 0.7.3

2011-03-07 Thread Stephen Connolly
Artifacts should be in the Maven Central Repository by now -Stephen On 4 March 2011 21:52, Eric Evans eev...@rackspace.com wrote: It's only been a couple of weeks since the last release, but a rather nasty bug (some details here[1]) has since been fixed, and it seemed best to get that out

Installation

2011-03-07 Thread Mark
Where do must people install Cassandra to? /var or /opt? Thanks

Re: Is secondary index consistent with its base table?

2011-03-07 Thread Jonathan Ellis
Yes, this is guaranteed the same way single-row updates are guaranteed to be atomic (the commitlog). On Mon, Mar 7, 2011 at 10:13 AM, Alvin UW alvi...@gmail.com wrote: Hello, I was wondering whether Secondary Index is consistent with its base table? How did you guarantee the consistency?

Re: Is secondary index consistent with its base table?

2011-03-07 Thread Jonathan Ellis
It does, but this is an implementation detail subject to change (e.g., the bitmap indexes being added do not). On Mon, Mar 7, 2011 at 10:55 AM, Alvin UW alvi...@gmail.com wrote: Thanks. Does Cassandra store secondary index with an extra CF? 2011/3/7 Jonathan Ellis jbel...@gmail.com Yes,

Alternative to repair

2011-03-07 Thread Daniel Doubleday
Hi all we're still on 0.6 and are facing problems with repairs. I.e. a repair for one CF takes around 60h and we have to do that twice (RF=3, 5 nodes). During that time the cluster is under pretty heavy IO load. It kinda works but during peek times we see lots of dropped messages (including

Re: how large can a cluster over the WAN be?

2011-03-07 Thread Robert Coli
On Sun, Mar 6, 2011 at 1:39 PM, Mimi Aluminium mimi.alumin...@gmail.com wrote: Are you familiar with Cassandra cluster that is installed in datacenters that are spread across the WAN? can you comment on the perfromance of such installation? What is the largest size of of such a cluster you are

CompactionExecutor EOF During Bootstrap

2011-03-07 Thread Erik Onnen
During a recent upgrade of our cassandra ring from 0.6.8 to 0.7.3 and prior to a drain on the 0.6.8 nodes, we lost a node for reasons unrelated to cassandra. We decided to push forward with the drain on the remaining healthy nodes. The upgrade completed successfully for the remaining nodes and the

Re: how large can a cluster over the WAN be?

2011-03-07 Thread John Lewis
When you say decent latency and throughput what numbers do you consider decent? I know throughput would be highly dependent on the quantity of kb shoved through the pipe so I would expect throughput needs would be highly dependent on the data actually in cassandra. Thanks for the info, I am

Re: Installation

2011-03-07 Thread Eric Gilmore
The DataStax packaged releaseshttp://www.datastax.com/docs/0.7/configuration/packaged_releasesfollow standard practices for Linux-ish installation, so they might be a good model to follow. For instance, the RHEL/CentOS package installs the binaries (cassandra-cli, nodetool) in /usr/bin,

changing ip's ...

2011-03-07 Thread Sasha Dolgy
hi there, is there an easy way to 'un-mess' things when the ip of a server is changed? updating the cassandra.yaml didn't help. when the member with the changed ip comes up, it's fine ... but other members in the ring don't see it and keep the old ip address regardless of the yaml changing

recommended way to grow a cluster?

2011-03-07 Thread Paul Pak
Hello, I'm doing some testing of Cassandra and I've read a lot about people running into situations growing their clusters. So, I'm about to test it with .7.3. I've got a test node which is a single node with a replication factor of 1. I'd like to grow it to 3 nodes and a replication factor of

Re: Nodes frozen in GC

2011-03-07 Thread Aaron Morton
It's always possible to run out of memory. Can you provide... - number cf's and their Memtable settings - any row or key cache settings - any other buffer or memory settings you may have changed in Cassandra.yaml. - what load you are putting on the cluster, e.g. Inserting x rows/columns per

Re: CompactionExecutor EOF During Bootstrap

2011-03-07 Thread Erik Onnen
Thanks Jonathan. Filed: https://issues.apache.org/jira/browse/CASSANDRA-2283 We'll start the scrub during our normal compaction cycle and update this thread and the bug with the results. -erik On Mon, Mar 7, 2011 at 11:27 AM, Jonathan Ellis jbel...@gmail.com wrote: It sounds like it doesn't

Re: What would be a good strategy for Storing the large text contents like blog posts in Cassandra.

2011-03-07 Thread Aaron Morton
You can fill your boots. So long as your boots have a capacity of 2 billion. Background ... http://wiki.apache.org/cassandra/LargeDataSetConsiderations http://wiki.apache.org/cassandra/CassandraLimitations

Re: Nodes frozen in GC

2011-03-07 Thread Jonathan Ellis
It sounds like you're complaining that the JVM sometimes does stop-the-world GC. You can mitigate this but not (for most workloads) eliminate it with GC option tuning. That's simply the state of the art for Java garbage collection right now. On Sun, Mar 6, 2011 at 2:18 AM, ruslan usifov

Re: Nodes frozen in GC

2011-03-07 Thread ruslan usifov
2011/3/8 Jonathan Ellis jbel...@gmail.com It sounds like you're complaining that the JVM sometimes does stop-the-world GC. You can mitigate this but not (for most workloads) eliminate it with GC option tuning. That's simply the state of the art for Java garbage collection right now. Hm,

Re: Nodes frozen in GC

2011-03-07 Thread Paul Pak
So, are you saying this is normal and expected from Cassandra? So, under load, we can expect java garbage collection to stop the Cassandra process on that server from time to time, essentially taking out the node for short periods of time while it does garbage collection? Also, why is there so

Re: Nodes frozen in GC

2011-03-07 Thread Paul Pak
Hi Ruslan, It looks like Jonathan and Stu have already been working to reduce garbage collection on v.8 The ticket is at https://issues.apache.org/jira/browse/CASSANDRA-2252 Jonathan, is there any way to apply the patch to .73 and have ruslan test it to see if it fixes his issue with Garbage

Re: Nodes frozen in GC

2011-03-07 Thread Chris Goffinet
Can you tell me how many SSTables on disk when you see GC pauses? In your 3 node cluster, what's the RF factor? On Mon, Mar 7, 2011 at 1:50 PM, ruslan usifov ruslan.usi...@gmail.comwrote: 2011/3/8 Jonathan Ellis jbel...@gmail.com It sounds like you're complaining that the JVM sometimes does

Re: Data model

2011-03-07 Thread aaron morton
Will answer on the user list. Aaron On 8/03/2011, at 1:11 AM, Baskar wrote: Does Cassandra allow nesting of column families? Here is the use case - we need to store calls made by employees - employees are associated with an account - accounts have phone numbers - many calls are made by

Re: Nodes frozen in GC

2011-03-07 Thread ruslan usifov
2011/3/8 Chris Goffinet c...@chrisgoffinet.com Can you tell me how many SSTables on disk when you see GC pauses? In your 3 node cluster, what's the RF factor? About 30-40, and i use RF=2, and insert rows with QUORUM consistency level

Re: Nodes frozen in GC

2011-03-07 Thread Chris Goffinet
The rows you are inserting, what is your update ratio to those rows? On Mon, Mar 7, 2011 at 4:03 PM, ruslan usifov ruslan.usi...@gmail.comwrote: 2011/3/8 Chris Goffinet c...@chrisgoffinet.com Can you tell me how many SSTables on disk when you see GC pauses? In your 3 node cluster, what's

Re: Data model

2011-03-07 Thread aaron morton
No to nesting CF's. When creating the data model, it's best to think about how you need to read the data. Denormalise and group data to work towards getting all the data you need by reading one or more rows from the same CF. This is not a hard rule, just a guideline I use. Without knowing

Re: Nodes frozen in GC

2011-03-07 Thread ruslan usifov
2011/3/8 Chris Goffinet c...@chrisgoffinet.com The rows you are inserting, what is your update ratio to those rows? I doesn't update them only insert, with speed 16000 per second

Re: Error when bringing up nodes during failure testing

2011-03-07 Thread aaron morton
It's failing because when the node bootstraps it does not know about enough nodes to support the RF... replication factor (3) exceeds number of endpoints (2) I *think* the normal work around is to disable autobootstrap, bring the nodes up then run nodetool join or StorageService.joinRing()

when do snapshots go away?

2011-03-07 Thread Jeffrey Wang
Hi all, When I drop a column family, it creates a snapshot. When does the snapshot go away and free up the disk space? I was able to run nodetool clearsnapshot to get rid of them, but will they go away themselves? (Also, is there a purpose to keeping a snapshot around?) -Jeffrey

Re: Nodes frozen in GC

2011-03-07 Thread Chris Goffinet
How large are your SSTables on disk? My thought was because you have so many on disk, we have to store the bloom filter + every 128 keys from index in memory. On Mon, Mar 7, 2011 at 4:35 PM, ruslan usifov ruslan.usi...@gmail.comwrote: 2011/3/8 Chris Goffinet c...@chrisgoffinet.com The rows

Re: changing ip's ...

2011-03-07 Thread aaron morton
Not this fits your problem, but if you pass -Dcassandra.load_ring_state=false as a JVM option it will stop the node from loading the saved endpoints. It should then rediscover them via gossip. It sounds like you've changed a seed ip is that correct? Not sure there are any other implications

Re: Error when bringing up nodes during failure testing

2011-03-07 Thread aaron morton
1) yes 2) um, not sure. The nodetool output below looks like there are only 2 nodes in that cluster, i.e. there are no down nodes. Aaron On 8/03/2011, at 2:11 PM, mcasandra wrote: aaron morton wrote: It's failing because when the node bootstraps it does not know about enough nodes to

Re: Error when bringing up nodes during failure testing

2011-03-07 Thread mcasandra
aaron morton wrote: 2) um, not sure. The nodetool output below looks like there are only 2 nodes in that cluster, i.e. there are no down nodes. There are actually 3 nodes. Not sure why it's not showing the other node in the output which is currently down. The error I am getting is from the

TException: Error: TSocket read 0 bytes

2011-03-07 Thread ko...@vivinavi.com
Hi Jonathan Ellis Thank you for your advice. It was useful for me but I have still other problem. Now Cassandra 0.6.12 is working. So I installed Thrift 0.7.2 for PHP interface. Then I tried an example program from Cassandra Wiki ThriftExamples. However fread or fwrite in TSocket.php doesn't

Splitting the data of a single blog into 2 CFs (to implement effective caching) according to views.

2011-03-07 Thread Aditya Narayan
My application displays list of several blogs' overview data (like blogTitle/ nameOfBlogger/ shortDescrption for each blog) on 1st page (in very much similar manner like Digg's newsfeed) and when the user selects a particular blog to see., the application takes him to that specific blog's full