Timeout reading row from CF with collections

2013-07-12 Thread Paul Ingalls
I'm running into a problem trying to read data from a column family that includes a number of collections. Cluster details: 4 nodes running 1.2.6 on VMs with 4 cpus and 7 Gb of ram. raid 0 striped across 4 disks for the data and logs each node has about 500 MB of data currently loaded Here is

Cassandra-CQL-Csharp-driver-sample

2013-07-12 Thread Murali
Hi, I created a very simple CRUD operation using Cassandra CQL C-sharp driver. If somebody is interested, please try it out and feedback / comments are welcome. https://github.com/muralidharand/cassandra-CQL-csharp-driver-sample -- Thanks, Murali

Re: Node tokens / data move

2013-07-12 Thread aaron morton
Can he not specify all 256 tokens in the YAML of the new cluster and then copy sstables? I know it is a bit ugly but should work. You can pass a comma separated list of tokens to the -Dcassandra.replace_token JVM param. AFAIK it's not possible to provide the list in the yaml

Re: how to determine RF on the fly ?

2013-07-12 Thread aaron morton
It's available on the Thrift API call describe_keyspaces() https://github.com/apache/cassandra/blob/trunk/interface/cassandra.thrift#L730 Cheers - Aaron Morton Cassandra Consultant New Zealand @aaronmorton http://www.thelastpickle.com On 11/07/2013, at 7:04 AM, Robert Coli

Re: Quorum reads and response time

2013-07-12 Thread aaron morton
But when I run the same query with consistency level as Quorum, it is taking ~2.3 seconds. It feels as if querying of the nodes are in sequence. No. As Sankalp says look for GC issues. If none then take a look at how much data you are pulling back, and tell us what sort of query you are

Re: Timeout reading row from CF with collections

2013-07-12 Thread Sylvain Lebresne
My bet is that you're hitting https://issues.apache.org/jira/browse/CASSANDRA-5677. -- Sylvain On Fri, Jul 12, 2013 at 8:17 AM, Paul Ingalls paulinga...@gmail.com wrote: I'm running into a problem trying to read data from a column family that includes a number of collections. Cluster

Re: temporarily running a cassandra side by side in production

2013-07-12 Thread aaron morton
We are starting to think we are going to try to run a side by side cassandra instance in production while we map/reduce from one cassandra into the new instance. What do you mean by side-by-side ? Can I assume a cassandra instance will not only bind to the new ports when I change these

Re: manually removing sstable

2013-07-12 Thread aaron morton
That sounds sane to me. Couple of caveats: * Remember that Expiring Columns turn into Tombstones and can only be purged after TTL and gc_grace. * Tombstones will only be purged if all fragments of a row are in the SStable(s) being compacted. Cheers - Aaron Morton Cassandra

Re: IllegalArgumentException on query with AbstractCompositeType

2013-07-12 Thread aaron morton
The “ALLOW FILTERING” clause also has no effect. You only need that when the WHERE clause contains predicates for columns that are not part of the primary key. CREATE INDEX ON conv_msgdata_by_participant_cql(msgReadFlag); On general this is a bad idea in Cassandra (also in a relational DB

Re: manually removing sstable

2013-07-12 Thread Theo Hultberg
thanks aaron, the second point I had not considered, and it could explain why the sstables don't always disapear completely, sometimes a small file (but megabytes instead of gigabytes) is left behind. T# On Fri, Jul 12, 2013 at 10:25 AM, aaron morton aa...@thelastpickle.comwrote: That sounds

Extract meta-data using cql 3

2013-07-12 Thread Murali
Hi experts, How to extract meta-data of a table or a keyspace using CQL 3.0? -- Thanks, Murali

Re: Extract meta-data using cql 3

2013-07-12 Thread Sylvain Lebresne
The raw answer is that you should query the system tables. The schema is stored in the 3 following tables: System.schema_keyspaces, System.schema_columnfamilies and System.schema_columns. Unfortunately, the information stored in there is, for different reasons, not in a form that makes a lot of

Re: Extract meta-data using cql 3

2013-07-12 Thread Theo Hultberg
there's a keyspace called system which has a few tables that contain the metadata. for example schema_keyspaces that contain keyspace metadata, and schema_columnfamilies that contain table metadata. there are more, just fire up cqlsh and do a describe keyspace in the system keyspace to find them.

Re: Alternate major compaction

2013-07-12 Thread Radim Kolar
with some very little work (less then 10 KB of code) is possible to have online sstable splitter and exported this functionality over JMX.

Error: Main method not found in class org.apache.cassandra.service.CassandraDaemon

2013-07-12 Thread Vivek Mishra
Earlier, everything was working fine but now i am getting this strange error. Initially i was working via tarball installation and did install a Cassandra rpm package. Since then, i am getting Error: Main method not found in class org.apache.cassandra.service.CassandraDaemon, please define the

[BETA RELEASE] Apache Cassandra 2.0.0-beta1 released

2013-07-12 Thread Sylvain Lebresne
The Cassandra team is pleased to announce the release of the first beta for the future Apache Cassandra 2.0.0. Let me first stress that this is beta software and as such is *not* ready for production use. The goal of this release is to give a preview of what will become Cassandra 2.0 and to get

Re: temporarily running a cassandra side by side in production

2013-07-12 Thread Hiller, Dean
Heh, oops, yes, We have 12 nodes and are trying to run 2 instances of cassandra on those 12 nodes. So far, in QA this appears to be working. I like clustername change idea as a just in case so I will definitely be doing that one. Thanks, Dean From: aaron morton

Compression ratio

2013-07-12 Thread cem
Hi All, Can anyone explain the compression ratio? Is it the compressed data / original or original/ compressed ? Or something else. thanks a lot. Best Regards, Cem

Representation of dynamically added columns in table (column family) schema using cqlsh

2013-07-12 Thread Shahab Yunus
A basic question and it seems that I have a gap in my understanding. I have a simple table in Cassandra with multiple column families. I add new columns to each of these column families on the fly. When I view (using the 'DESCRIBE table' command) the schema of a particular column family, I see

Re: node tool ring displays 33.33% owns on 3 node cluster with replication

2013-07-12 Thread Andrew Bialecki
Not sure if it's the best/intended behavior, but you should see it go back to 100% if you run: nodetool -h 127.0.0.1 -p 8080 ring keyspace. I think the rationale for showing 33% is that different keyspaces might have different RFs, so it's unclear what to show for ownership. However, if you

Re: Compression ratio

2013-07-12 Thread Yuki Morishita
it's compressed/original. https://github.com/apache/cassandra/blob/cassandra-1.1.11/src/java/org/apache/cassandra/io/sstable/SSTableMetadata.java#L124 On Fri, Jul 12, 2013 at 10:02 AM, cem cayiro...@gmail.com wrote: Hi All, Can anyone explain the compression ratio? Is it the compressed data

Re: Compression ratio

2013-07-12 Thread cem
Thank you very much! On Fri, Jul 12, 2013 at 5:59 PM, Yuki Morishita mor.y...@gmail.com wrote: it's compressed/original. https://github.com/apache/cassandra/blob/cassandra-1.1.11/src/java/org/apache/cassandra/io/sstable/SSTableMetadata.java#L124 On Fri, Jul 12, 2013 at 10:02 AM, cem

Re: How many DCs can you have in a cluster?

2013-07-12 Thread sankalp kohli
More than the DC, I think you will be bound by number of replicas. I dont know how it will work in case of 10-20 replication factor specially for range queries. On Thu, Jul 11, 2013 at 7:14 PM, Blair Zajac bl...@orcaware.com wrote: In this C* Summit 2013 talk titled A Deep Dive Into How

AUTO : Samuel CARRIERE is out of the office (retour 07/08/2013)

2013-07-12 Thread Samuel CARRIERE
Je suis absent(e) du bureau jusqu'au 07/08/2013 Remarque : ceci est une réponse automatique à votre message Compression ratio envoyé le 12/07/2013 17:02:11. C'est la seule notification que vous recevrez pendant l'absence de cette personne.

Re: How many DCs can you have in a cluster?

2013-07-12 Thread Blair Zajac
Yes, there's going to be a lot of replicas in total, but the replication factor will be 3 in each DC. Will it still be an issue? Blair On Jul 12, 2013, at 10:58 AM, sankalp kohli kohlisank...@gmail.com wrote: More than the DC, I think you will be bound by number of replicas. I dont know

Re: Timeout reading row from CF with collections

2013-07-12 Thread Paul Ingalls
Yep, that was it. I built from the cassandra 1.2 branch and no more timeouts. Thanks for getting that fix into 1.2! Paul On Jul 12, 2013, at 1:20 AM, Sylvain Lebresne sylv...@datastax.com wrote: My bet is that you're hitting https://issues.apache.org/jira/browse/CASSANDRA-5677. --

hot sstables evicted from page cache on compaction causing high latency

2013-07-12 Thread John Watson
Having a real issue where at the completion of large compactions, it will evict hot sstables from the kernel page cache causing huge read latency while it is backfilled. https://dl.dropboxusercontent.com/s/149h7ssru0dapkg/Screen%20Shot%202013-07-12%20at%201.46.19%20PM.png Blue line - page cache

Re: Alternate major compaction

2013-07-12 Thread Robert Coli
On Thu, Jul 11, 2013 at 9:43 PM, Takenori Sato ts...@cloudian.com wrote: I made the repository public. Now you can checkout from here. https://github.com/cloudian/support-tools checksstablegarbage is the tool. Enjoy, and any feedback is welcome. Thanks very much, useful tool! Out of

Re: Representation of dynamically added columns in table (column family) schema using cqlsh

2013-07-12 Thread Eric Stevens
If you're creating dynamic columns via Thrift interface, they will not be reflected in the CQL3 schema. I would recommend not mixing paradigms like that, either stick with CQL3 or Thrift / cassandra-cli. With compact storage creates column families which can be interacted with meaningfully via

Minimum CPU and RAM for Cassandra and Hadoop Cluster

2013-07-12 Thread Martin Arrowsmith
Dear Cassandra experts, I have an HP Proliant ML350 G8 server, and I want to put virtual servers on it. I would like to put the maximum number of nodes for a Cassandra + Hadoop cluster. I was wondering - what is the minimum RAM and memory per node I that I need to have Cassandra + Hadoop before

Re: Representation of dynamically added columns in table (column family) schema using cqlsh

2013-07-12 Thread Shahab Yunus
Thanks Eric for the explanation. Regards, Shahab On Fri, Jul 12, 2013 at 11:13 AM, Shahab Yunus shahab.yu...@gmail.comwrote: A basic question and it seems that I have a gap in my understanding. I have a simple table in Cassandra with multiple column families. I add new columns to each of

Re: Node tokens / data move

2013-07-12 Thread Radim Kolar
its possible to change num_tokens on node with data? i changed it and restarted node but it still has same amount in nodetool status.

Re: Alternate major compaction

2013-07-12 Thread Takenori Sato
It's light. Without -v option, you can even run it against just a SSTable file without needing the whole Cassandra installation. - Takenori On Sat, Jul 13, 2013 at 6:18 AM, Robert Coli rc...@eventbrite.com wrote: On Thu, Jul 11, 2013 at 9:43 PM, Takenori Sato ts...@cloudian.com wrote: I

Re: Rhombus - A time-series object store for Cassandra

2013-07-12 Thread Ananth Gundabattula
Hello Rob, Thanks for the pointer. I have a couple of queries: How does this project compare to the KairosDb project on github ( For one I see that Rhombus supports multi column query which is cool whereas kairos time series DB/OpenTSDB do not seem to have such a feature - although we can use