Re: why set replica placement strategy at keyspace level ?

2013-01-28 Thread aaron morton
The row is the unit of replication, all values with the same storage engine row key in a KS are on the same nodes. if they were per CF this would not hold. Not that it would be the end of the world, but that is the first thing that comes to mind. Cheers - Aaron Morton

Re: Does setstreamthroughput also throttle the network traffic caused by nodetool repair?

2013-01-28 Thread aaron morton
Will that throttle the network traffic caused by nodetool repair? yes. Should I call it to all the nodes on the cluster? Or set it in the yaml file. Cheers - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http://www.thelastpickle.com On 25/01/2013, at

Re: why set replica placement strategy at keyspace level ?

2013-01-28 Thread Manu Zhang
On Mon 28 Jan 2013 04:42:49 PM CST, aaron morton wrote: The row is the unit of replication, all values with the same storage engine row key in a KS are on the same nodes. if they were per CF this would not hold. Not that it would be the end of the world, but that is the first thing that comes

CQL3 jdbc and Tomcat resource

2013-01-28 Thread Andy Cobley
I tried to add a CQL3 jdbc resource to tomcat 7 in a context.xml file (in a Eclipse project) as follows: Resource type=javax.sql.DataSource name=jdbc/CF1 factory=org.apache.tomcat.jdbc.pool.DataSourceFactory

Re: What is the default 'key_validation_class' on secondary INDEX(es)

2013-01-28 Thread Sylvain Lebresne
Your question is missing a what. What do you want to know the default of? If you are asking for the key_validation_class of the Index CF, then it's the column type that defines it. If you're asking about the index CF comparator, then in that example it would use a comparator that sorts like your

Re: CQL3 jdbc and Tomcat resource

2013-01-28 Thread Andy Cobley
Apologies, I was missing a few cassandra jar libs in the tomcat library. Andy On 28 Jan 2013, at 11:31, Andy Cobley acob...@computing.dundee.ac.uk wrote: I tried to add a CQL3 jdbc resource to tomcat 7 in a context.xml file (in a Eclipse project) as follows: Resource

Re: Denormalization

2013-01-28 Thread chandra Varahala
My experience we can design main column families and lookup column families. Main column family have all denormalized data,lookup column families have rowkey of denormalized column families's column. In users column family all user's denormalized data and lookup column family name like

RE: Accessing Metadata of Column Familes

2013-01-28 Thread Rishabh Agrawal
I found following issues while working on Cassandra version 1.2, CQL 3 and Thrift protocol 19.35.0. Case 1: Using CQL I created a table t1 with columns col1 and col2 with col1 being my primary key. When I access same data using CLI, I see col1 gets adopted as rowkey and col2 being another

Re: Accessing Metadata of Column Familes

2013-01-28 Thread Brian O'Neill
Through CQL, you see the logical schema. Through CLI, you see the physical schema. This may help: http://www.datastax.com/dev/blog/cql3-for-cassandra-experts -brian On Mon, Jan 28, 2013 at 7:26 AM, Rishabh Agrawal rishabh.agra...@impetus.co.in wrote: I found following issues while working on

data not shown up after some time

2013-01-28 Thread Matthias Zeilinger
Hi, I´m a simple operations guy and new to Cassandra. I have the problem that one of our application is writing data into Cassandra (but not deleting them, because we should have a 90 days TTL). The application operates in 1 KS with 5 CF. my current setup: 3 node cluster and KS has a RF of 3 (I

[RELEASE] Apache Cassandra 1.2.1 released

2013-01-28 Thread Sylvain Lebresne
The Cassandra team is pleased to announce the release of Apache Cassandra version 1.2.1. Cassandra is a highly scalable second-generation distributed database, bringing together Dynamo's fully distributed design and Bigtable's ColumnFamily-based data model. You can read more here:

RE: data not shown up after some time

2013-01-28 Thread Viktor Jevdokimov
Are you sure your app is setting TTL correctly? TTL is in seconds. For 90 days it have to be 90*24*60*60=7776000. What If you set by accident 777600 (10 times less) - that will be 9 days, almost what you see. Best regards / Pagarbiai Viktor Jevdokimov Senior Developer Email:

RE: data not shown up after some time

2013-01-28 Thread Matthias Zeilinger
Hi, No I have checked the TTL: 7776000 Very interesting is, if I do a simple list cf; the data is shown, but it I do a get cf where index='testvalue'; it returns 0 Row Returned. How can that be? Br, Matthias Zeilinger Production Operation - Shared Services P: +43 (0) 50 858-31185 M: +43 (0)

unsubscribe

2013-01-28 Thread Olivier Devos
unsubscribe From: Matthias Zeilinger [mailto:matthias.zeilin...@bwinparty.com] Sent: lundi 28 janvier 2013 15:32 To: user@cassandra.apache.org Subject: RE: data not shown up after some time Hi, No I have checked the TTL: 7776000 Very interesting is, if I do a simple “list cf;” the

Re: Cassandra timeout whereas it is not much busy

2013-01-28 Thread Nicolas Lalevée
I did some testing, I have a theory. First, we have it seems a lot of CF. And two are particularly every hungry in RAM, consuming a quite big amount of RAM for the bloom filters. Cassandra do not force the flush of the memtables if it has more than 6G of Xmx (luckily for us, this is the

unsubscribe

2013-01-28 Thread Olivier Devos
unsubscribe

Re: unsubscribe

2013-01-28 Thread Eric Evans
http://i.imgur.com/2ch9L.gif On Mon, Jan 28, 2013 at 8:36 AM, Olivier Devos olde...@gmail.com wrote: unsubscribe -- Eric Evans Acunu | http://www.acunu.com | @acunu

Re: unsubscribe

2013-01-28 Thread Alain RODRIGUEZ
You can try it a third time or you can rather try writing to user-unsubscr...@cassandra.apache.org Alain 2013/1/28 Olivier Devos olde...@gmail.com unsubscribe

Re: Unavaliable Exception

2013-01-28 Thread Everton Lima
Thanks for replies. 2013/1/25 Michael Kjellman mkjell...@barracuda.com More nodes! On Jan 25, 2013, at 7:21 AM, Alain RODRIGUEZ arodr...@gmail.com wrote: fwiw, I have a mixed ubuntu 11.10 / 12.04 6 node cluster (AWS m1.xlarge). The load average is always between 0 and 5 for 11.10 nodes

cluster issues

2013-01-28 Thread S C
One of our node in a 3 node cluster drifted by ~ 20-25 seconds. While I figured this pretty quickly, I had few questions that am looking for some answers. We can always be proactive in keeping the time sync. But, Is there any way to recover from a time drift (in a reactive manner)? Since it

JDBC, Select * Cql2 vs Cql3 problem ?

2013-01-28 Thread Andy Cobley
I have the following code in my app using the JDBC (cassandra-jdbc-1.1.2.jar) drivers to CQL: try { rs= stmt.executeQuery(SELECT * FROM users); }catch(Exception et){ System.out.println(Can not execute statement +et); } When connecting to a CQL2 server (cassandra 1.1.5) the code

Re: Cassandra pending compaction tasks keeps increasing

2013-01-28 Thread Wei Zhu
Any thoughts? Thanks. -Wei - Original Message - From: Wei Zhu wz1...@yahoo.com To: user@cassandra.apache.org Sent: Friday, January 25, 2013 10:09:37 PM Subject: Re: Cassandra pending compaction tasks keeps increasing To recap the problem, 1.1.6 on SSD, 5 nodes, RF = 3, one CF only.

cql: show tables in a keystone

2013-01-28 Thread Paul van Hoven
Is there some way in cql to get a list of all tables or column families that belong to a keystore like show tables in sql?

Re: cql: show tables in a keystone

2013-01-28 Thread Brian O'Neill
cqlsh use keyspace; cqlsh:cirrus describe tables; For more info: cqlsh help describe -brian --- Brian O'Neill Lead Architect, Software Development Health Market Science The Science of Better Results 2700 Horizon Drive € King of Prussia, PA € 19406 M: 215.588.6024 € @boneill42

Re: Cassandra pending compaction tasks keeps increasing

2013-01-28 Thread Wei Zhu
Two fundamental questions: * Why did nodetool repairs bring so much data. A lot of SSTables are created, disk space almost doubled. * Why does level compactions run so slow? We turned off throtting completely and don't see much utilization of the SSD and CPU. One example, 0.7MB/s on

Re: why set replica placement strategy at keyspace level ?

2013-01-28 Thread aaron morton
Another thing that's been confusing me is that when we talk about the data model should the row key be inside or outside a column family? My mental model is: cluster == database keyspace == table row == a row in a table CF == a family of columns in one row (I think that's different to

Re: Node selection when both partition key and secondary index field constrained?

2013-01-28 Thread aaron morton
It uses the index... cqlsh:dev tracing on; Now tracing requests. cqlsh:dev cqlsh:dev cqlsh:dev SELECT id, flag from foo WHERE TOKEN(id) '-9939393' AND TOKEN(id) = '0' AND flag=true; Tracing session: 128cab90-6982-11e2-8cd1-51eaa232562e activity |

Re: Issues with CQLSH in Cassandra 1.2

2013-01-28 Thread aaron morton
I was able to replicate it… $ bin/nodetool -h 127.0.0.1 -p 7100 describering foo Schema Version:253da4a3-e277-35b5-8d04-dbeeb3c9508e TokenRange: TokenRange(start_token:3074457345618258602, end_token:-9223372036854775808, endpoints:[], rpc_endpoints:[], endpoint_details:[])

Re: Cassandra pending compaction tasks keeps increasing

2013-01-28 Thread Derek Williams
I could be wrong about this, but when repair is run, it isn't just values that are streamed between nodes, it's entire sstables. This causes a lot of duplicate data to be written which was already correct on the node, which needs to be compacted away. As for speeding it up, no idea. On Mon, Jan

Re: cql: show tables in a keystone

2013-01-28 Thread Theo Hultberg
the DESCRIBE family of commands in cqlsh are wrappers around queries to the system keyspace, so if you want to inspect what keyspaces and tables exist from your application you can do something like: SELECT columnfamily_name, comment FROM system.schema_columnfamilies WHERE keyspace_name = 'test';

Re: why set replica placement strategy at keyspace level ?

2013-01-28 Thread Hiller, Dean
If you write to 4 CF's with the same row key that is considered one mutation Hm, I never considered this, never knew either.(very un-intuitive from a user perspective IMHO). So If I write to CF Users with rowkey=dean and to CF Schedules with rowkey=dean, it is actually one row? (it's so

Problem on node join the ring

2013-01-28 Thread Daning Wang
I add a new node to ring(version 1.1.6), after more than 30 hours, it is still in the 'Joining' state Address DC RackStatus State Load Effective-Ownership Token 141784319550391026443072753096570088105 10.28.78.123datacenter1 rack1 Up

Understanding Virtual Nodes on Cassandra 1.2

2013-01-28 Thread Zhong Li
Hi All, Virtual Nodes is great feature. After I searched some document on Datastax website and some old ticket, seems that it works for random partitioner only, and leaves order preserved partitioner out of the luck. I may misunderstand, please correct me. if it doesn't love order preserved

JNA not found.

2013-01-28 Thread Tim Dunphy
Hey List, I just downloaded 1.21 and have set it up across my cluster. When I noticed the following notice: INFO 18:14:53,828 JNA not found. Native methods will be disabled. So I downloaded jna.jar from git hub and moved it to the cassandra /lib directory. I changed mod to 755 as per the

Re: What is the default 'key_validation_class' on secondary INDEX(es)

2013-01-28 Thread Alan Ristić
2013/1/28 Sylvain Lebresne sylv...@datastax.com If you are asking for the key_validation_class of the Index CF, then it's the column type that defines it Sylvain, that was the one I meant, great. Tnx for explanation. *Alan Ristić* *m*: 040 423 688

1.2 Authentication

2013-01-28 Thread Daning Wang
We were using SimpleAuthenticator on 1.1.x, it worked fine. While testing 1.2, I have put classes under example/simple_authentication in a jar and copy to lib directory, the class is loaded. however, when I try to connect with correct user/password, it gives me error ./cqlsh s2.dsat103-e1a -u

Re: Node selection when both partition key and secondary index field constrained?

2013-01-28 Thread Mike Sample
Thanks Aaron. So basically it's merging the results 2 separate queries: Indexed scan (token-range) intersect foo.flag_index=true where the latter query hits the entire cluster as per the secondary index FAQ entry. Thus the overall query would fail if LOCAL_QUORUM was requested, RF=3 and 2

Re: JNA not found.

2013-01-28 Thread Tim Dunphy
I went to github to try to download jna again. I downloaded version 3.5.1 [root@cassandra-node01 cassandrahome]# ls -l lib/jna-3.5.1.jar -rw-r--r-- 1 root root 692603 Jan 28 21:57 lib/jna-3.5.1.jar I noticed in the datastax docs that java 7 was not recommended so I downgraded to java 6

RE: Accessing Metadata of Column Familes

2013-01-28 Thread Rishabh Agrawal
Any points on the same. - Rishabh - Reply message - From: Rishabh Agrawal rishabh.agra...@impetus.co.in To: user@cassandra.apache.org user@cassandra.apache.org Subject: Accessing Metadata of Column Familes Date: Mon, Jan 28, 2013 5:56 pm I found following issues while working on

Re: data not shown up after some time

2013-01-28 Thread aaron morton
If you are seeing failed secondary index reads you may be seeing this https://issues.apache.org/jira/browse/CASSANDRA-5079 Cheers - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http://www.thelastpickle.com On 29/01/2013, at 3:31 AM, Matthias Zeilinger

getting error for decimal type data

2013-01-28 Thread Kuldeep Mishra
while I an trying to list column family data using cassandra-cli then I am getting following problem for decimal type data, any suggestion will be appreciated. Exception in thread main java.lang.OutOfMemoryError: Java heap space at

Re: Cassandra timeout whereas it is not much busy

2013-01-28 Thread aaron morton
From what I could read there seems to be a contention issue around the flushing (the switchlock ?). Cassandra would then be slow, but not using the entire cpu. I would be in the strange situation I was where I reported my issue in this thread. Does my theory makes sense ? If you are

Re: cluster issues

2013-01-28 Thread aaron morton
We can always be proactive in keeping the time sync. But, Is there any way to recover from a time drift (in a reactive manner)? Since it was a lab environment, I dropped the KS (deleted data directory) There is a way to remove future dated columns, but it not for the faint hearted.

Re: JDBC, Select * Cql2 vs Cql3 problem ?

2013-01-28 Thread aaron morton
What is your table spec ? Do you have the full stack trace from the exception ? Cheers - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http://www.thelastpickle.com On 29/01/2013, at 8:15 AM, Andy Cobley acob...@computing.dundee.ac.uk wrote: I have the

RE: data not shown up after some time

2013-01-28 Thread Matthias Zeilinger
How can I check for this secondary index read fails? Is it in the system.log or over the nodetool? Br, Matthias Zeilinger Production Operation - Shared Services P: +43 (0) 50 858-31185 M: +43 (0) 664 85-34459 E: matthias.zeilin...@bwinparty.com bwin.party services (Austria) GmbH Marxergasse 1B

Re: Cassandra pending compaction tasks keeps increasing

2013-01-28 Thread aaron morton
* Why nodetool repair increases the data size that much? It's not likely that much data needs to be repaired. Will that happen for all the subsequent repair? Repair only detects differences in entire rows. If you have very wide rows then small differences in rows can result in a large

Re: why set replica placement strategy at keyspace level ?

2013-01-28 Thread aaron morton
So If I write to CF Users with rowkey=dean and to CF Schedules with rowkey=dean, it is actually one row? In my mental model that's correct. A RowMutation is a row key and a collection of (internal) ColumnFamilies which contain the columns to write for a single CF. This is the thing that is

Re: Problem on node join the ring

2013-01-28 Thread aaron morton
there is no streaming anymore Nodes only bootstrap once, when they are first started. I have turned on the debug, this is what it is doing now(cpu is pretty much idle), no any error message. Looks like it is receiving writes and reads, looks like it's part of the ring. Is this ring