Re: Error when using CQL driver : No indexed columns present in by-columns clause with equals operator

2013-01-31 Thread Sylvain Lebresne
birth_year is secondary indexed. When querying using 2nd indexes, Cassandra (it's not CQL specific) requires that you use an '=' for at least one of the indexed column in the where clause (in your example you only have one such column so it should be an '='). This is a limitation of 2ndary indexes

Re: too many warnings of Heap is full

2013-01-31 Thread Alain RODRIGUEZ
@Bryan Other space is used but can be more easily controlled by tuning for - memtable - compaction - key cache - *row cache* Isn't row cache stored off-heap since a while ? 2013/1/31 Bryan Talbot btal...@aeriagames.com On Wed, Jan 30, 2013 at 2:44 PM, Guillermo Barbero

Re: Understanding Virtual Nodes on Cassandra 1.2

2013-01-31 Thread Manu Zhang
On Thu 31 Jan 2013 03:43:32 AM CST, Zhong Li wrote: Are there tickets/documents explain how data be replicated on Virtual Nodes? If there are multiple tokens on one physical host, may a chance two or more tokens chosen by replication strategy located on same host? If move/remove/add a token

Re: too many warnings of Heap is full

2013-01-31 Thread Guillermo Barbero
Hi, replies to your answers: The CFs that seems to flush more often (users and messages) do not have multiget queries... There are 4 secondary indexes for users and 3 for messages. I've seen something odd here: the index of the CFs is also stored in a column AND is also indexed. There isn't any

Re: JDBC, Select * Cql2 vs Cql3 problem ?

2013-01-31 Thread Andy Cobley
Hi Edward, I've used Hector for years now myself and know how good it is. I think I've tracked this to a problem in the JDBC driver, I'll post a message and raise a ticket over there. (there is still a possibility that Cassandra 1.2 is sending an incorrect CQLresult set) Andy On 30 Jan

JDBC : CreateresultSet fails with null column in CqlResultSet

2013-01-31 Thread Andy Cobley
As you may be aware I've been trying to track down a problem using JDBC 1.1.2 with Cassandra 1.2.0 I was getting a null pointer exception in the result set. I've done some digging into the JDBC driver and found the following. In CassandraResultSet.java the new result set is Instantiated in

Re: Error when using CQL driver : No indexed columns present in by-columns clause with equals operator

2013-01-31 Thread Dinusha Dilrukshi
Thanks Sylvain.. Regards, Dinusha. On Thu, Jan 31, 2013 at 2:47 PM, Sylvain Lebresne sylv...@datastax.comwrote: birth_year is secondary indexed. When querying using 2nd indexes, Cassandra (it's not CQL specific) requires that you use an '=' for at least one of the indexed column in the

Re: CQL binary protocol

2013-01-31 Thread Gabriel Ciuloaica
Hi, You may take a look to java-driver https://github.com/datastax/java-driver project. It has an implementation for connection pool. Cheers, Gabi On 1/31/13 6:48 PM, Vivek Mishra wrote: Hi, Any connection pool API available for cassandra transport

General question regarding bootstrap and nodetool repair

2013-01-31 Thread Wei Zhu
Hi, After messing around with my Cassandra cluster recently, I think I need some basic understanding on how things work behind scene regarding data streaming. Let's say we have three node cluster with RF = 3.  If node 3 for some reason dies and I want to replace it with a new node with the same

initial_token

2013-01-31 Thread Stephen.M.Thompson
Hi folks, I'm trying to get a multimode setup working, which seems like it should be really simple from the documentation. ERROR 11:41:20,773 Fatal configuration error org.apache.cassandra.exceptions.ConfigurationException: For input string: 85070591730234615865843651857942052864 at

Re: initial_token

2013-01-31 Thread Edward Capriolo
Now by default a new partitioner is chosen Murmer3. The range of tokens used to be something like 0 - 2^127. Now the range of its tokens is -2^64 - 2^64 . You can switch back to random partitioner and follow the old instructions or try to find a new doc with the new instructions. On Thu, Jan 31,

Re: General question regarding bootstrap and nodetool repair

2013-01-31 Thread Wei Zhu
I decided to dig in to the source code, looks like in the case of nodetool repair, if the current node sees the difference between the remote nodes based on the merkle tree calculation, it will start a streamrepair session to ask the remote nodes to stream data between  each other.  But I am

Re: General question regarding bootstrap and nodetool repair

2013-01-31 Thread Rob Coli
On Thu, Jan 31, 2013 at 12:19 PM, Wei Zhu wz1...@yahoo.com wrote: But I am still not sure how about the my first question regarding the bootstrap, anyone? As I understand it, bootstrap occurs from a single replica. Which replica is chosen is based on some internal estimation of which is

Re: initial_token

2013-01-31 Thread Rob Coli
On Thu, Jan 31, 2013 at 12:17 PM, Edward Capriolo edlinuxg...@gmail.com wrote: Now by default a new partitioner is chosen Murmer3. Now = as of 1.2, to be unambiguous. =Rob -- =Robert Coli AIMGTALK - rc...@palominodb.com YAHOO - rcoli.palominob SKYPE - rcoli_palominodb

Re: initial_token

2013-01-31 Thread Edward Capriolo
This is the bad side of changing default. There are going to be a few groups unfortunates. The first group, who only can not setup their cluster, and eventually figure out their tokens. (this thread) The second group, who assume their tokens were correct and run around with an unbalanced cluster

Re: General question regarding bootstrap and nodetool repair

2013-01-31 Thread Wei Zhu
Thanks Rob. I think you are right on it. Here is what I found: https://github.com/apache/cassandra/blob/cassandra-1.1.0/src/java/org/apache/cassandra/dht/RangeStreamer.java#L140 It sorts the end point by proximity and in 

Re: too many warnings of Heap is full

2013-01-31 Thread Nate McCall
There are 4 secondary indexes for users and 3 for messages. I've seen something odd here: the index of the CFs is also stored in a column AND is also indexed. There isn't any query by that secondary index. I should probably erase the index, shouldn't I? What is the cardinality like on these

Re: General question regarding bootstrap and nodetool repair

2013-01-31 Thread Wei Zhu
One more question though, I tried to replace a node with a new node with the same IP, Here is what we did: * drain the node * check nodetool on other nodes, and this node is marked down (the token for this node is 100) * clear the data, commit log, saved cache on the down node. * change

Re: General question regarding bootstrap and nodetool repair

2013-01-31 Thread Rob Coli
On Thu, Jan 31, 2013 at 3:31 PM, Wei Zhu wz1...@yahoo.com wrote: The only reason I can think of is that the new node has the same IP as the dead node we tried to replace? After reading the bootstrap code, it shouldn't be the case. Is it a bug? Or anyone tried to replace a dead node with the

Re: Pycassa vs YCSB results.

2013-01-31 Thread Tyler Hobbs
Can you provide the python script that you're using? (I'm moving this thread to the pycassa mailing list ( pycassa-disc...@googlegroups.com), which is a better place for this discussion.) On Thu, Jan 31, 2013 at 6:25 PM, Pradeep Kumar Mantha pradeep...@gmail.comwrote: Hi, I am trying to

Re: Pycassa vs YCSB results.

2013-01-31 Thread Pradeep Kumar Mantha
Thanks.. Please find the script as attachment. Just re-iterating. Its just a simple python script which submit 4 threads. This script has been scheduled on 8 cores using taskset unix command , thus running 32 threads/node. and then scaling to 16 nodes thanks pradeep On Thu, Jan 31, 2013 at 4:38

Re: Start token sorts after end token

2013-01-31 Thread aaron morton
Out of interest do you have the ticket? A - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http://www.thelastpickle.com On 31/01/2013, at 3:19 AM, Edward Capriolo edlinuxg...@gmail.com wrote: Fix is simply to switch to random partitioner. On Wednesday,

Re: Node selection when both partition key and secondary index field constrained?

2013-01-31 Thread aaron morton
So basically it's merging the results 2 separate queries: Indexed scan (token-range) intersect foo.flag_index=true NO. It is doing one query, one the secondary index. When it reads the row keys in that index is discards any outside of the token range, That query is sent to nodes which

Re: Nodetool can not get to 7199 after migrating to 1.2.1

2013-01-31 Thread aaron morton
Check the JMX port listed in /etc/cassandra/cassandra-env.sh If it's not default you can pass it to nodetool using the --jmx-port option. Cheers - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http://www.thelastpickle.com On 31/01/2013, at 3:52 AM,

Re: Suggestion: Move some threads to the client-dev mailing list

2013-01-31 Thread aaron morton
+1 Also where can I learn more about pyhtondra ? Cheers - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http://www.thelastpickle.com On 31/01/2013, at 8:09 AM, Rob Coli rc...@palominodb.com wrote: On Wed, Jan 30, 2013 at 7:21 AM, Edward Capriolo