CQL where clause and OR

2011-05-02 Thread Sébastien Druon
Hi! Is it possible to use an OR operator in the WHERE clause of a SELECT statement? I do not find any documentation on that i.e. SELECT name1 FROM cf1 WHERE name1=value1 OR name1=value2 Thanks a lot in advance Sébastien Druon

Terrible CQL idea: and aliases of = and =

2011-05-02 Thread David Boxenhorn
Is this still true? *Note: The greater-than and less-than operators ( and ) result in key ranges that are inclusive of the terms. There is no supported notion of “strictly” greater-than or less-than; these operators are merely supported as aliases to = and =. * I think that making and aliases

Re: Experiences with MapReduce Stress Tests

2011-05-02 Thread Subscriber
Hi Jeremy, thanks for the link. I doubled the rpc_timeout (20 seconds) and reduced the range-batch-size to 2048, but I still get timeouts... Udo Am 29.04.2011 um 18:53 schrieb Jeremy Hanna: It sounds like there might be some tuning you can do to your jobs - take a look at the wiki's

Re: Strange corrupt sstable

2011-05-02 Thread Daniel Doubleday
Just for the record: The problem had nothing to do with bad memory. After some more digging it turned out that due to a bug we wrote invalid utf-8 sequences as row keys. In 0.6 the key tokens are constructed from string decoded bytes. This does not happen anymore in 0.7 files. So what

Re: Terrible CQL idea: and aliases of = and =

2011-05-02 Thread Jonathan Ellis
80% sure that is an obsolete comment. Eric, can you verify? On Mon, May 2, 2011 at 5:22 AM, David Boxenhorn da...@taotown.com wrote: Is this still true? Note: The greater-than and less-than operators ( and ) result in key ranges that are inclusive of the terms. There is no supported notion

Re: Terrible CQL idea: and aliases of = and =

2011-05-02 Thread Eric Evans
On Mon, 2011-05-02 at 09:29 -0500, Jonathan Ellis wrote: 80% sure that is an obsolete comment. Eric, can you verify? No, it's true. -- Eric Evans eev...@rackspace.com

Re: Terrible CQL idea: and aliases of = and =

2011-05-02 Thread Jonathan Ellis
Where is that happening then? RelationType.forString is not lossy, and neither is the RelationType - IndexExpression conversion in getIndexedSlices. On Mon, May 2, 2011 at 10:30 AM, Eric Evans eev...@rackspace.com wrote: On Mon, 2011-05-02 at 09:29 -0500, Jonathan Ellis wrote: 80% sure that is

Re: Terrible CQL idea: and aliases of = and =

2011-05-02 Thread Eric Evans
On Mon, 2011-05-02 at 10:41 -0500, Jonathan Ellis wrote: Where is that happening then? RelationType.forString is not lossy, and neither is the RelationType - IndexExpression conversion in getIndexedSlices. WhereClause.and(Relation) where it assigns start/end keys. -- Eric Evans

Re: Combining all CFs into one big one

2011-05-02 Thread Tyler Hobbs
On Mon, May 2, 2011 at 5:05 AM, David Boxenhorn da...@taotown.com wrote: Wouldn't it be the case that the once-used rows in your batch process would quickly be traded out of the cache, and replaced by frequently-used rows? Yes, and you'll pay a cache miss penalty for each of the replacements.

Re: Combining all CFs into one big one

2011-05-02 Thread David Boxenhorn
I guess I'm still feeling fuzzy on this because my actual use-case isn't so black-and-white. I don't have any CFs that are accessed purely, or even mostly, in once-through batch mode. What I have is CFs with more and less data, and CFs that are accessed more and less frequently. On Mon, May 2,

Re: Combining all CFs into one big one

2011-05-02 Thread Tyler Hobbs
On Mon, May 2, 2011 at 12:06 PM, David Boxenhorn da...@taotown.com wrote: I guess I'm still feeling fuzzy on this because my actual use-case isn't so black-and-white. I don't have any CFs that are accessed purely, or even mostly, in once-through batch mode. What I have is CFs with more and

Specifying exact nodes for Consistency Levels

2011-05-02 Thread A J
Is it possible in some way to specify what specific nodes I want to include (or exclude) from the Consistency Level fulfillment ? Example, I have a cluster of 4 nodes (n1,n2,n3 and n4) and set N=4. I want to set W=3 and want to ensure that it is n1,n2 and n3 only that are used to satisfy w=3 (i.e.

Re: Terrible CQL idea: and aliases of = and =

2011-05-02 Thread Jonathan Ellis
Created https://issues.apache.org/jira/browse/CASSANDRA-2592 to address this. On Mon, May 2, 2011 at 10:46 AM, Eric Evans eev...@rackspace.com wrote: On Mon, 2011-05-02 at 10:41 -0500, Jonathan Ellis wrote: Where is that happening then?  RelationType.forString is not lossy, and neither is the

Re: Specifying exact nodes for Consistency Levels

2011-05-02 Thread Jonathan Ellis
No. On Mon, May 2, 2011 at 12:26 PM, A J s5a...@gmail.com wrote: Is it possible in some way to specify what specific nodes I want to include (or exclude) from the Consistency Level fulfillment ? Example, I have a cluster of 4 nodes (n1,n2,n3 and n4) and set N=4. I want to set W=3 and want to

Re: Specifying exact nodes for Consistency Levels

2011-05-02 Thread Robert Jackson
The only way I can think of would be to use NetworkTopologyStrategy and PropertyFileSnitch to define two data centers. Set node1-node3 in DC1 then set node4 in DC2. Then set the strategy_options=[{DC1:3, DC2:1}] for that keyspace. This would make your cluster conceptually similar to a split

Re: Experiences with MapReduce Stress Tests

2011-05-02 Thread Jeremy Hanna
Udo, One thing to get out of the way - you're running task trackers on all of your cassandra nodes, right? That is the first and foremost way to get good performance. Otherwise you don't have data locality, which is really the point of map/reduce, co-locating your data and your processes

Replica data distributing between racks

2011-05-02 Thread Eric tamme
I am experiencing an issue where replication is not being distributed between racks when using PropertyFileSnitch in conjunction with NetworkTopologyStrategy. I am running 0.7.3 from a tar.gz on cassandra.apache.org I have 4 nodes, 2 data centers, and 2 racks in each data center. Each rack has

Re: Replica data distributing between racks

2011-05-02 Thread aaron morton
That appears to be working correctly, but does not sound great. When the NTS selects replicas in a DC it orders the tokens available in the DC, then (in the first pass) iterates through them placing a replica in each unique rack. e.g. if the RF in each DC was 2, the replicas would be put on

Re: Replica data distributing between racks

2011-05-02 Thread Jonathan Ellis
On Mon, May 2, 2011 at 2:18 PM, aaron morton aa...@thelastpickle.com wrote: When the NTS selects replicas in a DC it orders the tokens available in  the DC, then (in the first pass) iterates through them placing a replica in each unique rack.  e.g. if the RF in each DC was 2, the replicas

Re: Replica data distributing between racks

2011-05-02 Thread Eric tamme
On Mon, May 2, 2011 at 3:22 PM, Jonathan Ellis jbel...@gmail.com wrote: On Mon, May 2, 2011 at 2:18 PM, aaron morton aa...@thelastpickle.com wrote: When the NTS selects replicas in a DC it orders the tokens available in  the DC, then (in the first pass) iterates through them placing a replica

Re: Replica data distributing between racks

2011-05-02 Thread aaron morton
My bad, I missed the way TokenMetadata.ringIterator() and firstTokenIndex() work. Eric, can you show the output from nodetool ring ? Aaron On 3 May 2011, at 07:30, Eric tamme wrote: On Mon, May 2, 2011 at 3:22 PM, Jonathan Ellis jbel...@gmail.com wrote: On Mon, May 2, 2011 at 2:18 PM,

Re: Replica data distributing between racks

2011-05-02 Thread Eric tamme
On Mon, May 2, 2011 at 5:59 PM, aaron morton aa...@thelastpickle.com wrote: My bad, I missed the way TokenMetadata.ringIterator() and firstTokenIndex() work. Eric, can you show the output from nodetool ring ? Here is output from nodtool ring - ip addresses changed obviously. Address

Re: Replica data distributing between racks

2011-05-02 Thread Eric tamme
On Mon, May 2, 2011 at 5:59 PM, aaron morton aa...@thelastpickle.com wrote: My bad, I missed the way TokenMetadata.ringIterator() and firstTokenIndex() work. Eric, can you show the output from nodetool ring ? Sorry if the previous paste was way to unformatted, here is a pastie.org link

Cassandra Meetup in DC

2011-05-02 Thread Chris Burroughs
http://www.meetup.com/Cassandra-DC-Meetup/ *What*: First Cassandra DC Meetup *When*: Thursday, May 12, 2011 at 6:30 PM *Where*: Northside Social Coffee Wine - 3211 Wilson Blvd Arlington, VA I'm pleased to announce the the first Cassandra DC Meetup

Building from source from behind firewall since Maven switch?

2011-05-02 Thread Suan-Aik Yeo
Has anybody successfully built-from-source from behind a firewall since 0.7.1 when they switched from using Ivy to Maven? We were able to build from source in 0.7.0 by inserting proxy.host and proxy.port property names in build.xml and exporting -Dhttp.proxyHost and -Dhttp.proxyPort variables