CQL where clause and "OR"

2011-05-02 Thread Sébastien Druon
Hi! Is it possible to use an "OR" operator in the "WHERE" clause of a "SELECT" statement? I do not find any documentation on that i.e. SELECT name1 FROM cf1 WHERE name1=value1 OR name1=value2 Thanks a lot in advance Sébastien Druon

Re: Combining all CFs into one big one

2011-05-02 Thread David Boxenhorn
Wouldn't it be the case that the once-used rows in your batch process would quickly be traded out of the cache, and replaced by frequently-used rows? This would be the case even if your batch process goes on for a long time, since caching is done on a row-by-row basis. In effect, it would mean that

Re: CQL where clause and "OR"

2011-05-02 Thread Miguel Auso
hi!, It`s posible for exemple: select * from aw_advancednewsletter_subscriptions where id='1' OR id='3'; Un Saludo Miguel Ángel Ausó 2011/5/2 Sébastien Druon > Hi! > > Is it possible to use an "OR" operator in the "WHERE" clause of a "SELECT" > statement? I do not find any documentation

Terrible CQL idea: > and < aliases of >= and <=

2011-05-02 Thread David Boxenhorn
Is this still true? *Note: The greater-than and less-than operators (> and <) result in key ranges that are inclusive of the terms. There is no supported notion of “strictly” greater-than or less-than; these operators are merely supported as aliases to >= and <=. * I think that making > and < ali

Re: Experiences with Map&Reduce Stress Tests

2011-05-02 Thread Subscriber
Hi Jeremy, thanks for the link. I doubled the rpc_timeout (20 seconds) and reduced the range-batch-size to 2048, but I still get timeouts... Udo Am 29.04.2011 um 18:53 schrieb Jeremy Hanna: > It sounds like there might be some tuning you can do to your jobs - take a > look at the wiki's Hado

Re: Strange corrupt sstable

2011-05-02 Thread Daniel Doubleday
Just for the record: The problem had nothing to do with bad memory. After some more digging it turned out that due to a bug we wrote invalid utf-8 sequences as row keys. In 0.6 the key tokens are constructed from string decoded bytes. This does not happen anymore in 0.7 files. So what apparentl

Re: CQL where clause and "OR"

2011-05-02 Thread Jonathan Ellis
OR will not be supported for a while yet, however IN support is in trunk and will be in 0.8.1 (but not 0.8.0). On Mon, May 2, 2011 at 5:10 AM, Miguel Auso wrote: > hi!, > It`s posible > for exemple: > select * from aw_advancednewsletter_subscriptions where id='1' OR id='3'; > > > Un Saludo > Migu

Re: Strange corrupt sstable

2011-05-02 Thread Jonathan Ellis
Thanks for tracking that down! 0.7 OPP adds additional checks (and if you're starting from scratch you should use BOP instead) that keys are valid UTF8, so it shouldn't be an issue there. On Mon, May 2, 2011 at 7:39 AM, Daniel Doubleday wrote: > Just for the record: > > The problem had nothing t

Re: Terrible CQL idea: > and < aliases of >= and <=

2011-05-02 Thread Jonathan Ellis
80% sure that is an obsolete comment. Eric, can you verify? On Mon, May 2, 2011 at 5:22 AM, David Boxenhorn wrote: > Is this still true? > > Note: The greater-than and less-than operators (> and <) result in key > ranges that are inclusive of the terms. There is no supported notion of > “strictl

Re: Terrible CQL idea: > and < aliases of >= and <=

2011-05-02 Thread Eric Evans
On Mon, 2011-05-02 at 09:29 -0500, Jonathan Ellis wrote: > 80% sure that is an obsolete comment. > > Eric, can you verify? No, it's true. -- Eric Evans eev...@rackspace.com

Re: Terrible CQL idea: > and < aliases of >= and <=

2011-05-02 Thread Jonathan Ellis
Where is that happening then? RelationType.forString is not lossy, and neither is the RelationType -> IndexExpression conversion in getIndexedSlices. On Mon, May 2, 2011 at 10:30 AM, Eric Evans wrote: > On Mon, 2011-05-02 at 09:29 -0500, Jonathan Ellis wrote: >> 80% sure that is an obsolete comm

Re: Terrible CQL idea: > and < aliases of >= and <=

2011-05-02 Thread Eric Evans
On Mon, 2011-05-02 at 10:41 -0500, Jonathan Ellis wrote: > Where is that happening then? RelationType.forString is not lossy, > and neither is the RelationType -> IndexExpression conversion in > getIndexedSlices. WhereClause.and(Relation) where it assigns start/end keys. -- Eric Evans eev...@ra

Re: Combining all CFs into one big one

2011-05-02 Thread Tyler Hobbs
On Mon, May 2, 2011 at 5:05 AM, David Boxenhorn wrote: > Wouldn't it be the case that the once-used rows in your batch process would > quickly be traded out of the cache, and replaced by frequently-used rows? > Yes, and you'll pay a cache miss penalty for each of the replacements. > This would

Re: Combining all CFs into one big one

2011-05-02 Thread David Boxenhorn
I guess I'm still feeling fuzzy on this because my actual use-case isn't so black-and-white. I don't have any CFs that are accessed purely, or even mostly, in once-through batch mode. What I have is CFs with more and less data, and CFs that are accessed more and less frequently. On Mon, May 2, 20

Re: Combining all CFs into one big one

2011-05-02 Thread Tyler Hobbs
On Mon, May 2, 2011 at 12:06 PM, David Boxenhorn wrote: > I guess I'm still feeling fuzzy on this because my actual use-case isn't so > black-and-white. I don't have any CFs that are accessed purely, or even > mostly, in once-through batch mode. What I have is CFs with more and less > data, and C

Specifying exact nodes for Consistency Levels

2011-05-02 Thread A J
Is it possible in some way to specify what specific nodes I want to include (or exclude) from the Consistency Level fulfillment ? Example, I have a cluster of 4 nodes (n1,n2,n3 and n4) and set N=4. I want to set W=3 and want to ensure that it is n1,n2 and n3 only that are used to satisfy w=3 (i.e.

Re: Terrible CQL idea: > and < aliases of >= and <=

2011-05-02 Thread Jonathan Ellis
Created https://issues.apache.org/jira/browse/CASSANDRA-2592 to address this. On Mon, May 2, 2011 at 10:46 AM, Eric Evans wrote: > On Mon, 2011-05-02 at 10:41 -0500, Jonathan Ellis wrote: >> Where is that happening then?  RelationType.forString is not lossy, >> and neither is the RelationType ->

Re: Specifying exact nodes for Consistency Levels

2011-05-02 Thread Jonathan Ellis
No. On Mon, May 2, 2011 at 12:26 PM, A J wrote: > Is it possible in some way to specify what specific nodes I want to > include (or exclude) from the Consistency Level fulfillment ? > Example, I have a cluster of 4 nodes (n1,n2,n3 and n4) and set N=4. I > want to set W=3 and want to ensure that i

Re: Specifying exact nodes for Consistency Levels

2011-05-02 Thread Robert Jackson
The only way I can think of would be to use NetworkTopologyStrategy and PropertyFileSnitch to define two data centers. Set node1-node3 in DC1 then set node4 in DC2. Then set the strategy_options=[{DC1:3, DC2:1}] for that keyspace. This would make your cluster conceptually similar to a split dat

Re: Experiences with Map&Reduce Stress Tests

2011-05-02 Thread Jeremy Hanna
Udo, One thing to get out of the way - you're running task trackers on all of your cassandra nodes, right? That is the first and foremost way to get good performance. Otherwise you don't have data locality, which is really the point of map/reduce, co-locating your data and your processes oper

Replica data distributing between racks

2011-05-02 Thread Eric tamme
I am experiencing an issue where replication is not being distributed between racks when using PropertyFileSnitch in conjunction with NetworkTopologyStrategy. I am running 0.7.3 from a tar.gz on cassandra.apache.org I have 4 nodes, 2 data centers, and 2 racks in each data center. Each rack has

Re: Replica data distributing between racks

2011-05-02 Thread aaron morton
That appears to be working correctly, but does not sound great. When the NTS selects replicas in a DC it orders the tokens available in the DC, then (in the first pass) iterates through them placing a replica in each unique rack. e.g. if the RF in each DC was 2, the replicas would be put on 2

Re: Replica data distributing between racks

2011-05-02 Thread Jonathan Ellis
On Mon, May 2, 2011 at 2:18 PM, aaron morton wrote: > When the NTS selects replicas in a DC it orders the tokens available in  the > DC, then (in the first pass) iterates through them placing a replica in each > unique rack.  e.g. if the RF in each DC was 2, the replicas would be put on 2 > uni

Re: Replica data distributing between racks

2011-05-02 Thread Eric tamme
On Mon, May 2, 2011 at 3:22 PM, Jonathan Ellis wrote: > On Mon, May 2, 2011 at 2:18 PM, aaron morton wrote: >> When the NTS selects replicas in a DC it orders the tokens available in  the >> DC, then (in the first pass) iterates through them placing a replica in each >> unique rack.  e.g. if th

Re: Replica data distributing between racks

2011-05-02 Thread aaron morton
My bad, I missed the way TokenMetadata.ringIterator() and firstTokenIndex() work. Eric, can you show the output from nodetool ring ? Aaron On 3 May 2011, at 07:30, Eric tamme wrote: > On Mon, May 2, 2011 at 3:22 PM, Jonathan Ellis wrote: >> On Mon, May 2, 2011 at 2:18 PM, aaron morton wrot

Re: Replica data distributing between racks

2011-05-02 Thread Eric tamme
On Mon, May 2, 2011 at 5:59 PM, aaron morton wrote: > My bad, I missed the way TokenMetadata.ringIterator() and firstTokenIndex() > work. > > Eric, can you show the output from nodetool ring ? > Here is output from nodtool ring - ip addresses changed obviously. Address Status State Lo

Re: Replica data distributing between racks

2011-05-02 Thread Eric tamme
On Mon, May 2, 2011 at 5:59 PM, aaron morton wrote: > My bad, I missed the way TokenMetadata.ringIterator() and firstTokenIndex() > work. > > Eric, can you show the output from nodetool ring ? > > Sorry if the previous paste was way to unformatted, here is a pastie.org link with nicer formatting

Cassandra Meetup in DC

2011-05-02 Thread Chris Burroughs
http://www.meetup.com/Cassandra-DC-Meetup/ *What*: First Cassandra DC Meetup *When*: Thursday, May 12, 2011 at 6:30 PM *Where*: Northside Social Coffee & Wine - 3211 Wilson Blvd Arlington, VA I'm pleased to announce the the first Cassandra DC Meetup

Building from source from behind firewall since Maven switch?

2011-05-02 Thread Suan-Aik Yeo
Has anybody successfully built-from-source from behind a firewall since 0.7.1 when they switched from using Ivy to Maven? We were able to build from source in 0.7.0 by inserting "proxy.host" and "proxy.port" property names in build.xml and exporting -Dhttp.proxyHost and -Dhttp.proxyPort variabl

Unable to add columns to empty row in Column family: Cassandra

2011-05-02 Thread anuya joshi
Hello, I am using Cassandra for my application.My Cassandra client uses Thrift APIs directly. The problem I am facing currently is as follows: 1) I added a row and columns in it dynamically via Thrift API Client 2) Next, I used command line client to delete row which actually deleted all the colu

Re: Building from source from behind firewall since Maven switch?

2011-05-02 Thread Stephen Connolly
-autoproxy worked for me when I write the original patch but as I no longer work for the company where I wrote the patch, I don't have a firewall to deal with worst case you might have to create a ~/.m2/settings.xml with the proxy details... if that is the case can you raise a jira in MANTTASKS (

Re: Unable to add columns to empty row in Column family: Cassandra

2011-05-02 Thread chovatia jaydeep
Hi Anuya, > However, columns are not being inserted. Do you mean to say that after insert operation you couldn't retrieve the same data? If so, then please check the time-stamp when you reinserted after delete operation. Your second insertion time-stamp has to be greater than the previous ins

Re: Unable to add columns to empty row in Column family: Cassandra

2011-05-02 Thread chovatia jaydeep
One small correction in my mail below.  Second insertion time-stamp has to be greater than delete time-stamp in-order to retrieve the data. Thank you, Jaydeep From: chovatia jaydeep To: "user@cassandra.apache.org" Sent: Monday, 2 May 2011 11:52 PM Subject: Re: