Re: Doubt regarding CQL

2012-02-22 Thread Mateusz Korniak
On Wednesday 22 of February 2012, Rishabh Agrawal wrote: I have installed CQL drivers for python. When I try execute cqlsh I get following error cql-1.0.3$ cqlsh localhost 9160 (...) File /usr/local/lib/python2.7/dist-packages/cql/cassandra/ttypes.py, line 7, in module from thrift.Thrift

Re: [BETA RELEASE] Apache Cassandra 1.1.0-beta1 released

2012-02-22 Thread Sylvain Lebresne
Arf, you'r right sorry. I've fixed it (but it could take ~1 to get propagated to all apache mirrors). -- SYlvain On Wed, Feb 22, 2012 at 2:46 AM, Maki Watanabe watanabe.m...@gmail.com wrote: The link is wrong.

Re: What linux distro for the Cassandra nodes ?

2012-02-22 Thread aaron morton
Here are the platforms the data stax distro supports http://www.datastax.com/products/community/platforms Cheers - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 22/02/2012, at 12:11 AM, Aditya Gupta wrote: I am about to choose a linux distro

RE: Doubt regarding CQL

2012-02-22 Thread Rishabh Agrawal
Thanks for the reply I installed 0.8.0 drift package. But still problem persists. -Original Message- From: Mateusz Korniak [mailto:mateusz-li...@ant.gliwice.pl] Sent: Wednesday, February 22, 2012 1:47 PM To: user@cassandra.apache.org Subject: Re: Doubt regarding CQL On Wednesday 22 of

List all keys with RandomPartitioner

2012-02-22 Thread Flavio Baronti
I need to iterate over all the rows in a column family stored with RandomPartitioner. When I reach the end of a key slice, I need to find the token of the last key in order to ask for the next slice. I saw in an old email that the token for a specific key can be recoveder through

Re: List all keys with RandomPartitioner

2012-02-22 Thread Henrik Schröder
I had to port that piece of code to C#, and it's just a few lines of code, so just write your own. Here's the original so you can see what it does: http://git-wip-us.apache.org/repos/asf?p=cassandra.git;a=blob_plain;f=src/java/org/apache/cassandra/utils/FBUtilities.java;hb=refs/heads/trunk

Re: List all keys with RandomPartitioner

2012-02-22 Thread Franc Carter
On Wed, Feb 22, 2012 at 8:47 PM, Flavio Baronti f.baro...@list-group.comwrote: I need to iterate over all the rows in a column family stored with RandomPartitioner. When I reach the end of a key slice, I need to find the token of the last key in order to ask for the next slice. I saw in an

Re: List all keys with RandomPartitioner

2012-02-22 Thread Flavio Baronti
Il 2/22/2012 12:24 PM, Franc Carter ha scritto: On Wed, Feb 22, 2012 at 8:47 PM, Flavio Baronti f.baro...@list-group.com mailto:f.baro...@list-group.com wrote: I need to iterate over all the rows in a column family stored with RandomPartitioner. When I reach the end of a key slice, I

Re: List all keys with RandomPartitioner

2012-02-22 Thread Rafael Almeida
From: Franc Carter franc.car...@sirca.org.au To: user@cassandra.apache.org Sent: Wednesday, February 22, 2012 9:24 AM Subject: Re: List all keys with RandomPartitioner On Wed, Feb 22, 2012 at 8:47 PM, Flavio Baronti f.baro...@list-group.com wrote: I need to

Re: List all keys with RandomPartitioner

2012-02-22 Thread R. Verlangen
You can leave the end key empty. 1) Start with startkey = 2) Next iteration start with startkey = last key of the previous batch 3) Keep on going until you ran out of results 2012/2/22 Rafael Almeida almeida...@yahoo.com From: Franc Carter

Re: Doubt regarding CQL

2012-02-22 Thread paul cannon
Rishabh- It looks like you're not actually using the cqlsh that comes with Cassandra 1.0.7. Are you using an old version of the Python CQL driver? Old versions of the driver had cqlsh bundled with it, instead of with Cassandra. The 1.0.7 Debian/Ubuntu packages do not include cqlsh, because of

Please advise -- 750MB object possible?

2012-02-22 Thread Maxim Potekhin
Hello everybody, I'm being asked whether we can serve an object, which I assume is a blob, of 750MB size? I guess the real question is of how to chunk it and/or even it's possible to chunk it. Thanks! Maxim

Re: Please advise -- 750MB object possible?

2012-02-22 Thread Dan Retzlaff
Chunking is a good idea, but you'll have to do it yourself. A few of the columns in our application got quite large (maybe ~150MB) and the failure mode was RPC timeout exceptions. Nodes couldn't always move that much data across our data center interconnect in the default 10 seconds. With enough

Re: Please advise -- 750MB object possible?

2012-02-22 Thread Mohit Anchlia
In my opinion if you are busy site or application keep blobs out of the database. On Wed, Feb 22, 2012 at 9:37 AM, Dan Retzlaff dretzl...@gmail.com wrote: Chunking is a good idea, but you'll have to do it yourself. A few of the columns in our application got quite large (maybe ~150MB) and the

Re: reads/s suddenly dropped

2012-02-22 Thread Franc Carter
On Mon, Feb 20, 2012 at 9:42 PM, Franc Carter franc.car...@sirca.org.auwrote: On Mon, Feb 20, 2012 at 12:00 PM, aaron morton aa...@thelastpickle.comwrote: Aside from iostats.. nodetool cfstats will give you read and write latency for each CF. This is the latency for the operation on each

Re: Please advise -- 750MB object possible?

2012-02-22 Thread Rafael Almeida
Keep them where? From: Mohit Anchlia mohitanch...@gmail.com To: user@cassandra.apache.org Cc: potek...@bnl.gov Sent: Wednesday, February 22, 2012 3:44 PM Subject: Re: Please advise -- 750MB object possible? In my opinion if you are busy site or application

Re: Please advise -- 750MB object possible?

2012-02-22 Thread R. Verlangen
I would suggest you chunk them down into small pieces (~ 10-50MB) and just fetch all the parts you need. A problem might be that if fetching one fails, the whole blob is useless. 2012/2/22 Rafael Almeida almeida...@yahoo.com Keep them where? -- *From:* Mohit

Re: Please advise -- 750MB object possible?

2012-02-22 Thread Mohit Anchlia
Outside on the file system and a pointer to it in C* On Wed, Feb 22, 2012 at 10:03 AM, Rafael Almeida almeida...@yahoo.comwrote: Keep them where? -- *From:* Mohit Anchlia mohitanch...@gmail.com *To:* user@cassandra.apache.org *Cc:* potek...@bnl.gov *Sent:*

Re: Please advise -- 750MB object possible?

2012-02-22 Thread Maxim Potekhin
The idea was to provide redundancy, resilience, automatic load balancing and automatic repairs. Going the way of the file system does not achieve any of that. Maxim On 2/22/2012 1:34 PM, Mohit Anchlia wrote: Outside on the file system and a pointer to it in C* On Wed, Feb 22, 2012 at 10:03

Re: Please advise -- 750MB object possible?

2012-02-22 Thread Mohit Anchlia
unless you use distributed fs On Wed, Feb 22, 2012 at 10:37 AM, Maxim Potekhin potek...@bnl.gov wrote: The idea was to provide redundancy, resilience, automatic load balancing and automatic repairs. Going the way of the file system does not achieve any of that. Maxim On 2/22/2012 1:34

Re: Flume and Cassandra

2012-02-22 Thread aaron morton
Maybe Storm is what you are looking for (as well as flume to get the messages from the network) http://www.datastax.com/events/cassandranyc2011/presentations/marz Cheers - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 22/02/2012, at 2:23 AM,

Re: Issue regarding 'describe' keyword in 1.0.7 version.

2012-02-22 Thread aaron morton
Rishabh could you give an example ? Was this using the CLI ? Cheers - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 22/02/2012, at 2:55 AM, Dave Brosius wrote: What it's saying is if you define a KeySpace Foo and under it a ColumnFamily

Re: Logging 'write' operations

2012-02-22 Thread aaron morton
In theory you could grab the commit log, you would then have to work with internal cassandra structures to understand what is in there. Otherwise do it at the app level or design it into your data model. Cheers - Aaron Morton Freelance Developer @aaronmorton

Re: Please advise -- 750MB object possible?

2012-02-22 Thread Rob Coli
On Wed, Feb 22, 2012 at 10:37 AM, Maxim Potekhin potek...@bnl.gov wrote: The idea was to provide redundancy, resilience, automatic load balancing and automatic repairs. Going the way of the file system does not achieve any of that. (Apologies for continuing slightly OT thread, but if people

Chicago Cassandra Meetup on 3/1 (Preview of my Pycon talk)

2012-02-22 Thread Jeremiah Jordan
I am going to be doing a trial run of my Pycon talk about setting up a development instance of Cassandra and accessing it from Python (Pycassa mostly, some thrift just to scare people off of using thrift) for a Chicago Cassandra Meetup. Anyone in Chicago feel free to come by. The talk is

nodetool ring runs very slow

2012-02-22 Thread Feng Qu
We noticed that nodetool ring sometimes returns in 17-20 sec while it normally runs in less than a sec. There were some compaction running when it happened. Did compaction cause nodetool slowness? Anything else I should check? time nodetool -h hostname ring real 0m17.595s user 0m0.339s sys

Re: Please advise -- 750MB object possible?

2012-02-22 Thread Maxim Potekhin
Thank you so much, looks nice, I'll be looking into it. On 2/22/2012 3:08 PM, Rob Coli wrote: On Wed, Feb 22, 2012 at 10:37 AM, Maxim Potekhin potek...@bnl.gov mailto:potek...@bnl.gov wrote: The idea was to provide redundancy, resilience, automatic load balancing and automatic

Re: Please advise -- 750MB object possible?

2012-02-22 Thread Rustam Aliyev
Hi Maxim, If you need to store Blobs, then BlobStores such as OpenStack Object Store (aka Swift) should be better choise. As far as I know, MogileFS (which is also a sort of BlobStore) has scalability bottleneck - MySQL. There are few reasons why BlobStores are better choise. In the

Best suitable value for flush_largest_memtables_at

2012-02-22 Thread Roshan Pradeep
Hi Experts Under massive write load what would be the best value for Cassandra * flush_largest_memtables_at* setting? Yesterday I got an OOM exception in one of our production Cassandra node under heavy write load within 5 minute duration. I change the above setting value to .45 and also change

Re: Flume and Cassandra

2012-02-22 Thread Edward Capriolo
I have been working on IronCount (https://github.com/edwardcapriolo/IronCount/) which is designed to do what you are talking about. Kafka takes care of the distributed producer/consumer message queues and IronCount sets up custom consumers to process those messages. It might be what your are

Re: Flume and Cassandra

2012-02-22 Thread Milind Parikh
Coolwww.countandra.org calls them cascaded counters and it will be also based on Kafka. /*** sent from my android...please pardon occasional typos as I respond @ the speed of thought / On Feb 22, 2012 7:22 PM, Edward Capriolo edlinuxg...@gmail.com

Re: Please advise -- 750MB object possible?

2012-02-22 Thread Edward Capriolo
Someone has backended mongo's gridfs into Cassandra but I can not find it on Github atm On Wed, Feb 22, 2012 at 6:51 PM, Rustam Aliyev rus...@code.az wrote: Hi Maxim, If you need to store Blobs, then BlobStores such as OpenStack Object Store (aka Swift) should be better choise. As far as I

insert performance

2012-02-22 Thread Deno Vichas
all, would i be better off (i'm in java land) with spawning a bunch of threads that all add a single item to a mutator or a single thread that adds a bunch of items to a mutator? thanks, deno

Re: How to delete a range of columns using first N components of CompositeType Column?

2012-02-22 Thread Praveen Baratam
More precisely, Lets say we have a CF with the following spec. create column family Test with comparator = 'CompositeType(UTF8Type,UTF8Type,UTF8Type)' and key_validation_class = 'UTF8Type' and default_validation_class = 'UTF8Type'; And I have columns such as: Jack:Name:First -