Re: atomicity of a row write

2012-01-24 Thread Sylvain Lebresne
On Mon, Jan 23, 2012 at 9:20 PM, Guy Incognito dnd1...@gmail.com wrote: hi all, having read: http://wiki.apache.org/cassandra/FAQ#batch_mutate_atomic i would like some clarification: is a write to a single row key in a single column family atomic in the sense that i can do a batch mutate

Re: Return list by limit 1 which is NOT null

2012-01-24 Thread aaron morton
No, you will need to filter client side. Can you do it in a single row ? Cheers - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 24/01/2012, at 10:00 AM, Eric Martell wrote: HI, I am trying to create a keys list which I will fetch the key

Re: SSTable compaction issue in our system

2012-01-24 Thread aaron morton
When the default compaction strategy the SSTables are grouped into buckets, where the size of every sstable int he bucket is within 50% of the average size of files in the bucket. There is also a catch all first bucket for all files less than 50MB (by default). The min_compaction_threshold CF

Re: Tips for using OrderedPartitioner

2012-01-24 Thread aaron morton
Nothing I can thin of other than making the keys uniform. Having a single index row with the RP can be a pain. Is there a way to partition it ? Cheers - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 23/01/2012, at 11:42 PM, Tharindu Mathew

Re: architectural understanding of write operation node flow

2012-01-24 Thread Peter Dijkshoorn
yeah, well main question remains then, is the node receiving the request from the client called the coordinator (even if it is not responsible for that key)? Or will that node forward the call to the first responsible node who does the coordinating stuff? (as the cassandra and dynamo paper state)

Re: architectural understanding of write operation node flow

2012-01-24 Thread Sylvain Lebresne
On Tue, Jan 24, 2012 at 9:57 AM, Peter Dijkshoorn peter.dijksho...@adyen.com wrote: yeah, well main question remains then, is the node receiving the request from the client called the coordinator (even if it is not responsible for that key)? Yes. Or will that node forward the call to the

Re: Tips for using OrderedPartitioner

2012-01-24 Thread R. Verlangen
If you would like to index your rows in an index-row, you could also choose for indexing the index-rows. This will scale up for any needs and create a tree structure. 2012/1/24 aaron morton aa...@thelastpickle.com Nothing I can thin of other than making the keys uniform. Having a single index

Remote Hadoop Job Deployment

2012-01-24 Thread Brian O'Neill
FYI... we finally got around to releasing a version of Virgil that includes the ability to deploy jobs to remote Hadoop clusters running against Cassandra Column Families. http://brianoneill.blogspot.com/2012/01/virgil-remote-hadoop-job-deployment-via.html This has enabled an army of people to

Thrift framed transport for cocoa

2012-01-24 Thread Jools Enticknap
Hi All, I've been writing a OSX cassandra data browser, and found that later versions now only work using the FramedTransport. As a result I've created a Framed Transport for the thrift cocoa library, and created a JIRA THRIFT-1503. Cheers, -- Jools Enticknap

Re: Return list by limit 1 which is NOT null

2012-01-24 Thread Eric Martell
Thanks Aaron. How do I implement that in the row? Does row will not have any tombstones? Please let me know. Thanks and Regards. --- On Tue, 1/24/12, aaron morton aa...@thelastpickle.com wrote: From: aaron morton aa...@thelastpickle.com Subject: Re: Return list by limit 1 which is NOT null To:

Unsubscribe

2012-01-24 Thread mike.li
This email was sent to you by Thomson Reuters, the global news and information company. Any views expressed in this message are those of the individual sender, except where the sender specifically states them to be the views of Thomson Reuters.

Command to display config values

2012-01-24 Thread A J
Is there a command in cqlsh or cassandra CLI that can display the various values of the configuration parameters at use. I am particularly interested in finding the value of ' commitlog_sync' that the current session is using ? Thanks. AJ

Enable thrift logging

2012-01-24 Thread ruslan usifov
Hello I try to log thrift log message (this need to us for solve communicate problem between Cassandra daemon and php client ), so in log4j-server.properties i write follow lines: log4j.logger.org.apache.thrift.transport=DEBUG,THRIFT log4j.appender.THRIFT=org.apache.log4j.RollingFileAppender

Re: Cassandra x MySQL Sharded - Insert Comparison

2012-01-24 Thread Maxim Potekhin
a) I hate to break it to you, but 6GB x 4 cores != 'high-end machine'. It's pretty much middle of the road consumer level these days. b) Hosting the client and Cassandra on the same node is a Bad Idea. It will depend on what exactly the client will do, but in my experience it won't work too

Re: Enable thrift logging

2012-01-24 Thread R. Verlangen
Pick a custom loglevel and redirect them with the /etc/syslog.conf ? 2012/1/24 ruslan usifov ruslan.usi...@gmail.com Hello I try to log thrift log message (this need to us for solve communicate problem between Cassandra daemon and php client ), so in log4j-server.properties i write follow

unsubscribe

2012-01-24 Thread mike.li
This email was sent to you by Thomson Reuters, the global news and information company. Any views expressed in this message are those of the individual sender, except where the sender specifically states them to be the views of Thomson Reuters.

Re: Return list by limit 1 which is NOT null

2012-01-24 Thread aaron morton
Tombstoned rows are returned without any columns. Cheers - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 25/01/2012, at 4:37 AM, Eric Martell wrote: Thanks Aaron. How do I implement that in the row? Does row will not have any tombstones?

Re: Command to display config values

2012-01-24 Thread aaron morton
Nothing through those API's, can you check the yaml file ? Cheers - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 25/01/2012, at 10:10 AM, A J wrote: Is there a command in cqlsh or cassandra CLI that can display the various values of the

Re: Enable thrift logging

2012-01-24 Thread aaron morton
Do you want to log from inside the thrift code or from the cassandra thrift classes ? if it's the later try log4j.logger.org.apache.thrift=DEBUG,THRIFT org.apache.thrift.transport is part of thrift proper. Cheers - Aaron Morton Freelance Developer @aaronmorton

Re: SSTable compaction issue in our system

2012-01-24 Thread aaron morton
The issue with major / manual compaction is that it creates a one file. One big old file. That one file will not be compacted unless there are (min_compaction_threshold -1) other files of a similar size. So thombstones and overwrites in that file may not be purged for a long time. If you

Re: Enable thrift logging

2012-01-24 Thread ruslan usifov
2012/1/25 aaron morton aa...@thelastpickle.com Do you want to log from inside the thrift code or from the cassandra thrift classes ? Exceptions happens inside thrift, so inside thrift:-))) if it's the later try log4j.logger.org.apache.thrift=DEBUG,THRIFT org.apache.thrift.transport is

Re: Cassandra performance question

2012-01-24 Thread Jonathan Ellis
No argument there. Thanks for explaining what you were doing to encrypt client traffic! On Mon, Jan 23, 2012 at 10:11 PM, Chris Marino ch...@vcider.com wrote: Hi Jonathan, yes, when I say 'node encryption' I mean inter-Cassandra node encryption. When I say 'client encryption' I mean encrypted

unsubscribe

2012-01-24 Thread Hitomi Benedetti

unsubscribe

2012-01-24 Thread Eric Hunt

unsubscribe

2012-01-24 Thread Shen
Shen Sizhong 在 2012-1-25,上午8:57,Eric Hunt eric.o.h...@gmail.com 写道:

Re: unsubscribe

2012-01-24 Thread Dave Brosius
Folks who wish to unsubscribe should sent a blank email to the following address user-unsubscr...@cassandra.apache.org mailto:user-unsubscr...@cassandra.apache.org

Python CQL Batching is slower than single statements

2012-01-24 Thread Blake Visin
So I decided that it would be beneficial to use batching in my application since I am doing many, many inserts. When I implemented batching in CQL using 'BEGIN BATCH'..'APPLY BATCH' I saw a significant decrease in the speed of inserts, no matter the number of insert statements I included between

Re: Command to display config values

2012-01-24 Thread A J
Yes, I can see the yaml files. But I need to confirm through some database query that the change in yaml on node restart was picked up by the database. On Tue, Jan 24, 2012 at 7:07 PM, aaron morton aa...@thelastpickle.com wrote: Nothing through those API's, can you check the yaml file ? Cheers

Re: Cassandra usage

2012-01-24 Thread Maxim Potekhin
You provide zero information on what you are planning to do with the data. Thus, your question is impossible to answer. On 1/24/2012 9:38 PM, francesco.tangari@gmail.com wrote: Do you think that for a standard project with 50.000.000 of rows on 2-3 machines cassandra is appropriate or i

Re: Cassandra usage

2012-01-24 Thread francesco . tangari . inf
Standard analysis, display or aggregate some rows or standard operations that i can do on a normal dbms -- francesco.tangari@gmail.com Inviato con Sparrow (http://www.sparrowmailapp.com/?sig) Il giorno mercoledì 25 gennaio 2012, alle ore 04.26, Maxim Potekhin ha scritto: You

Re: Cassandra x MySQL Sharded - Insert Comparison

2012-01-24 Thread Gustavo Gustavo
I was able to make Cassandra beat MySQL MyISAM (~10k inserts/s against 6k inserts/s) using two physical machines (laptops) - one the client, and the other one the server, with 50 inserting threads. I don't know exactly why yet, but the high-level client that I was using to C# (Aquiles) was taking

CF Comparator type and get_slice

2012-01-24 Thread Gustavo Gustavo
I suppose that a CF Comparator type is used to sort the columns based on its type, right? So, let's suppose I have two columns: 1, 3. If I insert column 2 I will end up with 1, 2, 3, ok? I'm using the default BytesType as the Comparator type to store time series columns (actually the value is a

Re: Cassandra usage

2012-01-24 Thread Gustavo Gustavo
That's for sure not much. Your rdbms can probably hold the entire dataset in memory, and you can do all kinds for queries that you want. Cassandra is for some very specific use cases. If you really need a cluster, have you thought about MySQL Cluster? 2012/1/25 francesco.tangari@gmail.com

Re: Cassandra usage

2012-01-24 Thread francesco . tangari . inf
make example of cases please? -- francesco.tangari@gmail.com Inviato con Sparrow (http://www.sparrowmailapp.com/?sig) Il giorno mercoledì 25 gennaio 2012, alle ore 05.29, Gustavo Gustavo ha scritto: That's for sure not much. Your rdbms can probably hold the entire dataset in memory,

Re: Python CQL Batching is slower than single statements

2012-01-24 Thread aaron morton
There are few slight differences in the execution paths, nothing jumps out (it *looks* like the authorization to write to the CF is checked for each statement in the batch, not sure how heavy that is.). If you send a batch with more statements that concurrent_writers in the yaml some of those