Re: Performance problem with large wide row inserts using CQL

2014-02-22 Thread Rüdiger Klaehn
On Fri, Feb 21, 2014 at 11:51 AM, Sylvain Lebresne sylv...@datastax.comwrote: On Thu, Feb 20, 2014 at 10:49 PM, Rüdiger Klaehn rkla...@gmail.comwrote: Hi Sylvain, I applied the patch to the cassandra-2.0 branch (this required some manual work since I could not figure out which commit it was

Queuing System

2014-02-22 Thread Jagan Ranganathan
Hi, I need to decouple some of the work being processed from the user thread to provide better user experience. For that I need a queuing system with the following needs, High Availability No Data Loss Better Performance. Following are some libraries that were considered along with the

Re: Disabling opscenter data collection in Datastax community 2.0

2014-02-22 Thread Michael Shuler
On 02/22/2014 06:12 AM, user 01 wrote: I'm using dsc20 (datastax community edition for cassandra 2.0) in production environment. But since I am not authorized to use Opscenter for production use. So how do I disable the data recording that is being done for opscenter consumption, as this is just

Re: Queuing System

2014-02-22 Thread DuyHai Doan
Jagan Queue-like data structures are known to be one of the worst anti patterns for Cassandra: http://www.datastax.com/dev/blog/cassandra-anti-patterns-queues-and-queue-like-datasets On Sat, Feb 22, 2014 at 4:03 PM, Jagan Ranganathan ja...@zohocorp.comwrote: Hi, I need to decouple some of

Re: Queuing System

2014-02-22 Thread Laing, Michael
We use RabbitMQ for queuing and Cassandra for persistence. RabbitMQ with clustering and/or federation should meet your high availability needs. Michael On Sat, Feb 22, 2014 at 10:25 AM, DuyHai Doan doanduy...@gmail.com wrote: Jagan Queue-like data structures are known to be one of the

Re: Queuing System

2014-02-22 Thread Tupshin Harper
While, historically, it has been true that queuing in Cassandra has been an anti-pattern, it is also true that Leveled Compaction addresses the worst aspect of frequent deletes in Cassandra, and that overall, queuing in Cassandra is nowhere near the anti-pattern that it used to be. This is

Re: Queuing System

2014-02-22 Thread Jagan Ranganathan
Hi Michael, Yes I am planning to use RabbitMQ for my messaging system. But I wonder which will give better performance if writing directly into Rabbit with Ack support Vs a temporary Queue in Cassandra first and then dequeue and publish in Rabbit. Complexities involving - Handling scenarios

Re: Queuing System

2014-02-22 Thread Joe Stein
If performance and availability for messaging is a requirement then use Apache Kafka http://kafka.apache.org/ You can pass the same thrift/avro objects through the Kafka commit log or strings or whatever you want. /*** Joe Stein Founder, Principal

Re: Queuing System

2014-02-22 Thread Jagan Ranganathan
Hi Joe, If my understanding is right, Kafka does not satisfy the high availability/replication part well because of the need for leader and In-Sync replicas. Regards, Jagan On Sat, 22 Feb 2014 22:02:27 +0530 Joe Steinlt;crypt...@gmail.comgt; wrote If performance and

Re: Queuing System

2014-02-22 Thread Jagan Ranganathan
Hi, Thanks for the pointer. Following are some options given there, If you know where your live data begins, hint Cassandra with a start column, to reduce the scan times and the amount of tombstones to collect. A broker will usually have some notion of what’s next in the sequence and thus

Re: Queuing System

2014-02-22 Thread Jagan Ranganathan
Thanks Tupshin for your assistance. As I mentioned in the other mail, Yes I am planning to use RabbitMQ for my messaging system. But I wonder which will give better performance if writing directly into Rabbit with Ack support Vs a temporary Queue in Cassandra first and then dequeue and publish

Re: List support in Net::Async::CassandraCQL ?

2014-02-22 Thread Paul LeoNerd Evans
(resending for the list now I'm subscribed) On Sat, 22 Feb 2014 14:03:06 +1100 Jacob Rhoden jacob.rho...@me.com wrote: This perl library has been extremely useful for scripting up data migrations. I wonder if anyone knows of the easiest way to use lists with this driver? Throwing a perl array

Re: Queuing System

2014-02-22 Thread DuyHai Doan
Jagan Few time ago I dealed with a similar queuing design for one customer. *If you never delete messages in the queue*, then it is possible to use wide rows with bucketing and increasing monotonic column name to store messages. CREATE TABLE *read_only_queue *( bucket_number int,

Re: Queuing System

2014-02-22 Thread thunder stumpges
We use this same setup also and it works great. Thunder - Reply message - From: Laing, Michael michael.la...@nytimes.com To: user@cassandra.apache.org Subject: Queuing System Date: Sat, Feb 22, 2014 7:31 AM We use RabbitMQ for queuing and Cassandra for persistence. RabbitMQ with

Re: Queuing System

2014-02-22 Thread thunder stumpges
This seems a bit overkill. We run far more than 100mps (closer to 600) in rabbit with very good latency on a 3 node cluster. It has been very reliable as well. Thunder - Reply message - From: Jagan Ranganathan ja...@zohocorp.com To: user@cassandra.apache.org Subject: Queuing System

[OT]: Can I have a non-delivering subscription?

2014-02-22 Thread Paul LeoNerd Evans
A question about the mailing list itself, rather than Cassandra. I've re-subscribed simply because I have to be subscribed in order to send to the list, as I sometimes try to when people Cc questions about my Net::Async::CassandraCQL perl module to me. However, if I want to read the list, I

Re: Queuing System

2014-02-22 Thread Joe Stein
Without them you have no durability. With them you have guarantees... More than any other system with messaging features. It is a durable CP commit log. Works very well for data pipelines with AP systems like Cassandra which is a different system solving different problems. When a Kafka

Loading CQL PDO for CentOS PHP

2014-02-22 Thread Spencer Brown
I'm trying to get CQL going for my CentOS 5 cassandra PHP platform. I've installed thrift, but when I try to make cassandra-pdo or YACassandraPDO for that matter, none of the tests pass. And when I install it with PHP, phpinfo still doesn't show it loading and it doesn't work. Any ideas would be

Re: Update multiple rows in a CQL lightweight transaction

2014-02-22 Thread Tupshin Harper
#5633 was actually closed because the static columns feature ( https://issues.apache.org/jira/browse/CASSANDRA-6561) which has been checked in to the 2.0 branch but is not yet part of a release (it will be in 2.0.6). That feature will let you update multiple rows within a single partition by

abusing cassandra's multi DC abilities

2014-02-22 Thread Jonathan Haddad
Upfront TLDR: We want to do stuff (reindex documents, bust cache) when changed data from DC1 shows up in DC2. Full Story: We're planning on adding data centers throughout the US. Our platform is used for business communications. Each DC currently utilizes elastic search and redis. A message

Re: List support in Net::Async::CassandraCQL ?

2014-02-22 Thread Jacob Rhoden
Hi Paul, On 23 Feb 2014, at 4:15 am, Paul LeoNerd Evans leon...@leonerd.org.uk wrote: On Sat, 22 Feb 2014 14:03:06 +1100 Jacob Rhoden jacob.rho...@me.com wrote: my $q = $cass-prepare(update contact set name=?, address=? where uuid=?)-get; push @f, $q-execute([$name, @address, $uuid]);

Re: Queuing System

2014-02-22 Thread Jagan Ranganathan
Thanks Duy Hai for sharing the details. I have a doubt. If for some reason there is a Network Partition or more than 2 Node failure serving the same partition/load and you ended up writing hinted hand-off. Is there a possibility of a data loss? If yes, how do we avoid that? Regards, Jagan

Re: [OT]: Can I have a non-delivering subscription?

2014-02-22 Thread Robert Wille
Yeah, it¹s called a rule. Set one up to delete everything from user@cassandra.apache.org. On 2/22/14, 10:32 AM, Paul LeoNerd Evans leon...@leonerd.org.uk wrote: A question about the mailing list itself, rather than Cassandra. I've re-subscribed simply because I have to be subscribed in order to

Re: Disabling opscenter data collection in Datastax community 2.0

2014-02-22 Thread user 01
I would be using nodetool JConsole for monitoring. Though it would be less informative but I think it will do. Otherwise also I cannot use Opscenter as I am not using the DSE but DSC, in production. So I am not allowed to use it for prod. use, Isn't it ? Not everyone here as well is using DSE

Re: Disabling opscenter data collection in Datastax community 2.0

2014-02-22 Thread Tupshin Harper
You can use OpsCenter in production with DSC/Apache Cassandra clusters. Some features are only enabled with DSE, but the rest work fine with DSC. -Tupshin On Feb 22, 2014 11:20 PM, user 01 user...@gmail.com wrote: I would be using nodetool JConsole for monitoring. Though it would be less