Re: Hadoop jobs and data locality

2013-05-05 Thread Shamim
Hello,   We have also came across this issue in our dev environment, when we upgrade Cassandra from 1.1.5 to 1.2.1 version. I have mentioned this issue in few times in this forum but haven't got any answer yet. For quick work around you can use pig.splitCombination false in your pig script to

Re: hector or astyanax

2013-05-05 Thread Shamim
Hi,   Astyanax is just a refactoring of Hector and implements a few common cassandra use cases. Very easy to use api. In Astyanax you will found all the functions from hector. For better performance you can also check datastax java driver https://github.com/datastax/java-driver. There are

Node went down and came back up

2013-05-05 Thread Dan Kogan
Hello, Last night one of our nodes froze and the server had to be rebooted. After it came up, the node joined the ring and everything looked normal. However, this morning there seem to be some inconsistencies in the data (e.g. some nodes don't have a given record or have a different version of

Re: hector or astyanax

2013-05-05 Thread Renato Marroquín Mogrovejo
Hey Shamim, Why do you say that Java-Driver has better performance over Hector or Astyanax? Is there any reasons for this? Thanks. Renato M. 2013/5/5 Shamim sre...@yandex.ru: Hi, Astyanax is just a refactoring of Hector and implements a few common cassandra use cases. Very easy to use

Re: hector or astyanax

2013-05-05 Thread Derek Williams
The binary protocol is able to multiplex multiple requests using a single connection, which can lead to much better performance (similar to HTTP vs SPDY). This is without comparing the performance of thrift vs binary protocol, which I assume the binary protocol would be faster since it is

Re: hector or astyanax

2013-05-05 Thread Aaron Turner
On Sun, May 5, 2013 at 1:09 PM, Derek Williams de...@fyrie.net wrote: The binary protocol is able to multiplex multiple requests using a single connection, which can lead to much better performance (similar to HTTP vs SPDY). This is without comparing the performance of thrift vs binary

Re: How to use Write Consistency 'ANY' with SSTABLELOADER - DSE Cassandra 1.1.9

2013-05-05 Thread praveen.akunuru
Hi Aaron, Rob, Thank you for your responses. Sorry about the delay in getting back to you. To answer your questions: Is this a once off data load or something you need to do regularly? This will be a regular load. We will have to do a load with 10 million records once in every 2 hours on our

Re: hector or astyanax

2013-05-05 Thread Edward Capriolo
I am aware of no benchmark that shows the binary driver to be faster then thrift. Yes. Theoretically a driver that with multiplex *should be* faster in *some* cases. However I have never seen any evidence to back up this theory anecdotal or otherwise. In fact

Re: hector or astyanax

2013-05-05 Thread Derek Williams
I haven't done any performance testing with Cassandra 1.2, I was only giving possible reasons why the datastax binary driver might be faster. I have seen much better performance and reliability under heavy load in our internal rest services when switching from HTTP to SPDY. Possible reasons could

RE: local_quorum

2013-05-05 Thread Kanwar Sangha
Anyone ? From: Kanwar Sangha [mailto:kan...@mavenir.com] Sent: 03 May 2013 08:59 To: user@cassandra.apache.org Subject: local_quorum Hi - I have 2 data centres (DC1 and DC2) and I have local_quorum set as the CL for reads. Say there is a RF factor = 2. (so 2 copies each in DC). If both nodes

Re: local_quorum

2013-05-05 Thread shubham srivastava
You will get an error back . Also use RF:3 for the high availability in one DC. On Mon, May 6, 2013 at 7:05 AM, Kanwar Sangha kan...@mavenir.com wrote: Anyone ? ** ** *From:* Kanwar Sangha [mailto:kan...@mavenir.com] *Sent:* 03 May 2013 08:59 *To:* user@cassandra.apache.org