Re: Replication to second data center with different number of nodes

2015-03-28 Thread Colin Clark
I typically use a # a lot lower than 256, usually less than 20 for num_tokens as a larger number has historically had a dramatic impact on query performance. — Colin Clark co...@clark.ws +1 612-859-6129 skype colin.p.clark On Mar 28, 2015, at 3:46 PM, Eric Stevens migh...@gmail.com wrote: If

Re: ('Unable to complete the operation against any hosts', {})

2015-03-28 Thread Dhanasekaran
Rahul, Can you try reducing the batch size to 1000? Also what is the write consistency level? Thanks, Dhanasekaran On 29-Mar-2015, at 12:30 am, Rahul Bhardwaj rahul.bhard...@indiamart.com wrote: Hi All, awaiting any response.. please help regards: rahul On Fri, Mar 27, 2015 at

Re: ('Unable to complete the operation against any hosts', {})

2015-03-28 Thread Rahul Bhardwaj
Hi All, awaiting any response.. please help regards: rahul On Fri, Mar 27, 2015 at 5:54 PM, Rahul Bhardwaj rahul.bhard...@indiamart.com wrote: Hi All, We are using cassandra version 2.1.2 with cqlsh 5.0.1 (cluster of three nodes with rf 2) I need to load around 40 million records

Re: High latencies for simple queries

2015-03-28 Thread Ben Bromhead
cqlsh runs on the internal cassandra python drivers: cassandra-pylib and cqlshlib. I would not recommend using them at all (nothing wrong with them, they are just not built with external users in mind). I have never used python-driver in anger so I can't comment on whether it is genuinely slower

Re: ('Unable to complete the operation against any hosts', {})

2015-03-28 Thread Jonathan Haddad
Don't use batches for this. Use a lot of async queries. https://lostechies.com/ryansvihla/2014/08/28/cassandra-batch-loading-without-the-batch-keyword/ Jon On Mar 27, 2015, at 5:24 AM, Rahul Bhardwaj rahul.bhard...@indiamart.com wrote: Hi All, We are using cassandra version 2.1.2

Re: Replication to second data center with different number of nodes

2015-03-28 Thread Eric Stevens
If you're curious about how Cassandra knows how to replicate data in the remote DC, it's the same as in the local DC, replication is independent in each, and you can even set a different replication strategy per keyspace per datacenter. Nodes in each DC take up num_tokens positions on a ring,

Re: Arbitrary nested tree hierarchy data model

2015-03-28 Thread Robert Wille
Ben Bromhead sent an email to me directly and expressed an interest in seeing some of my queries. I may as well post them for everyone. Here are my queries for the part of my code that reads and cleans up browse trees. @NamedCqlQueries({ @NamedCqlQuery( name =

Re: Arbitrary nested tree hierarchy data model

2015-03-28 Thread daemeon reiydelle
Fascinating. Both the mysql front and and this delightful inverted search solution. Your creativity makes me wonder what other delights your query solutions might expose!! sent from my mobile Daemeon C.M. Reiydelle USA 415.501.0198 London +44.0.20.8144.9872 On Mar 27, 2015 7:08 PM, Robert Wille

Re: Arbitrary nested tree hierarchy data model

2015-03-28 Thread Peter Lin
that's neat, thanks for sharing. sounds like the solution is partly inspired by merkle tree to make lookup fast and easy. peter On Fri, Mar 27, 2015 at 10:07 PM, Robert Wille rwi...@fold3.com wrote: Okay, this is going to be a pretty long post, but I think its an interesting data model, and

Re: High latencies for simple queries

2015-03-28 Thread Artur Siekielski
On 03/28/2015 12:13 AM, Ben Bromhead wrote: One other thing to keep in mind / check is that doing these tests locally the cassandra driver will connect using the network stack, whereas postgres supports local connections over a unix domain socket (this is also enabled by default). Unix domain