I typically use a # a lot lower than 256, usually less than 20 for num_tokens
as a larger number has historically had a dramatic impact on query performance.
—
Colin Clark
co...@clark.ws
+1 612-859-6129
skype colin.p.clark
On Mar 28, 2015, at 3:46 PM, Eric Stevens migh...@gmail.com wrote:
If
Rahul,
Can you try reducing the batch size to 1000?
Also what is the write consistency level?
Thanks,
Dhanasekaran
On 29-Mar-2015, at 12:30 am, Rahul Bhardwaj rahul.bhard...@indiamart.com
wrote:
Hi All,
awaiting any response.. please help
regards:
rahul
On Fri, Mar 27, 2015 at
Hi All,
awaiting any response.. please help
regards:
rahul
On Fri, Mar 27, 2015 at 5:54 PM, Rahul Bhardwaj
rahul.bhard...@indiamart.com wrote:
Hi All,
We are using cassandra version 2.1.2 with cqlsh 5.0.1 (cluster of three
nodes with rf 2)
I need to load around 40 million records
cqlsh runs on the internal cassandra python drivers: cassandra-pylib and
cqlshlib.
I would not recommend using them at all (nothing wrong with them, they are
just not built with external users in mind).
I have never used python-driver in anger so I can't comment on whether it
is genuinely slower
Don't use batches for this. Use a lot of async queries.
https://lostechies.com/ryansvihla/2014/08/28/cassandra-batch-loading-without-the-batch-keyword/
Jon
On Mar 27, 2015, at 5:24 AM, Rahul Bhardwaj rahul.bhard...@indiamart.com
wrote:
Hi All,
We are using cassandra version 2.1.2
If you're curious about how Cassandra knows how to replicate data in the
remote DC, it's the same as in the local DC, replication is independent in
each, and you can even set a different replication strategy per keyspace
per datacenter. Nodes in each DC take up num_tokens positions on a ring,
Ben Bromhead sent an email to me directly and expressed an interest in seeing
some of my queries. I may as well post them for everyone. Here are my queries
for the part of my code that reads and cleans up browse trees.
@NamedCqlQueries({
@NamedCqlQuery(
name =
Fascinating. Both the mysql front and and this delightful inverted search
solution. Your creativity makes me wonder what other delights your query
solutions might expose!!
sent from my mobile
Daemeon C.M. Reiydelle
USA 415.501.0198
London +44.0.20.8144.9872
On Mar 27, 2015 7:08 PM, Robert Wille
that's neat, thanks for sharing.
sounds like the solution is partly inspired by merkle tree to make lookup
fast and easy.
peter
On Fri, Mar 27, 2015 at 10:07 PM, Robert Wille rwi...@fold3.com wrote:
Okay, this is going to be a pretty long post, but I think its an
interesting data model, and
On 03/28/2015 12:13 AM, Ben Bromhead wrote:
One other thing to keep in mind / check is that doing these tests
locally the cassandra driver will connect using the network stack,
whereas postgres supports local connections over a unix domain socket
(this is also enabled by default).
Unix domain
10 matches
Mail list logo