Re: Cassandra - Spark - Flume: best architecture for log analytics.

2015-07-22 Thread Pierre Devops
Cassandra is not very good at massive read/bulk read if you need to retrieve and compute a large amount of data on multiple machines using something like spark or hadoop (or you'll need to hack and process the sstable directly, something which is not natively supported, you'll have to hack your

Compacting large partition warning

2015-07-20 Thread Pierre Devops
Hi, So since, I upgraded to 2.2-rc2 I get the CASSANDRA-9643 warning : WARN o.a.c.i.s.f.b.BigTableWriter - Compacting large partition bytes Some of my partitions may have ~20 millions of rows, while others may have only a few hundreds of rows. It may grow up to 300 millions of rows per

Re: Cassandra 2.2, 3.0, and beyond

2015-06-11 Thread Pierre Devops
Hi, 3.x beta release date ? 2015-06-11 16:21 GMT+02:00 Jonathan Ellis jbel...@gmail.com: 3.1 is EOL as soon as 3.3 (the next bug fix release) comes out. On Thu, Jun 11, 2015 at 4:10 AM, Stefan Podkowinski stefan.podkowin...@1und1.de wrote: We are also extending our backwards

Slow bulk loading

2015-05-07 Thread Pierre Devops
Hi, I m streaming a big sstable using bulk loader of sstableloader but it's very slow (3 Mbytes/sec) : Summary statistics: Connections per host: : 1 Total files transferred: : 1 Total bytes transferred: : 10357947484 Total duration (ms): : 3280229