Re: Using K8s to Manage Cassandra in Production

2018-05-23 Thread vincent gromakowski
orte. >- Mature package management, Generics/inability to write DRY >code, a million if err statements (: >- Some other awesome operators/controllers are written in JVM based >languages. The sparkKubernetes resource manager (which is a k8s controller) >is written in

Re: Using K8s to Manage Cassandra in Production

2018-05-23 Thread vincent gromakowski
Why did you choose java for the operator implementation when everybody seems to use the go client (probably for greater functionalities) ? 2018-05-23 15:39 GMT+02:00 Ben Bromhead : > You can get a good way with StatefulSets, but as Tom mentioned there are > still some

Re: Sorl/DSE Spark

2018-04-12 Thread vincent gromakowski
Best practise is to use a dedicated DC for analytics separated from the hot DC. Le jeu. 12 avr. 2018 à 15:45, sha p a écrit : > Got it. > Thank you so for your detailed explanation. > > Regards, > Shyam > > On Thu, 12 Apr 2018, 17:37 Evelyn Smith,

Re: What kind of Automation you have for Cassandra related operations on AWS ?

2018-02-09 Thread vincent gromakowski
<oleksandr.shul...@zalando.de>: > On Fri, Feb 9, 2018 at 1:01 PM, vincent gromakowski < > vincent.gromakow...@gmail.com> wrote: > >> Working on a Kubernetes operator for Cassandra (Alpha stage...) >> > > I would love to learn more about your approach. Do you h

Re: What kind of Automation you have for Cassandra related operations on AWS ?

2018-02-09 Thread vincent gromakowski
Working on a Kubernetes operator for Cassandra (Alpha stage...) Le 9 févr. 2018 12:56 PM, "Oleksandr Shulgin" a écrit : > On Fri, Feb 9, 2018 at 12:46 AM, Krish Donald > wrote: > >> Hi All, >> >> What kind of Automation you have for Cassandra

Re: Pluggable throttling of read and write queries

2017-02-20 Thread vincent gromakowski
Aren't you using mesos Cassandra framework to manage your multiple clusters ? (Seen a presentation in cass summit) What's wrong with your current mesos approach ? I am also thinking it's better to split a large cluster into smallers except if you also manage client layer that query cass and you

Re: cassandra user request log

2017-02-10 Thread vincent gromakowski
tx 2017-02-10 10:01 GMT+01:00 Benjamin Roth <benjamin.r...@jaumo.com>: > you could write a custom trigger that logs access to specific CFs. But be > aware that this may have a big performance impact. > > 2017-02-10 9:58 GMT+01:00 vincent gromakowski < > vincent.gromakow.

Re: cassandra user request log

2017-02-10 Thread vincent gromakowski
don't think this is viable. > There are many many JMX metrics on an aggregated level. But none per > authed used. > What exactly do you want to find out? Is it for debugging purposes? > > > 2017-02-10 9:42 GMT+01:00 vincent gromakowski < > vincent.gromakow...@gmail.com>: > &g

cassandra user request log

2017-02-10 Thread vincent gromakowski
Hi all, Is there any way to trace user activity at the server level to see which user is accessing which data ? Do you thin it would be simple to implement ? Tx

Re: [External] Re: Cassandra ad hoc search options

2017-01-31 Thread vincent gromakowski
You can also have a look at https://github.com/strapdata/elassandra 2017-01-31 9:50 GMT+01:00 vincent gromakowski <vincent.gromakow...@gmail.com >: > The problem with adhoc queries on casssandra (with spark or not) is the > partition model of cassandra that needs to be respected t

Re: [External] Re: Cassandra ad hoc search options

2017-01-31 Thread vincent gromakowski
The problem with adhoc queries on casssandra (with spark or not) is the partition model of cassandra that needs to be respected to avoid full scan queries (the link you mentioned explains all of them). With FiloDB, which works on cassandra, you can pushdown predicates of the partition key and

Re: [External] Re: Cassandra ad hoc search options

2017-01-30 Thread vincent gromakowski
I gave a try on spark+filodb and it's very interesting for ad-hoc queries Le 31 janv. 2017 7:20 AM, "siddharth verma" a écrit : Hi, *Are you using the DataStax connector as well? * Yes, we used it to query on lucene index. *Does it support querying against any

Re: are there any free Cassandra -> ElasticSearch connector / plugin ?

2016-10-13 Thread vincent gromakowski
Elassandra https://github.com/vroyer/elassandra Le 14 oct. 2016 12:02 AM, "Eric Ho" a écrit : > I don't want to change my code to write into C* and then to ES. > So, I'm looking for some sort of a sync tool that will sync my C* table > into ES and it should be smart enough

Re: Cassandra data modeling for a social network

2016-05-31 Thread vincent gromakowski
Or use graphframes (Spark) over cassandra to store separately a graph of users and followers and next a table of tweet. You will be able to join data between those 2 structures using spark. 2016-05-31 14:27 GMT+02:00 : > Hello, > > >* First, Is this data modeling

Re: Data platform support

2016-05-10 Thread vincent gromakowski
Maybe a SMACK stack would be a better option for using spark with Cassandra... Le 10 mai 2016 8:45 AM, "Srini Sydney" a écrit : > Thanks a lot..denise > > On 10 May 2016 at 02:42, Denise Rogers wrote: > >> It really depends how close you want to stay to

Re: Efficiently filtering results directly in CS

2016-04-09 Thread vincent gromakowski
ed-functions-in-cassandra-3-0/ > > On 9 April 2016 at 00:22, Kevin Burton <bur...@spinn3r.com> wrote: > >> Ha.. Yes... C*... I guess I need something like coprocessors in >> bigtable. >> >> On Fri, Apr 8, 2016 at 1:49 AM, vincent gromakowski < >> vincent.gr

Re: Efficiently filtering results directly in CS

2016-04-08 Thread vincent gromakowski
c* I suppose 2016-04-07 19:30 GMT+02:00 Jonathan Haddad : > What is CS? > > On Thu, Apr 7, 2016 at 10:03 AM Kevin Burton wrote: > >> I have a paging model whereby we stream data from CS by fetching 'pages' >> thereby reading (sequentially) entire

Re: cassandra disks cache on SSD

2016-04-01 Thread vincent gromakowski
Can you provide me a approximate estimation of performance gain ? 2016-04-01 19:27 GMT+02:00 Mateusz Korniak <mateusz-li...@ant.gliwice.pl>: > On Friday 01 April 2016 13:16:53 vincent gromakowski wrote: > > (...) looking > > for a way to use some kind of tiering with few

cassandra disks cache on SSD

2016-04-01 Thread vincent gromakowski
I am looking for way to optimize large reads. I have seen using SSD is a good option but out of budget, so I am looking for a way to use some kind of tiering with few SSD caching hot data from HDD. I have identified two solutions and would like to get opinions from you and if you have any