Re: C* data modeling for time series

2018-06-18 Thread mm
Hi, we're currently evaluating KairosDB for time series which looks quite nice. https://kairosdb.github.io/ The cool thing with KairosDB is that it uses Cassandra as storage engine and provide additional features (mainly a REST-based API for accessing data). Maybe you can take a look the

Re: C* data modeling for time series

2018-06-18 Thread Affan Syed
I have looked at this problem for a good year now. My feel is that Cassandra alone as the sole underlying DB for Timeseries just does not cut it. I am starting to look at C* along with another DB for executing the sort of queries we want here. Currently I am evaluating Druid vs Kudu to be this

Re: C* data modeling for time series

2017-07-26 Thread Jeff Jirsa
On 2017-07-26 05:15 (-0700), Junaid Nasir wrote: > I have a C* cluster (3 nodes) with some 60gb data (replication factor 2). > when I started using C* coming from SQL background didn't give much thought > about modeling the data correctly. so what I did was > > CREATE TABLE

Re: C* data modeling for time series

2017-07-26 Thread CPC
If all of your queries like this(i mean get all devices given a a time range) Hadoop would be more appropriate since those are analytical queries. Anyway, to query such data with spark Cassandra connector your partition key could include day and hash of your deviceid as pseudo partition key

Re: C* data modeling for time series

2017-07-26 Thread Junaid Nasir
all devices. After selecting the data I group them and perform other actions i.e sum, avg on fields and then display those to compare how devices are doing compared to each other. On Wed, Jul 26, 2017 at 5:32 PM, CPC wrote: > Hi Junaid, > > Given a time range do you want to

Re: C* data modeling for time series

2017-07-26 Thread CPC
Hi Junaid, Given a time range do you want to take all devices or a specific device? On Jul 26, 2017 3:15 PM, "Junaid Nasir" wrote: I have a C* cluster (3 nodes) with some 60gb data (replication factor 2). when I started using C* coming from SQL background didn't give much

C* data modeling for time series

2017-07-26 Thread Junaid Nasir
I have a C* cluster (3 nodes) with some 60gb data (replication factor 2). when I started using C* coming from SQL background didn't give much thought about modeling the data correctly. so what I did was CREATE TABLE data ( deviceId int, time timestamp,