Hi,
we're currently evaluating KairosDB for time series which looks quite
nice.
https://kairosdb.github.io/
The cool thing with KairosDB is that it uses Cassandra as storage engine
and provide
additional features (mainly a REST-based API for accessing data).
Maybe you can take a look the
I have looked at this problem for a good year now. My feel is that
Cassandra alone as the sole underlying DB for Timeseries just does not cut
it.
I am starting to look at C* along with another DB for executing the sort of
queries we want here.
Currently I am evaluating Druid vs Kudu to be this
On 2017-07-26 05:15 (-0700), Junaid Nasir wrote:
> I have a C* cluster (3 nodes) with some 60gb data (replication factor 2).
> when I started using C* coming from SQL background didn't give much thought
> about modeling the data correctly. so what I did was
>
> CREATE TABLE
If all of your queries like this(i mean get all devices given a a time
range) Hadoop would be more appropriate since those are analytical queries.
Anyway, to query such data with spark Cassandra connector your partition
key could include day and hash of your deviceid as pseudo partition key
all devices.
After selecting the data I group them and perform other actions i.e sum,
avg on fields and then display those to compare how devices are doing
compared to each other.
On Wed, Jul 26, 2017 at 5:32 PM, CPC wrote:
> Hi Junaid,
>
> Given a time range do you want to
Hi Junaid,
Given a time range do you want to take all devices or a specific device?
On Jul 26, 2017 3:15 PM, "Junaid Nasir" wrote:
I have a C* cluster (3 nodes) with some 60gb data (replication factor 2).
when I started using C* coming from SQL background didn't give much
I have a C* cluster (3 nodes) with some 60gb data (replication factor 2).
when I started using C* coming from SQL background didn't give much thought
about modeling the data correctly. so what I did was
CREATE TABLE data ( deviceId int,
time timestamp,