Ajay created CASSANDRA-9767:
-------------------------------

             Summary: Count with results in the CQL query
                 Key: CASSANDRA-9767
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-9767
             Project: Cassandra
          Issue Type: Bug
          Components: Core
         Environment: Cassandra 2.0.16
Ubuntu 15.04
            Reporter: Ajay
             Fix For: 2.0.x


Lets assume we have a column family as below:

create table sample ( track_id int, user_id int, country varchar, primary key 
((track_id), user_id));

where track_id is the partition key.

Now to aggregate the number of rows for a single track_id, we can query using 
CQL as below:
select count(*) where track_id = 1 and user_id = 1;

But that will return only the count. If we need the other columns along with 
the count, we cannot query as below as it throws error:

 select count(*), user_id  from sample where track_id = 1 and user_id = 1;

Bad Request: line 1:15 mismatched input ',' expecting K_FROM.

In this case, all rows for a given track_id and user_id will have the same 
value for country. So we should be able to query as above.  Also in SQL, it is 
possible to select columns along with aggregate functions.

Though I know that Cassandra is not analytics (unlike Hadoop and Spark), we 
need some basic aggregate functions like min, max, avg, count etc.... It cannot 
used just as a data store (as garbage-in garbage-out). 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to