Re: Cassandra primary key design to cater range query

2015-01-09 Thread Tyler Hobbs
Your proposed model for the table to handle the last query looks good, so I would stick with that. On Mon, Jan 5, 2015 at 5:45 AM, Nagesh nageswara.r...@gmail.com wrote: Hi All, I have designed a column family prodgroup text, prodid int, status int, , PRIMARY KEY ((prodgroup), prodid,

Re: User audit in Cassandra

2015-01-09 Thread Ajay
Thanks Tyler Hobbs. We need to capture what are the queries ran by a user in a session and its time taken. (don't need query plan or so). Is that possible? With Authenticator we can capture only the session creation right? Thanks Ajay On Sat, Jan 10, 2015 at 6:07 AM, Tyler Hobbs

Re: sstable structure

2015-01-09 Thread Tyler Hobbs
sstable2json can give you a pretty good idea of the format. Otherwise, your best option is to read the code, starting with org.apache.cassandra.io.sstable.SSTableWriter. On Fri, Jan 2, 2015 at 12:27 PM, Nikolay Mihaylov n...@nmmm.nu wrote: Hi from some time I try to find the structure of

Re: User audit in Cassandra

2015-01-09 Thread Tyler Hobbs
system_traces is for query tracing, which is for diagnosing performance problems, not logging activity. Cassandra is designed to allow you to write your own Authenticator pretty easily. You can just subclass PasswordAuthenticator and add logging where desired. Compile that into a jar, put it in

Re: Cassandra primary key design to cater range query

2015-01-09 Thread Ajay
Hi, I read somewhere that the order of columns in the cluster key matters. Please correct me if I am wrong. For example, PRIMARY KEY((prodgroup), status, productid). Then the below query cannot run, select * from product where prodgroup='xyz' and prodid 0 But this query can be run: select *

Re: User audit in Cassandra

2015-01-09 Thread DuyHai Doan
What you want is something like audit logger like the one provided by DSE ? ( http://www.datastax.com/2014/12/enhanced-enterprise-security-in-datastax-enterprise-4-6 ) On Thu, Jan 8, 2015 at 1:34 PM, Ajay ajay.ga...@gmail.com wrote: Hi, Is there a way to enable user audit or trace if we have

Is there a way to add a new node to a cluster but not sync old data?

2015-01-09 Thread Yatong Zhang
Hi there, I am using C* 2.0.10 and I was trying to add a new node to a cluster(actually replace a dead node). But after added the new node some other nodes in the cluster had a very high work-load and affected the whole performance of the cluster. So I am wondering is there a way to add a new

Re: How to bulkload into a specific data center?

2015-01-09 Thread Robert Coli
On Fri, Jan 9, 2015 at 11:38 AM, Benyi Wang bewang.t...@gmail.com wrote: - Is it possible to modify SSTableLoader to allow it access one data center? Even if you only write to nodes in DC A, if you replicate that data to DC B, it will have to travel over the WAN anyway? What are you

Re: Updated JMX metrics overview

2015-01-09 Thread Tyler Hobbs
On Thu, Jan 8, 2015 at 9:57 AM, Reik Schatz reik.sch...@gmail.com wrote: org.apache.cassandra.db type=StorageProxy TotalHints - is this the number of hints since the node was started or a lifetime value Since the node was started. org.apache.cassandra.db type=StorageProxy

nodetool repair

2015-01-09 Thread Adil
Hi guys, We have two DC, we are planning to schedule running nodetool repair weekly, my question is : nodetool repair is cross cluster or not? it's sufficient to run it without options on a node or should be scheduled on every node with the host option. Thanks

Re: C* throws OOM error despite use of automatic paging

2015-01-09 Thread DuyHai Doan
What is the data size of the column family you're trying to fetch with paging ? Are you storing big blob or just primitive values ? On Fri, Jan 9, 2015 at 8:33 AM, Mohammed Guller moham...@glassbeam.com wrote: Hi – We have an ETL application that reads all rows from Cassandra (2.1.2),

Re: nodetool repair

2015-01-09 Thread Robert Coli
On Fri, Jan 9, 2015 at 8:01 AM, Adil adil.cha...@gmail.com wrote: We have two DC, we are planning to schedule running nodetool repair weekly, my question is : nodetool repair is cross cluster or not? it's sufficient to run it without options on a node or should be scheduled on every node with

Re: How to bulkload into a specific data center?

2015-01-09 Thread Benyi Wang
Hi Ryan, Thanks for your reply. Now I understood how SSTableLoader works. - If I understand correctly, the current o.a.c.io.sstable.SSTableLoader doesn't use LOCAL_ONE or LOCAL_QUORUM. Is it right? - Is it possible to modify SSTableLoader to allow it access one data center? Because

Re: High read latency after data volume increased

2015-01-09 Thread Roni Balthazar
Hi there, The compaction remains running with our workload. We are using SATA HDDs RAIDs. When trying to run cfhistograms on our user_data table, we are getting this message: nodetool: Unable to compute when histogram overflowed Please see what happens when running some queries on this cf:

Re: High read latency after data volume increased

2015-01-09 Thread Brian Tarbox
C* seems to have more than its share of version x doesn't work, use version y type issues On Thu, Jan 8, 2015 at 2:23 PM, Robert Coli rc...@eventbrite.com wrote: On Thu, Jan 8, 2015 at 11:14 AM, Roni Balthazar ronibaltha...@gmail.com wrote: We are using C* 2.1.2 with 2 DCs. 30 nodes DC1