Fwd: Hints folder missing in Cassandra

2018-02-06 Thread test user
Has anyone run into the issue mentioned below? -- Forwarded message -- From: test user Date: Tue, Feb 6, 2018 at 3:28 PM Subject: Hints folder missing in Cassandra To: user-subscr...@cassandra.apache.org Hello All, I am using Cassandra 3.10. I have

Re: Add column if it does not exist?

2018-02-06 Thread Irtiza Ali
Hello, this link might also be helpful to you for querying table schema. Link: https://docs.datastax.com/en/cql/3.3/cql/cql_using/useQuerySystemTable.html Best, Iriiza On Tue, Feb 6, 2018 at 9:55 PM, Oliver Ruebenacker wrote: > > Hello, > > Is there a describe

Re: node restart causes application latency

2018-02-06 Thread Jeff Jirsa
Unless you abbreviated, your data model is questionable (SELECT without any equality in the WHERE clause on the partition key will always cause a range scan, which is super inefficient). Since you're doing LOCAL_ONE and a range scan, timeouts sorta make sense - the owner of at least one range

RE: Cassandra CDC

2018-02-06 Thread Rahul Singh
Ah got it. Good stuff. -- Rahul Singh rahul.si...@anant.us Anant Corporation On Feb 6, 2018, 9:40 AM -0500, Nigel LEACH , wrote: > Not too much delving needed, I upgraded jamm to v0.3.2. I’m not entirely sure > why this was required, it seems a little obscure,

Re: node restart causes application latency

2018-02-06 Thread Michael Shuler
On 02/06/2018 12:58 PM, Mike Torra wrote: > > I restart a node like this: > > nodetool disablethrift && nodetool disablegossip && nodetool drain > sudo service cassandra restart Just a guess here - are you really only using thrift? (ie. `nodetool disablebinary`) > When I do that, I very often

Re: Heavy one-off writes best practices

2018-02-06 Thread Romain Hardouin
We use Spark2Cassandra (this fork works with C*3.0  https://github.com/leoromanovsky/Spark2Cassandra ) SSTables are streamed to Cassandra by Spark2Cassandra (so you need to open port 7000 accordingly).During benchmark we used 25 EMR nodes but in production we use less nodes to be more gentle

node restart causes application latency

2018-02-06 Thread Mike Torra
Hi - I am running a 29 node cluster spread over 4 DC's in EC2, using C* 3.11.1 on Ubuntu. Occasionally I have the need to restart nodes in the cluster, but every time I do, I see errors and application (nodejs) timeouts. I restart a node like this: nodetool disablethrift && nodetool

Re: [announce] Release of Cassandra Prometheus metrics exporter

2018-02-06 Thread Lucas Benevides
Hello Romain, I want to test criteo but have some doubts. Graphite is not good for me because data is stored in a whisper file, which is not accurate and I have scientific purposes. Do I have to run the java application (jar) in every node of my cluster? Is the internal storage a round-robin

Re: Add column if it does not exist?

2018-02-06 Thread Irtiza Ali
Yes describe query is available cqlsh. Try the example given below: Example: cqlsh> describe keyspaceName.tableName; It will return the schema of the table. On 6 Feb 2018 21:55, "Oliver Ruebenacker" wrote: > > Hello, > > Is there a describe query in CQL? I don't see

Re: Add column if it does not exist?

2018-02-06 Thread Oliver Ruebenacker
Hello, Is there a describe query in CQL? I don't see one on http://cassandra.apache.org/doc/latest/cql/index.html. I also can't find such a query in the DataStax Java driver API. Thanks! Best, Oliver On Tue, Feb 6, 2018 at 11:48 AM, Irtiza Ali wrote: > Hello. >

Re: Add column if it does not exist?

2018-02-06 Thread Irtiza Ali
Hello. Another thing that you can try is the use the describe table query to get the table schema and parse it. Once done you can check whether column exists or not. With Regards Irtiza Ali On 6 Feb 2018 21:35, "Oliver Ruebenacker" wrote: > Thanks for the response! > >

Re: Add column if it does not exist?

2018-02-06 Thread Oliver Ruebenacker
Thanks for the response! So, the best solution I can come up with is catching the InvalidQueryException and check whether its message contains the phrase "conflicts with an existing column". Seems to work, but super-ugly. I do assume that in general, if a request fails, it does not

Re: Increased latency after setting row_cache_size_in_mb

2018-02-06 Thread mohsin k
@Rahul, I am using cassandra-stress tool. On Tue, Feb 6, 2018 at 7:37 PM, Rahul Singh wrote: > Could be the cause. I would run 2 and then 4 concurrent clients to see how > they behave. What’s your client written in? How are you managing your > connection? > > -- >

Re: Heavy one-off writes best practices

2018-02-06 Thread Julien Moumne
This does look like a very viable solution. Thanks. Could you give us some pointers/documentation on : - how can we build such SSTables using spark jobs, maybe https://github.com/Netflix/sstable-adaptor ? - how do we send these tables to cassandra? does a simple SCP work? - what is the

RE: Cassandra CDC

2018-02-06 Thread Nigel LEACH
Not too much delving needed, I upgraded jamm to v0.3.2. I’m not entirely sure why this was required, it seems a little obscure, but I’m back on track. Many Thanks Nigel From: Nigel LEACH Sent: 06 February 2018 14:25 To: user@cassandra.apache.org Subject: RE: Cassandra CDC Thanks Rahul, I

RE: Cassandra CDC

2018-02-06 Thread Nigel LEACH
Thanks Rahul, I looked at the smartcat implementation, and am doing something very similar. Unfortunately, I’m using a mixed DSE and Apache codebase, which may not help. Anyway, if CDC worked for you with a random partitioner that’s good news, as its more likely my code that is wrong. I will

Re: Cassandra CDC

2018-02-06 Thread Rahul Singh
Nigel, Are you using something like this or rolled your own? https://github.com/smartcat-labs/cassandra-kafka-connector/tree/master/cassandra-cdc Ive used it in a docker composition and it seemed to work fine for me.

Re: Increased latency after setting row_cache_size_in_mb

2018-02-06 Thread Rahul Singh
Could be the cause. I would run 2 and then 4 concurrent clients to see how they behave. What’s your client written in? How are you managing your connection? -- Rahul Singh rahul.si...@anant.us Anant Corporation On Feb 6, 2018, 8:50 AM -0500, mohsin k , wrote: >

Re: Increased latency after setting row_cache_size_in_mb

2018-02-06 Thread mohsin k
Thanks, Jeff, will definitely check the trace. Also, one strange thing I noticed, with number of threads till '64', the latency is around 3ms but as the number of threads increases latency also increases. Eventually, at thread count, 609 latency is around 30ms. I am using a single client to

Cassandra CDC

2018-02-06 Thread Nigel LEACH
Hello, I'm loading Cassandra (v3.10.0.1652) data into a Kafka (v1.0.0) topic via CDC and the org.apache.cassandra.db.commitlog.CommitLogReader. All seems to fit together, but I am seeing an "Invalid partitioner RandomPartitioner" error thrown. Is CDC compatible with the RandomPartitioner? There