RE: Attempted to write commit log entry for unrecognized table

2017-08-16 Thread Myron A. Semack
Restarting the Cassandra service resolved this issue. Thanks for your advice! Sincerely, Myron A. Semack From: kurt greaves [mailto:k...@instaclustr.com] Sent: Tuesday, August 15, 2017 6:10 PM To: User Subject: Re: Attempted to write commit log entry for

Re: Migrate from DSE (Datastax) to Apache Cassandra

2017-08-16 Thread Ioannis Zafiropoulos
We use NetworkTopologyStrategy as the replication strategy. The only DSE specific features we use (left untouched by default) are: authenticator: com.datastax.bdp.cassandra.auth.DseAuthenticator authorizer: com.datastax.bdp.cassandra.auth.DseAuthorizer role_manager:

Re: Cassandra Writes Duplicated/Concatenated List Data

2017-08-16 Thread Sagar Jambhulkar
What is your query to fetch rows. Can you share P1,pk2,time for the sample rows you pasted? On 17-Aug-2017 2:20 AM, "Nathan McLean" wrote: > Hello All, > > I have a Cassandra cluster with a table similar to the following: > > ``` > CREATE TABLE table ( > pk1

Re: Full table scan with cassandra

2017-08-16 Thread Ben Bromhead
Apache Cassandra is not great in terms of performance at the moment for batch analytics workloads that require a full table scan. I would look at FiloDB for all the benefits and familiarity of Cassandra with better streaming and analytics performance: https://github.com/filodb/FiloDB There are

Full table scan with cassandra

2017-08-16 Thread Alex Kotelnikov
Hey, we are trying Cassandra as an alternative for storage huge stream of data coming from our customers. Storing works quite fine, and I started to validate how retrieval does. We have two types of that: fetching specific records and bulk retrieval for general analysis. Fetching single record

Re: Migrate from DSE (Datastax) to Apache Cassandra

2017-08-16 Thread Felipe Esteves
Ioannis, As some people already said, there's one or two keyspaces that uses EverywhereStrategy, dse_system is one of them, if I'm not wrong. You must remember to change them to a community strategy or it will fail. --

Cassandra Writes Duplicated/Concatenated List Data

2017-08-16 Thread Nathan McLean
Hello All, I have a Cassandra cluster with a table similar to the following: ``` CREATE TABLE table ( pk1 text, pk2 int, time timestamp, ... probability list, PRIMARY KEY ((pk1, pk2), time) ) WITH CLUSTERING ORDER BY (time DESC) ``` Python processes write to this table

Re: live dsc upgrade from 2.0 to 2.1 behind the scenes

2017-08-16 Thread Park Wu
Thank you Eric, for advice. It's great help.-Park On Tuesday, August 15, 2017 3:42 AM, Erick Ramirez wrote: 1) You should not perform any streaming operations (repair, bootstrap, decommission) in the middle of an upgrade. Note that an upgrade is not complete

Re: Cassandra Writes Duplicated/Concatenated List Data

2017-08-16 Thread Christophe Schmitz
Hi Nathan, The code may occasionally write to the same row multiple times. > > Can you run a test using IF NOT EXISTS in your inserts to see if that makes a difference? That shouldn't make a difference, but I don't see what the problem might be at the moment. -- *Christophe

Re: Full table scan with cassandra

2017-08-16 Thread Dor Laor
Hi Alex, You probably didn't get the paralelism right. Serial scan has a paralelism of one. If the paralelism isn't large enough, perf will be slow. If paralelism is too large, Cassandra and the disk will trash and have too many context switches. So you need to find your cluster's sweet spot. We