Re: mysql based columnar DB to Cassandra DB - Migration

2014-11-27 Thread Kiran Ayyagari
On Fri, Nov 28, 2014 at 1:06 PM, Akshay Ballarpure < akshay.ballarp...@tcs.com> wrote: > Thanks Kiran for reply. > How about other column based databases like infobright , hbase .. can we > really migrate it to cassandra ? > > no, this is only for migrating data from RDBMS to Cassandra > > > > Fr

Re: mysql based columnar DB to Cassandra DB - Migration

2014-11-27 Thread Akshay Ballarpure
Thanks Kiran for reply. How about other column based databases like infobright , hbase .. can we really migrate it to cassandra ? From: Kiran Ayyagari To: user@cassandra.apache.org Date: 11/28/2014 08:27 AM Subject:Re: mysql based columnar DB to Cassandra DB - Migration O

Re: mysql based columnar DB to Cassandra DB - Migration

2014-11-27 Thread Kiran Ayyagari
On Wed, Nov 26, 2014 at 2:15 PM, Akshay Ballarpure < akshay.ballarp...@tcs.com> wrote: > Hello Folks, > I have one mysql based columnar DB, i want to migrate it to Cassandra. How > its possible ? > > see if Troop[1] helps, it was only tested with mysql 5.x and Cassandra 2.0.10 [1] https://github.c

Re: Storing time-series and geospatial data in C*

2014-11-27 Thread Jabbar Azam
Spico, Here's a link flor the time series data http://www.datastax.com/dev/blog/advanced-time-series-with-cassandra You'll also need to understand the composite key format http://www.datastax.com/documentation/cql/3.1/cql/cql_reference/refCompositePk.html Mike Malone has done videos and slides on

Re: Column family ID mismatch-Error on concurrent schema modifications

2014-11-27 Thread Jens-U. Mozdzen
Hi DuyHai, Zitat von DuyHai Doan : Hello Peter I'm working with Peter and am the one initiating the table creation in my code. For safe concurrent table creation, use CREATE TABLE xxx IF NOT EXISTS. It unfortunately, my code already has the "IF NOT EXISTS" clause in the create stateme

Re: Column family ID mismatch-Error on concurrent schema modifications

2014-11-27 Thread Jens-Uwe Mozdzen
Hi Eric, Zitat von Eric Stevens : Be careful with creating many dynamically created column families unless you're cleaning up old ones to keep the total number of CF's reasonable. Having many column families will increase memory pressure and reduce overall performance. will "inactive" CFs be r

Re: Column family ID mismatch-Error on concurrent schema modifications

2014-11-27 Thread Eric Stevens
Be careful with creating many dynamically created column families unless you're cleaning up old ones to keep the total number of CF's reasonable. Having many column families will increase memory pressure and reduce overall performance. On Thu Nov 27 2014 at 8:19:35 AM DuyHai Doan wrote: > Hello

Re: Data synchronization between 2 running clusters on different availability zone

2014-11-27 Thread Eric Stevens
There's no reason you can't run on multiple cloud providers as long as you treat them as logically distinct DC's. It should largely work the same way as running in several AWS regions, but you'll need to use something like GossipingPropertyFileSnitch because the EC2 snitches are specific to AWS.

Re: Storing time-series and geospatial data in C*

2014-11-27 Thread Jack Krupansky
How you store the data will be mostly a matter of how you wish to access the data after it is stored. IOW, what kinds of queries or batch processing and how you intend to sequence through the data. And also what categories of “warmth” you intend to maintain, especially for data to be queried mos

Re: multiple threads updating result in TransportException

2014-11-27 Thread Eric Stevens
A lot of people do a lot of multi-threaded work with Datastax Java Driver. It looks like you're using Cassandra Driver 2.0.0-RC2, might I suggest as a first step, at least upgrade to 2.0.0 final? RC2 wasn't even the final release candidate for 2.0.0. On Wed Nov 26 2014 at 8:44:07 AM Brian Tarbox

Re: Column family ID mismatch-Error on concurrent schema modifications

2014-11-27 Thread DuyHai Doan
Hello Peter For safe concurrent table creation, use CREATE TABLE xxx IF NOT EXISTS. It will use light weight transaction and you'll have to pay some penalty in term of performance but at least the table creation will be linearizable Le 27 nov. 2014 14:26, "Peter Lange" a écrit : > Hi, > > We use

Column family ID mismatch-Error on concurrent schema modifications

2014-11-27 Thread Peter Lange
Hi, We use a four-node Cassandra-Cluster in Version 2.1.2. Our Client-Applications creates Tables dynamically. At one point two (or more) of our Clients connected to two (or more) different Cassandra-Nodes will create the same table simultaneously. We get the "Column family ID mismatch"-E

Re: Cassandra COPY to CSV and DateTieredCompactionStrategy

2014-11-27 Thread Paulo Ricardo Motta Gomes
Regarding the first question you need to configure your application to write to both CFs (old and new) during the migration phase. I'm not sure about the second question, but my guess is that only the writeTime will be taken into account. On Thu, Nov 27, 2014 at 10:54 AM, Batranut Bogdan wrote:

Cassandra COPY to CSV and DateTieredCompactionStrategy

2014-11-27 Thread Batranut Bogdan
Hello all, I have a few things that I need to understand. 1 . Here is the scenario: we have a HUGE cf where there are daily writes it is like a time series. Now we want to change the type of a column in primary key. What I think we can do is to export to csv, create the new table and write back

Storing time-series and geospatial data in C*

2014-11-27 Thread Spico Florin
Hello! Can you please recommend me some new articles and case studies were Cassandra was used to store time-series and geo-spatial data? I'm particular interested in best practices, data models and retrieval techniques. Thanks. Regards, Florin

RE: A question to adding a new data center

2014-11-27 Thread Lu, Boying
If the node-to-node encryption is enabled among all current connected DCs, how to add a new DC in this case? After adding the new DC’s public key into the trust store file, are the current connected DCs needed to be restart? Thanks Boying From: Mark Reddy [mailto:mark.l.re...@gmail.com] Sent

Re: Repair completes successfully but data is still inconsistent

2014-11-27 Thread André Cruz
On 26 Nov 2014, at 19:07, Robert Coli wrote: > > Yes. Do you know if 5748 was created as a result of compaction or via a flush > from a memtable? It was the result of a compaction: INFO [CompactionExecutor:22422] 2014-11-13 13:08:41,926 CompactionTask.java (line 262) Compacted 2 sstables to

Re: Cassandra backup via snapshots in production

2014-11-27 Thread Jens Rantil
Late answer; You can find my backup script here: https://gist.github.com/JensRantil/a8150e998250edfcd1a3 Basically you need to set S3_BUCKET, PGP_KEY_RECIPIENT, configure s3cmd (using s3cmd --configure) and then issue `./backup-keyspace.sh your-keyspace` to backup it to S3. We run the script i

RE: Cassandra backup via snapshots in production

2014-11-27 Thread Ngoc Minh VO
Thanks a lot for your answers! What we plan to do is: - auto_snapshot = true - if the human errors happened on D-5: o we will bring the cluster offline o purge all data o import snapshots prior D-5 (and delete snapshots after D-5) o upload all missing data between D

Re: Data synchronization between 2 running clusters on different availability zone

2014-11-27 Thread Spico Florin
Hello! I have another question. What about the following scenario: two Cassandra instances installed on different cloud providers (EC2, Flexiant)? How do you synchronize them? Can you use some internal tools or do I have to implement my own mechanism? Thanks. Florin On Thu, Nov 27, 2014 at 11:

Re: Data synchronization between 2 running clusters on different availability zone

2014-11-27 Thread Spico Florin
Hello, Rob! Thank you very much for the detailed support. Regards, Florin On Wed, Nov 26, 2014 at 12:41 AM, Robert Coli wrote: > On Tue, Nov 25, 2014 at 7:09 AM, Spico Florin > wrote: > >> 1. For ensuring high availability I would like to install one Cassandra >> cluster on one availability