Re: Which hector version is suitable for cassandra 2.0.6 ?

2014-03-28 Thread ssiv...@gmail.com
Hello, DS JD On 03/27/2014 01:06 PM, DE VITO Dominique wrote: Hi, -Message d'origine- De : ssiv...@gmail.com [mailto:ssiv...@gmail.com] Envoyé : jeudi 27 mars 2014 10:41 À : user@cassandra.apache.org Objet : Re: Which hector version is suitable for cassandra 2.0.6 ? On 03/27/2014

RE: Question about rpms from datastax

2014-03-28 Thread Romain HARDOUIN
cassandra*.noarch.rpm - Install Cassandra Only dsc*.noarch.rpm - DSC stands for DataStax Community. Install Cassandra + OpsCenter Donald Smith donald.sm...@audiencescience.com a écrit sur 27/03/2014 20:36:57 : De : Donald Smith donald.sm...@audiencescience.com A : 'user@cassandra.apache.org'

Re: Installing Datastax Cassandra 1.2.15 Using Yum (Java Issue)

2014-03-28 Thread Jon Forrest
On 3/28/2014 8:20 AM, Michael Shuler wrote: # rpm -iv cassandra12-1.2.15-1.noarch.rpm error: Failed dependencies: java = 1.6.0 is needed by cassandra12-1.2.15-1.noarch This properly indicates the missing dependency, which is not installed, nor provided by a package in your 'rpm -i

Re: (SOLVED) Installing Datastax Cassandra 1.2.15 Using Yum (Java Issue)

2014-03-28 Thread Tyler Hobbs
On Fri, Mar 28, 2014 at 11:48 AM, Colin colpcl...@gmail.com wrote: OpenJDK will crash under load whilst running Cassandra. That's definitely the case for OpenJDK 6, but 7 *should* be okay. However, most people are running the Oracle JRE (even for 7), so there's not a ton of evidence out there

How to tear down an EmbeddedCassandraService in unit tests?

2014-03-28 Thread Clint Kelly
All, I have a question about how to use the EmbeddedCassandraService in unit tests. I wrote a short collection of unit tests here: https://github.com/wibiclint/cassandra-java-driver-keyspaces I'm trying to start up a new EmbeddedCassandraService for each unit test. I looked at the Cassandra

(SOLVED) Installing Datastax Cassandra 1.2.15 Using Yum (Java Issue)

2014-03-28 Thread Jon Forrest
In a previous message I described my guess at what was causing the Datastax Cassandra installation to require OpenJDK. Using the method I describe below, I'm now able to install the Datastax Cassandra rpm. Note that I have no idea (yet) whether Cassandra actually runs, but at least it installs.

Re: (SOLVED) Installing Datastax Cassandra 1.2.15 Using Yum (Java Issue)

2014-03-28 Thread Colin
OpenJDK will crash under load whilst running Cassandra. -- Colin +1 320 221 9531 On Mar 28, 2014, at 4:11 PM, Jon Forrest jon.forr...@xoom.com wrote: In a previous message I described my guess at what was causing the Datastax Cassandra installation to require OpenJDK. Using the

Cassandra Snapshots giving me corrupted SSTables in the logs

2014-03-28 Thread Russ Lavoie
We are using cassandra 1.2.10 (With JNA installed) on ubuntu 12.04.3 and are running our instances in Amazon Web Services. What I am trying to do. Our cassandra systems data is on an EBS volume so we can take snapshots of the data and create volumes based on those snapshots and restore them

Re: Installing Datastax Cassandra 1.2.15 Using Yum (Java Issue)

2014-03-28 Thread Blair Zajac
On Mar 27, 2014, at 2:16 PM, Michael Dykman mdyk...@gmail.com wrote: Java on linux has *always* been a hassle. Recently, installing ant via apt-get on an active ubuntu still want to yank in components of GCJ shudder. Back to the tar-ball. For Ubuntu and Debian, I use the webupd8team

Re: Cassandra Snapshots giving me corrupted SSTables in the logs

2014-03-28 Thread Robert Coli
On Fri, Mar 28, 2014 at 11:15 AM, Russ Lavoie ussray...@yahoo.com wrote: We are using cassandra 1.2.10 (With JNA installed) on ubuntu 12.04.3 and are running our instances in Amazon Web Services. Our cassandra systems data is on an EBS volume Best practice for Cassandra on AWS is to run

Re: How to tear down an EmbeddedCassandraService in unit tests?

2014-03-28 Thread Andre Sprenger
We are using version 1.2.4 and it is difficult to shutdown the embedded version. But you don't have to. Just check in each test setup method if embedded Cassandra is already running and start it if necessary. Than create keyspaces/tables in setup methods and drop them in teardown methods. For us

Re: Cassandra Snapshots giving me corrupted SSTables in the logs

2014-03-28 Thread Russ Lavoie
Thank you for your quick response. Is there a way to tell when a snapshot is completely done? On Friday, March 28, 2014 1:30 PM, Robert Coli rc...@eventbrite.com wrote: On Fri, Mar 28, 2014 at 11:15 AM, Russ Lavoie ussray...@yahoo.com wrote: We are using cassandra 1.2.10 (With JNA

Re: Cassandra Snapshots giving me corrupted SSTables in the logs

2014-03-28 Thread Laing, Michael
In your step 4, be sure you create a consistent EBS snapshot. You may have pieces of your sstables that have not actually been flushed all the way to EBS. See https://github.com/alestic/ec2-consistent-snapshot ml On Fri, Mar 28, 2014 at 3:21 PM, Russ Lavoie ussray...@yahoo.com wrote: Thank

Re: Cassandra Snapshots giving me corrupted SSTables in the logs

2014-03-28 Thread Robert Coli
On Fri, Mar 28, 2014 at 12:21 PM, Russ Lavoie ussray...@yahoo.com wrote: Thank you for your quick response. Is there a way to tell when a snapshot is completely done? IIRC, the JMX call blocks until the snapshot completes. It should be done when nodetool returns. =Rob

Re: Cassandra Snapshots giving me corrupted SSTables in the logs

2014-03-28 Thread Russ Lavoie
Robert, That is what I thought as well.  But apparently something is happening.  The only way I can get away with doing this is adding a sleep 60 right after the nodetool snapshot is executed.  I can reproduce this 100% of the time by not issuing a sleep after nodetool snapshot. This is the

StatusLogger output help

2014-03-28 Thread Tom van den Berge
Hi, In my cassandra logs, I see a lot of StatusLogger output lines. I'm trying to understand why this is logged, and how to interpret the output. Maybe someone can point me to some documentation on this particular logging aspect? I would like to know what is triggering the StatusLogger.java to

Re: Cassandra Snapshots giving me corrupted SSTables in the logs

2014-03-28 Thread Jonathan Haddad
I have a nagging memory of reading about issues with virtualization and not actually having durable versions of your data even after an fsync (within the VM). Googling around lead me to this post: http://petercai.com/virtualization-is-bad-for-database-integrity/ It's possible you're hitting this

Re: Cassandra Snapshots giving me corrupted SSTables in the logs

2014-03-28 Thread Jonathan Haddad
I will +1 the recommendation on using tablesnap over EBS. S3 is at least predictable. Additionally, from a practical standpoint, you may want to back up your sstables somewhere. If you use S3, it's easy to pull just the new tables out via aws-cli tools (s3 sync), to your remote, non-aws server,

Re: Cassandra Snapshots giving me corrupted SSTables in the logs

2014-03-28 Thread Laing, Michael
As I tried to say, EBS snapshots require much care or you get corruption such as you have encountered. Does Cassandra quiesce the file system after a snapshot using fsfreeze or xfs_freeze? Somehow I doubt it... On Fri, Mar 28, 2014 at 4:17 PM, Jonathan Haddad j...@jonhaddad.com wrote: I have

Re: Cassandra Snapshots giving me corrupted SSTables in the logs

2014-03-28 Thread Laing, Michael
+1 for tablesnap On Fri, Mar 28, 2014 at 4:28 PM, Jonathan Haddad j...@jonhaddad.com wrote: I will +1 the recommendation on using tablesnap over EBS. S3 is at least predictable. Additionally, from a practical standpoint, you may want to back up your sstables somewhere. If you use S3,

Re: Cassandra Snapshots giving me corrupted SSTables in the logs

2014-03-28 Thread Jonathan Haddad
Another thing to keep in mind is that if you are hitting the issue I described, waiting 60 seconds will not absolutely solve your problem, it will only make it less likely to occur. If a memtable has been partially flushed at the 60 second mark you will end up with the same corrupt sstable. On

Read performance in map data type

2014-03-28 Thread Apoorva Gaurav
Hello All, We've a schema which can be modeled as (studentID, subjectID, marks) where combination of studentID and subjectID is unique. Number of studentID can go up to 100 million and for each studentID we can have up to 10k subjectIDs. We are using apahce cassandra 2.0.4 and datastax java

Re: Read performance in map data type

2014-03-28 Thread Shrikar archak
Hi Apoorva, I assume this is the table with studentId and subjectId as primary keys and not other like like marks in that. create table marks_table(studentId int, subjectId int, marks int, PRIMARY KEY(studentId,subjectId)); Also could you give the cfhistogram stats? nodetool cfhistograms your

Timeuuid inserted with now(), how to get the value back in Java client?

2014-03-28 Thread Andy Atj2
I'm writing a Java client to a Cassandra db. One of the main primary keys is a timeuuid. I plan to do INSERTs using now() and have Cassandra generate the value of the timeuuid. After the INSERT, I need the Cassandra-generated timeuuid value. Is there an easy wsay to get it, without having to

Re: Read performance in map data type

2014-03-28 Thread Apoorva Gaurav
Hello Shrikar, Yes primary key is (studentID, subjectID). I had dropped the test table, recreating and populating it post which will share the cfhistogram. In such case is there any practical limit on the rows I should fetch, for e.g. should I do select * form marks_table where studentID =