persistent compaction issue (1.1.4 and 1.1.5)

2012-09-17 Thread Michael Kjellman
Hi All, I have an issue where each one of my nodes (currently all running at 1.1.5) is reporting around 30,000 pending compactions. I understand that a pending compaction doesn't necessarily mean it is a scheduled task however I'm confused why this behavior is occurring. It is the same on all

Re: persistent compaction issue (1.1.4 and 1.1.5)

2012-09-18 Thread Michael Kjellman
for the numbers to start to rise ? Cheers - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 18/09/2012, at 7:39 AM, Michael Kjellman mkjell...@barracuda.commailto:mkjell...@barracuda.com wrote: Hi All, I have an issue where each one of my nodes (currently all

Re: persistent compaction issue (1.1.4 and 1.1.5)

2012-09-18 Thread Michael Kjellman
: What Compaction Strategy are you using ? Are there any errors in the logs ? If you restart a node how long does it take for the numbers to start to rise ? Cheers - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 18/09/2012, at 7:39 AM, Michael Kjellman

Re: persistent compaction issue (1.1.4 and 1.1.5)

2012-09-18 Thread Michael Kjellman
(over 100K in single directory) and a lot of thinking for compaction executor to decide what to compact next. I can see numbers like 5K-10K and still thing this is high number. If I were you, I'd increase sstable_size_in_mb 10-20 times it is now. 2012/9/17 Michael Kjellman mkjell

Re: persistent compaction issue (1.1.4 and 1.1.5)

2012-09-18 Thread Michael Kjellman
be potentially stopping compactions? Will compactions still happen if there are different versions of the schema? On 9/18/12 11:38 AM, Michael Kjellman mkjell...@barracuda.com wrote: Thanks, I just modified the schema on the worse offending column family (as determined by the .json) from 10MB

Re: Solr Use Cases

2012-09-19 Thread Michael Kjellman
If I were you I would look into ElasticSearch unless you are okay updating the search cache very infrequently. I tried Solandra vs ElasticSearch in our use case and there was no contest. Also, Cassandra is great for writes but not as optimized for reads. Honestly, it all depends on your use

Re: Losing keyspace on cassandra upgrade

2012-09-19 Thread Michael Kjellman
Sounds like you are loosing your system keyspace. When you say nothing important changed between yaml files do you mean with or without your changes? Did your data directories change in the migration? Permissions okay? I've done a 1.1.1 to 1.1.5 upgrade on many of my nodes without issue.. On

Re: Setting the default replication factor for Solandra cores

2012-09-19 Thread Michael Kjellman
If I recall correctly you should make those changes in the schema through the CLI. I never ended up running Solandra in production though so I'm not sure if anyone else has better options. Why is the CLI not enough? On Sep 19, 2012, at 5:56 AM, Safdar Kureishy

Re: Data Modeling - JSON vs Composite columns

2012-09-19 Thread Michael Kjellman
Client code. CQL will only deserialize composites as you mention in A. On Sep 19, 2012, at 5:01 AM, Roshni Rajagopal roshni_rajago...@hotmail.commailto:roshni_rajago...@hotmail.com wrote: Hi, There was a conversation on this some time earlier, and to continue it Suppose I want to associate a

Re: Losing keyspace on cassandra upgrade

2012-09-19 Thread Michael Kjellman
ring restart was necessary. This was on Cass 1.1.1. Cheers, Edward On 12-09-19 08:12 AM, Michael Kjellman wrote: Sounds like you are loosing your system keyspace. When you say nothing important changed between yaml files do you mean with or without your changes? Did your data directories change

Re:

2012-09-19 Thread Michael Kjellman
A few questions: what version of 1.1 are you running. What version of Hadoop? What is your job config? What is the buffer size you've chosen? How much data are you dealing with? On Sep 19, 2012, at 7:23 PM, Manu Zhang owenzhang1...@gmail.com wrote: I've been bulk loading data into Cassandra

Re:

2012-09-19 Thread Michael Kjellman
://www.datastax.com/dev/blog/bulk-loading#comment-127019; buffer size is 64 MB as in the example; I'm dealing with about 1GB data. job config, you mean? On Thu, Sep 20, 2012 at 10:32 AM, Michael Kjellman mkjell...@barracuda.commailto:mkjell...@barracuda.com wrote: A few questions: what version

Re: persistent compaction issue (1.1.4 and 1.1.5)

2012-09-19 Thread Michael Kjellman
SSTables to level0. Best regards, Vitalii Tymchyshyn 2012/9/19 Michael Kjellman mkjell...@barracuda.commailto:mkjell...@barracuda.com Potentially the pending compactions are a symptom and not the root cause/problem. When updating a 3rd column family with a larger sstable_size_in_mb it looks like

Re: Using the commit log for external synchronization

2012-09-20 Thread Michael Kjellman
+1. Would be a pretty cool feature Right now I write once to cassandra and once to kafka. On 9/20/12 4:13 PM, Data Craftsman 木匠 database.crafts...@gmail.com wrote: This will be a good new feature. I guess the development team don't have time on this yet. ;) On Thu, Sep 20, 2012 at 1:29 PM,

Re: persistent compaction issue (1.1.4 and 1.1.5)

2012-09-20 Thread Michael Kjellman
Ended up switching the biggest offending column families back to size tiered compaction and pending compactions across the cluster dropped to 0 very quickly. On Sep 19, 2012, at 10:55 PM, Michael Kjellman mkjell...@barracuda.com wrote: After changing my ss_table_size as recommended my pending

Re: Is Cassandra right for me?

2012-09-21 Thread Michael Kjellman
Brisk is no longer actively developed by the original author or Datastax. It was left up for the community. https://github.com/steeve/brisk Has a fork that is supposedly compatible with 1.0 API Your more than welcome to fork that and make it work with 1.1 :) DSE != (Cassandra + Brisk) From:

Re: Cassandra vs Couchbase benchmarks

2012-10-01 Thread Michael Kjellman
From their wiki: The replication is an incremental one way process involving two databases (a source and a destination). The aim of the replication is that at the end of the process, all active documents on the source database are also in the destination database and all documents that were

Perlcassa - Perl Cassandra 'Client'

2012-10-11 Thread Michael Kjellman
Hi- A few months back I wrote a Perl client for Cassandra and I realized I never sent it out to this list. While I realize that while Perl is not the language du jour hopefully this will help someone else out. :) Code is periodically thrown up on CPAN but look at

Re: Why my Cassandra is compacting like mad

2012-10-15 Thread Michael Kjellman
I had a similar bug with 1.1.5 but I couldn't reproduce it so I didn't file a bug. I did a rolling restart of my nodes and things went back to normal. From: Manu Zhang owenzhang1...@gmail.commailto:owenzhang1...@gmail.com Reply-To: user@cassandra.apache.orgmailto:user@cassandra.apache.org

Re: Why my Cassandra is compacting like mad

2012-10-15 Thread Michael Kjellman
And to clarify my reply, this was a loop in compactions on system-schema_columns specifically. From: Michael Kjellman mkjell...@barracuda.commailto:mkjell...@barracuda.com Reply-To: user@cassandra.apache.orgmailto:user@cassandra.apache.org user@cassandra.apache.orgmailto:user

Re: [RELEASE] Apache Cassandra 1.1.6 released

2012-10-15 Thread Michael Kjellman
Sylvain - Getting a 404 still. I assume this is due to a delay in syncing out to Apache content mirrors? On 10/15/12 9:46 AM, Sylvain Lebresne sylv...@datastax.com wrote: The Cassandra team is pleased to announce the release of Apache Cassandra version 1.1.6. Cassandra is a highly scalable

Re: Using Cassandra to store binary files?

2012-10-16 Thread Michael Kjellman
When we started with Cassandra almost 2 years ago in production originally it was for the sole purpose storing blobs in a redundant way. I ignored the warnings as my own tests showed it would be okay (and two years later it is ok). If you plan on using Cassandra later (as we now as as features

Re: what happens while node is bootstrapping?

2012-10-16 Thread Michael Kjellman
Correct. Also, there is a new feature in 1.1+ that lets you play with live traffic on new nodes before they actually join the ring http://www.datastax.com/dev/blog/whats-new-in-cassandra-1-1-live-traffic-sa mpling On 10/16/12 9:42 AM, Andrey Ilinykh ailin...@gmail.com wrote: No. The

Re: Using Cassandra to store binary files?

2012-10-16 Thread Michael Kjellman
advantage vs. storing each file in one row….well at least from my understanding so read performance should be really really good in that case. Dean From: Michael Kjellman mkjell...@barracuda.commailto:mkjell...@barracuda.com Reply-To: user@cassandra.apache.orgmailto:user@cassandra.apache.org user

EOFException with BulkOutputFormat in 1.1.6

2012-10-17 Thread Michael Kjellman
I'm getting EOFExceptions with BulkOutputFormat 2012-10-17 12:23:01,182 ERROR org.apache.cassandra.concurrent.DebuggableThreadPoolExecutor: Error in ThreadPoolExecutor java.lang.RuntimeException: java.io.EOFException at org.apache.cassandra.utils.FBUtilities.unchecked(FBUtilities.java:628) at

Re: EOFException with BulkOutputFormat in 1.1.6

2012-10-17 Thread Michael Kjellman
Apologies - looks like this is already being tracked in https://issues.apache.org/jira/browse/CASSANDRA-4813 From: Michael Kjellman mkjell...@barracuda.commailto:mkjell...@barracuda.com Reply-To: user@cassandra.apache.orgmailto:user@cassandra.apache.org user@cassandra.apache.orgmailto:user

Re: hadoop consistency level

2012-10-18 Thread Michael Kjellman
Unless you have Brisk (however as far as I know there was one fork that got it working on 1.0 but nothing for 1.1 and is not being actively maintained by Datastax) or go with CFS (which comes with DSE) you are not guaranteed all data is on that hadoop node. You can take a look at the forks if

Re: hadoop consistency level

2012-10-18 Thread Michael Kjellman
DataStax Entreprise service... On Thu, Oct 18, 2012 at 3:00 PM, Michael Kjellman mkjell...@barracuda.com wrote: Unless you have Brisk (however as far as I know there was one fork that got it working on 1.0 but nothing for 1.1 and is not being actively maintained by Datastax) or go with CFS (which

Re: hadoop consistency level

2012-10-18 Thread Michael Kjellman
. On 10/18/12 1:15 PM, Andrey Ilinykh ailin...@gmail.com wrote: On Thu, Oct 18, 2012 at 12:00 PM, Michael Kjellman mkjell...@barracuda.com wrote: Unless you have Brisk (however as far as I know there was one fork that got it working on 1.0 but nothing for 1.1 and is not being actively maintained

Re: hadoop consistency level

2012-10-18 Thread Michael Kjellman
. If you don't specify a CL in your job config it will default to ONE (at least that's what my read of the ConfigHelper source for 1.1.6 shows) On 10/18/12 1:29 PM, Andrey Ilinykh ailin...@gmail.com wrote: On Thu, Oct 18, 2012 at 1:24 PM, Michael Kjellman mkjell...@barracuda.com wrote: Well

Re: Replication factor and performance questions

2012-11-05 Thread Michael Kjellman
Rule of thumb is to try to keep nodes under 400GB. Compactions/Repairs/Move operations etc become a nightmare otherwise. How much data do you expect to have on each node? Also depends on caches, bloom filters etc On 11/5/12 8:57 AM, Oleg Dulin oleg.du...@gmail.com wrote: I have 4 nodes at my

Re: Single Node Cassandra Installation

2012-11-05 Thread Michael Kjellman
Should be fine if one node can deal with your read and write load. Switching from SimpleStrategy to RackAware can be a pain. That¹s a potential growth point way down the line (if you ever have your nodes on different switches). You might want to just setup your keyspace as RackAware if you intend

Re: [BETA RELEASE] Apache Cassandra 1.2.0-beta2 released

2012-11-10 Thread Michael Kjellman
Ohh lala. Any documentation yet? On Nov 10, 2012, at 6:27 PM, Edward Capriolo edlinuxg...@gmail.commailto:edlinuxg...@gmail.com wrote: just a note for all. The default partitioner is no longer randompartitioner. It is now murmur, and the token range starts in negative numbers. So you don't

Re: SchemaDisagreementException

2012-11-19 Thread Michael Kjellman
While this might not be helpful (I don't have all the thread history here), have you checked that all your servers are properly synced with NTP? From: Everton Lima peitin.inu...@gmail.commailto:peitin.inu...@gmail.com Reply-To: user@cassandra.apache.orgmailto:user@cassandra.apache.org

Re: continue seeing Finished hinted handoff of 0 rows to endpoint

2012-11-24 Thread Michael Kjellman
Some people (myself included) have seen issues when upgrading from 1.1.2 to 1.1.6 with tombstoned rows in the HintsColumnFamily Some (mysql included) have fixed this by doing a nodetool scrub system HintsColumnFamily -mike On 11/24/12 7:37 AM, Chuan-Heng Hsiao hsiao.chuanh...@gmail.com wrote:

Re: continue seeing Finished hinted handoff of 0 rows to endpoint

2012-11-24 Thread Michael Kjellman
Sorry, auto correct changed 'myself' to 'mysql'. Sorry for the confusion... On Nov 24, 2012, at 9:29 AM, Michael Kjellman mkjell...@barracuda.com wrote: Some people (myself included) have seen issues when upgrading from 1.1.2 to 1.1.6 with tombstoned rows in the HintsColumnFamily Some

Re: Java high-level client

2012-11-27 Thread Michael Kjellman
Netflix has a great client https://github.com/Netflix/astyanax On 11/27/12 7:40 AM, Peter Lin wool...@gmail.com wrote: I use hector-client master, which is pretty stable right now. It uses the latest thrift, so you can use hector with thrift 0.9.0. That's assuming you don't mind using the

Re: for a school project

2012-11-27 Thread Michael Kjellman
*because *which *be *with Please see: http://en.wikipedia.org/wiki/Codd's_12_rules http://wiki.apache.org/cassandra/DataModel That should get you going for your school report. If you have more specific questions about terms on the wiki please feel free to ask. On 11/27/12 4:02 AM,

Re: counters + replication = awful performance?

2012-11-27 Thread Michael Kjellman
Are you writing with QUORUM consistency or ONE? On 11/27/12 9:52 AM, Sergey Olefir solf.li...@gmail.com wrote: Hi Juan, thanks for your input! In my case, however, I doubt this is the case -- clients are able to push many more updates than I need to saturate replication_factor=2 case (e.g. I'm

Re: Java high-level client

2012-11-28 Thread Michael Kjellman
-level client +1 On Tue, Nov 27, 2012 at 10:10 AM, Michael Kjellman mkjell...@barracuda.commailto:mkjell...@barracuda.com wrote: Netflix has a great client https://github.com/Netflix/astyanax 'Like' us on Facebook for exclusive content and other resources on all Barracuda Networks

Re: Java high-level client

2012-11-28 Thread Michael Kjellman
at 2:51 PM, Michael Kjellman mkjell...@barracuda.commailto:mkjell...@barracuda.com wrote: Lots of example code, nice api, good performance as the first things that come to mind why I like Astyanax better than Hector From: Andrey Ilinykh ailin...@gmail.commailto:ailin...@gmail.com Reply-To: user

progress of cleanup operations

2012-11-29 Thread Michael Kjellman
I know there is very little insight currently into long running JMX operations but has anyone figured out any way clever to determine approximate progress on cleanup operations? Merci. Best, Michael 'Like' us on Facebook for exclusive content and other resources on all Barracuda Networks

Re: progress of cleanup operations

2012-11-29 Thread Michael Kjellman
operation, so you would estimate in terms of how long it would take to go through all of those. On Nov 29, 2012, at 10:37 AM, Michael Kjellman mkjell...@barracuda.com wrote: I know there is very little insight currently into long running JMX operations but has anyone figured out any way clever

Re: progress of cleanup operations

2012-11-29 Thread Michael Kjellman
that to the number of total sstables in your data dir for that cf for a rough percentage of sstables that have been processed. Thoughts? On 11/29/12 10:36 AM, Michael Kjellman mkjell...@barracuda.com wrote: compactionstats does not show the overall progress it has made on cleanup operations

Re: Freeing up disk space on Cassandra 1.1.5 with Size-Tiered compaction.

2012-12-06 Thread Michael Kjellman
+1 On Dec 6, 2012, at 10:06 PM, Wei Zhu wz1...@yahoo.com wrote: I think Aaron meant 300-400GB instead of 300-400MB. Thanks. -Wei - Original Message - From: Wade L Poziombka wade.l.poziom...@intel.com To: user@cassandra.apache.org Sent: Thursday, December 6, 2012 6:53:53 AM

Re: Vnode migration path

2012-12-11 Thread Michael Kjellman
ranges around to migrate to vnodes. Richard. On 11 December 2012 08:47, Michael Kjellman mkjell...@barracuda.commailto:mkjell...@barracuda.com wrote: So I'm wondering if anyone has given thought to their migration path to Vnodes. Other than having a separate cluster and migrating the data from

Re: State of Cassandra and Java 7

2012-12-13 Thread Michael Kjellman
Works just fine for us. On 12/13/12 11:43 AM, Drew Kutcharian d...@venarc.com wrote: Hey Guys, With Java 6 begin EOL-ed soon (https://blogs.oracle.com/java/entry/end_of_public_updates_for), what's the status of Cassandra's Java 7 support? Anyone using it in production? Any outstanding *known*

Re: State of Cassandra and Java 7

2012-12-15 Thread Michael Kjellman
What issues have you ran into? Actually curious because we push 1.1.5-7 really hard and have no issues whatsoever. On Dec 15, 2012, at 7:51 AM, Brian Tarbox tar...@cabotresearch.commailto:tar...@cabotresearch.com wrote: We've reverted all machines back to Java 6 after running into numerous

Re: State of Cassandra and Java 7

2012-12-16 Thread Michael Kjellman
:54 AM, Sylvain Lebresne sylv...@datastax.com wrote: On Sat, Dec 15, 2012 at 7:12 PM, Michael Kjellman mkjell...@barracuda.com wrote: What issues have you ran into? Actually curious because we push 1.1.5-7 really hard and have no issues whatsoever. A related question is which which

Re: Partition maintenance

2012-12-18 Thread Michael Kjellman
You could make a column family for each period of time and then drop the column family when you want to destroy it. Before you drop it you could use the sstabletojson converter and write the json files out to tape. Might make your life difficult however if you need an input split for map

Re: Partition maintenance

2012-12-18 Thread Michael Kjellman
a secondary index for that, I would assume. From: Michael Kjellman [mailto:mkjell...@barracuda.com] Sent: Tuesday, December 18, 2012 11:15 AM To: user@cassandra.apache.orgmailto:user@cassandra.apache.org Subject: Re: Partition maintenance You could make a column family for each period of time

Re: Monitoring the number of client connections

2012-12-18 Thread Michael Kjellman
netstat + cron is your friend at this point in time On Dec 18, 2012, at 8:25 PM, aaron morton aa...@thelastpickle.commailto:aa...@thelastpickle.com wrote: AFAIK the count connections is not exposed. Cheers - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton

Re: Fixing the schema for a Column Family

2012-12-28 Thread Michael Kjellman
I've found that if you drop a column family, the data is still there/snapshotted. If you recreate the column family as expected the data will repopulate the cf. From: Charles Lamanna char...@metricshub.commailto:char...@metricshub.com Reply-To:

Re: Cassadra API for Java

2012-12-28 Thread Michael Kjellman
This was asked as recently as one month + 1 day btw: http://grokbase.com/t/cassandra/user/12bve4d8e8/java-high-level-client if you weren't subscribed to the group to see the messages to see a longer discussion. From: Baskar Sikkayan techba...@gmail.commailto:techba...@gmail.com Reply-To:

Re: Error after 1.2.0 upgrade

2013-01-03 Thread Michael Kjellman
I'm having huge upgrade issues from 1.1.7 - 1.2.0 atm but in a 12 node cluster which I am slowly massaging into a good state I haven't seen this in 15+ hours of operation… This looks related to JNA? From: Alain RODRIGUEZ arodr...@gmail.commailto:arodr...@gmail.com Reply-To:

Re: Error after 1.2.0 upgrade

2013-01-03 Thread Michael Kjellman
? Does it ? The only thing logged about JNA is the following : JNA mlockall successful. What does this line *** java.lang.instrument ASSERTION FAILED ***: !errorOutstanding with message transform method call failed at ../../../src/share/instrument/JPLISAgent.c line: 806 means? 2013/1/3 Michael

Re: num_tokens - virtual nodes

2013-01-04 Thread Michael Kjellman
http://www.mail-archive.com/user@cassandra.apache.org/msg26528.html From: Alain RODRIGUEZ arodr...@gmail.commailto:arodr...@gmail.com Reply-To: user@cassandra.apache.orgmailto:user@cassandra.apache.org user@cassandra.apache.orgmailto:user@cassandra.apache.org Date: Friday, January 4, 2013 6:00

Re: Specifying initial token in 1.2 fails

2013-01-04 Thread Michael Kjellman
Murmur3 != MD5 (RandomPartitioner) From: Dwight Smith dwight.sm...@genesyslab.commailto:dwight.sm...@genesyslab.com Reply-To: user@cassandra.apache.orgmailto:user@cassandra.apache.org user@cassandra.apache.orgmailto:user@cassandra.apache.org Date: Friday, January 4, 2013 8:36 AM To:

Re: Specifying initial token in 1.2 fails

2013-01-04 Thread Michael Kjellman
token method supported? If so – how should these be generated, the token-generator seems to only apply to RandomPartitioner. Thanks again From: Michael Kjellman [mailto:mkjell...@barracuda.com] Sent: Friday, January 04, 2013 8:39 AM To: user@cassandra.apache.orgmailto:user@cassandra.apache.org

Re: Cassandra / Windows Server 2008

2013-01-04 Thread Michael Kjellman
Use linux ;) More seriously, I'm wondering if it is binding to the IPV6 address? Is that enabled on that NIC? You could try disabling IPv6 and seeing if RPC binds correctly.. From: stephen.m.thomp...@wellsfargo.commailto:stephen.m.thomp...@wellsfargo.com

Re: Cassandra 1.2 compatible Python bindings

2013-01-06 Thread Michael Kjellman
I know one outstanding issue is that CQL3 created column families won't be listed as CQL3 column families aren't exposed by the old thrift calls. I know work is in progress to fix this... On Jan 6, 2013, at 8:01 PM, aaron morton aa...@thelastpickle.commailto:aa...@thelastpickle.com wrote: I'm

Re: help turning compaction..hours of run to get 0% compaction....

2013-01-07 Thread Michael Kjellman
Size tiered or leveled compaction? From: Brian Tarbox tar...@cabotresearch.commailto:tar...@cabotresearch.com Reply-To: user@cassandra.apache.orgmailto:user@cassandra.apache.org user@cassandra.apache.orgmailto:user@cassandra.apache.org Date: Monday, January 7, 2013 12:03 PM To:

Re: help turning compaction..hours of run to get 0% compaction....

2013-01-07 Thread Michael Kjellman
) insert once, ready many, update-never. Brian On Mon, Jan 7, 2013 at 3:13 PM, Michael Kjellman mkjell...@barracuda.commailto:mkjell...@barracuda.com wrote: Size tiered or leveled compaction? From: Brian Tarbox tar...@cabotresearch.commailto:tar...@cabotresearch.com Reply-To: user

Re: help turning compaction..hours of run to get 0% compaction....

2013-01-07 Thread Michael Kjellman
in detail. Thanks. Brain On Mon, Jan 7, 2013 at 4:13 PM, Michael Kjellman mkjell...@barracuda.commailto:mkjell...@barracuda.com wrote: http://www.datastax.com/dev/blog/when-to-use-leveled-compaction If you perform at least twice as many reads as you do writes, leveled compaction may actually save

Re: Script to load sstables from v1.0.x to v 1.1.x

2013-01-08 Thread Michael Kjellman
I thought this was to load between separate clusters not to upgrade within the same cluster. No? On Jan 8, 2013, at 11:29 AM, Rob Coli rc...@palominodb.com wrote: On Tue, Jan 8, 2013 at 8:41 AM, Todd Nine todd.n...@gmail.com wrote: I have recently been trying to restore backups from a v1.0.x

Re: Date Index?

2013-01-09 Thread Michael Kjellman
ElasticSearch is a nice option for ordered lists. In 2.0 triggers would fit updates to elastic search much easier as right now it's in your application logic to detect changes and update. On Jan 9, 2013, at 7:55 AM, stephen.m.thomp...@wellsfargo.commailto:stephen.m.thomp...@wellsfargo.com

Re: change cluster name retaining keypsace

2013-01-09 Thread Michael Kjellman
I think Arron meant /var/lib/cassandra (by default) Check there (unless you changed you data directories in your cassandra.yaml) On Jan 9, 2013, at 7:36 PM, Tim Dunphy bluethu...@gmail.commailto:bluethu...@gmail.com wrote: Hello, And thanks for your reply! Well so far it's just a single node.

Re: Starting Cassandra

2013-01-10 Thread Michael Kjellman
I've seen this with OpenJDK 7. Grab Java 7 u10 from Oracle and you should be good to go. From: Alain RODRIGUEZ arodr...@gmail.commailto:arodr...@gmail.com Reply-To: user@cassandra.apache.orgmailto:user@cassandra.apache.org user@cassandra.apache.orgmailto:user@cassandra.apache.org Date:

Re: inconsistent hadoop/cassandra results

2013-01-10 Thread Michael Kjellman
I found that overall Hadoop input/output from Cassandra could use a little more QA and input from the community. (Especially with large datasets). There were some serious BOF bugs in 1.1 that have been resolved in 1.2. (Yay!) But, the problems in 1.1 weren't immediately apparent. Testing in my

Re: Murmur3Partitioner cassandra 1.2

2013-01-11 Thread Michael Kjellman
There is no migration path from RandomPartitioner to Murmur3 On Jan 11, 2013, at 11:12 AM, Tim Dunphy bluethu...@gmail.commailto:bluethu...@gmail.com wrote: Hello, Will I have to recreate my schema in order to use the Murmur3Partitioner ? Thanks Tim -- GPG me!! gpg --keyserver

Re: java exception on start

2013-01-12 Thread Michael Kjellman
https://issues.apache.org/jira/browse/CASSANDRA-4916 Fixed for 1.2.1! From: Tim Dunphy bluethu...@gmail.commailto:bluethu...@gmail.com Reply-To: user@cassandra.apache.orgmailto:user@cassandra.apache.org user@cassandra.apache.orgmailto:user@cassandra.apache.org Date: Saturday, January 12, 2013

Re: Cassandra 1.1.2 - 1.1.8 upgrade

2013-01-16 Thread Michael Kjellman
upgradesstables is safe, but it is essentially compaction (because sstables are immutable it rewrites the sstable in the new format) so you'll want to do it when traffic is low to avoid IO issues. upgradesstables always needs to be done between majors. While 1.1.2 - 1.1.8 is not a major, due

Re: unsubscribe

2013-01-16 Thread Michael Kjellman
Writing to the list user@cassandra.apache.org Subscription addressuser-subscr...@cassandra.apache.org Digest subscription address user-digest-subscr...@cassandra.apache.org Unsubscription addressesuser-unsubscr...@cassandra.apache.org Getting help with the list

Re: Pig / Map Reduce on Cassandra

2013-01-16 Thread Michael Kjellman
Brisk is pretty much stagnant. I think someone forked it to work with 1.0 but not sure how that is going. You'll need to pay for DSE to get CFS (which is essentially Brisk) if you want to use any modern version of C*. Best, Michael On 1/16/13 11:17 AM, cscetbon@orange.com

Re: BulkOutputFormat

2013-01-17 Thread Michael Kjellman
https://issues.apache.org/jira/browse/CASSANDRA-4813 Fixed in 1.2.0 Best, michael From: chandra Varahala hadoopandcassan...@gmail.commailto:hadoopandcassan...@gmail.com Reply-To: user@cassandra.apache.orgmailto:user@cassandra.apache.org

Re: BulkOutputFormat

2013-01-17 Thread Michael Kjellman
chandra On Thu, Jan 17, 2013 at 1:50 PM, Michael Kjellman mkjell...@barracuda.commailto:mkjell...@barracuda.com wrote: https://issues.apache.org/jira/browse/CASSANDRA-4813 Fixed in 1.2.0 Best, michael From: chandra Varahala hadoopandcassan...@gmail.commailto:hadoopandcassan...@gmail.com Reply

Re: Cassandra source code explained

2013-01-22 Thread Michael Kjellman
http://wiki.apache.org/cassandra/ArchitectureInternals From: Radek Gruchalski radek.gruchal...@portico.iomailto:radek.gruchal...@portico.io Reply-To: user@cassandra.apache.orgmailto:user@cassandra.apache.org user@cassandra.apache.orgmailto:user@cassandra.apache.org Date: Tuesday, January 22,

Re: Unavaliable Exception

2013-01-25 Thread Michael Kjellman
More nodes! On Jan 25, 2013, at 7:21 AM, Alain RODRIGUEZ arodr...@gmail.commailto:arodr...@gmail.com wrote: fwiw, I have a mixed ubuntu 11.10 / 12.04 6 node cluster (AWS m1.xlarge). The load average is always between 0 and 5 for 11.10 nodes while 12.04 nodes shows all the time a load between

Re: Issue when deleting Cassandra rowKeys.

2013-01-26 Thread Michael Kjellman
What is your gc_grace set to? Are your findings before or after this time after the deletion? From: Kasun Weranga kas...@wso2.commailto:kas...@wso2.com Reply-To: user@cassandra.apache.orgmailto:user@cassandra.apache.org user@cassandra.apache.orgmailto:user@cassandra.apache.org Date: Saturday,

Re: ConfigHelper.setThriftContact() undefined in cassandra v1.2

2013-01-29 Thread Michael Kjellman
Pretty sure you are looking for something like: // thrift input job settings ConfigHelper.setInputRpcPort(job.getConfiguration(), 9160); ConfigHelper.setInputInitialAddress(job.getConfiguration(), 127.0.0.1); ConfigHelper.setInputPartitioner(job.getConfiguration(), RandomPartitioner); // thrift

Re: How Cassandra guarantees the replicas if any node is down?

2013-01-29 Thread Michael Kjellman
Do get started look at: HintedHandoff: http://wiki.apache.org/cassandra/HintedHandoff Operations: http://wiki.apache.org/cassandra/Operations (specifically repair and repair –pr operations) There should be a ton of information on this you can easily Google. Best, Michael From: dong.yajun

Re: Is there any way to fetch all data efficiently from a column family?

2013-01-29 Thread Michael Kjellman
How often do you need to do this? How many rows in your column families? If it's not a frequent operation you can just page the data n number of rows at a time using nothing special but C* and a driver. Or another option is you can write a map/reduce job if you need an entire cf to be an input

Re: Is there any way to fetch all data efficiently from a column family?

2013-01-29 Thread Michael Kjellman
is to purge the bombstones, another option is using the map/reduce job to do the purging things without major compactions. Best Rick. On Wed, Jan 30, 2013 at 1:15 PM, Michael Kjellman mkjell...@barracuda.commailto:mkjell...@barracuda.com wrote: How often do you need to do this? How many rows

Re: Is there any way to fetch all data efficiently from a column family?

2013-01-29 Thread Michael Kjellman
from Datastax: http://www.datastax.com/docs/1.1/cluster_architecture/hadoop_integration From: Michael Kjellman mkjell...@barracuda.commailto:mkjell...@barracuda.com Reply-To: user@cassandra.apache.orgmailto:user@cassandra.apache.org user@cassandra.apache.orgmailto:user@cassandra.apache.org Date

Re: Inserting via thrift interface to column family created with Compound Key via cql3

2013-01-30 Thread Michael Kjellman
Are you using execute_cql3_query() ? On Jan 30, 2013, at 7:31 AM, Oleksandr Petrov oleksandr.pet...@gmail.com wrote: Hi, I'm creating a table via cql3 query like: CREATE TABLE posts ( userid text, blog_name text, entry_title text, posted_at text, PRIMARY KEY (userid,

Re: Inserting via thrift interface to column family created with Compound Key via cql3

2013-01-30 Thread Michael Kjellman
, Michael Kjellman mkjell...@barracuda.commailto:mkjell...@barracuda.com wrote: Are you using execute_cql3_query() ? On Jan 30, 2013, at 7:31 AM, Oleksandr Petrov oleksandr.pet...@gmail.commailto:oleksandr.pet...@gmail.com wrote: Hi, I'm creating a table via cql3 query like: CREATE TABLE posts

Re: Inserting via thrift interface to column family created with Compound Key via cql3

2013-01-30 Thread Michael Kjellman
* allows to query everything that is greater than 3foo.getBytes(), but * not 3foo.getBytes() itself. */ Or am I missing the fact that you are inserting with cql3 as well? From: Michael Kjellman mkjell...@barracuda.commailto:mkjell...@barracuda.com Reply-To: user@cassandra.apache.orgmailto:user

Re: Why do Datastax docs recommend Java 6?

2013-02-05 Thread Michael Kjellman
There have been tons of threads/convos on this. In the early days of Java 7 it was pretty unstable and there was pretty much no convincing reason to use Java 7 over Java 6. Now that Java 7 has stabilized and Java 6 is EOL it's a reasonable decision to use Java 7 and we do it in production with

Re: Netflix/Astynax Client for Cassandra

2013-02-06 Thread Michael Kjellman
It's a really great library and definitely recommended by me and many who are reading this. And if you are just starting out on 1.2.1 with C* you might also want to evaluate https://github.com/datastax/java-driver and the new binary protocol. Best, michael From: Cassa L

Re: UNSUBSCRIBE

2013-02-08 Thread Michael Kjellman
Sigh. Google please On Feb 8, 2013, at 11:00 AM, Rudy Jahchan r...@carbonfive.com wrote: -- Rudy Jahchan

Re: How to change the tokens?

2013-02-08 Thread Michael Kjellman
You need to move the node to the new token you would like. Take care to only do one move operation at a time in the ring and also do a repair –pr and cleanup after the move operation. See nodetool move for more information Best, michael From: S C as...@outlook.commailto:as...@outlook.com

Re: How to change the tokens?

2013-02-08 Thread Michael Kjellman
Not sure why you would need to do that...And regardless you will need to move the data to change the tokens. On Feb 8, 2013, at 7:01 PM, S C as...@outlook.commailto:as...@outlook.com wrote: I have a scenario where I have to swap the tokens of the two nodes in a three node cluster. Thanks, SC

Re: multiget_slice using CQL3

2013-02-14 Thread Michael Kjellman
I'm confused what you are looking to do. CQL3 syntax (SELECT * FROM keyspace.cf WHERE user = 'cooldude') has nothing to do with thrift client calls (such as multiget_slice) What is your goal here? Best, michael On 2/14/13 5:57 PM, Drew Kutcharian d...@venarc.com wrote: Hi Guys, What's the

Re: Cassandra 1.20 with Cloudera Hadoop (CDH4) Compatibility Issue

2013-02-15 Thread Michael Kjellman
That bug is kinda wrong though. 1.0.x is current for like a year now and C* works great with it :) On Feb 15, 2013, at 7:38 PM, Dave Brosius dbros...@mebigfatguy.commailto:dbros...@mebigfatguy.com wrote: see https://issues.apache.org/jira/browse/CASSANDRA-5201 On 02/15/2013 10:05 PM, Yang

Re: Cassandra 1.20 with Cloudera Hadoop (CDH4) Compatibility Issue

2013-02-15 Thread Michael Kjellman
Sorry. I meant to say even though there *wasnt* a major change between 1.0.x and 0.22. The big change was 0.20 to 0.22. Sorry for the confusion. On Feb 15, 2013, at 9:53 PM, Michael Kjellman mkjell...@barracuda.commailto:mkjell...@barracuda.com wrote: There were pretty big changes in Hadoop

Re: unsubscribe

2013-02-17 Thread Michael Kjellman
Please see the Mailing Lists section of the home page. http://cassandra.apache.org user-unsubscr...@cassandra.apache.org From: James Wong jwong...@gmail.commailto:jwong...@gmail.com Reply-To: user@cassandra.apache.orgmailto:user@cassandra.apache.org

Re: Is C* common nickname for Cassandra?

2013-02-17 Thread Michael Kjellman
Why do you feel that link is unprofessional? Just wondering. I actually quite like the abbreviation personally. On Feb 17, 2013, at 1:37 PM, Boris Solovyov boris.solov...@gmail.commailto:boris.solov...@gmail.com wrote: Thanks. I don't know if anyone cares my opinion, but as a newcomer to the

Re: Cassandra backup

2013-02-18 Thread Michael Kjellman
There is this: http://www.datastax.com/dev/blog/whats-new-in-cassandra-1-1-flexible-data-file-placement But you'll need to design your data model around the fact that this is only as granular as 1 column family Best, michael From: Kanwar Sangha kan...@mavenir.commailto:kan...@mavenir.com

Re: Long running nodetool repair

2013-02-19 Thread Michael Kjellman
This is very normal (unfortunately). Are you doing a repair –pr or a straight up repair? Does nodetool netstats show anything? I frequently see repair hang in 1.2.1, and I haven't been able to figure out why yet though. Feel free to take a stack dump with jstack on the node doing the repair

Re: benchmark

2013-02-20 Thread Michael Kjellman
http://www.miraclelinux.com/jp/online-service/labs/pdf/zabbix-write-performance is a recent one that comes to mind But that was just write performance.. If you are really doing a case study you might want to do it yourself, in which case you can use the stress tool distributed with Cassandra

  1   2   >