Clarification on how multi-DC replication works

2014-02-10 Thread Sameer Farooqui
Hi, I was hoping someone could clarify a point about multi-DC replication. Let's say I have 2 data centers configured with replication factor = 3 in each DC. My client app is sitting in DC 1 and is able to intelligently pick a coordinator that will also be a replica partner. So the client app

Re: First SSTable file is not being compacted

2014-02-06 Thread Sameer Farooqui
). However, in OpsCenter the CF: SSTable Count (demodb.users) graph shows only one SSTable. The nodetool cfstats command also shows SSTable count: 1 for this CF. - SF On Thu, Feb 6, 2014 at 8:54 AM, Chris Burroughs chris.burrou...@gmail.comwrote: On 02/06/2014 01:17 AM, Sameer Farooqui wrote

First SSTable file is not being compacted

2014-02-05 Thread Sameer Farooqui
Hi, I'm running C* 2.0.4 and when I have a handful of SSTable files and trigger a manual compaction with 'nodetool compact' the first SSTable file doesn't get compacted away. Is there something special about the first SSTable that it remains even after a SizedTierCompaction? - SF

Looking for clarification on the gossip protocol... 3 random nodes every second?

2014-02-04 Thread Sameer Farooqui
Hi, I'm looking to get some clarification on how the gossip protocol works in Cassandra 2.0. Does a node contact 3 purely random nodes every second for gossip or is there more intelligence involved in how it selects the 3 nodes? *The Apache wiki on Cassandra states this:* Gossip timer task runs

Error during cleanup

2013-10-08 Thread Sameer Farooqui
Hi, When running cleanup on a node with C* 2.0.1, I got the following error: cassandra01 - Error during cleanup: javax.management.MBeanException: java.util.concurrent.ExecutionException: java.lang.ClassCastException: org.apache.cassandra.io.sstable.SSTableReader$EmptyCompactionScanner cannot be

How to determine which node(s) an insert would go to in C* 2.0 with vnodes?

2013-10-08 Thread Sameer Farooqui
Hi, When using C* 2.0 in a large 100 node cluster with Murmer3Hash, vnodes and 256 tokens assigned to each node, is it possible to find out where a certain key is destined to go? If the keyspace defined has replication factor = 3, then a specific key like 'row-1' would be destined to go to 3

Re: Question about SizeTieredCompactionStrategy in C* 2.0: not all SSTables are being compacted

2013-10-08 Thread Sameer Farooqui
that are a similar size (by default, they basically need to be within 50% of each other). Perhaps your first SSTable was very large or small compared to the others? On Mon, Oct 7, 2013 at 8:06 PM, Sameer Farooqui sam...@blueplastic.comwrote: Hi, I have a fresh 1-node C* 2.0 install with a demo

Re: Error during cleanup

2013-10-08 Thread Sameer Farooqui
at 2:46 PM, Tyler Hobbs ty...@datastax.com wrote: Do you have a complete stacktrace available? On Tue, Oct 8, 2013 at 2:08 AM, Sameer Farooqui sam...@blueplastic.comwrote: Hi, When running cleanup on a node with C* 2.0.1, I got the following error: cassandra01 - Error during cleanup

Re: Question about SizeTieredCompactionStrategy in C* 2.0: not all SSTables are being compacted

2013-10-08 Thread Sameer Farooqui
probably quite a bit smaller (~25% of the size). Once you have two more sstables of roughly that size, they should be compacted automatically. On Tue, Oct 8, 2013 at 2:01 PM, Sameer Farooqui sam...@blueplastic.comwrote: Thanks for the reply, Tyler. I thought that too.. that maybe the SSTables

Re: Question about info returned from 'nodetool gossipinfo'

2013-10-07 Thread Sameer Farooqui
to be using Murmur3Partitioner, which has negative tokens. -Ravi On Sunday, October 6, 2013 9:12 PM, Sameer Farooqui sam...@blueplastic.com wrote: Hi, Does anyone know what the information that comes from the nodetool gossipinfo command means? For example, what is the LOAD #'s meaning

Question about SizeTieredCompactionStrategy in C* 2.0: not all SSTables are being compacted

2013-10-07 Thread Sameer Farooqui
Hi, I have a fresh 1-node C* 2.0 install with a demo keyspace created with the SizeTiered compaction strategy. I've noticed that in the beginning this keyspace has just one SSTable: demodb-users-jb-1-Data.db But as I add more data to the table and do some flushes, the # of SSTables builds up.

Re: Question about info returned from 'nodetool gossipinfo'

2013-10-07 Thread Sameer Farooqui
A quick follow up question - If my node is configured with Virtual Nodes and 256 tokens are assigned to it, how does the STATUS line only show one token value? In this case the -12331...#. On Mon, Oct 7, 2013 at 9:00 PM, Sameer Farooqui sam...@blueplastic.comwrote: Thanks, Ravi! On Mon, Oct

Question about info returned from 'nodetool gossipinfo'

2013-10-06 Thread Sameer Farooqui
Hi, Does anyone know what the information that comes from the nodetool gossipinfo command means? For example, what is the LOAD #'s meaning or the SEVERITY #? My load is 91457.0 and SEVERITY: 22.448. Also, next to STATUS:NORMAL is a negative #: -123311655 what does that # mean?

Re: Repair fails with java.io.IOError: java.io.EOFException

2011-07-26 Thread Sameer Farooqui
and the 646GB are just insanely high. Not sure why or how to troubleshoot. On Fri, Jul 22, 2011 at 1:28 PM, Sameer Farooqui cassandral...@gmail.com wrote: I don't see a JVM crashlog ( hs_err_pid[pid].log) in ~/brisk/resources/cassandra/bin or /tmp. So maybe JVM didn't crash? We're

Re: Repair fails with java.io.IOError: java.io.EOFException

2011-07-25 Thread Sameer Farooqui
insanely high. Not sure why or how to troubleshoot. On Fri, Jul 22, 2011 at 1:28 PM, Sameer Farooqui cassandral...@gmail.comwrote: I don't see a JVM crashlog ( hs_err_pid[pid].log) in ~/brisk/resources/cassandra/bin or /tmp. So maybe JVM didn't crash? We're running a pretty up to date with Sun

Re: Repair fails with java.io.IOError: java.io.EOFException

2011-07-22 Thread Sameer Farooqui
wrote: Did you check for a JVM crash log? You should make sure you're running the latest Sun JVM, older versions and OpenJDK in particular are prone to segfaulting. On Thu, Jul 21, 2011 at 6:53 PM, Sameer Farooqui cassandral...@gmail.com wrote: We are starting Cassandra with brisk cassandra

Repair fails with java.io.IOError: java.io.EOFException

2011-07-21 Thread Sameer Farooqui
While running Repair on a 0.8.1 node, we got this error in the system.log: ERROR [Thread-23] 2011-07-21 15:48:43,868 AbstractCassandraDaemon.java (line 113) Fatal exception in thread Thread[Thread-23,5,main] java.io.IOError: java.io.EOFException at

Re: Repair fails with java.io.IOError: java.io.EOFException

2011-07-21 Thread Sameer Farooqui
Ellis jbel...@gmail.com wrote: Looks harmless to me. On Thu, Jul 21, 2011 at 1:41 PM, Sameer Farooqui cassandral...@gmail.com wrote: While running Repair on a 0.8.1 node, we got this error in the system.log: ERROR [Thread-23] 2011-07-21 15:48:43,868 AbstractCassandraDaemon.java (line

Re: Repair fails with java.io.IOError: java.io.EOFException

2011-07-21 Thread Sameer Farooqui
2011, at 10:13, Sameer Farooqui wrote: Hmm. Just looked at the log more closely. So, what actually happened is while Repair was running on this specific node, the Cassandra java process terminated itself automatically. The last entries in the log are: INFO [ScheduledTasks:1] 2011-07-21 13:00

Re: Repair fails with java.io.IOError: java.io.EOFException

2011-07-21 Thread Sameer Farooqui
, at 10:50, Sameer Farooqui wrote: Hey Aaron, I don't have any output.log files in that folder: ubuntu@ip-10-2-x-x:~$ cd /var/log/cassandra ubuntu@ip-10-2-x-x:/var/log/cassandra$ ls system.log system.log.11 system.log.4 system.log.7 system.log.1 system.log.2 system.log.5

Re: Cassandra CLOUD . How its related

2011-07-20 Thread Sameer Farooqui
Are you talking about cloudsandra.com? Check out their website. Cassandra is a database. Cloud is just a fancy term for remote hosting. The two aren't really related. On Wed, Jul 20, 2011 at 3:19 AM, CASSANDRA learner cassandralear...@gmail.com wrote: Hi Guys, When we talk about cassandra,

Re: best example of indexing

2011-07-20 Thread Sameer Farooqui
More info: http://www.datastax.com/docs/0.8/data_model/secondary_indexes http://www.datastax.com/docs/0.8/data_model/cfs_as_indexes On Wed, Jul 20, 2011 at 10:49 AM, Konstantin Naryshkin konstant...@a-bb.net wrote: In the Cassandra CLI tutorial(

Re: Too many open files during Repair operation

2011-07-19 Thread Sameer Farooqui
I'm guessing you've seen this already? http://www.datastax.com/docs/0.8/troubleshooting/index#java-reports-an-error-saying-there-are-too-many-open-files Check out the # of File Descriptors opened with the lsof- -n | grep java command. On Tue, Jul 19, 2011 at 8:30 AM, cbert...@libero.it

Re: Is there a way to read a Double value from the CLI?

2011-07-19 Thread Sameer Farooqui
Ellis jbel...@gmail.com wrote: org.apache.cassandra.db.marshal.DoubleType should work, then. the cast shouldn't be necessary if the doubletype is actually in the schema either. On Mon, Jul 18, 2011 at 4:34 PM, Sameer Farooqui cassandral...@gmail.com wrote: Yup, Jonathan, I was just

Re: Is anyone actually seriously using SimpleAuthenticator and SimpleAuthority?

2011-07-19 Thread Sameer Farooqui
We studied SimpleAuthority a few months back out of curiosity and took some notes on it to eventually use it in the future. Somebody getting started with this might find the following helpful... - - - - - - The following discusses ways to configure security best practices for a Cassandra cluster

Possible bug in Cassandra 0.8.1?

2011-07-18 Thread Sameer Farooqui
I'm running into a quirky issue with Brisk 1.0 Beta 2 (w/ Cassandra 0.8.1). I think the last node in our cluster is having problems (10.201.x.x). OpsCenter and nodetool ring (run from that node) show the node as down, but the rest of the cluster sees it as up. If I run nodetool ring from one of

Is there a way to read a Double value from the CLI?

2011-07-18 Thread Sameer Farooqui
I wrote some data to a standard column family (using Hector) with ASCII row-key, Long column name and Double column value. The CLI doesn't seem to have a double data type to interpret the column value for display. [default@DemoKS] get DemoCF[ascii('ABC_1020')][long(2009010100)] as

Re: Is there a way to read a Double value from the CLI?

2011-07-18 Thread Sameer Farooqui
: get X as LongType will give you the bytes in that column value interpreted as a long. So it did exactly what you asked it to do :) On Mon, Jul 18, 2011 at 4:16 PM, Sameer Farooqui cassandral...@gmail.com wrote: I wrote some data to a standard column family (using Hector) with ASCII row-key

Re: Brisk Unbalanced Ring

2011-07-18 Thread Sameer Farooqui
FYI - This manual reordering of the DCs and RACs might make it easier to see how the tokens are arranged. Pretty sure that the token ranges are picked correctly. Ignore the Owns column, b/c it is not multi-datacenter aware (so it thinks all of the nodes are in one ring as opposed to two (DC1

Re: Question about compaction

2011-07-14 Thread Sameer Farooqui
. In current version, Cassandra delete compacted sstables at Full GC and on startup. maki 2011/7/14 Sameer Farooqui cassandral...@gmail.com: Running Cassandra 0.8.1. Ran major compaction via: sudo /home/ubuntu/brisk/resources/cassandra/bin/nodetool -h localhost compact From what I'd read

Re: question on capacity planning

2011-07-14 Thread Sameer Farooqui
So, in our experience, the amount of storage overhead is much higher. If you plan on storing 120TB of data, you will want to expect storing 250 TB of data on disk after the data over head. And then since you have to leave 50% of storage space free for compaction, you're looking at needing about

Data overhead discussion in Cassandra

2011-07-14 Thread Sameer Farooqui
We just set up a demo cluster with Cassandra 0.8.1 with 12 nodes and loaded 1.5 TB of data into it. However, the actual space on disk being used by data files in Cassandra is 3 TB. We're using a standard column family with a million rows (key=string) and 35,040 columns per key. The column name is

Question about compaction

2011-07-13 Thread Sameer Farooqui
Running Cassandra 0.8.1. Ran major compaction via: sudo /home/ubuntu/brisk/resources/cassandra/bin/nodetool -h localhost compact From what I'd read about Cassandra, I thought that after compaction all of the different SSTables on disk for a Column Family would be merged into one new file.

Re: advice for EC2 deployment

2011-06-23 Thread Sameer Farooqui
EC2Snitch doesn't currently support multi-Regions in Amazon. Tickets to track: https://issues.apache.org/jira/browse/CASSANDRA-2452 https://issues.apache.org/jira/browse/CASSANDRA-2491 Let us know if/how you get the OpenVPN connection to work across Regions. On Thu, Jun 23, 2011 at 6:29 AM,

Problem with PropertyFileSnitch in Amazon EC2

2011-06-20 Thread Sameer Farooqui
Hi, I'm setting up a 3 node test cluster in multiple Amazon Availability Zones to test cross-zone internode communication (and eventually cross-region communications). But I wanted to start with a cross-zone setup and am having trouble getting the nodes to connect to each other and join one

Re: Problem with PropertyFileSnitch in Amazon EC2

2011-06-20 Thread Sameer Farooqui
:20:09,692 HintedHandOffManager.java (line 274) schema for /10.198.x.x matches local schema DEBUG [HintedHandoff:1] 2011-06-21 00:20:09,692 HintedHandOffManager.java (line 288) Sleeping 11662ms to stagger hint delivery - Sameer On Mon, Jun 20, 2011 at 2:28 PM, Sameer Farooqui cassandral

Advice on configuring a Brisk cluster across regions in Amazon

2011-06-17 Thread Sameer Farooqui
Hi, I'd like to learn how to set up a Brisk cluster with HA/DR in Amazon. Last time I tried this a few months ago, it was tricky because we had to either set up a VPN or hack the Cassandra source to get internode communications to work across regions. But with v 0.8's new BriskSnitch or

When does it make sense to use TimeUUID?

2011-06-14 Thread Sameer Farooqui
I would like to store some timestamped user info in a Column Family with the usernames as the row key and different timestamps as column names. Each user might have a thousand timestamped data. I understand that the ver 1 UUIDs that Cassandra combines the MAC address of the computer generating

Re: When does it make sense to use TimeUUID?

2011-06-14 Thread Sameer Farooqui
most-recent, or least-recent order. *From:* Sameer Farooqui [mailto:cassandral...@gmail.com] *Sent:* Tuesday, June 14, 2011 8:16 PM *To:* user@cassandra.apache.org *Subject:* When does it make sense to use TimeUUID? I would like to store some timestamped user info in a Column Family

A couple of links for people getting started with the NoSQL space

2011-06-01 Thread Sameer Farooqui
A $35 book was published today with a really excellent chapter describing of the NoSQL ecosystem. If you’re interested in the NoSQL space, I highly recommend it (and you can read it for free from the links below): NoSQL Chapter: http://www.aosabook.org/en/nosql.html Rest of book:

Re: EC2 node adding trouble

2011-05-24 Thread Sameer Farooqui
What region and availability zones are the different nodes in? Are you using EC2 Snitch? Did you set up the cluster using the Datastax AMI? - Sameer On Tue, May 24, 2011 at 8:35 AM, Marcus Bointon mar...@synchromedia.co.ukwrote: Hi, First time here. I'm having trouble adding a third node to

Re: EC2 node adding trouble

2011-05-24 Thread Sameer Farooqui
...@synchromedia.co.ukwrote: On 24 May 2011, at 19:33, Sameer Farooqui wrote: What region and availability zones are the different nodes in? Are you using EC2 Snitch? Did you set up the cluster using the Datastax AMI? The two existing ones are in us-east-1c and us-east-1d, the new one

Re: How to configure internode encryption in 0.8.0?

2011-05-19 Thread Sameer Farooqui
advice on this. On May 17, 2011, at 7:47 PM, Sameer Farooqui wrote: Thanks for the link, Jeremy. I generated the keystore and truststore for inter-node communication using the link in the YAML file: http://download.oracle.com/javase/6/docs/technotes/guides/security/jsse

Re: Questions about using MD5 encryption with SimpleAuthenticator

2011-05-18 Thread Sameer Farooqui
from Jonathan confirming that statement :) Cheers - Aaron Morton Freelance Cassandra Developer @aaronmorton http://www.thelastpickle.com On 19/05/2011, at 1:19 AM, Ted Zlatanov t...@lifelogs.com wrote: On Tue, 17 May 2011 15:52:22 -0700 Sameer Farooqui cassandral

Snapshotting to a different volume?

2011-05-18 Thread Sameer Farooqui
As of 0.8.0, is it possible to take a Cassandra snapshot to a different volume (like a EBS volume dedicated for backups)? About a year ago, Jonathan Ellis said that this won't be implemented b/c snapshots are basically hard links:

Re: Snapshotting to a different volume?

2011-05-18 Thread Sameer Farooqui
watanabe.m...@gmail.comwrote: SSTables are immutable. Those won't changed once written to disk. From iPhone On 2011/05/19, at 9:37, Sameer Farooqui cassandral...@gmail.com wrote: As of 0.8.0, is it possible to take a Cassandra snapshot to a different volume (like a EBS volume dedicated

Using Toad to access Cassandra

2011-05-18 Thread Sameer Farooqui
Has anybody heard of or used Toad to access Cassandra? http://www.quest.com/toad-for-cloud-databases/ They claim to: Toad® for Cloud Databases provides a SQL-based interface that makes it simple for you to generate queries, migrate, browse, and edit data, as well as create reports and tables in

Re: Questions about using MD5 encryption with SimpleAuthenticator

2011-05-17 Thread Sameer Farooqui
. - Aaron Morton Freelance Cassandra Developer @aaronmorton http://www.thelastpickle.com On 17 May 2011, at 10:59, Sameer Farooqui wrote: By the way, just noticed a typo in my email below. I'm using the correct keyspace name in all locations on the cluster... however in my

Re: Questions about using MD5 encryption with SimpleAuthenticator

2011-05-17 Thread Sameer Farooqui
On Tue, May 17, 2011 at 3:05 PM, Sameer Farooqui cassandral...@gmail.comwrote: Hey Aaron, Unfortunately it fails with plaintext password also: ubuntu@domU-12-31-39-0C-D9-13:~/apache-cassandra-0.8.0-beta1$ bin/cassandra-cli -h ec2-50-19-26-189.compute-1.amazonaws.com -p 9160 -u jdoe -pw 'nosql

Re: How to configure internode encryption in 0.8.0?

2011-05-17 Thread Sameer Farooqui
in your 0.8 download at the very bottom. There are docs and examples there. e.g. http://svn.apache.org/repos/asf/cassandra/tags/cassandra-0.8.0-beta2/conf/cassandra.yaml On May 16, 2011, at 6:36 PM, Sameer Farooqui wrote: I understand that 0.8.0 has configurable internode encryption (CASSANDRA

Questions about using MD5 encryption with SimpleAuthenticator

2011-05-16 Thread Sameer Farooqui
Hi all, We are trying to use MD5 encrypted passwords. Quick question first - Is SHA-2 supported yet? US-CERT of the U. S. Department of Homeland Security has said that MD5 should be considered cryptographically broken and unsuitable for further use”, and SHA-2 family of hash functions is

Re: Questions about using MD5 encryption with SimpleAuthenticator

2011-05-16 Thread Sameer Farooqui
not the issue. - Sameer On Mon, May 16, 2011 at 3:55 PM, Sameer Farooqui cassandral...@gmail.comwrote: Hi all, We are trying to use MD5 encrypted passwords. Quick question first - Is SHA-2 supported yet? US-CERT of the U. S. Department of Homeland Security has said that MD5 should be considered

Re: Using composite column names in the CLI

2011-05-16 Thread Sameer Farooqui
Cassandra wouldn't know that the column name is composite of two different things. So you could just request the column names and values for a specific key like this and then just look at the column names that get returned: [default@MyKeyspace] get DemoCF[ascii('key_42')]; = (column=CA_SanJose,

How to configure internode encryption in 0.8.0?

2011-05-16 Thread Sameer Farooqui
I understand that 0.8.0 has configurable internode encryption (CASSANDRA-1567, 2152). I haven't been able to find any info on how to configure it though on this mailing list or the Datastax website. Can somebody point me towards how to set this up? - Sameer

Re: Hinted Handoff

2011-05-12 Thread Sameer Farooqui
I'm not sure about your first question. I believe the internal system keyspace holds the hinted handoff information. In 0.6 and earlier, HintedHandoffManager.sendMessage used to read the entire row into memory and then send the row back to the client in a single message. As of 0.7, Cassandra

Re: Crash when uploading large data sets

2011-05-12 Thread Sameer Farooqui
The key JVM options for Cassandra are in cassandra.in.sh. What is your min and max heap size? The default setting of max heap size is 1GB. How much RAM do your nodes have? You may want to increase this setting. You can also set the -Xmx and -Xms options to the same value to keep Java from having

Keyspace creation error on 0.8 beta2

2011-05-11 Thread Sameer Farooqui
When I run this from the Cassandra CMD-Line: create keyspace MyKeySpace with placement_strategy = 'org.apache.cassandra.locator.SimpleStrategy' and strategy_options = [{replication_factor:2}]; I get this error: Internal error processing system_add_keyspace My syntax is correct for creating the

Re: network topology issue

2011-05-11 Thread Sameer Farooqui
Anurag, The Cassandra ring spans datacenters, so you can't use token 0 on both nodes. Cassandra’s ring is from 0 to 2**127 in size. Try assigning one node the token of 0 and the second node 8.50705917 × 10^37 (input this as a single long number). To add a new keyspace in 0.8, run this from the

Re: Keyspace creation error on 0.8 beta2

2011-05-11 Thread Sameer Farooqui
FYI - creating the keyspace with the syntax below works in beta1, just not beta2. jeromatron on the IRC channel commented that it looks like the java classpath is using the wrong library dependency for commons lang in beta2. - Sameer On Wed, May 11, 2011 at 4:09 PM, Sameer Farooqui cassandral

Re: network topology issue

2011-05-11 Thread Sameer Farooqui
...@gmail.comwrote: Thanks Sameer for your answer. I am using two DCs DC1 , DC2 with both having one node each, my straegy_options values are DC1:1,DC2:1 I am not sure what my RF should be , should it be 1 or 2? Please Advise Thanks Anurag On Wed, May 11, 2011 at 5:27 PM, Sameer Farooqui

Re: installing cassandra on ec2 boxes

2011-05-10 Thread Sameer Farooqui
Hi Anurag, We're using an elastic IP for the seed address (public DNS name should also work) and using the private IP (10.255.x.x) for the listen address. As you're getting started, you may also find this blog that my team put together helpful:

Re: installing cassandra on ec2 boxes

2011-05-10 Thread Sameer Farooqui
Thanks Anurag On Tue, May 10, 2011 at 11:19 AM, Sameer Farooqui cassandral...@gmail.com wrote: Hi Anurag, We're using an elastic IP for the seed address (public DNS name should also work) and using the private IP (10.255.x.x) for the listen address. As you're getting started, you may also

How to build and use modified Cassandra src code?

2011-05-10 Thread Sameer Farooqui
I just edited the MessagingServices.java and OutboundTcpConnection.java files in 0.8.0beta 2 and build it successfully using Ant (I just ran the ant command in the apache-cassandra-0.8.0-beta2-src directory). I need some help with how to deploy the newly build binaries to a new Cassandra cluster

EC2 Snitch

2011-05-10 Thread Sameer Farooqui
Has anybody successfully used EC2 Snitch for cross-region deployments on EC2? Brandon Williams has not recommended using this just yet, but I was curious if anybody is using it with 0.8.0. Also, the snitch just let's the cluster automatically discover what the different regions (aka data centers)

Re: Running Cassandra across different Amazon EC2 regions

2011-05-06 Thread Sameer Farooqui
vendor. - Sameer On Thu, May 5, 2011 at 10:48 PM, Sasha Dolgy sdo...@gmail.com wrote: http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Re-IP-address-resolution-in-MultiDC-setup-EC2-VIP-td6306635.html On May 6, 2011 3:07 AM, Sameer Farooqui cassandral...@gmail.com wrote

Re: Decommissioning node is causing broken pipe error

2011-05-05 Thread Sameer Farooqui
Just wanted to update you guys that we turned on DEBUG level logging on the decommissioned node and the node receiving the decommissioned node's range. We did this by editing cassandra-home/conf/log4j-server.properties and changing the log4j.rootLogger to DEBUG. We ran decommission again and saw

Does anybody have experience with running Cassandra in Amazon's Virtual Private Cloud (VPC)?

2011-05-05 Thread Sameer Farooqui
Here is an image that shows what the Amazon VPC we're thinking about using looks like: http://i.imgur.com/OUe1i.png We would like to configure a 2 node Cassandra cluster in the private subnet and a read/write web application service in the public subnet. However, we also want to span the

Running Cassandra across different Amazon EC2 regions

2011-05-05 Thread Sameer Farooqui
We're trying to set up a Cassandra 0.8.0beta1 cluster across Amazon East West regions. It does not work out of the box with the binaries and the nodes in different regions end up setting this own clusters. The problem is with Cassandra's Listening Address as described by Rui: Using external IP

How to find what node a key is on

2011-03-23 Thread Sameer Farooqui
Does anybody know if it's possible to find out what node a specific key/row lives on? We have a 30 node cluster and I'm curious how much faster it'll be to read data directly from the node that stores the data. We're using random partitioner, by the way. *Sameer Farooqui *Accenture Technology

Re: How to find what node a key is on

2011-03-23 Thread Sameer Farooqui
. Aaron On 24 Mar 2011, at 11:53, Sameer Farooqui wrote: Does anybody know if it's possible to find out what node a specific key/row lives on? We have a 30 node cluster and I'm curious how much faster it'll be to read data directly from the node that stores the data. We're using random

How to use JConsole to connect to a Cassandra cluster in Amazon EC2?

2011-03-02 Thread Sameer Farooqui
I want to use JConsole to look at the Cassandra's MBean's attributes (like Memtable stats). But since my Cassandra nodes are running on Amazon EC2 instances, I have to use an ssh login with a ppk authentication file (when connecting via Putty, anyway). It looks like two popular approaches to

Re: Cassandra documentation

2011-02-14 Thread Sameer Farooqui
Here is a blog my team is working on at Accenture which is intended to be a complete beginner's guide to Cassandra. I'm still updating a few posts based on DataStax's recommendations and I need to add the last three posts (will get this done soon), but you can start checking it out via this link:

Re: Problems with Python Stress Test

2011-02-04 Thread Sameer Farooqui
...@gmail.com wrote: On Thu, Feb 3, 2011 at 7:02 PM, Sameer Farooqui cassandral...@gmail.comwrote: Hi guys, I was playing around with the stress.py test this week and noticed a few things. 1) Progress-interval does not always work correctly. I set it to 5 in the example below, but am

Problems with Python Stress Test

2011-02-03 Thread Sameer Farooqui
Hi guys, I was playing around with the stress.py test this week and noticed a few things. 1) Progress-interval does not always work correctly. I set it to 5 in the example below, but am instead getting varying intervals: *techlabs@cassandraN1:~/apache-cassandra-0.7.0-src/contrib/py_stress$