Re: Import/Export of Schema Migrations

2011-05-16 Thread David Boxenhorn
What you describe below sounds like what I want to do. I think that the only additional thing I am requesting is to export the migrations from the dev cluster (since Cassandra already has a table that saves them - I just want that information!) so I can import it to the other clusters. This would

Re: insert and batch_insert

2011-05-16 Thread aaron morton
batch_mutate() and insert() follow the a similar execution path to a single insert in the server. It's not like putting multiple statements in a Transaction in the RDBMS. Where they do differ is that you can provide multiple columns for a row in a column family, and these will be applied as

Re: Multiget_slice or composite column keys?

2011-05-16 Thread aaron morton
I'd stick with the RandomPartitioner until you have a really good reason to change :) I'd also go with your alternative design with some possible tweaks. Consider partitioning the rows by year or some other sensible value. If you will generally be getting the most recent data this can reduce

Re: problems using property snitch file

2011-05-16 Thread aaron morton
Do all the lines have the same IP address or is that just how you redacted them ? is the single node in DC1 the one you are moving? Cheers - Aaron Morton Freelance Cassandra Developer @aaronmorton http://www.thelastpickle.com On 16 May 2011, at 12:28, Anurag Gujral wrote:

Re: Converting separate clusters in mutliple dc to one cluster across multiple dcs

2011-05-16 Thread aaron morton
a) No, the tokens and the endpoints are for the entire cluster. You can only have one snitch per cluster, why do you want multiple ones? b) That happens if the endpoint / node is not listed in the topology, add the node to the topology first. created

Re: Multiget_slice or composite column keys?

2011-05-16 Thread Charles Blaxland
Thanks Aaron, very useful. I'll give some of your suggestions a go... On 16 May 2011 19:13, aaron morton aa...@thelastpickle.com wrote: I'd stick with the RandomPartitioner until you have a really good reason to change :) I'd also go with your alternative design with some possible tweaks.

AssertionError

2011-05-16 Thread Desimpel, Ignace
Environment : java 64 bit server, java client, thrift get_slice method, Cassandra 0.7.4, single node Depending on the data I pass for a query on a CF I get the following listed below. Any suggestions what could be wrong based on the stack trace? java.lang.AssertionError at

Concurrent Mark Sweep taking 12 seconds

2011-05-16 Thread Héctor Izquierdo Seliva
Hi everyone. I see in the logs that Concurrent Mark Sweep is taking 12 seconds to do its stuff. Is this normal? There is no stop-the-world GC, it just takes 12 seconds. Configuration: 0.7.5 , 8GB Heap, 16GB machines. 7 * 64 MB memtables.

Using composite column names in the CLI

2011-05-16 Thread David Boxenhorn
Is there a way to view composite column names in the CLI? Is there a way to input them (i.e. in the set command)?

Re: Concurrent Mark Sweep taking 12 seconds

2011-05-16 Thread Jonathan Ellis
Yes. 2011/5/16 Héctor Izquierdo Seliva izquie...@strands.com: Hi everyone. I see in the logs that Concurrent Mark Sweep is taking 12 seconds to do its stuff. Is this normal? There is no stop-the-world GC, it just takes 12 seconds. Configuration: 0.7.5 , 8GB Heap, 16GB machines. 7 * 64 MB

Re: problems using property snitch file

2011-05-16 Thread Anurag Gujral
All lines have different IP address,yes I am trying to move single node in DC1 Thanks Anurag On Mon, May 16, 2011 at 3:19 AM, aaron morton aa...@thelastpickle.comwrote: Do all the lines have the same IP address or is that just how you redacted them ? is the single node in DC1 the one you are

Re: insert and batch_insert

2011-05-16 Thread Xiaowei Wang
Thanks Aaron, really help! 2011/5/16 aaron morton aa...@thelastpickle.com batch_mutate() and insert() follow the a similar execution path to a single insert in the server. It's not like putting multiple statements in a Transaction in the RDBMS. Where they do differ is that you can provide

Cassandra and concurrent programming

2011-05-16 Thread Mark Kerzner
Hi, guys, what happens if I have two threads or two processes, both of which need to ask if some condition in Cassandra is fulfilled, and then, say, write the data based on that. If one receives the no answer and decides to write, but before he does, the other one receives the no answer and

Re: Cassandra and concurrent programming

2011-05-16 Thread Peter Schuller
In threading, you would do an atomic put if not present, is there such a thing in Cassandra? No. In general, one works to avoid the need for strong co-ordination. If strong co-ordination is truly required, some external method is required. Some people use ZooKeeper

Re: Cassandra and concurrent programming

2011-05-16 Thread Mark Kerzner
Thank you for a quick answer - I could impress my colleagues at the meeting :) On Mon, May 16, 2011 at 2:54 PM, Peter Schuller peter.schul...@infidyne.com wrote: In threading, you would do an atomic put if not present, is there such a thing in Cassandra? No. In general, one works to

Re: AssertionError

2011-05-16 Thread Aaron Morton
The code is trying to follow the column index for a row in an sstable, but it cannot skip as many bytes as it would like to to get to the column. Helpfully the help says running out of bytes is only one of the reasons why this could happen:) Can you provide some more information about the

Re: Commitlog Disk Full

2011-05-16 Thread Sanjeev Kulkarni
Hi, Are you referring to the binary_memtable_throughput_in_mb which is a global parameter or the per col fam specific memtable_throughput_in_mb? The former is set to 256 and we dont override the default col fam specific value. Would just re-setting the global binary_memtable_throughput_in_mb to

Re: Commitlog Disk Full

2011-05-16 Thread mcasandra
You can try to update column family using cassandra-cli. Try to set memtable_throughput to 32 first. [default@unknown] help update column family; update column family Bar; update column family Bar with att1=value1; update column family Bar with att1=value1 and att2=value2...; Update a column

Questions about using MD5 encryption with SimpleAuthenticator

2011-05-16 Thread Sameer Farooqui
Hi all, We are trying to use MD5 encrypted passwords. Quick question first - Is SHA-2 supported yet? US-CERT of the U. S. Department of Homeland Security has said that MD5 should be considered cryptographically broken and unsuitable for further use”, and SHA-2 family of hash functions is

Re: Questions about using MD5 encryption with SimpleAuthenticator

2011-05-16 Thread Sameer Farooqui
By the way, just noticed a typo in my email below. I'm using the correct keyspace name in all locations on the cluster... however in my examples below, I used MyKeyspace in some spots and MDR in other spots, but in the cluster I'm specifying the same keyspace name everywhere, so that's not the

Re: Using composite column names in the CLI

2011-05-16 Thread Sameer Farooqui
Cassandra wouldn't know that the column name is composite of two different things. So you could just request the column names and values for a specific key like this and then just look at the column names that get returned: [default@MyKeyspace] get DemoCF[ascii('key_42')]; = (column=CA_SanJose,

How to configure internode encryption in 0.8.0?

2011-05-16 Thread Sameer Farooqui
I understand that 0.8.0 has configurable internode encryption (CASSANDRA-1567, 2152). I haven't been able to find any info on how to configure it though on this mailing list or the Datastax website. Can somebody point me towards how to set this up? - Sameer

Re: How to configure internode encryption in 0.8.0?

2011-05-16 Thread Jeremy Hanna
Take a look at cassandra.yaml in your 0.8 download at the very bottom. There are docs and examples there. e.g. http://svn.apache.org/repos/asf/cassandra/tags/cassandra-0.8.0-beta2/conf/cassandra.yaml On May 16, 2011, at 6:36 PM, Sameer Farooqui wrote: I understand that 0.8.0 has configurable

Re: Commitlog Disk Full

2011-05-16 Thread Sanjeev Kulkarni
Hey guys, I have updated all my column families with 32 as the memtable_throughput. I will let you know how cassandra behaves. Thanks! On Mon, May 16, 2011 at 3:52 PM, mcasandra mohitanch...@gmail.com wrote: You can try to update column family using cassandra-cli. Try to set

Re: Commitlog Disk Full

2011-05-16 Thread Sanjeev Kulkarni
After I updated the memtable_throughput, I stopped all my writing processes. I did a du /commitlog to find how much was cassandra commitlog at that time. For the three nodes it was around 1.4G each. I waited for about 30 minutes to see whether cassandra flushes things. When I look at du now, it

Cassandra Hackathon?

2011-05-16 Thread Joseph Stein
Any interest for a Cassandra Hackathon evening in NYC? Any committer(s) going to be in the NYC area together that can lead/guide this? http://www.meetup.com/NYC-Cassandra-User-Group/events/18635801/ I have a thumbs up to use our office www.medialets.com in the Milk Studios building. It is a big

Re: Cassandra Hackathon?

2011-05-16 Thread Edward Capriolo
I had it on our list of ideas for the Cassandra NYC meetup. I am down for action. On Mon, May 16, 2011 at 9:40 PM, Joseph Stein crypt...@gmail.com wrote: Any interest for a Cassandra Hackathon evening in NYC? Any committer(s) going to be in the NYC area together that can lead/guide this?

Re: Commitlog Disk Full

2011-05-16 Thread Sanjeev Kulkarni
Its now almost 4 hours. I still see commitlogs worth 1.2G on the machines. I see no activity On Mon, May 16, 2011 at 6:33 PM, Sanjeev Kulkarni sanj...@locomatix.comwrote: After I updated the memtable_throughput, I stopped all my writing processes. I did a du /commitlog to find how much was