Re: Data tombstoned during bulk loading 1.2.10 - 2.0.3

2014-02-04 Thread olek.stas...@gmail.com
I don't know what is the real cause of my problem. We are still guessing. All operations I have done one cluster are described on timeline: 1.1.7- 1.2.10 - upgradesstable - 2.0.2 - normal operations -2.0.3 - normal operations - now normal operations means reads/writes/repairs. Could you please,

Maximum size and number of datafiles

2014-02-04 Thread Bonnet Jonathan .
Hello here, Is it possible to tell me if it possible to choose the maximum size for a datafile to prevent fs saturation. When cassandra choose to add a datafile ? Thanks 4 all your answears. Regards, Bonnet Jonathan.

Keyspace directory not getting created in 1 machine

2014-02-04 Thread Hari Rajendhran
Dear Team , I have a 3 node cassandra 1.1.12 opensource version installed in our lab.The db files for columnfamilies are getting created in 2 machines while in one of the machine the data directory is empty.I have tried with the following option nodetool  -h [IP address of the not working

Re: Keyspace directory not getting created in 1 machine

2014-02-04 Thread Duncan Sands
Hi Hari, On 04/02/14 10:38, Hari Rajendhran wrote: Dear Team , I have a 3 node cassandra 1.1.12 opensource version installed in our lab.The db files for columnfamilies are getting created in 2 machines while in one of the machine the data directory is empty.I have tried with the following

what tool will create noncql columnfamilies in cassandra 3a

2014-02-04 Thread Edward Capriolo
Cassandra 2.0.4 cli is informing me that it will no longer exist in the next major. How will users adjust the meta data of non cql column families and other cfs that do not fit into the cql model? -- Sorry this was sent from mobile. Will do less grammar and spell check than usual.

Re: Ultra wide row anti pattern

2014-02-04 Thread Edward Capriolo
I have actually been building something similar in my space time. You can hang around and wait for it or build your own. Here is the basics. Not perfect but it will work. Create column family queue with gc_grace_period=[1 day] set queue [timeuuid()] [z+timeuuid()] = [ work do do] The producer

Re: Ultra wide row anti pattern

2014-02-04 Thread Yogi Nerella
Sorry, I am not understanding the problem, and I am new to Cassandra, and want to understand this issue. Why do we need to use wide row for this situation, why not a simple table in cassandra? todolist (user, state) == is there any other information in this table which needs for processing

Re: Ultra wide row anti pattern

2014-02-04 Thread Edward Capriolo
Generally you need to make a wide row because the row keys in cassandra are ordered by their md5/murmer code. As a result you have no way of locating new rows, but if the row name is predictable the columns inside the row are ordered. On Tue, Feb 4, 2014 at 12:02 PM, Yogi Nerella

Re: Data tombstoned during bulk loading 1.2.10 - 2.0.3

2014-02-04 Thread Robert Coli
On Tue, Feb 4, 2014 at 12:21 AM, olek.stas...@gmail.com olek.stas...@gmail.com wrote: I don't know what is the real cause of my problem. We are still guessing. All operations I have done one cluster are described on timeline: 1.1.7- 1.2.10 - upgradesstable - 2.0.2 - normal operations -2.0.3

Re: Ultra wide row anti pattern

2014-02-04 Thread DuyHai Doan
Great idea for implementing queue pattern. Thank you Edward. However with your design there are still corner cases for 2 consumers to read from the same queue. Reading and writing with QUORUM does not prevent race conditions. I believe the new CAS feature of C* 2.0 might be useful here but with

Re: Ultra wide row anti pattern

2014-02-04 Thread Edward Capriolo
You could use another column of CAS as a management layer. You only have to consult it when picking up new rows. On Tue, Feb 4, 2014 at 3:45 PM, DuyHai Doan doanduy...@gmail.com wrote: Great idea for implementing queue pattern. Thank you Edward. However with your design there are still

Re: Cassandra 2.0 with Hadoop 2.x?

2014-02-04 Thread Cyril Scetbon
Hi, Look for posts from Thunder Stumpges in this mailing list. I know he has succeeded to make it Hadoop 2.x work with Cassandra 2.x For those who are interested in using it with Cassandra 1.2.13 you can use the patch

Re: Data tombstoned during bulk loading 1.2.10 - 2.0.3

2014-02-04 Thread olek.stas...@gmail.com
Seems good. I'll discus it with data owners and we choose the best method. Best regards, Aleksander 4 lut 2014 19:40 Robert Coli rc...@eventbrite.com napisał(a): On Tue, Feb 4, 2014 at 12:21 AM, olek.stas...@gmail.com olek.stas...@gmail.com wrote: I don't know what is the real cause of my

Question 1: JMX binding, Question 2: Logging

2014-02-04 Thread Kyle Crumpton (kcrumpto)
Hi all, I'm fairly new to Cassandra. I'm deploying it to a PaaS. One thing this entails is that it must be able to have more than one instance on a single node. I'm running into the problem that JMX binds to 0.0.0.0:7199. My question is this: Is there a way to configure this? I have actually

Re: Question 1: JMX binding, Question 2: Logging

2014-02-04 Thread srmore
Hello Kyle, For your first question, you need to create aliases to localhost e.g. 127.0.0.2,127.0.0.3 etc. this should get you going. About the logging issue, I think if your instance failing before it gets to long anything, as an example you can strart one instance and make sure it logs

Re: Question 1: JMX binding, Question 2: Logging

2014-02-04 Thread Andrey Ilinykh
JMX stuff is in /conf/cassandra-env.sh On Tue, Feb 4, 2014 at 2:25 PM, Kyle Crumpton (kcrumpto) kcrum...@cisco.com wrote: Hi all, I'm fairly new to Cassandra. I'm deploying it to a PaaS. One thing this entails is that it must be able to have more than one instance on a single node. I'm

Re: Cassandra 2.0 with Hadoop 2.x?

2014-02-04 Thread Thunder Stumpges
Hello Clint, Yes I was able to get it working after a bit of work. I have pushed the branch with the fix (which is currently quite a ways behind latest). You can compare to yours I suppose. Let me know if you have any questions.

Re: Lots of deletions results in death by GC

2014-02-04 Thread Robert Wille
I ran my test again, and Flush Writer¹s ³All time blocked² increased to 2 and then shortly thereafter GC went into its death spiral. I doubled memtable_flush_writers (to 2) and memtable_flush_queue_size (to 8) and tried again. This time, the table that always sat with Memtable data size = 0 now

Re: Lots of deletions results in death by GC

2014-02-04 Thread Benedict Elliott Smith
Is it possible you are generating *exclusively* deletes for this table? On 5 February 2014 00:10, Robert Wille rwi...@fold3.com wrote: I ran my test again, and Flush Writer's All time blocked increased to 2 and then shortly thereafter GC went into its death spiral. I doubled

Re: what tool will create noncql columnfamilies in cassandra 3a

2014-02-04 Thread Patricia Gorla
I am also curious as to how users will manage Thrift-based tables without the cli. PyCassaShell comes to mind, as does using Thrift-based clients. On Tue, Feb 4, 2014 at 9:53 AM, Edward Capriolo edlinuxg...@gmail.comwrote: Cassandra 2.0.4 cli is informing me that it will no longer exist in

Re: Lots of deletions results in death by GC

2014-02-04 Thread srmore
Sorry to hear that Robert, I ran into similar issue a while ago. I had an extremely heavy write and update load, as a result Cassandra (1.2.9) was constantly flushing to disk and used to GC, tried exactly the same steps you tried (tuning memtable_flush_writers (to 2) and memtable_flush_queue_size

Looking for clarification on the gossip protocol... 3 random nodes every second?

2014-02-04 Thread Sameer Farooqui
Hi, I'm looking to get some clarification on how the gossip protocol works in Cassandra 2.0. Does a node contact 3 purely random nodes every second for gossip or is there more intelligence involved in how it selects the 3 nodes? *The Apache wiki on Cassandra states this:* Gossip timer task runs