Re: Changing compression_parameters of exsisting CF

2012-11-27 Thread Шамим
thank you Aaron, seems it's bug on cql 2.*.* version, through cassandra-cli it's working fine. Create issue CASSANDRA-4996 27.11.2012, 11:34, aaron morton aa...@thelastpickle.com: is it expectable behaviour? or it's bug? It exhibits bug like qualities. Can you create a ticket on 

for a school project

2012-11-27 Thread davukovi
Hello, I have a question about codd's rules and cassandra cuz im doing a school project sith my colleagues. I know that cassandra is NOSQL type of dbms, but witch of 12 codd's rules can me applied? Thank you very much! :)

Re: for a school project

2012-11-27 Thread Tupshin Harper
Rules that apply: 2 - guaranteed access 3 - treatment of nulls (though different than an rdbms due to the inherent sparse nature of rows) 4 - online catalog (not really true until Cassandra 1.2 and CQL 3 5 - comprehensive data sub language (only if you remove the word relational) 6 - view updating

Re: Other problem in update

2012-11-27 Thread Edward Capriolo
I am just taking a stab at this one. UUID's interact with system time and maybe your real time os is doing something funky there. The other option, which seems more likely, is that your unit tests are not cleaning up their data directory and there is some corrupt data in there. On Tue, Nov 27,

How to determine compaction bottlenecks

2012-11-27 Thread Derek Bromenshenkel
Setup: C* 1.1.6, 6 node (Linux, 64GB RAM, 16 Core CPU, 2x512 SSD), RF=3, 1.65TB total used Background: Client app is off - no reads/writes happening. Doing some cluster maintenance requiring node repairs and upgradesstables. I've been playing around with trying to figure out what is making

Re: Other problem in update

2012-11-27 Thread Tupshin Harper
Unless I'm misreading the git history, the stack trace you referenced isn't from 1.1.2. In particular, the writeHintForMutation method in StorageProxy.java wasn't added to the codebase until September 9th (

Java high-level client

2012-11-27 Thread Carsten Schnober
Hi, I'm aware that this has been a frequent question, but answers are still hard to find: what's an appropriate Java high-level client? I actually believe that the lack of a single maintained Java API that is packaged with Cassandra is quite an issue. The way the situation is right now, new users

Re: Java high-level client

2012-11-27 Thread Peter Lin
I use hector-client master, which is pretty stable right now. It uses the latest thrift, so you can use hector with thrift 0.9.0. That's assuming you don't mind using the active development branch. On Tue, Nov 27, 2012 at 10:36 AM, Carsten Schnober schno...@ids-mannheim.de wrote: Hi, I'm

Re: Java high-level client

2012-11-27 Thread Hiller, Dean
I am biased of course but you can find plenty of documentation on playorm here https://github.com/deanhiller/playorm/wiki It uses astyanax because of the better node knowledge. Also feel free to post questions on stackoverflow as we heavily monitor stack overflow and are notified every hour of

Re: Hector (was: Java high-level client)

2012-11-27 Thread Carsten Schnober
Am 27.11.2012 16:40, schrieb Peter Lin: Hi Peter, thanks for your prompt reply! I use hector-client master, which is pretty stable right now. Please excuse my ignorance, but just to be sure I'd like to ask: does hector-client master differ from the Hector client linked from the Cassandra Wiki

Re: Hector (was: Java high-level client)

2012-11-27 Thread Peter Lin
I could be wrong, but the most recent release is against cassandra 1.0.x master tracks against cassandra 1.1.x I've contributed a few patches related to CQL3 the last few weeks and master seems stable to me. for the record, I don't work for DataStax so it's just my opinion. I needed the

Re: Java high-level client

2012-11-27 Thread Michael Kjellman
Netflix has a great client https://github.com/Netflix/astyanax On 11/27/12 7:40 AM, Peter Lin wool...@gmail.com wrote: I use hector-client master, which is pretty stable right now. It uses the latest thrift, so you can use hector with thrift 0.9.0. That's assuming you don't mind using the

Re: Java high-level client

2012-11-27 Thread Vivek Mishra
So I've had a look at Kundera and at Easy-Cassandra. Both seem to lack a real documentation beyond the examples available in their Github repositories, right? Vivek Could you please share what exactly you looking for documentation and not present. I suggest you to join

Re: for a school project

2012-11-27 Thread Michael Kjellman
*because *which *be *with Please see: http://en.wikipedia.org/wiki/Codd's_12_rules http://wiki.apache.org/cassandra/DataModel That should get you going for your school report. If you have more specific questions about terms on the wiki please feel free to ask. On 11/27/12 4:02 AM,

counters + replication = awful performance?

2012-11-27 Thread Sergey Olefir
Hi, I have a serious problem with counters performance and I can't seem to figure it out. Basically I'm building a system for accumulating some statistics on the fly via Cassandra distributed counters. For this I need counter updates to work really fast and herein lies my problem -- as soon as I

Re: Java high-level client

2012-11-27 Thread Edward Capriolo
Hector does not require an outdated version of thift, you are likely using an outdated version of hector. Here is the long and short of it: If the thrift thrift API changes then hector can have compatibility issues. This happens from time to time. The main methods like get() and insert() have

Re: counters + replication = awful performance?

2012-11-27 Thread Juan Valencia
Hi Sergey, I know I've had similar issues with counters which were bottle-necked by network throughput. You might be seeing a problem with throughput between the clients and Cass or between the two Cass nodes. It might not be your case, but that was what happened to me :-) Juan On Tue, Nov

Re: counters + replication = awful performance?

2012-11-27 Thread Sergey Olefir
Hi Juan, thanks for your input! In my case, however, I doubt this is the case -- clients are able to push many more updates than I need to saturate replication_factor=2 case (e.g. I'm doing as many as 6x more increments when testing 2-node cluster with replication_factor=1), so bandwidth between

Re: counters + replication = awful performance?

2012-11-27 Thread Michael Kjellman
Are you writing with QUORUM consistency or ONE? On 11/27/12 9:52 AM, Sergey Olefir solf.li...@gmail.com wrote: Hi Juan, thanks for your input! In my case, however, I doubt this is the case -- clients are able to push many more updates than I need to saturate replication_factor=2 case (e.g. I'm

Re: counters + replication = awful performance?

2012-11-27 Thread Sergey Olefir
I'm using ONE like this (Hector): ConfigurableConsistencyLevel consistencyLevel = new ConfigurableConsistencyLevel(); consistencyLevel.setDefaultReadConsistencyLevel(HConsistencyLevel.ONE); consistencyLevel.setDefaultWriteConsistencyLevel(HConsistencyLevel.ONE); -- View this message in

Re: counters + replication = awful performance?

2012-11-27 Thread Sergey Olefir
I'm using ONE like this (Java, Hector): ConsistencyLevel consistencyLevel = new ConfigurableConsistencyLevel(); consistencyLevel.setDefaultReadConsistencyLevel(HConsistencyLevel.ONE); consistencyLevel.setDefaultWriteConsistencyLevel(HConsistencyLevel.ONE); Michael Kjellman wrote Are you

Re: Java high-level client

2012-11-27 Thread Wei Zhu
FYI, We are using Hector 1.0-5 which comes with cassandra-thrift 1.09 - libthrift 0.6.1. It can work with Cassandra 1.1.6. Totally agree it's a pain to deal with different version of libthrift. We use scribe for logging, a bit messy over there. Thanks. -Wei

Re: Pagination

2012-11-27 Thread Derek Williams
Do you really require page numbers? I usually find them annoying while paging through a forum, especially if it is quite active. Threads from the bottom of the page get bumped to the next page so you end up seeing the same content again. I much prefer the first page being the current N results,

RE: Pagination

2012-11-27 Thread Sam Hodgson
Well I know what you mean and i have been doing that however im currently migrating an old mysql site onto cass and just trying to keep things consistent on the front end for the guy, i thought i might be missing a trick but if not then yeah I may well ditch the page linkage if it starts

Re: Generic questions over Cassandra 1.1/1.2

2012-11-27 Thread Bill de hÓra
I'm not sure I always understand what people mean by schema less exactly and I'm curious. For 'schema less', given this - {{{ cqlsh use example; cqlsh:example CREATE TABLE users ( ... user_name varchar, ... password varchar, ... gender varchar, ... session_token varchar, ... state

Re: counters + replication = awful performance?

2012-11-27 Thread Edward Capriolo
The difference between Replication factor =1 and replication factor 1 is significant. Also it sounds like your cluster is 2 node so going from RF=1 to RF=2 means double the load on both nodes. You may want to experiment with the very dangerous column family attribute: - replicate_on_write:

Re: selective replication of keyspaces

2012-11-27 Thread Edward Capriolo
You can do something like this: Divide your nodes up into 4 datacenters art1,art2,art3,core [default@unknown] create keyspace art1 placement_strategy = 'org.apache.cassandra.locator.NetworkTopologyStrategy' and strategy_options=[{art1:2,core:2}]; [default@unknown] create keyspace art2

Re: counters + replication = awful performance?

2012-11-27 Thread Scott McKay
We're having a similar performance problem. Setting 'replicate_on_write: false' fixes the performance issue in our tests. How dangerous is it? What exactly could go wrong? On 12-11-27 01:44 PM, Edward Capriolo wrote: The difference between Replication factor =1 and replication factor 1 is

Re: counters + replication = awful performance?

2012-11-27 Thread Edward Capriolo
I mispoke really. It is not dangerous you just have to understand what it means. this jira discusses it. https://issues.apache.org/jira/browse/CASSANDRA-3868 On Tue, Nov 27, 2012 at 6:13 PM, Scott McKay sco...@mailchannels.comwrote: We're having a similar performance problem. Setting

Re: counters + replication = awful performance?

2012-11-27 Thread Sergey Olefir
Hi, thanks for your suggestions. Regarding replicate=2 vs replicate=1 performance: I expected that below configurations will have similar performance: - single node, replicate = 1 - two nodes, replicate = 2 (okay, this probably should be a bit slower due to additional overhead). However what I'm

Re: counters + replication = awful performance?

2012-11-27 Thread Edward Capriolo
Cassandra's counters read on increment. Additionally they are distributed so that can be multiple reads on increment. If they are not fast enough and you have avoided all tuning options add more servers to handle the load. In many cases incrementing the same counter n times can be avoided.

Re: counters + replication = awful performance?

2012-11-27 Thread Edward Capriolo
By the way the other issues you are seeing with replicate on write at false could be because you did not repair. You should do that when changing rf. On Tuesday, November 27, 2012, Edward Capriolo edlinuxg...@gmail.com wrote: Cassandra's counters read on increment. Additionally they are

Re: counters + replication = awful performance?

2012-11-27 Thread Sergey Olefir
I already do a lot of in-memory aggregation before writing to Cassandra. The question here is what is wrong with Cassandra (or its configuration) that causes huge performance drop when moving from 1-replication to 2-replication for counters -- and more importantly how to resolve the problem.

Re: selective replication of keyspaces

2012-11-27 Thread jer...@simpleartmarketing.com
Thank you. This is a good start I was beginning to think it couldn't be done. When I run the command I get the error syntax error at position 21: missing EOF at 'placement_strategy' that is probably because I still need to set the correct properties in the conf files On November 27, 2012

Re: counters + replication = awful performance?

2012-11-27 Thread Edward Capriolo
Say you are doing 100 inserts rf1 on two nodes. That is 50 inserts a node. If you go to rf2 that is 100 inserts a node. If you were at 75 % capacity on each mode your now at 150% which is not possible so things bog down. To figure out what is going on we would need to see tpstat, iostat , and

Re: selective replication of keyspaces

2012-11-27 Thread Edward Capriolo
My mistake that is older cli syntax, I wad just showing the concept set up 4 datacenter and selectively replicate keyspaces between them. On Tuesday, November 27, 2012, jer...@simpleartmarketing.com jer...@simpleartmarketing.com wrote: Thank you. This is a good start I was beginning to think

Re: Generic questions over Cassandra 1.1/1.2

2012-11-27 Thread Edward Capriolo
@Bill Are you saying that now cassandra is less schema less ? :) Compact storage is the schemaless of old. On Tuesday, November 27, 2012, Bill de hÓra b...@dehora.net wrote: I'm not sure I always understand what people mean by schema less exactly and I'm curious. For 'schema less', given

Re: Upgrade

2012-11-27 Thread aaron morton
Do you have the error stack ? Cheers - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http://www.thelastpickle.com On 28/11/2012, at 12:28 AM, Everton Lima peitin.inu...@gmail.com wrote: Hello people. I was using cassandra 1.1.6 and use the Object

Re: How to determine compaction bottlenecks

2012-11-27 Thread aaron morton
I've been playing around with trying to figure out what is making compactions run so slow. Is this regular compaction or table upgrades ? I *think* upgrade tables is single threaded. Do you have some compaction logs lines that say Compacted to…? It's handy to see the throughput and the

Re: Frame size exceptions occurring with ColumnFamilyInputFormat for very large rows

2012-11-27 Thread Marko Rodriguez
Hello, I was wondering if anyone had an answer to my previous message below. Seems another is having the same problem, but unfortunately with no response as well. http://mail-archives.apache.org/mod_mbox/cassandra-user/201211.mbox/%3c509a4a1f.8070...@semantico.com%3E Any help

Re: Frame size exceptions occurring with ColumnFamilyInputFormat for very large rows

2012-11-27 Thread Marko Rodriguez
Hi, Even when setting it to 32m in cassandra.yaml (and restarting Cassandra), the same problem emerges -- its as if Cassandra doesn't register the update (its always locked at 16mb). And I know that Cassandra is reading the property from cassandra.yaml because if I do -1, it complains saying

Re: need some help with row cache

2012-11-27 Thread Bryan Talbot
On Tue, Nov 27, 2012 at 8:16 PM, Yiming Sun yiming@gmail.com wrote: Hello, but it is not clear to me where this setting belongs to, because even in the v1.1.6 conf/cassandra.yaml, there is no such property, and apparently adding this property to the yaml causes a fatal configuration

Re: Hive on Cassandra : issues in setup

2012-11-27 Thread Naveen Reddy
Can someone please help me with this or share your experiences if you have tried this before please? From: Naveen Reddy naveen_2...@yahoo.co.in To: user@cassandra.apache.org user@cassandra.apache.org Sent: Monday, 26 November 2012 4:24 PM Subject: Hive on

Re: need some help with row cache

2012-11-27 Thread Yiming Sun
Hi Bryan, Thank you very much for this information. So in other words, the settings such as row_cache_size_in_mb in YAML alone are not enough, and I must also specify the caching attribute on a per column family basis? -- Y. On Tue, Nov 27, 2012 at 11:57 PM, Bryan Talbot

Re: need some help with row cache

2012-11-27 Thread Yiming Sun
Also, what command can I used to see the caching setting? DESC TABLE cf doesn't list caching at all. Thanks. -- Y. On Wed, Nov 28, 2012 at 12:15 AM, Yiming Sun yiming@gmail.com wrote: Hi Bryan, Thank you very much for this information. So in other words, the settings such as

Re: counters + replication = awful performance?

2012-11-27 Thread Sergey Olefir
I think there might be a misunderstanding as to the nature of the problem. Say, I have test set T. And I have two identical servers A and B. - I tested that server A (singly) is able to handle load of T. - I tested that server B (singly) is able to handle load of T. - I then join A and B in the

Re: need some help with row cache

2012-11-27 Thread Wz1975
Use cassandracli. Thanks. -Wei Sent from my Samsung smartphone on ATT Original message Subject: Re: need some help with row cache From: Yiming Sun yiming@gmail.com To: user@cassandra.apache.org CC: Also, what command can I used to see the caching setting?  DESC

Always see strange error in logs on cassandra 1.1.6

2012-11-27 Thread Михаил Колесников
Hi. After updating Cassandra from 1.1.5 to 1.1.6. Every schema updates ends with strange error exception in system.log and I must to restart nodes in cluster for whom `describe cluster` says that nodes are unreachable. Neither `nodetool repair` or `nodetool upgragesstables` doesn't help. Only