Re: svn commit: r756155

2009-03-20 Thread Jonathan Ellis
writeAtMost. As far as I can see, the rest of the changes are comment deletions and indentation changes, right? On Fri, Mar 20, 2009 at 7:27 AM, Jonathan Ellis jbel...@gmail.com wrote: it's actually not just indentation, it's brace placement -- from        while (len 0)        {            int n

OPHF

2009-03-30 Thread Jonathan Ellis
Avinash, You mentioned that you have a new order-preserving hash function that you think will be more generally useful. Can you post it? thanks, -Jonathan

Re: Per-Namespace / Per-Table Partitioner

2009-03-30 Thread Jonathan Ellis
But I do think there is nothing wrong with partitioner-per-namespace. It should be straightfoward to implement (once we have real namespace support to begin with) and it might be interesting for some apps to have that ability. -Jonathan On Mon, Mar 30, 2009 at 1:26 PM, Jonathan Ellis jbel

Re: handling deletes

2009-03-31 Thread Jonathan ellis
:32 PM, Jeremy Dunck jdu...@gmail.com wrote: On Mon, Mar 30, 2009 at 6:53 PM, Jonathan Ellis jbel...@gmail.com wrote: ... that's a really huge if, since (a) you'd add a whole new layer of complexity, and (b) it's not going to magically take any less space outside than inside. Ah, hmm. I

Re: OPHF

2009-04-01 Thread Jonathan Ellis
they are completely tested too in a distributed setting. Over here we do not have detailed test code, but we test by directing a % of the site traffic to a test cluster before we sign off on anything. Avinash On Wed, Apr 1, 2009 at 3:00 PM, Jonathan Ellis jbel...@gmail.com wrote: Say for instance

Roadmap

2009-04-02 Thread Jonathan Ellis
Someone asked on IRC if there is a roadmap for Cassandra. This is a good discussion to have. :) Personally my priority list looks like this: High priority: 1. range queries [which requires the partitioner changes we've been discussing] 2. make cassandra not allow itself to run out of memory

Re: Random Checkins

2009-04-02 Thread Jonathan Ellis
On Thu, Apr 2, 2009 at 6:37 PM, Avinash Lakshman avinash.laksh...@gmail.com wrote: Another issue with ConcurrentHashMap is that the dude is a memory hog. We got rid of it over a year ago because of the very same reason. Re CHM: what do you suggest instead? I assume you are talking about for

Re: working together

2009-04-08 Thread Jonathan Ellis
On Wed, Apr 8, 2009 at 6:26 PM, Sandeep Tata sandeep.t...@gmail.com wrote: I think it is reasonable that a codebase that has evolved for over two years has significant opportunity for refactoring when it is opened to a host of new developers. That said, large scale refactoring *at this stage*

Re: working together

2009-04-09 Thread Jonathan Ellis
and impressive. But the refactoring is still necessary because it would bring the code quality to a higher layer. But we should take it more seriously and more cautious, should we? best regards, hanzhu On Thu, Apr 9, 2009 at 9:38 AM, Jonathan Ellis jbel...@gmail.com wrote: On Wed, Apr 8, 2009

Re: Development process (was: working together)

2009-04-09 Thread Jonathan Ellis
+1 On Thu, Apr 9, 2009 at 10:49 AM, Johan Oskarsson jo...@oskarsson.nu wrote: Thanks Sandeep. Would we all be comfortable adopting this process going forward, hopefully reducing friction, bugs and problems in general? I assume +1 from me and Sandeep so far. /Johan Sandeep Tata wrote:

Re: [jira] Commented: (CASSANDRA-68) Bloom filters have much higher false-positive rate than expected

2009-04-11 Thread Jonathan Ellis
On Sat, Apr 11, 2009 at 9:53 PM, Prashant Malik pma...@gmail.com wrote: The results are a bit counter intuitive here I would have expected it to be faster with the same FP rate but   I am not sure why it is slower if you are just using a couple of hash functions and using double hashing.

Re: Roadmap

2009-04-16 Thread Jonathan ellis
On Thu, Apr 16, 2009 at 12:02 AM, Jonathan Ellis jbel...@gmail.com wrote: I went all Enterprise on our jira and assigned issues to version 0.3 that I'd like to get done in the relatively near future for our first official release. The list of issues is here: https://issues.apache.org/jira/secure

Row vs CF

2009-04-22 Thread Jonathan Ellis
In a bunch of places in the code we wrap a CF in a Row object, basically a key + multiple CFs. But currently only a single ColumnFamily will ever be in a Row object. (At least in the Rows involved in a client read op. Maybe Rows are used internally in other places with multiple CFs. But I am

moved source to src/java

2009-04-29 Thread Jonathan Ellis
Another heads up: I applied Johan's patches moving the human-editable source to src/java and the antlr-generated stuff to src/gen-java. -Jonathan

Re: Web site

2009-05-01 Thread Jonathan Ellis
I've done my bikeshedding. +1 :) -Jonathan On Fri, May 1, 2009 at 3:35 PM, Eric Evans eev...@rackspace.com wrote: The current project site (http://incubator.apache.org/cassandra) redirects to the wiki, which IMO leaves a lot to be desired. With the upcoming release, there are bound to be a

last api change for 0.3

2009-05-05 Thread Jonathan Ellis
I committed the patch for CASSANDRA-131 which (a) enables exception throwing on the insert methods (so you don't have to explicitly check return value to see if something worked), and (b) moves the _blocking method as a flag into the nonblocking ones. so instead of insert_blocking use insert with

Re: Project Wiki

2009-05-06 Thread Jonathan Ellis
I like the Moin look better. On Wed, May 6, 2009 at 4:11 PM, Eric Evans eev...@rackspace.com wrote: I'd like to draw some attention to https://issues.apache.org/jira/browse/CASSANDRA-145 which says: I believe the project would be better served by Apache's moinmoin wiki. Here's why:  *

Re: Cassandra 0.3 RC is out

2009-05-14 Thread Jonathan Ellis
On Thu, May 14, 2009 at 5:44 AM, Bertrand Delacretaz bdelacre...@apache.org wrote: Having it under http://incubator.apache.org/cassandra/releases/ clearly crosses that line. Would downloads/ be better? -Jonathan

Re: Cassandra 0.3 RC is out

2009-05-14 Thread Jonathan Ellis
On Thu, May 14, 2009 at 9:39 AM, Matthieu Riou matthieu.r...@gmail.com wrote:  On Thu, May 14, 2009 at 3:48 AM, Jonathan Ellis jbel...@gmail.com wrote: Do not include any links on the project website that might encourage non-developers to download and use nightly builds, snapshots, release

Re: Cassandra 0.3 RC is out

2009-05-14 Thread Jonathan Ellis
On Thu, May 14, 2009 at 9:45 AM, Matthieu Riou matthieu.r...@gmail.com wrote: Not to mention those who are in another timezone or those who would like to contribute on their spare time (like a day here and there). If the real development happens outside the ML, you won't even know those people

Re: Cassandra 0.3 RC is out

2009-05-14 Thread Jonathan Ellis
On Thu, May 14, 2009 at 9:48 AM, Matthieu Riou matthieu.r...@gmail.com wrote: On Thu, May 14, 2009 at 7:44 AM, Jonathan Ellis jbel...@gmail.com wrote: Do not include any links on the project website that might encourage non-developers to download and use nightly builds, snapshots, release

Re: Versioning scheme

2009-05-14 Thread Jonathan Ellis
. I don't care either way :) On Thu, May 14, 2009 at 11:06 AM, Johan Oskarsson jo...@oskarsson.nu wrote: I guess this time it's my OCD that thinks having a 0.3 and then a 0.3.1 feels wrong, something missing on the first one :) /Johan Jonathan Ellis wrote: There's nothing in 0.3

Submit patch link in jira

2009-05-14 Thread Jonathan Ellis
this marks the ticket as I have a patch available so it shows up here: https://issues.apache.org/jira/secure/IssueNavigator.jspa?reset=truepid=12310865status=10002 let's try to remember to use this since it makes it easier to see what is ready for review. I know I have been sloppy here in the

Re: Submit patch link in jira

2009-05-14 Thread Jonathan Ellis
, May 14, 2009 at 12:31 PM, Jonathan Ellis jbel...@gmail.com wrote: this marks the ticket as I have a patch available so it shows up here: https://issues.apache.org/jira/secure/IssueNavigator.jspa?reset=truepid=12310865status=10002 let's try to remember to use this since it makes it easier to see

Re: 0.3 and the OOM gremlin (CASSANDRA-208)

2009-06-03 Thread Jonathan Ellis
binary compatibility anyway? We might be stuck with having to write a tool that migrates from a 0.3 format to a 0.4 format. On Wed, Jun 3, 2009 at 2:44 PM, Jonathan Ellis jbel...@gmail.com wrote: The fix for 208 [1] is fairly invasive.  should we (a) release another RC and do more testing

Re: 0.3 and the OOM gremlin (CASSANDRA-208)

2009-06-03 Thread Jonathan Ellis
and rename get_slice_from to get_slice How does that sound? -Jonathan On Wed, Jun 3, 2009 at 4:59 PM, Jonathan Ellis jbel...@gmail.com wrote: You are right. Of course, there's no sense in making such a tool harder to write than it needs to be. But I don't care that strongly since I won't

Re: 0.3 and the OOM gremlin (CASSANDRA-208)

2009-06-04 Thread Jonathan Ellis
 - r/m old get_slice and rename get_slice_from to get_slice How does that sound? -Jonathan On Wed, Jun 3, 2009 at 4:59 PM, Jonathan Ellis jbel...@gmail.com wrote: You are right.  Of course, there's no sense in making such a tool harder to write than it needs to be. But I don't care

Time to release 0.3

2009-06-05 Thread Jonathan Ellis
The consensus was that it's better to release an imperfect-but-stable 0.3 now. We've resolved all the 0.3 issues in jira, added a BUGS.txt, and amended our NOTICES to include those of our dependencies. What's next? -Jonathan

Re: Constants class

2009-06-13 Thread Jonathan Ellis
To keep the bureaucrats happy, can you submit this suggestion to JIRA? https://issues.apache.org/jira/browse/CASSANDRA thanks, -Jonathan On Sat, Jun 13, 2009 at 2:42 PM, Edward Ribeiroedward.ribe...@gmail.com wrote: Hello, I've just checked out Cassandra source, and I would like to make a

Re: Column assert

2009-06-16 Thread Jonathan Ellis
checking user input is done with explicit if statements; verifying internal assumptions is done with asserts. Column is not a user-facing class (that would be column_t). -Jonathan On Tue, Jun 16, 2009 at 7:31 PM, Edward Ribeiroedward.ribe...@gmail.com wrote:   Dear developers,   I saw the

Re: Time to release 0.3

2009-06-17 Thread Jonathan Ellis
list and let people vote on it? I guess with the incubator there's extra steps after that, but I assume the mentors will let us know. /Johan Chris Goffinet wrote: ChangeLog? :) On Jun 5, 2009, at 1:00 PM, Jonathan Ellis wrote: The consensus was that it's better to release an imperfect

Re: Time to release 0.3

2009-06-17 Thread Jonathan Ellis
guess with the incubator there's extra steps after that, but I assume the mentors will let us know. /Johan Chris Goffinet wrote: ChangeLog? :) On Jun 5, 2009, at 1:00 PM, Jonathan Ellis wrote: The consensus was that it's better to release an imperfect-but-stable 0.3 now. We've resolved

Re: Time to release 0.3

2009-06-17 Thread Jonathan Ellis
it on the dev list and let people vote on it? I guess with the incubator there's extra steps after that, but I assume the mentors will let us know. /Johan Chris Goffinet wrote: ChangeLog? :) On Jun 5, 2009, at 1:00 PM, Jonathan Ellis wrote: The consensus was that it's better to release

Re: moving to framed transport (client breakage inevitable)

2009-06-19 Thread Jonathan Ellis
That's probably what we'll have to do if C# et al don't support Framed. Which is a shame, because framed makes a lot more sense -- there's really no reason to keep non-framed around. But that's Thrift for you. -Jonathan On Fri, Jun 19, 2009 at 11:44 PM, Ian Holsmani...@holsman.net wrote: Hey

proposal: rename table to namespace

2009-06-20 Thread Jonathan Ellis
Since we're proposing things that break stuff this weekend... :) I think we should rename table to namespace in the config file. Calling it table confuses people coming from an rdbms background (i.e. just about everyone). -Jonathan

Re: proposal: rename table to namespace

2009-06-20 Thread Jonathan Ellis
namespace might just be more confusing. On Jun 20, 2009, at 6:54 AM, Jonathan Ellis wrote: Since we're proposing things that break stuff this weekend... :) I think we should rename table to namespace in the config file. Calling it table confuses people coming from an rdbms background (i.e

Re: proposal: rename table to namespace

2009-06-22 Thread Jonathan Ellis
For those who don't follow -users, On Mon, Jun 22, 2009 at 9:35 AM, Matt Revellemreve...@gmail.com wrote: Cassandra only supports one table per instance (before today?). However, as Jonathan mentioned previously, what you consider to be tables are represented as column families in Cassandra.

Re: proposal: rename table to namespace

2009-06-22 Thread Jonathan Ellis
On Mon, Jun 22, 2009 at 11:32 AM, Alexander Staubomadevilgen...@gmail.com wrote: On Mon, Jun 22, 2009 at 4:43 PM, Jonathan Ellisjbel...@gmail.com wrote: On Mon, Jun 22, 2009 at 9:35 AM, Matt Revellemreve...@gmail.com wrote: Cassandra only supports one table per instance (before today?).  

Re: proposal: rename table to namespace

2009-06-22 Thread Jonathan Ellis
On Mon, Jun 22, 2009 at 10:59 AM, Chris Goffinetc...@chrisgoffinet.com wrote: But the thing is, BigTable uses 'Tables' as well. Calling it namespaces just makes it even more confusing. Well, (a) it's only confusing to people who have actually read the Bigtable paper, and they can cope. :P

Re: proposal: rename table to namespace

2009-06-22 Thread Jonathan Ellis
On Mon, Jun 22, 2009 at 11:58 AM, Alexander Stauboa...@purefiction.net wrote: If that's your point, you could have made it clearer. You will not achieve that goal by calling it a namespace. I would argue that users tend to be more familiar with the idea of a table That's exactly the problem:

Re: proposal: rename table to namespace

2009-06-22 Thread Jonathan Ellis
On Mon, Jun 22, 2009 at 12:26 PM, Alexander Stauboa...@purefiction.net wrote: On Mon, Jun 22, 2009 at 7:13 PM, Jonathan Ellisjbel...@gmail.com wrote: Actually, Cassandra doesn't support multidimensional cells a la Bigtable. I meant multidimensional in the column sense. If you mean CFs have

[POSTPONED] VOTE: Cassandra 0.3.0 release

2009-06-24 Thread Jonathan Ellis
the above potential problem. What do people feel? Jun IBM Almaden Research Center K55/B1, 650 Harry Road, San Jose, CA 95120-6099 jun...@almaden.ibm.com Jonathan Ellis jbel...@gmail.com Jonathan Ellis jbel...@gmail.com 06/22/2009 08:14 AM Please respond to cassandra-dev

Re: Time to move to (ASF-sanctioned) git?

2009-06-24 Thread Jonathan Ellis
Forgot my footnotes. :) [1] http://spyced.blogspot.com/2009/06/patch-oriented-development-made-sane.html [2] http://markmail.org/thread/2vtyrx56jwsloxhn [3] http://code.google.com/p/gerrit/ On Wed, Jun 24, 2009 at 10:28 AM, Jonathan Ellisjbel...@gmail.com wrote: Yesterday I posted about my

0.3.0 RC3

2009-06-24 Thread Jonathan Ellis
RC3 is out at http://people.apache.org/~jbellis/cassandra/cassandra-0.3.0-rc3.tar.gz svn tag is https://svn.apache.org/repos/asf/incubator/cassandra/tags/cassandra-0.3.0-rc3 changelog is https://svn.apache.org/repos/asf/incubator/cassandra/tags/cassandra-0.3.0-rc3/CHANGES.txt Planning to

Re: moving to framed transport (client breakage inevitable)

2009-06-24 Thread Jonathan Ellis
If we were talking about any other project I would say great idea! But with Thrift, simple patches can take 6+ weeks to be committed. (I am not exaggerating.) So I would vote for making it configurable to avoid screwing over people on those platforms. -Jonathan On Sat, Jun 20, 2009 at 3:32 PM,

Re: Alternative wire protocols

2009-06-24 Thread Jonathan Ellis
I'm interested in Avro as kind of a better Thrift, but that's still pretty alpha. I'm not really interested in stuff that's going to be Much Slower like anything over http (Jay from Voldemort said that's basically a waste of time and I believe him) or anything that requires hand-writing clients

Re: Alternative wire protocols

2009-06-25 Thread Jonathan Ellis
On Thu, Jun 25, 2009 at 1:35 PM, Michael Greenemichael.gre...@gmail.com wrote: A quick time-to-production could be achieved through distributing re-packaged client libraries.  I like that idea.  I brought it up on IRC at one time, and Jonathan convinced me I was wrong, but now I can't remember

[VOTE] 0.3.0-final

2009-06-26 Thread Jonathan Ellis
I propose releasing 0.3.0-rc3 as 0.3.0-final. We've had some unofficial voting on the rc3 thread but this is the official one. :) Voting is open for 72h. binary build is at http://people.apache.org/~jbellis/cassandra/cassandra-0.3.0-rc3.tar.gz svn tag is

Re: Website, an update

2009-06-27 Thread Jonathan Ellis
Wow, that would definitely make us the leader in website awesomeness. :) On Sat, Jun 27, 2009 at 4:22 PM, Daniel Lundind...@eintr.org wrote: I had some spare time, so I worked on a new design for the project site today. Check it out: http://cassandra-test.unempty.com/ Let me know what you

Re: Web sIte design, humble beginnings ... and a doodle

2009-06-27 Thread Jonathan Ellis
is .svg source? https://issues.apache.org/jira/browse/CASSANDRA-231 On Sat, Jun 27, 2009 at 5:49 PM, Evan Weaverewea...@gmail.com wrote: Do you have the vector source for this? I'd like to fuss with it a bit. If you want I can even give it to one of our designers to work on briefly. Evan

Re: Alternative wire protocols

2009-06-28 Thread Jonathan Ellis
On Sun, Jun 28, 2009 at 7:49 AM, Bill de hOrab...@dehora.net wrote: I've been playing around again with Cassandra recently and I agree Thrift is a pain point, and that was the case when I looked at the project originally. But I think it's not so much Thrift as how the data is presented to

Re: [VOTE] 0.3.0-final

2009-06-29 Thread Jonathan Ellis
, at 12:10 PM, Eric Evans wrote: On Fri, 2009-06-26 at 12:49 -0500, Jonathan Ellis wrote: I propose releasing 0.3.0-rc3 as 0.3.0-final. +1 -- Eric Evans eev...@racklabs.com -- Ian Holsman i...@holsman.net -- Evan Weaver

[RESULTS] [VOTE] 0.3.0-final

2009-06-29 Thread Jonathan Ellis
Voting is over. That's +3 from committers in favor. On Fri, Jun 26, 2009 at 12:49 PM, Jonathan Ellisjbel...@gmail.com wrote: I propose releasing 0.3.0-rc3 as 0.3.0-final. We've had some unofficial voting on the rc3 thread but this is the official one. :) Voting is open for 72h. binary

The future of 0.3

2009-06-29 Thread Jonathan Ellis
With 0.3.0 voted in (the mentors technically have the last word, but let's assume it does get approved :), we should think about the future of the 0.3 branch. Fundamentally 0.3 has issues (see BUGS.txt) and fixing those issues would turn it into 0.4, so I see the 0.3 maintenance mission as very

Re: cassandra-cli suggestion

2009-07-01 Thread Jonathan Ellis
could you submit a patch to JIRA? https://issues.apache.org/jira/browse/CASSANDRA thanks, -Jonathan On Wed, Jul 1, 2009 at 1:18 PM, Anthony Molinaroantho...@pinkbunny.net wrote: Hi,  I'm in the process of packaging cassandra into some rpms to deploy at work and noticed that cassandra-cli

Re: Cassandra version number policy

2009-07-01 Thread Jonathan Ellis
We have already committed to breaking disk format for 0.4 (to fix OOM conditions). To me 0.3 to 0.4 is major (like with postgresql 8.3 to 8.4) but I guess it's just semantics. -Jonathan On Wed, Jul 1, 2009 at 1:14 PM, Anthony Molinaroantho...@pinkbunny.net wrote: Hi,  I've been lurking on

Re: Debug Cassandra with Eclipse

2009-07-04 Thread Jonathan Ellis
you just need to specify the right port, and make sure that's not firewalled. if you look at bin/cassandra.in.sh, -Xrunjdwp:transport=dt_socket,server=y,address=,suspend=n \ that option tells the jvm to listen for a remote debugger on port . On Sat, Jul 4, 2009 at 10:53 AM,

0.3 release process, again

2009-07-06 Thread Jonathan Ellis
Ant's points over on incubator general. - there must be a source distribution - the artifact names must include -incubating - the distributions must be signed The confusion here is, I thought we were supposed to get release approval first, _then_ create and sign the official tarballs. (ant

Re: Column is immutable ... no, it's not

2009-07-07 Thread Jonathan Ellis
Please give us some credit. It's obvious that you can modify the contents of the byte[], but we're not trying to protect against malicious plugins or... something; we're only concerned with thread safety. For that, the existing code is fine; it's not worth introducing extra copies to a

Re: REST client interface

2009-07-09 Thread Jonathan Ellis
Sure, that's reasonable. On Thu, Jul 9, 2009 at 3:21 AM, Viktor Klangviktor.kl...@gmail.com wrote: Thank you Jonathan, perhaps there would be a market for a soclet pooling solution? -- Viktor On Jul 8, 2009 11:46 PM, Jonathan Ellis jbel...@gmail.com wrote: I've always used socket-per

Anyone want to maintain CQL?

2009-07-15 Thread Jonathan Ellis
CQL is (a) very incomplete and (b) about to get more so soon (see https://issues.apache.org/jira/browse/CASSANDRA-296). If anyone wants to step up and say this is important enough that I'm willing to maintain it, then great, let's talk. Otherwise, we're leaning towards consigning it to svn

Re: Anyone want to maintain CQL?

2009-07-17 Thread Jonathan Ellis
So, for 185, I have a half-assed CQL update that at least keeps it working for ascii data. Once that's in, for 296 I propose that we proceed with the remove, and since it will be a clean remove in svn history it will be easy for Edward or anyone else to revert and work on locally. When progress

heads up on on-disk data format changes in trunk

2009-08-04 Thread Jonathan Ellis
Hi guys, We're trying to get the data format incompatibilities done with before we put out a 0.4 beta. There's 3 coming up. In order of when they are likely to be finished: https://issues.apache.org/jira/browse/CASSANDRA-341 -- to fix bugs (today)

Re: heads up on on-disk data format changes in trunk

2009-08-04 Thread Jonathan Ellis
Re the index bugs: this is referring to column indexes, not key indexes. (The former are kept in the SSTable; the latter are in a separate file. Don't delete your -Index.db files. :) And a correction: 341 is the only one that the compaction process will fix, not 332 as well. -Jonathan On Tue,

Re: EOFException in get_slice_super_by_names

2009-08-05 Thread Jonathan Ellis
I found the bug: https://issues.apache.org/jira/browse/CASSANDRA-341 In 0.3 your best bet is to increase ColumnIndexSizeInKB in your config file to a value large enough that it never creates indexes. -Jonathan On Fri, Jul 31, 2009 at 4:18 PM, Mark McBridemark.mcbr...@gmail.com wrote: Has

default OrderPreservingPartitioner changed

2009-08-07 Thread Jonathan Ellis
The default OPP now does comparisons based strictly on byte order, and is no longer collation aware. This is a better default choice for those who don't need collation since it's much faster. If you do need collation, the old partitioner is still available as CollatingOPP: -

Re: build failed

2009-08-10 Thread Jonathan Ellis
Oops! My fault. I'll just run thrift -gen java one more time, just in case, I thought... :) Fix checked in. On Mon, Aug 10, 2009 at 7:28 PM, Mark Lewandowskim...@digg.com wrote: I just checked out the latest from trunk, and can't get it to build. This is what I've gotten so far:

Re: [vote] 0.4.0 beta 1

2009-08-12 Thread Jonathan Ellis
On Wed, Aug 12, 2009 at 5:49 AM, ant elderant.el...@gmail.com wrote: On Tue, Aug 11, 2009 at 9:44 PM, Jonathan Ellisjbel...@gmail.com wrote: What do you mean by an unofficial release? A release needs follow the ASF release procedures and be voted on so needs svn tags, and distributions to

Re: Fixing the data model names

2009-08-12 Thread Jonathan Ellis
My brief two cents: I think terminology + api changes need to be a big improvement to be worth breaking things at this point, and I don't think this proposal meets that bar. In fact I'm not sure any proposal could. On the specifics: * Keyspace vs Database Actually the right concept from the

Re: Fixing the data model names

2009-08-12 Thread Jonathan Ellis
On Wed, Aug 12, 2009 at 7:23 PM, Evan Weaverewea...@gmail.com wrote: Re. Jonathan on database: oracle/sqlserver/mysql/postgres call it a database. No. With a database (ignoring things like user accounts that don't apply) the difference is that you decide at connection time what database you

Re: Fixing the data model names

2009-08-12 Thread Jonathan Ellis
If thrift were sane it would look something like struct Column { byte[] name, optional listColumn subcolumns, optional int64 timestamp, optional byte[] value } you can either have the subcolumns, or the timestamp and value seems reasonable to me. of course in the real world, thrift

Re: Fixing the data model names

2009-08-12 Thread Jonathan Ellis
I agree with the proposition that the SuperColumn name is weak. (Although not, as I mentioned, Column or ColumnFamily.) And I could go with schema over keyspace. One option to deal with SC would be to excise the term SC (and SCF from the config) and instead just have Columns, which may or may

Re: Bootstrapping newly added nodes

2009-08-13 Thread Jonathan Ellis
see CASSANDRA-195 On Thu, Aug 13, 2009 at 5:27 AM, Mark Robsonmar...@gmail.com wrote: Hi, Looking at CASSANDRA-213 I see that there is now a procedure to Bootstrap a newly added node so it will copy the relevant parts of data from other nodes in the cluster. This is a Good Thing. What I

Re: Thrift API and SlicePredicate bug?

2009-08-13 Thread Jonathan Ellis
you're giving garbage as a slice predicate. thrift should error check that but does not. https://issues.apache.org/jira/browse/CASSANDRA-308 On Thu, Aug 13, 2009 at 6:04 AM, Sergio Bossasergio.bo...@gmail.com wrote: Hi guys, the following (client) method call: client.get_slice(Test1,      

Re: [vote] 0.4.0 beta 1

2009-08-13 Thread Jonathan Ellis
On Thu, Aug 13, 2009 at 1:20 AM, ant elderantel...@apache.org wrote: Ok I wonder if this is just misunderstandings based on naming and conventions. Could what you're calling here as an unofficial beta release be the same as what other ASF projects might name a release candidate? Is the 0.4.0

Re: Fixing the data model names

2009-08-13 Thread Jonathan Ellis
A row is the data associated with a key in a given CF. On Thu, Aug 13, 2009 at 12:17 AM, Arin Sarkissiana...@rspot.net wrote: Row? What are you guys referring to as a row? no - this isnt a joke Arin On Wed, Aug 12, 2009 at 9:39 PM, Evan Weaverewea...@gmail.com wrote: PS. How's Avro these

Re: Fixing the data model names

2009-08-13 Thread Jonathan Ellis
The assumption that within a CF only IColumns of the same type (C or SC) will be compared is baked in pretty deeply. -Jonathan On Wed, Aug 12, 2009 at 11:23 PM, Evan Weaverewea...@gmail.com wrote: Incidentally, is there any specific reason the collation has to be pre-defined at the CF? What if

Re: Fixing the data model names

2009-08-13 Thread Jonathan Ellis
On Thu, Aug 13, 2009 at 12:24 PM, Evan Weaverewea...@gmail.com wrote: What do you see as the benefit of ColumnFamily? It correctly implies group of columns w/o sounding excessively generic like ColumnCollection or something, and it means mostly the same thing as it does in Bigtable, which can be

Re: Fixing the data model names

2009-08-13 Thread Jonathan Ellis
On Thu, Aug 13, 2009 at 12:38 PM, Evan Weaverewea...@gmail.com wrote: I understand the BigTable precedent issue...but it's also a group of rows, and a group of keys, just as much. Column Family leaves out the keys Well, it does, or does not, to the same degree that a Table in a rbd does or

Re: [re-vote] cassandra 0.4 beta 1

2009-08-14 Thread Jonathan Ellis
-6099 jun...@almaden.ibm.com Jonathan Ellis ---08/14/2009 10:04:11 AM---I thought it best to take a new vote on making an *official* release of 0.4 beta 1. From: Jonathan Ellis jbel...@gmail.com To: cassandra-dev@incubator.apache.org Date: 08/14/2009 10:04 AM Subject: [re-vote

Re: [re-vote] cassandra 0.4 beta 1

2009-08-14 Thread Jonathan Ellis
.    [javac] Note: Recompile with -Xlint:unchecked for details.    [javac] 2 errors On Fri, Aug 14, 2009 at 12:35 PM, Jun Raojun...@almaden.ibm.com wrote: +1. Jun IBM Almaden Research Center K55/B1, 650 Harry Road, San Jose, CA 95120-6099 jun...@almaden.ibm.com Jonathan Ellis ---08/14

[results] [re-vote] cassandra 0.4 beta 1

2009-08-14 Thread Jonathan Ellis
The +1s have it. This vote is closed and we have submitted artifacts to be reviewed by the incubator PMC (see other thread). thanks, -Jonathan

Re: Exception at startup

2009-08-21 Thread Jonathan Ellis
that looks like a log replay bug we had in beta one but is fixed in current trunk. On 8/21/09, Anthony Molinaro antho...@alumni.caltech.edu wrote: Hi, I'm trying to make the transition from 0.3.0 to 0.4.0. I recreated all my packages, rejiggered my load scripts to load up data, and loaded

Re: Exception at startup

2009-08-22 Thread Jonathan Ellis
, Jonathan Ellis wrote: no, that should be recent enough. Was this cl written by the 0.3 install prior to the upgrade? On 8/21/09, Anthony Molinaro antho...@alumni.caltech.edu wrote: Maybe not, I'm using trunk (or at least trunk as of yesterday afternoon), was it fixed today by chance

Re: Data model names, reloaded

2009-08-24 Thread Jonathan Ellis
IMO the window for making this kind of change has passed. We've talked about finalizing the 0.4 api weeks ago, we got a beta out with it, and it does the job. The timeline wasn't a surprise to anyone paying attention to the list. It's time to move on. -Jonathan On Fri, Aug 21, 2009 at 1:36

Re: Data model names, reloaded

2009-08-24 Thread Jonathan Ellis
On Mon, Aug 24, 2009 at 10:26 AM, Toby DiPasqualecodeslin...@gmail.com wrote: That feels to me to be a short-sighted point of view. I'd imagine that its more important for people be able to understand the data model than meeting some kind of arbitrary timeline. I, too, find the current naming

Re: [VOTE] Change data model names for 0.5

2009-08-24 Thread Jonathan Ellis
-1 On Mon, Aug 24, 2009 at 2:29 PM, Evan Weaverewea...@gmail.com wrote: Resolved, that the data model names should be changed in Cassandra 0.5. Evan PS. Committers have the most weight, but everyone's voice is heard. -- Evan Weaver

Re: Wiki changes

2009-08-24 Thread Jonathan Ellis
It goes to -commits (with a reply-to of -dev). Jira generates way more spam to -commits, so no worries; we're used to it. :) -Jonathan On Mon, Aug 24, 2009 at 2:33 PM, Curt Micolasen...@gmail.com wrote: Hey Devs, I am not sure who receives the email mentioned whenever the wiki is edited,

Re: Wiki changes

2009-08-24 Thread Jonathan Ellis
Yes, definitely +1 thanks for the updates. On Mon, Aug 24, 2009 at 2:36 PM, Michael Greenemichael.gre...@gmail.com wrote: For those thinking, that's some spam I'd like to receive as well! you can get in on the action at http://mail-archives.apache.org/mod_mbox/incubator-cassandra-commits/

Re: [VOTE] 0.4 RC1

2009-09-02 Thread Jonathan Ellis
+1 from me.

Re: [VOTE] 0.4.0 RC2

2009-09-10 Thread Jonathan Ellis
+1

Calling all mentors

2009-09-16 Thread Jonathan Ellis
Hey mentors, We've had an RC2 vote over on incubator-general and cc'd here for days. Please vote. This is getting frustrating: we never did finish the RC1 vote, before it got lapped by RC2. Remember when I was explaining that the reason I was reluctant to go through the full release process

Re: [PROPOSAL] CTR for all non-code changes

2009-09-18 Thread Jonathan Ellis
On Fri, Sep 18, 2009 at 3:12 PM, Paul Querna p...@querna.org wrote: I am starting to believe it would be helpful if Cassandra adopted a similar CTR policy for non-code changes on trunk and all branches. Agreed. -Jonathan

Re: Binary Loading Question

2009-09-30 Thread Jonathan Ellis
So it looks like it takes multiple rows, one per column in the rowmutation. Which is confusing and weird if you ask me. On Wed, Sep 30, 2009 at 1:57 PM, Jonathan Ellis jbel...@gmail.com wrote: Yeah, just eyeballing it                ColumnFamilyStore cfStore = columnFamilyStores_.get

Re: Error in the latest Trunk - RackAware Statergy

2009-10-05 Thread Jonathan Ellis
and send the code across. if there is something which i need to know i will sync up @ IRC... (hopefully it will starts working for me)... Regards, /VJ On Mon, Oct 5, 2009 at 3:15 PM, Jonathan Ellis jbel...@gmail.com wrote: On Mon, Oct 5, 2009 at 5:02 PM, Vijay vijay2...@gmail.com wrote

want a ticket to NoSQL east?

2009-10-11 Thread Jonathan Ellis
Hi all, Rackspace is sponsoring NoSQL East (Oct 29/30 in Atlanta -- https://nosqleast.com/2009/) and we have two extra tickets to give away to the community. (Tickets for the original venue of 250 seats sold out, then they found another venue w/ 40 more seats, and it looks like those will sell

Re: want a ticket to NoSQL east?

2009-10-12 Thread Jonathan Ellis
These are spoken for now. On Sun, Oct 11, 2009 at 9:38 PM, Jonathan Ellis jbel...@gmail.com wrote: Hi all, Rackspace is sponsoring NoSQL East (Oct 29/30 in Atlanta -- https://nosqleast.com/2009/) and we have two extra tickets to give away to the community.  (Tickets for the original venue

Re: [VOTE] Release 0.4.1

2009-10-12 Thread Jonathan Ellis
+1 On Mon, Oct 12, 2009 at 1:45 PM, Eric Evans eev...@rackspace.com wrote: The 0.4 branch has received a number of important bug fixes[1] since we released 0.4.0, it feels about time for an 0.4.1. Shall we? :) SVN Tag:

Re: [RE-VOTE] Release 0.4.1

2009-10-12 Thread Jonathan Ellis
+1 On Mon, Oct 12, 2009 at 2:32 PM, Eric Evans eev...@racklabs.com wrote: Ok, sorry for the false start, here it is again (as r824462). The 0.4 branch has received a number of important bug fixes[1] since we released 0.4.0, it feels about time for an 0.4.1. Shall we? :) SVN Tag:

important performance note

2009-10-22 Thread Jonathan Ellis
Brandon Williams has been doing some testing on the garbage collector options we are using, and found that removing the line -XX:CMSInitiatingOccupancyFraction=1 \ from bin/cassandra.in.sh dramatically increases throughput by reducing the amount of CPU used by the garbage collector.

  1   2   >