Re: Cassandra Demo/Tutorial Applications

2010-03-12 Thread Jonathan Ellis
xtracted out > > > --- > Sent from my phone > Ian Holsman - 703 879-3128 > > On 13/03/2010, at 4:46 PM, Jonathan Ellis wrote: > >> On Fri, Mar 12, 2010 at 1:55 PM, Krishna Sankar >> wrote: >>> >>> I was looking at this from CASSANDRA-873 as well

Re: Cassandra Demo/Tutorial Applications

2010-03-12 Thread Jonathan Ellis
On Fri, Mar 12, 2010 at 1:55 PM, Krishna Sankar wrote: > I was looking at this from CASSANDRA-873 as well as hands-on homework (!) > for my OSCON tutorial. Have couple of questions. Would appreciate insights: > > A)  Cassandra-873 suggests Luenandra as one demo application > B)  Are there other id

Re: Re: wo did some test on cassandra ,but the result puzzled us

2010-03-12 Thread Jonathan Ellis
String(key), new ColumnPath( > "Standard1", null, "data".getBytes("UTF-8")), data,timestamp, > ConsistencyLevel.ONE); > totalSWriteTime += (System.currentTimeMillis() - start); >if(i % 1 == 0){ > System.out.println("Has write " + i); >

Re: wo did some test on cassandra ,but the result puzzled us

2010-03-11 Thread Jonathan Ellis
why reads are slower than writes: http://wiki.apache.org/cassandra/FAQ#reads_slower_writes no idea on seq vs random. i would not be surprised if there is a bug in your test code. On Fri, Mar 12, 2010 at 12:36 AM, Bingbing Liu wrote: > We did some test on on Cassandra, and the benchmark is from

Re: thinking about dropping hinted handoff

2010-03-10 Thread Jonathan Ellis
Read-only for a specific client is completely different from trying to read-only the entire node / cluster. So no, nothing wrong with that. 2010/3/10 Ted Zlatanov : > On Fri, 26 Feb 2010 08:18:49 -0600 Ted Zlatanov wrote: > > TZ> On Tue, 23 Feb 2010 12:30:52 -0600 Ted Zlatanov wrote: >> Can a C

Re: Re: how to do the filter in cassanda?

2010-03-10 Thread Jonathan Ellis
ter , which part of the code should > be modified? > > (i have tried to read the code , but as the cassandra is so big a system , i > failed) > > maybe someone can guide me ? > > > 2010-03-10 > > > > Bingbing Liu > > > > 发件人: Jonathan Ellis > 发送时间:

Re: Latest svn code

2010-03-10 Thread Jonathan Ellis
Both. The latest 0.6 code is in the 0.6 branch. The latest trunk code (will become 0.7) is in trunk. Trunk is in "breaking stuff" mode right now. On Wed, Mar 10, 2010 at 9:50 AM, David Dabbs wrote: > Hi. Is the latest code in trunk or the 0.6 branch? > > Thanks, > > david > > > >

Re: Further enhancments in j.a.c.auth

2010-03-10 Thread Jonathan Ellis
The latter. On Wed, Mar 10, 2010 at 2:33 AM, Morten Wegelbye Nissen wrote: > Jonathan Ellis wrote: >> >> We should probably use http://www.mindrot.org/projects/jBCrypt/. >> (Lots of background: >> >> http://chargen.matasano.com/chargen/2007/9/7/enough-with-th

Re: Further enhancments in j.a.c.auth

2010-03-09 Thread Jonathan Ellis
We should probably use http://www.mindrot.org/projects/jBCrypt/. (Lots of background: http://chargen.matasano.com/chargen/2007/9/7/enough-with-the-rainbow-tables-what-you-need-to-know-about-s.html) We kind of have a nagging feeling though that rolling our own auth framework in 2010 is the wrong ap

Re: how to do the filter in cassanda?

2010-03-09 Thread Jonathan Ellis
If you mean WHERE clause-like filtering, that's always done client side right now. On Tue, Mar 9, 2010 at 1:00 AM, Bingbing Liu wrote: > hi, > i mean when cassandra get data from the file system of each node (in other > words, read data from file) > > does the filtering condition also been trans

Re: seqid_ in Cassandra.Client

2010-03-02 Thread Jonathan Ellis
ra is headed? > > Thanks, > > david > >> -Original Message- >> From: Jonathan Ellis [mailto:jbel...@gmail.com] >> Sent: Tuesday, March 02, 2010 10:02 AM >> To: cassandra-dev@incubator.apache.org >> Subject: Re: seqid_ in Cassandra.Client

Re: seqid_ in Cassandra.Client

2010-03-02 Thread Jonathan Ellis
org.apache.cassandra.thrift.* (in 0.6) or .service (in 0.5) is autogenerated by Thrift. We try not to mess with the Thrift compiler except to fix bugs, but you're welcome to take a stab at it. :) https://issues.apache.org/jira/browse/THRIFT On Tue, Mar 2, 2010 at 9:58 AM, David Dabbs wrote: > H

Re: Commit log changes in 0.7

2010-02-26 Thread Jonathan Ellis
Remember that the header is per-segment. So I would just say that CF modification forces a new segment creation. We can support additions and deletions just by doing that, so that should probably be the first milestone. Renames are harder because currently we map id -> name -> files on disk. We

Re: consistent backups

2010-02-25 Thread Jonathan Ellis
Go ahead. 2010/2/25 Ted Zlatanov : > On Thu, 25 Feb 2010 08:22:38 -0600 Jonathan Ellis wrote: > > JE> 2010/2/25 Ted Zlatanov : >>> I want a consistent backup. > > JE> You can get an "eventually consistent backup" by flushing all nodes > JE> an

Re: consistent backups (was: thinking about dropping hinted handoff)

2010-02-25 Thread Jonathan Ellis
2010/2/25 Ted Zlatanov : > I want a consistent backup. You can get an "eventually consistent backup" by flushing all nodes and snapshotting; no individual node's backup is guaranteed to be consistent but if you restore from that snapshot then clients will get eventually consistent behavior as usua

Re: [VOTE] Release 0.5.1

2010-02-23 Thread Jonathan Ellis
+1 On Tue, Feb 23, 2010 at 5:27 PM, Eric Evans wrote: > > There have been some important bug fixes[1] in the 0.5 branch since we > released. Plus, I thought it would be cool to conduct two separate > release votes at the time. :) > > I propose the following tag and artifacts for 0.5.1: > > SVN Ta

Re: thinking about dropping hinted handoff

2010-02-23 Thread Jonathan Ellis
2010/2/23 Ted Zlatanov : > JE> because in a masterless environment there is no way to tell "when it's > over" > > Would it work to use an external agent?  It can get the list of nodes, > make them all read-only, then wait until every node reports no write > activity through JMX. At that point I'd

Re: thinking about dropping hinted handoff

2010-02-23 Thread Jonathan Ellis
2010/2/23 Ted Zlatanov : > You're welcome.  I don't understand why it doesn't help reach > consistency, though.  If you turn all the nodes in a cluster read-only > at the API level, what can make them inconsistent besides inter-node > traffic and scheduled writes?  I'd assume that activity will die

Re: thinking about dropping hinted handoff

2010-02-23 Thread Jonathan Ellis
2010/2/23 Ted Zlatanov : >>> Can a Cassandra node be made read-only (as far as clients know)? > > JE> no. > > Is there value (for reaching consistency) in adding that functionality? No. Thanks for the easy questions today. :) -Jonathan

Re: Tests ... large-volume, long-running, failure-case

2010-02-23 Thread Jonathan Ellis
contrib/py_stress is our standard performance tool. I think contrib/ is only in the source distro. On Tue, Feb 23, 2010 at 2:47 PM, Masood Mortazavi wrote: > Hi folks, > > Besides the regression tests described in "How to Contribute," are there > performance, large-data-volume, long-running or f

Re: [VOTE] Release 0.6.0-beta2

2010-02-23 Thread Jonathan Ellis
+1 On Tue, Feb 23, 2010 at 1:36 PM, Eric Evans wrote: > > There has been a lot of cool work done since our last release[1] (0.5.0), > and while there's still a bit more to be done[2], the dust is settling on > all of the big stuff. Now seems like a good time for a beta release. > > I propose the

Re: thinking about dropping hinted handoff

2010-02-23 Thread Jonathan Ellis
no. 2010/2/23 Ted Zlatanov : > On Mon, 22 Feb 2010 21:12:58 +0100 Peter Schüller wrote: > > PS> In general, what are people's thoughts on the appropriate mechanism to > PS> gain confidence that the cluster as a whole is reasonably consistent? > PS> In particular in relation to performing maintena

Re: thinking about dropping hinted handoff

2010-02-22 Thread Jonathan Ellis
On Mon, Feb 22, 2010 at 6:57 PM, Ryan King wrote: > I think I find it more compelling because we're currently experiencing > pain related to HH. I'd be ok with keeping it as long as we can make > the effects of a node down be less drastic. Can you open a ticket and tag it 0.6? I think I can impl

Re: thinking about dropping hinted handoff

2010-02-22 Thread Jonathan Ellis
On Mon, Feb 22, 2010 at 6:56 PM, Ryan King wrote: > Maybe I mis-read the code, but I thought it was trigger for every compaction. Full compactions is correct, at least that is the correct design. :) -Jonathan

Re: thinking about dropping hinted handoff

2010-02-22 Thread Jonathan Ellis
On Mon, Feb 22, 2010 at 1:53 PM, Ryan King wrote: > So, after having some more experience with HH, I've reformed my > opinion. I think we have 3 options: > > 1. Make the natural endpoints responsible for the hints. > 2. Make a random node responsible for hints. > 3. Get rid of HH. > > #1 reduces t

http://wiki.apache.org/cassandra/Improving%20Initial%20User%20Experiance

2010-02-21 Thread Jonathan Ellis
shouldn't a list of to-dos be in jira?

Re: Building Cassandra from Behind a Proxy

2010-02-19 Thread Jonathan Ellis
Can you add this to http://wiki.apache.org/cassandra/HowToContribute ? (that page looks like it could use a little refactoring) On Thu, Feb 18, 2010 at 5:21 PM, Gary Dusbabek wrote: > I found this: > > set ANT_OPTS=-Dhttp.proxyHost=myproxy -Dhttp.proxyPort=3128 > > from here:  http://www.jaya.fr

Re: 0.6, 0.7, and the future

2010-02-18 Thread Jonathan Ellis
On Thu, Feb 18, 2010 at 2:49 PM, Ryan King wrote: > On Thu, Feb 18, 2010 at 11:45 AM, Anthony Molinaro > wrote: >> +1 >> (although I'm dreading the export from old sstables into new sstables, >> any ideas on how fast that might be?, and I guess any idea if a tool >> for doing this will be provide

Re: 0.6, 0.7, and the future

2010-02-17 Thread Jonathan Ellis
On Wed, Feb 17, 2010 at 5:26 PM, David Strauss wrote: > How dependent are the 0.8 high-level queries on the 0.7 internal changes? It will definitely be affected by the String -> byte[] change with everything else. It also may benefit from being able to have more levels of subcolumns than the 1 w

0.6, 0.7, and the future

2010-02-17 Thread Jonathan Ellis
We're looking at branching 0.6 today and starting 0.7 work. 0.6 shaped up to be a really nice follow-up to 0.5, where we improved just about everything while keeping the upgrade path super easy. (We changed the network around again, but no disk changes, so it's just going to be shutdown-and-resta

Re: loadbalance and different strategies

2010-02-09 Thread Jonathan Ellis
On Tue, Feb 9, 2010 at 10:22 PM, Jaakko wrote: > Yes, that is of course true. However, I don't think this modification > would make the algorithm much less simple. We still consider the most > loaded node only, but take into account which DC the node is in. > Without that extra step, loadbalance o

Re: loadbalance and different strategies

2010-02-09 Thread Jonathan Ellis
On Tue, Feb 9, 2010 at 9:45 PM, Jonathan Ellis wrote: > That seems reasonable, although it feels a little weird for X to as G > for a token and be given one that G isn't the primary for. "for X to ask* G"

Re: loadbalance and different strategies

2010-02-09 Thread Jonathan Ellis
On Tue, Feb 9, 2010 at 6:12 PM, Jaakko wrote: > Let us suppose that all ranges are equal in size. In this case G's > range is A-G. If X boots in G's DC, it should take a token in the > middle of this range, which would be somewhere around D. If X boots > behind D Ah, I see, you are saying, "G has

Re: loadbalance and different strategies

2010-02-09 Thread Jonathan Ellis
On Tue, Feb 9, 2010 at 6:40 PM, Coe, Robin wrote: > Am I correct in assuming that a node given the flush command will not accept > new writes No. It's not designed to do that.

Re: loadbalance and different strategies

2010-02-09 Thread Jonathan Ellis
On Tue, Feb 9, 2010 at 3:13 AM, Jaakko wrote: > What they probably should do, is to just > consider nodes in the DC they are booting to, and try to balance load > evenly in that DC. I'm not sure what problem that would solve. It seems to me there are two goals: 1. don't transfer data across da

Re: get_range_slice() tester

2010-02-08 Thread Jonathan Ellis
I'm seeing failures on 0.5 but success against trunk, is that also what you see? -Jonathan On Mon, Feb 8, 2010 at 4:42 PM, Jack Culpepper wrote: > On Mon, Feb 8, 2010 at 2:34 PM, Jonathan Ellis wrote: >> This is supposed to pass on a single node but fail on two, correct? > &g

Re: get_range_slice() tester

2010-02-08 Thread Jonathan Ellis
This is supposed to pass on a single node but fail on two, correct? What are the tokens on your two nodes, in case that is relevant? (nodeprobe ring will tell you.) -Jonathan On Mon, Feb 8, 2010 at 4:01 PM, Jack Culpepper wrote: > Here's a tester program, for contrib. It generates 10 keys using

Re: nose tests -- run recently?

2010-02-08 Thread Jonathan Ellis
On Mon, Feb 8, 2010 at 3:53 PM, Jack Culpepper wrote: > $ python --version > Python 2.6.4 2.6.4 here, too. > One more stupid question if you can stand it: when I get "Connection > reset by peer" on the python side, where should I see the > corresponding error on the java side? Connection reset

Re: nose tests -- run recently?

2010-02-08 Thread Jonathan Ellis
On Mon, Feb 8, 2010 at 3:02 PM, Jack Culpepper wrote: > On Mon, Feb 8, 2010 at 12:38 PM, Jonathan Ellis wrote: >> On Mon, Feb 8, 2010 at 2:25 PM, Jack Culpepper >> wrote: >>> Are you running on a platform that doesn't care about capitalization? >> >> Y

Re: nose tests -- run recently?

2010-02-08 Thread Jonathan Ellis
On Mon, Feb 8, 2010 at 2:25 PM, Jack Culpepper wrote: > Are you running on a platform that doesn't care about capitalization? Yes. If you're building Thrift on windows you're only the second person I know to have done so. :) > error: [Errno 104] Connection reset by peer I've never seen Thrift

Re: nose tests -- run recently?

2010-02-08 Thread Jonathan Ellis
On Mon, Feb 8, 2010 at 3:17 AM, Jack Culpepper wrote: > ERROR: Failure: ImportError (No module named Cassandra) fixed in r907705, btw.

Re: nose tests -- run recently?

2010-02-08 Thread Jonathan Ellis
On Mon, Feb 8, 2010 at 3:17 AM, Jack Culpepper wrote: > Ok. Fixed that, but then run nosetests and I get a bunch of other > errors.. I must be doing something wrong. I just checked out the code > like 20 mins ago. I'm going to have to agree with your diagnosis -- everything passes for me -- but i

Re: batch file error on vista

2010-02-04 Thread Jonathan Ellis
The existing batch file works fine for me on windows 7. So does Tom's, modified to not hardcode stuff it shouldn't. (attached, w/ name mangling to make gmail happy). Can anyone test this on XP? If the code we're removing to "shorten lib path for older platforms" is required for XP we should pro

Re: bitmap slices

2010-02-04 Thread Jonathan Ellis
2010/2/4 Ted Zlatanov : > JE> The mask check needs to be done in the Slice Filter, not SP. > > Sorry, I don't know what you mean.  Are you referring to > o.a.c.db.filter.SliceQueryFilter?  So I'd just add an extra parameter to > the constructor and change the matching logic? Right, but make it opt

Re: bitmap slices

2010-02-03 Thread Jonathan Ellis
lve a real problem for you? -Jonathan 2010/2/3 Ted Zlatanov : > On Mon, 1 Feb 2010 11:14:12 -0600 Jonathan Ellis wrote: > > JE> 2010/2/1 Ted Zlatanov : >>> On Mon, 1 Feb 2010 10:41:28 -0600 Jonathan Ellis wrote: >>> > JE> I don't think this is very useful f

Re: column sizes (was: online codes (?))

2010-02-03 Thread Jonathan Ellis
gle node or am I off the mark. > > -Michael > > On Thu, Feb 4, 2010 at 8:28 AM, Jonathan Ellis wrote: >> On Wed, Feb 3, 2010 at 4:25 PM, Michael Pearson wrote: >>> I'd imagine the gossip overhead and key/column per disk limitation is >>> too open for a

Re: column sizes (was: online codes (?))

2010-02-03 Thread Jonathan Ellis
On Wed, Feb 3, 2010 at 4:25 PM, Michael Pearson wrote: > I'd imagine the gossip overhead and key/column per disk limitation is > too open for abuse to recommend storing lob columns with any level of > predictability, particularly if frequent updates are involved. Gossip overhead is constant for a

Re: column sizes

2010-02-03 Thread Jonathan Ellis
it puts a limit on result set size no matter what kind of columns are in the result 2010/2/3 Ted Zlatanov : > On Wed, 3 Feb 2010 07:23:06 -0600 Jonathan Ellis wrote: > > JE> At least one person is putting in chunks of up to 64MB, so at some > JE> level it "works" but

Re: column sizes (was: online codes (?))

2010-02-03 Thread Jonathan Ellis
At least one person is putting in chunks of up to 64MB, so at some level it "works" but it's not what it's designed for. 2010/2/3 Ted Zlatanov : > On Tue, 2 Feb 2010 23:05:04 -0600 Jonathan Ellis wrote: > > JE> The "atom" in cassandra is a single column

Re: online codes (?)

2010-02-02 Thread Jonathan Ellis
On Tue, Feb 2, 2010 at 9:05 PM, Anthony Di Franco wrote: > Taking the discussion below to the dev list. > > Continuing the discussion, it seems to me that objects in Cassandra > might be quite large from this passage: You've misunderstood. The "atom" in cassandra is a single column. These are al

Re: predicate queries (was: bitmap slices)

2010-02-01 Thread Jonathan Ellis
2010/2/1 Ted Zlatanov : > My list of things I need for predicate queries across column and > supercolumn names: > > - bitmask (OR AND1 AND2 AND3 ...).  This would make my life easier and >  take load off our Cassandra servers.  Currently I have to scan the >  result sets on the client side to find

Re: bitmap slices

2010-02-01 Thread Jonathan Ellis
2010/2/1 Ted Zlatanov : > On Mon, 1 Feb 2010 10:41:28 -0600 Jonathan Ellis wrote: > > JE> I don't think this is very useful for column names.  I could see it > JE> being useful for values but if we're going to add predicate queries > JE> then I'd rather do

Re: bitmap slices

2010-02-01 Thread Jonathan Ellis
I don't think this is very useful for column names. I could see it being useful for values but if we're going to add predicate queries then I'd rather do something more general. 2010/2/1 Ted Zlatanov : > On Mon, 1 Feb 2010 09:42:16 -0600 Jonathan Ellis wrote: > > J

Re: bitmap slices

2010-02-01 Thread Jonathan Ellis
how would this be different then the byte[] column name you can already match on? 2010/2/1 Ted Zlatanov : > On Fri, 29 Jan 2010 15:07:01 -0600 Ted Zlatanov wrote: > > TZ> On Fri, 29 Jan 2010 12:06:28 -0600 Jonathan Ellis > wrote: > JE> On Fri, Jan 29, 2010 at 9:09 AM,

Re: Understand how to provision nodes and use cassandra in the production

2010-01-30 Thread Jonathan Ellis
llocated is 3G. > What's a suggested KeysCachedFraction value? > > Suhail > > On Sat, Jan 30, 2010 at 5:58 AM, Jonathan Ellis wrote: > >> the thing that will help most in 0.5 is to increase your >> KeysCachedFraction to 0.2 or even more, depending on your workload. >> >&g

Re: Understand how to provision nodes and use cassandra in the production

2010-01-30 Thread Jonathan Ellis
the thing that will help most in 0.5 is to increase your KeysCachedFraction to 0.2 or even more, depending on your workload. On Sat, Jan 30, 2010 at 5:23 AM, Suhail Doshi wrote: > An issue I've been seeing is it's really hard to scale Cassandra with reads. > I've run top, vmstat, iostat. vmstat s

Re: Is this possible with cassandra

2010-01-29 Thread Jonathan Ellis
On Fri, Jan 29, 2010 at 9:09 AM, Mehar Chaitanya wrote: >   1. This would lead to enourmous amount of duplication of data, in short >   if I now want to view the data from IS_PUBLISHED dimenstion then my database >   size would scale up tremendously. Yes. But disk space is so cheap it's worth us

Re: Is this possible with cassandra

2010-01-29 Thread Jonathan Ellis
Cassandra does not support ad-hoc queries the way SQL does. If you want to ask "what rows have a column X containing value Y" then you need to create a columnfamily whose keys are the values of X, and whose columns are the keys of your original CF. Read http://arin.me/blog/wtf-is-a-supercolumn-ca

Re: Understand how to provision nodes and use cassandra in the production

2010-01-28 Thread Jonathan Ellis
On Thu, Jan 28, 2010 at 9:23 PM, Suhail Doshi wrote: > We've started to use Cassandra in production and just have one node right > now. Here's one of our ColumnFamilys: > > > 16G Jan 28 22:28 SomeIndex-5467-Index.db > 196M Jan 28 22:32 SomeIndex-5487-Index.db > > The first bottle neck you encounte

Re: How to write insert query in Cassandra

2010-01-28 Thread Jonathan Ellis
i believe cassandra_browser in contrib/ can do inserts with a gui, but it's nowhere near as mature as what you would see for mysql. you will also want to read http://wiki.apache.org/cassandra/API and http://wiki.apache.org/cassandra/ClientExamples and probably http://wiki.apache.org/cassandra/Thri

Re: EOFException after upgrading to 0.5.0

2010-01-28 Thread Jonathan Ellis
please read NEWS.txt, both of your problems are covered there (flush your commitlog, and don't mix 0.4 and 0.5 nodes in the same cluster) On Thu, Jan 28, 2010 at 6:44 AM, B R wrote: > Hi guys, > > We are in the process of upgrading from Cassandra 0.4.2 to 0.5.0 The first > issue I faced was : >

Re: thinking about dropping hinted handoff

2010-01-27 Thread Jonathan Ellis
On Wed, Jan 27, 2010 at 1:48 PM, Stu Hood wrote: >> The HH code currently tries to send the hints to nodes other than the >> natural endpoints. If small-scale performance is a problem, we could >> make the natural endpoints be responsible for the hints. This reduces >> durability a bit, but might

Re: thinking about dropping hinted handoff

2010-01-27 Thread Jonathan Ellis
On Wed, Jan 27, 2010 at 1:12 PM, Ryan King wrote: > On Wed, Jan 27, 2010 at 8:34 AM, Jonathan Ellis wrote: >> While being able to write (with CL.ZERO or new-in-0.6 ANY) even if all >> the real write targets are down is cool, but since your goal in real >> life is to keep

Re: Google SoC

2010-01-27 Thread Jonathan Ellis
I hadn't thought about that, but it's a great idea. I imagine the ASF will be a qualified organization once again with no further work necessary on our part in that area, so all we'd need to do would be come up with projects of appropriate scope. Any ideas there? On Wed, Jan 27, 2010 at 10:51 AM

Re: cassandra : How to handle joins

2010-01-27 Thread Jonathan Ellis
Have you read http://arin.me/code/wtf-is-a-supercolumn-cassandra-data-model ? On Wed, Jan 27, 2010 at 10:29 AM, Mehar Chaitanya wrote: > Hi Jonathan > > Thanks for ur reply > > I was wrong in my last posting asking about replication of changes. > > I want know actually how the cassandra works on

thinking about dropping hinted handoff

2010-01-27 Thread Jonathan Ellis
While being able to write (with CL.ZERO or new-in-0.6 ANY) even if all the real write targets are down is cool, but since your goal in real life is to keep enough replicas alive that you can actually do reads, I'm not sure how useful it is. HH also has a measurable performance problem in small clu

Re: Hint storage format.

2010-01-27 Thread Jonathan Ellis
On Wed, Jan 27, 2010 at 10:03 AM, Gary Dusbabek wrote: > The context of this discussion comes from CASSANDRA-293. > > Since it relies on keys, current hinted handoff scheme isn't going to > work for when a range-remove operation needs to be hinted for a downed > node. The idea I'm playing with no

Re: Help regarding cassandra

2010-01-27 Thread Jonathan Ellis
Cassandra supports clusters spanning multiple data centers (see RackAwareStrategy and contrib/property_snitch), but not replication between distinct clusters. On Wed, Jan 27, 2010 at 9:36 AM, Mehar Chaitanya wrote: > Hi All > > I was done with installing cassandra and inserting some data into Key

Re: error in apache-cassandra-incubating-0.5.0-src.tar.gz

2010-01-25 Thread Jonathan Ellis
1, 2: this is because you need to run ant to generate the thrift code 3: this is a warning, not an error 2010/1/25 Lu Ming : > I downloaded apache-cassandra-incubating-0.5.0-src.tar.gz and imported source > files into Eclipse. > and find three errors; > > 1)in org.apache.cassandra.cli.CliClient:

Re: [VOTE] Release 0.5.0 (final)

2010-01-18 Thread Jonathan Ellis
No, but we will definitely take a look at it for 0.5.1. 0.5.0 will not be perfect but it is a huge improvement over 0.4.2, which people are still using because that's the official "stable" release. We need to fix that. :) -Jonathan On Mon, Jan 18, 2010 at 6:39 PM, Ryan Daum wrote: > Any chance

Re: [VOTE] Release 0.5.0 (final)

2010-01-18 Thread Jonathan Ellis
+1 On Mon, Jan 18, 2010 at 2:28 PM, Eric Evans wrote: > > There have been a few changes[1] in the 0.5 branch since RC3. In a perfect > world, we'd probably push those into another release candidate, but I > feel pretty good about this one, and any remaining issues can always > be added to 0.5.1.

Re: the release after 0.5

2010-01-18 Thread Jonathan Ellis
I've moved the already-committed-as-0.9 issues to 0.6, and created a new 0.7 version for tickets that do not fit the goal of a quick release fully compatible with 0.5. -Jonathan On Fri, Jan 8, 2010 at 2:06 PM, Jonathan Ellis wrote: > In the month since 0.5 was branched, we've alre

Re: multiget_slice

2010-01-14 Thread Jonathan Ellis
    32   2557    0    0    71    31    5    5  1  0 94 >  3 > > On Thu, Jan 14, 2010 at 10:11 AM, Jonathan Ellis wrote: > >> how much data do you have on disk?  (only on enode?)  how large are >> the columns you are reading?  how much ram does vmstat say is being >> use

Re: multiget_slice

2010-01-14 Thread Jonathan Ellis
e 64 keys (max) with 200 columns per key worst case. > > Suhail > > On Thu, Jan 14, 2010 at 9:45 AM, Jonathan Ellis wrote: > >> how many keys are you fetching?  how many columns for each key? >> >> On Thu, Jan 14, 2010 at 1:49 AM, Suhail Doshi wrote: >> > I&#

Re: multiget_slice

2010-01-14 Thread Jonathan Ellis
how many keys are you fetching? how many columns for each key? On Thu, Jan 14, 2010 at 1:49 AM, Suhail Doshi wrote: > I've been seeing multiget_slice take an extremely long time: > > 2010-01-14 07:44:00,513 INFO -- Cassandra, delay: > 3.64020800591 --- > 2010-

Re: release policy

2010-01-11 Thread Jonathan Ellis
On Mon, Jan 11, 2010 at 12:03 PM, Ryan King wrote: > Both of the above are fine. I think we could even tolerate having to > run an upgrade tool with a node, as long as we can do it one at a time > and as long as... > >>  (3) network compatibility (from one cluster node to another): may >> change.

the release after 0.5

2010-01-08 Thread Jonathan Ellis
In the month since 0.5 was branched, we've already made some significant progress, particularly in performance. I can't find a way to easily link the full list in Jira, but these include 408+669 (mmapping sstables for reads instead of using buffered I/O): ~50% speed improvement 658 (better wr

release policy

2010-01-08 Thread Jonathan Ellis
I think we have enough people using Cassandra in production now that it would be useful to be explicit about the kinds of changes we will make between major and minor releases. Here is one possibility: Minor releases (e.g. 0.4.0 -> 0.4.1): minor releases will contain bug fixes only -- no new func

Re: API versioning

2010-01-06 Thread Jonathan Ellis
On Wed, Jan 6, 2010 at 2:47 PM, Eric Evans wrote: > On Wed, 2010-01-06 at 14:29 -0600, Jonathan Ellis wrote: >> The 0.5 api is a superset of the 0.4 one in method names and >> arguments, but the exceptions declared are different, so client code >> in compiled languages wi

Re: API versioning

2010-01-06 Thread Jonathan Ellis
+1, the release version is only tenously related to the API version and tracking the latter separately would be much more useful to clients for the reasons you gave. One question: do we need a 3-tuple? The 0.5 api is a superset of the 0.4 one in method names and arguments, but the exceptions decl

Re: [VOTE] Release 0.5.0-rc3

2010-01-06 Thread Jonathan Ellis
+1 On Wed, Jan 6, 2010 at 9:49 AM, Eric Evans wrote: > > Ok let's try this again, this time with the actual fix for #663. :) > > I propose the following tag and artifacts for 0.5.0-rc3: > > SVN Tag: > https://svn.apache.org/repos/asf/incubator/cassandra/tags/cassandra-0.5.0-rc3 > 0.5.0-rc3 artifa

Re: [VOTE] Release 0.5.0-rc2

2010-01-05 Thread Jonathan Ellis
oops, -1: I committed the wrong version of the fix for 663. Correct fix has been committed to 0.5 branch now. -Jonathan On Tue, Jan 5, 2010 at 4:22 PM, Jonathan Ellis wrote: > +1 > > On Tue, Jan 5, 2010 at 4:07 PM, Eric Evans wrote: >> >> There were some issues in rc1 t

Re: Chair

2010-01-05 Thread Jonathan Ellis
I can volunteer. On Tue, Jan 5, 2010 at 4:27 PM, Eric Evans wrote: > > When we graduate, we need to have someone who will act as the Chair of > the Project Management Committee, or "Chair". The Chair reports to the > ASF Board on behalf of the project, maintains information on the PMC > dispositi

Re: [VOTE] Release 0.5.0-rc2

2010-01-05 Thread Jonathan Ellis
+1 On Tue, Jan 5, 2010 at 4:07 PM, Eric Evans wrote: > > There were some issues in rc1 that warrant us taking another stab at > this, (see [1]). > > I propose the following tag and artifacts for 0.5.0-rc2: > > SVN Tag: > https://svn.apache.org/repos/asf/incubator/cassandra/tags/cassandra-0.5.0-rc

Re: Memtable Performance Problem

2009-12-30 Thread Jonathan Ellis
I went one better, all the way to column-level synchronization. :) Patches attached to https://issues.apache.org/jira/browse/CASSANDRA-658. review / testing appreciated. -Jonathan On Fri, Dec 11, 2009 at 3:53 AM, Stu Hood wrote: > Seems like we could just synchronize on the 'oldCf' object, sinc

Welcome new committers Gary Dusbabek and Jaakko Laine!

2009-12-30 Thread Jonathan Ellis
Thanks for the help, guys! -Jonathan

Re: [jira] Updated: (CASSANDRA-651) cassandra 0.5 version throttles and sometimes kills traffic to a node if you restart it.

2009-12-28 Thread Jonathan Ellis
nel was > reset , > after 10.237.1.135 come back, these socket channel remain closed, forever > -END-- > > > -Original Message- > From: Jonathan Ellis (JIRA) [mailto:j...@apache.org] > Sent: Thursday, December 24, 2009 10:47 AM > To

Re: [VOTE] Release 0.5.0-rc1

2009-12-23 Thread Jonathan Ellis
at 1:43 PM, Chris Goffinet > wrote: >> +1 >> >> On Wed, Dec 23, 2009 at 4:36 PM, Jonathan Ellis wrote: >> >>> +1 >>> >>> On Wed, Dec 23, 2009 at 3:28 PM, Eric Evans wrote: >>> > >>> > All of the 0.5 showstoppers are

Re: [VOTE] Release 0.5.0-rc1

2009-12-23 Thread Jonathan Ellis
+1 On Wed, Dec 23, 2009 at 3:28 PM, Eric Evans wrote: > > All of the 0.5 showstoppers are out of the way and things are looking > pretty solid. Shall we push out a release candidate? > > I propose the following tag and artifacts for 0.5.0-rc1 > > SVN Tag: > https://svn.apache.org/repos/asf/incuba

Re: build fails with "ant clean gen-thrift-java build"

2009-12-22 Thread Jonathan Ellis
2009/12/22 Ted Zlatanov : > But would you (as Gary IIRC mentioned > earlier) prefer the old constructors back instead to minimize changes to > Cassandra? In a perfect world, Thrift wouldn't go breaking stuff that wasn't causing problems, or if they did they would admit it and roll things back. It

Re: build fails with "ant clean gen-thrift-java build"

2009-12-22 Thread Jonathan Ellis
2009/12/22 Ted Zlatanov : > Looks like this is not getting changed and Cassandra must cope with > Thrift's new constructors instead.  Will the updated code make it into > SVN so I can do my auth patch against it? As soon as such a patch is contributed, sure. -Jonathan

Re: Small HintedHanoffManager improvement?

2009-12-22 Thread Jonathan Ellis
On Mon, Dec 21, 2009 at 12:46 PM, Ramzi Rabah wrote: > It seems that we still send a row mutation even if the cf of the row > is null. It should never happen, and if it does it's harmless, so adding a special case for it is counterproductive. > On a different note, I have a few questions about H

Re: Memtable Performance Problem

2009-12-10 Thread Jonathan Ellis
even with CHM, resolve is not threadsafe and needs to be synchronized. removing the synchronized could cause data loss. don't do that. :) On Thu, Dec 10, 2009 at 9:42 PM, 张洁 wrote: > When I was doing to write stress test,I found that the throughput is not > stable,from 200request/s to 12000requ

Re: dead code: org.apache.cassandra.net.sink.SinkManager

2009-12-09 Thread Jonathan Ellis
The authors of that class said (in a presentation?) that it's used to introduce artificial errors for testing. So I'm in no hurry to delete it. On Wed, Dec 9, 2009 at 8:53 PM, Kelvin Kakugawa wrote: > I've been looking around the codebase, and I was wondering if these > classes are dead code: >

Re: [VOTE] Release 0.5.0-beta2

2009-12-09 Thread Jonathan Ellis
+1

Re: Questions about weak reads

2009-12-09 Thread Jonathan Ellis
On Wed, Dec 9, 2009 at 11:01 AM, Sylvain Lebresne wrote: > And wouldn't it be possible/reasonable to do something like a strong read, but > with a modified quorumResponseHandler that return from get() as soon as it > gets an answer and do the responseResolver/read repair in the background. > Would

Re: Questions about weak reads

2009-12-09 Thread Jonathan Ellis
On Wed, Dec 9, 2009 at 10:33 AM, Sylvain Lebresne wrote: > Well, I just checkout from svn > (svn checkout https://svn.apache.org/repos/asf/incubator/cassandra/trunk > cassandra) Thanks, updated comments. > In any case, at least for readRemote, why when the "suitableEndpoint" > timeout another >

Re: Questions about weak reads

2009-12-09 Thread Jonathan Ellis
On Wed, Dec 9, 2009 at 9:37 AM, Sylvain Lebresne wrote: > But otherwise, the discrepancy between code and comments suggests that the > code was changed. If so, what was the rational behind the change ? I'm guessing you're reading the 0.4 source? This has been cleaned up in trunk. At least I'm p

Re: what are utils.Fast* for?

2009-12-05 Thread Jonathan Ellis
removed On Sat, Dec 5, 2009 at 1:55 PM, gabriele renzi wrote: > On Sat, Dec 5, 2009 at 3:23 PM, Jonathan Ellis wrote: >> Dead code.  Deleted. > > thanks, I love when this happens :) > I believe the same could be done for PrimeFinder.java > >> BTW, if you're

Re: what are utils.Fast* for?

2009-12-05 Thread Jonathan Ellis
Dead code. Deleted. BTW, if you're looking through the codebase you should probably start here: http://wiki.apache.org/cassandra/ArchitectureInternals On Sat, Dec 5, 2009 at 5:14 AM, gabriele renzi wrote: > Hi everyone, > > while trying to get a grasp of the codebase in cassandra, I run > FindB

  1   2   3   4   >