Re: Cassandra 0.3 RC is out

2009-05-14 Thread Jonathan Ellis
I've been asked to change the download url to http://people.apache.org/%7Ejbellis/cassandra/cassandra-0.3-rc.tgz to avoid incorrectly implying that this is An Official Release which it is not. -Jonathan

Re: Node Recovery

2009-05-18 Thread Jonathan Ellis
That's the price you pay for (a) eventual consistency in general and (b) doing read repair in the background specifically. Cassandra also has functionality (called strong read) to do a quorum read in the foreground and repair if necessary but that is not exposed in Thrift yet -- but even with

Re: Ingesting from Hadoop to Cassandra

2009-05-27 Thread Jonathan Ellis
On Wed, May 27, 2009 at 6:39 PM, Alexandre Linares lina...@ymail.com wrote: So it actually doesn't look blocked, but it's crawling.  Of course, in Hadoop, it always timed out (10 mins), before I could tell that it was crawling (I think) So, back to the original hypothesis: you need to increase

Re: cassandra's performance?

2009-06-03 Thread Jonathan Ellis
We're basically in a roll-your-own benchmark state. Johan can probably give some pointers: http://blog.oskarsson.nu/2009/05/vpork.html. Also see the how fast is it section here: http://spyced.blogspot.com/2009/05/cassandra-03-release-candidate-and.html -Jonathan On Wed, Jun 3, 2009 at 3:06 AM,

Re: cassandra's performance?

2009-06-03 Thread Jonathan Ellis
and the write path will never hit disk. On Wed, Jun 3, 2009 at 5:48 PM, Jonathan Ellis jbel...@gmail.com wrote: Cassandra is not designed to work memory-only.  It's designed designed to use disk for durability and to accommodate using large sets of data, letting the OS use memory as a huge cache

Re: questions about operations

2009-06-04 Thread Jonathan Ellis
On Thu, Jun 4, 2009 at 10:01 AM, Thorsten von Eicken t...@rightscale.com wrote: Ah, got it, I forgot about the time-sorted CFs. So does this mean that if I call get_columns_since on a name-sorted CF I will get an invalid request exception? And also if I call get_slice_by_name_range or

Re: New table and column families

2009-06-23 Thread Jonathan Ellis
you'll need to (a) make sure you have the latest trunk (b) wipe your data, commitlog, and system directories, since adding new tables or columnfamilies non-destructively is not yet supported (see https://issues.apache.org/jira/browse/CASSANDRA-44) -Jonathan On Tue, Jun 23, 2009 at 8:55 AM,

Re: problem running cassandra

2009-07-09 Thread Jonathan Ellis
cassandra Thanks a lot On Thu, Jul 9, 2009 at 10:27 AM, Jonathan Ellis jbel...@gmail.com wrote: what version are you trying to run?  on what platform? On Thu, Jul 9, 2009 at 12:04 PM, mobiledream...@gmail.com wrote: I did set it up as the readme file instructed but i encountered this error

Re: How to answer queries of form Give me the top 10 messages

2009-07-10 Thread Jonathan Ellis
Have you read this? http://blog.evanweaver.com/articles/2009/07/06/up-and-running-with-cassandra/ On Fri, Jul 10, 2009 at 4:43 PM, mobiledream...@gmail.com wrote: Hey guys how do we answer queries of type - give me the top 10 messages or top 10 users and so on thanks Example: SuperColumns

Re: cassandra slows down after inserts

2009-07-13 Thread Jonathan Ellis
Cassandra is replaying the transaction log and preloading SSTable indexes. This is normal. On Mon, Jul 13, 2009 at 8:10 AM, rkmr...@gmail.comrkmr...@gmail.com wrote: when i stop cassandra and start it again, this is what is printed. it takes just a couple of seconds for this to run. and after

Re: WARN - Unable to find a live Endpoint we might be out of live nodes , This is dangerous !!!!

2009-07-17 Thread Jonathan Ellis
programs # that require network functionality will fail. 127.0.0.1 localhost.localdomain localhost localhost ::1 localhost6.localdomain6 localhost6 On Fri, Jul 17, 2009 at 6:09 AM, Jonathan Ellis jbel...@gmail.com wrote: Please don't repeat your question separately on -user, -dev, and irc

Re: Scaling from 1 to x (was: one server or more servers?)

2009-07-17 Thread Jonathan Ellis
thought Hinted handoff will take care of this Right? Write will never fail insted it will write to another node right? correct me if i am wrong. Thanks and Regards, /VJ On Tue, Jul 14, 2009 at 7:26 AM, Jonathan Ellis jbel...@gmail.com wrote: N: guarantees consistent reads without having

Re: Scaling from 1 to x (was: one server or more servers?)

2009-07-17 Thread Jonathan Ellis
availability.) -Jonathan During query it will fail if i only have block for to be 3? Regards, /VJ On Fri, Jul 17, 2009 at 1:36 PM, Jonathan Ellis jbel...@gmail.com wrote: ck_for to zero, then writes will indeed never fail (unless the node the client is ta

Re: ì¤ ìì§ ì¤ ì thrift.Thrift.TApplicationEx ception: Internal error processing insert

2009-07-19 Thread Jonathan Ellis
That should be partially solved in trunk now that 139 is committed, and more solved when we commit 185 soon. On Sun, Jul 19, 2009 at 3:43 AM, mobiledream...@gmail.com wrote: Any utf-8 keyword causes cassandra to crash!

Re: how to delete an entire column family

2009-07-19 Thread Jonathan Ellis
iterate through the keys with get_key_range, and delete the row associated with each key On Sun, Jul 19, 2009 at 3:51 AM, mobiledream...@gmail.com wrote: In Super-column family Super1 there is a column family Related How do i delete the entire related column family thanks

Re: python thrift cassandra: get_slice_super vs get_slice_super_by_names

2009-07-19 Thread Jonathan Ellis
, Jonathan Ellis jbel...@gmail.com wrote: I would guess because kw != 'tofu' On Sun, Jul 19, 2009 at 12:24 AM, mobiledream...@gmail.com wrote: Why doesnt res return ColumnFamily Related whereas res2 works just fine thanks? timestamp = time.time() res = client.get_slice_super('Table1', kw

Re: New cassandra in trunk - breaks python thrift interface (was AttributeError: 'str' object has no attribute 'write')

2009-07-19 Thread Jonathan Ellis
Don't run trunk if you're not going to read svn log. The api changed with the commit of the 139 patches (and it will change again with the 185 ones). look at interface/cassandra.thrift to see what arguments are expected. On Sun, Jul 19, 2009 at 3:31 PM, mobiledream...@gmail.com wrote: Hey

Re: New cassandra in trunk - breaks python thrift interface (was AttributeError: 'str' object has no attribute 'write')

2009-07-19 Thread Jonathan Ellis
It works fine, it's just not the same as it was two weeks ago. On Sun, Jul 19, 2009 at 3:36 PM, mobiledream...@gmail.com wrote: ok so which is the version where cassandra python thrift works out of the box thanks On 7/19/09, Jonathan Ellis jbel...@gmail.com wrote: Don't run trunk if you're

Re: AttributeError: 'str' object has no attribute 'write'

2009-07-19 Thread Jonathan Ellis
Building the java interface is part of the build, but ant has no way to guess which additional client interfaces you want to use, if any. On Sun, Jul 19, 2009 at 6:46 PM, Ian Holsmani...@holsman.net wrote: hi Gasol. shouldn't regeneration of the interface be part of the build process? On

Fwd: thrift API changes

2009-07-20 Thread Jonathan Ellis
Oops, I sent this to the old google -user list by mistake the first time. Now that that's gone, I realized the error. -- Forwarded message -- From: Jonathan Ellis jbel...@gmail.com Date: Mon, Jul 20, 2009 at 10:10 PM Subject: Re: thrift API changes To: cassandra-u

Re: keys and column names cannot be utf-8

2009-07-21 Thread Jonathan Ellis
did you read the new section in the config xml explaining how to use a UTF8 comparator? also: thrift itself is just plain broken for unicode support in some languages; see THRIFT-395 I think the short version is that when you have a java server, unicode will work with java or C# clients but not

Re: keys and column names cannot be utf-8

2009-07-21 Thread Jonathan Ellis
you may also want to specify CompareSubcolumnsWith. On Tue, Jul 21, 2009 at 4:27 PM, mobiledream...@gmail.com wrote: thanks jonathan trying this ColumnFamily ColumnType=Super CompareWith=UTF8Type Name=Super1/ On Tue, Jul 21, 2009 at 2:24 PM, Jonathan Ellis jbel...@gmail.com wrote: On Tue

Re: keys and column names cannot be utf-8

2009-07-21 Thread Jonathan Ellis
clarify if this will guarantee proper python thrift utf8 behavior thanks On Tue, Jul 21, 2009 at 2:29 PM, Jonathan Ellis jbel...@gmail.com wrote: you may also want to specify CompareSubcolumnsWith. On Tue, Jul 21, 2009 at 4:27 PM, mobiledream...@gmail.com wrote: thanks jonathan trying

Re: how to deploy cassadnra 0.30 in Running on multiple nodes

2009-07-24 Thread Jonathan Ellis
because nodeprobe is in 0.4, not 0.3 On Fri, Jul 24, 2009 at 2:26 AM, 李楠qingnan...@gmail.com wrote: i deploy in storage-conf.xml Seeds add Seed192.1.12.149/Seed use bin/nodeprobe -host X ring where X is each machine in your cluster to make sure all the nodes see each other why do not i find 

Re: ColumnFamily config options

2009-08-01 Thread Jonathan Ellis
On Sat, Aug 1, 2009 at 9:04 AM, Ivan Changivan.ch...@medigy.com wrote: Could someone points me to resources/examples how to use the config options below: 1) Extending index at run time using the Index node   ColumnFamily name=foo     Index generator=some java class/   /ColumnFamily Index

Re: [Fwd: Re: Greetings!]

2009-08-01 Thread Jonathan Ellis
On Fri, Jul 31, 2009 at 5:42 PM, Colin Mollenhourco...@mollenhour.com wrote: This reply keeps getting blocked as spam so I am just sending to you directly.. Jonathan, thank you very much for the excellent response. If I may, a few more questions (inline): One caveat is that the subcolumns

Re: MVCC

2009-08-03 Thread Jonathan Ellis
On Mon, Aug 3, 2009 at 10:49 AM, Jun Raojun...@almaden.ibm.com wrote: Ivan, The original cassandra keeps multiple versions of the column data. No, it didn't. (It had versioning-related bugs but multiple versions a la Bigtable was never part of the design.) -Jonathan

Re: MVCC

2009-08-03 Thread Jonathan Ellis
cassandra had free multiple versions and we needed to manually delete the older versions On Mon, Aug 3, 2009 at 8:56 AM, Jonathan Ellis jbel...@gmail.com wrote: On Mon, Aug 3, 2009 at 10:49 AM, Jun Raojun...@almaden.ibm.com wrote: Ivan, The original cassandra keeps multiple versions

Re: MVCC

2009-08-03 Thread Jonathan Ellis
Strictly speaking, no; timestamp is client-provided. But in the sense that you'd better use ntpd on your clients, yes. On Mon, Aug 3, 2009 at 12:10 PM, Wilson Marwilson...@gmail.com wrote: So if different servers are not synchronized in time (to a Tier 1 time server), then updates from slower

Re: MVCC

2009-08-03 Thread Jonathan Ellis
On Mon, Aug 3, 2009 at 12:12 PM, Mark McBridemark.mcbr...@gmail.com wrote: Thanks, that makes sense.  Is it an ok general rule that the timestamps should be set to 1) The time that the data to be mutated was generated 2) The current system time if the time the data was mutated isn't available

Re: MVCC

2009-08-03 Thread Jonathan Ellis
On Mon, Aug 3, 2009 at 3:39 PM, Ivan Changivan.ch...@medigy.com wrote: Is this going to be an inherent limitation of Cassandra? If someone writes a patch that adds multi-version support without compromising single-version performance then I don't see any reasons to turn it down. -Jonathan

Re: .NET client example

2009-08-04 Thread Jonathan Ellis
It should be fairly easy to port the Java example on http://wiki.apache.org/cassandra/ClientExamples to C#. -Jonathan On Tue, Aug 4, 2009 at 2:55 PM, Robinson, Michaelmichael.robins...@sage.com wrote: Does anyone have or know of a C#/.NET port of the example Cassandra client code?  Since I’m

heads up on on-disk data format changes in trunk

2009-08-04 Thread Jonathan Ellis
Hi guys, We're trying to get the data format incompatibilities done with before we put out a 0.4 beta. There's 3 coming up. In order of when they are likely to be finished: https://issues.apache.org/jira/browse/CASSANDRA-341 -- to fix bugs (today)

Re: heads up on on-disk data format changes in trunk

2009-08-04 Thread Jonathan Ellis
Re the index bugs: this is referring to column indexes, not key indexes. (The former are kept in the SSTable; the latter are in a separate file. Don't delete your -Index.db files. :) And a correction: 341 is the only one that the compaction process will fix, not 332 as well. -Jonathan On Tue,

default OrderPreservingPartitioner changed

2009-08-07 Thread Jonathan Ellis
The default OPP now does comparisons based strictly on byte order, and is no longer collation aware. This is a better default choice for those who don't need collation since it's much faster. If you do need collation, the old partitioner is still available as CollatingOPP: -

Re: new thrift API

2009-08-08 Thread Jonathan Ellis
Stabilizing but not quite finished (329 and 311 are still waiting for review). But the fundamentals are the same. You have start/finish because that's what defines a range. You have count because you often want the First N results. -Jonathan On Sat, Aug 8, 2009 at 8:33 AM, Jonas

Re: new thrift API

2009-08-08 Thread Jonathan Ellis
a numeric offset since that can't be supported efficiently with a log-structured merge disk format. On Sat, Aug 8, 2009 at 8:57 AM, Jonas Bonérjo...@jonasboner.com wrote: 2009/8/8 Jonathan Ellis jbel...@gmail.com: (either of start and finish may be empty) Thanks for fast reply. So count

Re: Valid consistency level values on trunk

2009-08-11 Thread Jonathan Ellis
Thanks for looking into it, though. On Tue, Aug 11, 2009 at 10:59 AM, Mark McBridemark.mcbr...@gmail.com wrote: My shot at hero status has been thwarted, due to a preexisting Jira issue http://issues.apache.org/jira/browse/THRIFT-551 On Mon, Aug 10, 2009 at 5:49 PM, Mark

Re: New thrift api question.

2009-08-12 Thread Jonathan Ellis
abandon the old design? how to build 'one to many' relationship in a effective way with new api? what data model should i use? thank? 2009/8/13 Jonathan Ellis jbel...@gmail.com The easy way is to just show one page of results and have a Next button that uses start=last one from current page

Re: Visual representation of Cassandra data model

2009-08-12 Thread Jonathan Ellis
Thanks for taking a stab at this, Mark. I'm not a fan of teaching this by showing CF-spanning rows. (The bigtable paper does this IIRC but it's wrong. :) You can have data in different CFs with the same key, yes, but all that means is they will be stored on the same nodes. Each CF is stored

Re: Cassandra performance

2009-08-17 Thread Jonathan Ellis
What happened about 20h in to make the latency drop so dramatically? On Mon, Aug 17, 2009 at 12:14 PM, Huming Wuhuming...@gmail.com wrote: I did some performance test and I am not impressed :). The data set is 880K unique keys and there are 4 columns with 2 columns being string and the other 2

Re: Pls, help with fetching of super-column's value

2009-08-19 Thread Jonathan Ellis
2009/8/19 Teodor Sigaev teo...@sigaev.ru: It produces following error messages $VAR1 = 'Can\'t use string (0) as a SCALAR ref while strict refs in use at /usr/lib/perl5/site_perl/5.8.8/Thrift/BinaryProtocol.pm line 376.' If it's objecting to the 0 in reversed=0, it sounds like a perl specific

Re: Pls, help with fetching of super-column's value

2009-08-19 Thread Jonathan Ellis
2009/8/19 Teodor Sigaev teo...@sigaev.ru: If it's objecting to the 0 in reversed=0, it sounds like a perl specific problem -- why would it be turning that into a string? It seems to me that it isn't connected to reversed. I changed to 1 and nothing was changed. Dunno, then. Bug in the

Re: Pls, help with fetching of super-column's value

2009-08-19 Thread Jonathan Ellis
2009/8/19 Teodor Sigaev teo...@sigaev.ru: Dunno, then.  Bug in the generated perl code?  Wouldn't be the first time. Interesting, if column_names is added with known status ids then Cassandra returns them although it doesn't take into account reversed and count options. it's not supposed to.

Re: Anybody experience one Cassandra server locking up?

2009-08-19 Thread Jonathan Ellis
sounds like you are exhausting the memory on that instance and it is going into GC swap trying to free enough to continue. this is very easy to do on 0.3 -- try upgrading to the 0.4 beta if you are using 0.3. On Tue, Aug 18, 2009 at 3:36 PM, Brian Frank Coopercoop...@yahoo-inc.com wrote: Hi

Re: quorum read timeout

2009-08-19 Thread Jonathan Ellis
Is this 0.3 or 0.4/trunk? On Wed, Aug 19, 2009 at 2:36 PM, Phillip Michalakphil.micha...@digitalreasoning.com wrote: I'm running three Cassandra nodes in virtual machines. During a 'get' operation from Cassandra-remote directed at one of these nodes, I'm receiving the following output

Re: quorum read timeout

2009-08-19 Thread Jonathan Ellis
Looks like a bug in TcpConnectionManager. Can you file a ticket? thanks, -Jonathan On Wed, Aug 19, 2009 at 2:49 PM, Phillip Michalakphil.micha...@digitalreasoning.com wrote: It's cassandra-0.4-beta1. Thanks! Phil On Aug 19, 2009, at 4:43 PM, Jonathan Ellis wrote: Is this 0.3 or 0.4

Re: Cassandra performance

2009-08-19 Thread Jonathan Ellis
be careful when profiling blocking io -- I bet that means that I'm spending all my time blocking for more data to read since there is only one call per second. the internal Cassandra MessagingService uses nonblocking io, but the Thrift stuff is just your standard thread pool with blocking

Re: Server cannot startup after shutdown

2009-08-19 Thread Jonathan Ellis
The malformed input bug was fixed after beta1 and should be in a nightly build by now. (I introduced a regression where it couldn't handle the last entry in the commitlog being incomplete. So upgrading should be able to restart on the existing commitlogs.) The OOM puzzles me a little; I'm not

Re: Server cannot startup after shutdown

2009-08-20 Thread Jonathan Ellis
: Jonathan Ellis [jbel...@gmail.com] Sent: Wednesday, August 19, 2009 7:46 PM To: cassandra-user@incubator.apache.org Subject: Re: Server cannot startup after shutdown The malformed input bug was fixed after beta1 and should be in a nightly build by now.  (I introduced a regression where it couldn't

Re: Cassandra security

2009-08-21 Thread Jonathan Ellis
if your product is jvm based, just use the internal api and don't stzrt the thrift listeners at all. On 8/21/09, Mark McBride mark.mcbr...@gmail.com wrote: I'm looking at the potential of embedding Cassandra in one of our products. This ships as one or more virtual appliances that runs at a

Re: Newbe´s question

2009-08-24 Thread Jonathan Ellis
lazyboy works vs an earlier version of trunk, so it's already incompatible with 0.3, but not yet compatible w/ latest 0.4 :) On Mon, Aug 24, 2009 at 12:28 PM, Bruno Coutobco...@gmail.com wrote: Jonathan, First, thanks for answering so fast. I´m using version 0.3-final of Cassandra, then I

Re: Newbe´s question

2009-08-24 Thread Jonathan Ellis
That's probably the best option at the moment. Once you're familiar with the thrift API I'm sure the lazyboy devs would welcome updates too. -Jonathan On Mon, Aug 24, 2009 at 1:54 PM, Bruno Coutobco...@gmail.com wrote: Thanks for helpping me Jonathan! Well, now I know that I can´t use the

Re: Cassandra hardware setup

2009-08-26 Thread Jonathan Ellis
On Tue, Aug 25, 2009 at 7:07 PM, Scott Chaconscha...@gmail.com wrote: We're playing with Cassandra and would like to get a test cluster setup for evaluation.  I've been playing with it on my laptop and EC2, which are the resources easily available to me, but not that close to what I would be

Re: Server cannot startup after shutdown

2009-08-26 Thread Jonathan Ellis
On Wed, Aug 19, 2009 at 9:46 PM, Jonathan Ellisjbel...@gmail.com wrote: The OOM puzzles me a little; I'm not sure how it could be unable to replay a mutation that it was able to write to the commitlog in the first place. Ah, I think I know: if a compaction starts during recovery, that could

Re: Server cannot startup after shutdown

2009-08-26 Thread Jonathan Ellis
On Wed, Aug 26, 2009 at 12:26 PM, Brian Frank Coopercoop...@yahoo-inc.com wrote: Is the commitlog small enough that you can gzip it and attach to JIRA (10 MB limit)? /var/cassandra/commitlog has 215 files totaling about 28 GB. Most are 134 MB, the last one is 6MB. Which one would be useful

Re: Newbe´s question

2009-08-26 Thread Jonathan Ellis
I thought about that, but I really don't want Cassandra committers to have to be in the business of updating them all when we make changes, and having them in the repo creates that expectation even in contrib. On Wed, Aug 26, 2009 at 6:57 PM, Ian Holsmani...@holsman.net wrote: would it be

Re: Newbe´s question

2009-08-26 Thread Jonathan Ellis
or svn:link to pull them in from their own repos? (not sure how legal it would be). On Aug 27, 2009, at 10:03 AM, Jonathan Ellis wrote: I thought about that, but I really don't want Cassandra committers to have to be in the business of updating them all when we make changes, and having them

Re: Newbe´s question

2009-08-26 Thread Jonathan Ellis
...@holsman.net wrote: isn't there a way to use svn:external or svn:link to pull them in from their own repos? (not sure how legal it would be). On Aug 27, 2009, at 10:03 AM, Jonathan Ellis wrote: I thought about that, but I really don't want Cassandra committers to have to be in the business

Re: random n00b question

2009-09-15 Thread Jonathan Ellis
We don't currently have any optimizations to provide lightweight session consistency (see #132), but if you do quorum reads + quorum writes then you are guaranteed to read the most recent write which should be fine for most apps. On Tue, Sep 15, 2009 at 5:30 AM, Eric Bowman ebow...@boboco.ie

Re: New Features - Future releases

2009-09-18 Thread Jonathan Ellis
On Fri, Sep 18, 2009 at 9:09 PM, Jonathan Mischo jmis...@quagility.com wrote:        • Multiple data center replication in the background. maybe a multi master type thing It already has this. It was built from the ground up for this. It's highly tolerant to partitioning and has always

Re: Configuring multiple nodes

2009-09-20 Thread Jonathan Ellis
Could you mention this on http://wiki.apache.org/cassandra/RunningCassandra ? On Sun, Sep 20, 2009 at 10:46 PM, Anthony Molinaro antho...@alumni.caltech.edu wrote: One other gotcha I've come across trying to use jmx was on machines with multiple interfaces.  The rmi server doesn't always bind

Re: perfomance issue

2009-09-25 Thread Jonathan Ellis
It's impossible to say given what you have told us. Step zero in benchmarking cassandra is turning the log level to INFO. Step one is testing on a machine where you can put the commitlog directory on its own disk. It's true that frequently cassandra will be slower than custom code writing to

Re: perfomance issue

2009-09-25 Thread Jonathan Ellis
. -Jonathan On Fri, Sep 25, 2009 at 10:41 AM, Kirill A. Korinskiy catap+cassan...@catap.ru wrote: At Fri, 25 Sep 2009 10:32:55 -0500, Jonathan Ellis jbel...@gmail.com wrote: Step zero in benchmarking cassandra is turning the log level to INFO. sure, i'm switching off all log messages Step

Re: commit logs are not deleted

2009-09-25 Thread Jonathan Ellis
This is fixed on the 0.4 branch (but not in trunk, yet) On Fri, Sep 25, 2009 at 1:57 PM, Jonathan Ellis jbel...@gmail.com wrote: https://issues.apache.org/jira/browse/CASSANDRA-455 will address FlushPeriod not working. On Fri, Sep 25, 2009 at 1:33 PM, Igor Katkov ikat...@gmail.com wrote: I

Re: read repair keeps occurring on every quorum read

2009-09-25 Thread Jonathan Ellis
No, you're mixing two related concepts. When you do a quorum read it will fetch the actual data from one replica and do digest reads from the others. If the data from the one does not match the hash from the others, then you have the digestmismatchexception Edmond is seeing and read repair is

Re: commit logs are not deleted

2009-09-26 Thread Jonathan Ellis
On Fri, Sep 25, 2009 at 11:55 PM, Igor Katkov ikat...@gmail.com wrote: I checked out and built 0.4 branch. It's all the same, files stays. If you turn on debug logging, does it show sstable flushing? -Jonathan

Re: commit logs are not deleted

2009-09-26 Thread Jonathan Ellis
response time is also growing. I assume this is because Cassandra now has to read through some these files on reach read/write request On Fri, Sep 25, 2009 at 4:53 PM, Jonathan Ellis jbel...@gmail.com wrote: This is fixed on the 0.4 branch (but not in trunk, yet) On Fri, Sep 25, 2009 at 1:57

Re: commit logs are not deleted

2009-09-27 Thread Jonathan Ellis
commit log c:/cassandra-4rc/data/commitlog\CommitLog-1254027292143.log; dirty is 1, ... P.S. OS: Windows server 2003 On Sat, Sep 26, 2009 at 2:50 PM, Jonathan Ellis jbel...@gmail.com wrote: Patch 0001 on https://issues.apache.org/jira/browse/CASSANDRA-459 fixes another bug that could cause

Re: commit logs are not deleted

2009-09-27 Thread Jonathan Ellis
for CommitLogContext(file='c:/cassandra-4rc/data/commitlog\CommitLog-1254064318865.log', position=67321599), column family 0. CFIDs are BenchMarkApp: TableMetadata(Channels: 0, }), system: TableMetadata(LocationInfo: 1, HintsColumnFamily: 2, }), } On Sun, Sep 27, 2009 at 8:49 AM, Jonathan Ellis jbel

Re: commit logs are not deleted

2009-09-28 Thread Jonathan Ellis
deleted. Log if filled with lines like Not safe to delete commit log d:/cassandra-4/data/commitlog\CommitLog-1254177667321.log; dirty is 1, On Mon, Sep 28, 2009 at 5:21 PM, Jonathan Ellis jbel...@gmail.com wrote: I have a fix now at https://issues.apache.org/jira/browse/CASSANDRA-459 -- only

Re: read repair keeps occurring on every quorum read

2009-09-30 Thread Jonathan Ellis
Since JIRA is mostly dead right now, here is the patch to test against 0.4. On Mon, Sep 28, 2009 at 4:30 PM, Edmond Lau edm...@ooyala.com wrote: On Fri, Sep 25, 2009 at 8:10 PM, Jonathan Ellis jbel...@gmail.com wrote: No, you're mixing two related concepts. When you do a quorum read

Re: client code supercolumn insertion from cassandra 0.3.0 to 0.4.0

2009-10-02 Thread Jonathan Ellis
On Fri, Oct 2, 2009 at 3:50 AM, ROGER PUIG GANZA rp...@tid.es wrote: Code for Cassandra 0.4.0 . It’s suppoded to be working but when I fetch a superColumn with using the superKey, it returns a NotFoundException and when using the Cassandra-cli, I get this InvalidRequestException(why:column

Re: Connection Pooling

2009-10-03 Thread Jonathan Ellis
It would be pretty easy to create one with http://commons.apache.org/pool/. If your number of ops-per-connection is already high then pooling is a lower priority. On Sat, Oct 3, 2009 at 5:46 AM, Johannes Schaback johannes.schab...@visual-meta.com wrote: Hi! Just a quick question out of

Re: backing up data from cassandra

2009-10-05 Thread Jonathan Ellis
You can take a snapshot and either leave it in place indefinitely or throw it into your existing backup ecosystem. That's your best option for backup no matter which kind of partitioner you're using. -Jonathan On Mon, Oct 5, 2009 at 12:52 AM, Edmond Lau edm...@ooyala.com wrote: For folks who

Re: backing up data from cassandra

2009-10-05 Thread Jonathan Ellis
that manually anymore. On Mon, Oct 5, 2009 at 8:05 AM, Joe Van Dyk joevan...@gmail.com wrote: How do you take the snapshot?  What's the restore process? On Mon, Oct 5, 2009 at 5:22 AM, Jonathan Ellis jbel...@gmail.com wrote: You can take a snapshot and either leave it in place indefinitely or throw

Re: [VOTE] Project Logo

2009-10-05 Thread Jonathan Ellis
 [ 4] 2    http://99designs.com/contests/28940/entries/002  [10] 30   http://99designs.com/contests/28940/entries/030  [ 3] 32   http://99designs.com/contests/28940/entries/032  [11] 33   http://99designs.com/contests/28940/entries/033  [ 8] 90  

Re: backing up data from cassandra

2009-10-05 Thread Jonathan Ellis
On Mon, Oct 5, 2009 at 11:10 AM, Anthony Molinaro antho...@alumni.caltech.edu wrote: I assume the server also needs to be stopped while your are swapping files, but what about if you have a cluster of several servers and need to restore.  Is the process to shutdown all the servers, move the

Re: [VOTE] Project Logo

2009-10-05 Thread Jonathan Ellis
Oops, I had two at #8. Fixed:  [ 8] 90   http://99designs.com/contests/28940/entries/090  [ 9] 175  http://99designs.com/contests/28940/entries/175

Re: backing up data from cassandra

2009-10-05 Thread Jonathan Ellis
On Mon, Oct 5, 2009 at 11:23 AM, Thorsten von Eicken t...@rightscale.com wrote: Isn't the question about how you back up a cassandra cluster, not a single node? Sure, but the generalization is straightforward. :) Can you snapshot the various nodes at different times or do they need to be

Re: Error in the latest Trunk - RackAware Statergy

2009-10-05 Thread Jonathan Ellis
On Mon, Oct 5, 2009 at 5:02 PM, Vijay vijay2...@gmail.com wrote: ERROR [pool-1-thread-11] 2009-10-05 21:32:03,646 StorageProxy.java (line 179) error writing key c8f3c84c-6253-46fc-b1da-f00a871904f1 java.lang.UnsupportedOperationException: Rack-aware bootstrapping not supported pretty

Re: Storage proxy write latency is too high

2009-10-06 Thread Jonathan Ellis
is also broken in .NET (C#). Now I start to wonder if there is some fundamental flaw in async IO on windows... On Mon, Oct 5, 2009 at 3:23 PM, Jonathan Ellis jbel...@gmail.com wrote: On Mon, Oct 5, 2009 at 2:17 PM, Igor Katkov ikat...@gmail.com wrote: measured via JMX console i.e. does

Re: cassandra upgrade strategy

2009-10-06 Thread Jonathan Ellis
So far, it looks like the next version after 0.4 will meet our goal of being 100% compatible. If there are incompatibilities introduced in a future version after that (which is likely) then it will probably be one-upgrade-tool-fits-all. -Jonathan On Tue, Oct 6, 2009 at 4:37 PM, Jurijs

Re: Storage proxy write latency is too high

2009-10-06 Thread Jonathan Ellis
#). Now I start to wonder if there is some fundamental flaw in async IO on windows... On Mon, Oct 5, 2009 at 3:23 PM, Jonathan Ellis jbel...@gmail.com wrote: On Mon, Oct 5, 2009 at 2:17 PM, Igor Katkov ikat...@gmail.com wrote: measured via JMX console i.e. does not include client-cassandra-client

Re: circular buffer

2009-10-07 Thread Jonathan Ellis
It's not a fantastic fit because as you say you need to read the contents to be able to see which if any need to be deleted. (On the bright side if you are sorting by time uuid, for instance, you won't need an extra sort step.) I would predict it would perform better than doing the same thing in

Re: Quick nodeprobe question

2009-10-08 Thread Jonathan Ellis
This is fixed in the 0.4 branch and will be included in a forthcoming 0.4.1 release. On Thu, Oct 8, 2009 at 3:55 AM, Dan Larsen d...@techba.se wrote: Every time cassandra does a flush, the dbs grows, but nothing happens to the commitlog Am I missing something? Isn't the commitlog suppose to

want a ticket to NoSQL east?

2009-10-11 Thread Jonathan Ellis
Hi all, Rackspace is sponsoring NoSQL East (Oct 29/30 in Atlanta -- https://nosqleast.com/2009/) and we have two extra tickets to give away to the community. (Tickets for the original venue of 250 seats sold out, then they found another venue w/ 40 more seats, and it looks like those will sell

Re: want a ticket to NoSQL east?

2009-10-12 Thread Jonathan Ellis
These are spoken for now. On Sun, Oct 11, 2009 at 9:38 PM, Jonathan Ellis jbel...@gmail.com wrote: Hi all, Rackspace is sponsoring NoSQL East (Oct 29/30 in Atlanta -- https://nosqleast.com/2009/) and we have two extra tickets to give away to the community.  (Tickets for the original venue

Re: eventual consistency question

2009-10-13 Thread Jonathan Ellis
On Tue, Oct 13, 2009 at 4:52 PM, Brian Bulkowski br...@bulkowski.org wrote: Question 1:   the bootstrap parameter: what does it do, exactly? It's for adding nodes to an existing cluster. (This is being reworked to be more automatic for 0.5.) If you start a node without it, it assumes it

Re: I am looking for cross data center HA solution

2009-10-14 Thread Jonathan Ellis
On Wed, Oct 14, 2009 at 1:53 AM, Santal Li santal...@gmail.com wrote: But Seems the RackAwareStrategy was not finished, some funtions was direct throw out exception:   eg:        public EndPoint[] getReadStorageEndPoints(Token token, MapToken, EndPoint tokenToEndPointMap)    {    

Re: I am looking for cross data center HA solution

2009-10-14 Thread Jonathan Ellis
On Wed, Oct 14, 2009 at 1:53 AM, Santal Li santal...@gmail.com wrote: There was another way for cross data center HA,  how about add some kind of action hook, which can catch all the Cassandra.Iface function call(insert , batch_insert , remove ...) , then send the call to another Cassandra

Re: Cassandra restart

2009-10-14 Thread Jonathan Ellis
Turn on debug logging and see what nodes it is sending requests to, and what happens on that node to that request. Maybe there is an exception on the data node. 2009/10/14 Teodor Sigaev teo...@sigaev.ru: Why is it timing out? Sorry, I can't say why, I just see on client side {              

Re: Cassandra restart

2009-10-14 Thread Jonathan Ellis
And what happens on 10.3.2.36 for message 2691? 2009/10/14 Teodor Sigaev teo...@sigaev.ru: After restart old key become readable, another key become unreadable :) DEBUG [pool-1-thread-61] 2009-10-14 18:03:30,695 StorageProxy.java (line 239) weakreadlocal reading

Re: Cassandra restart

2009-10-15 Thread Jonathan Ellis
Does this still happen after a few dozen queries to 10.2.3.38? It looks like .40 is finding the key and trying to send it back, so one possibility is that .40's pooled socket to .38 hasn't realized that .38 was restarted, so the first few messages might get lost before it reconnects. 2009/10/15

Re: cassandra fatal error - The name should match the name of the current column or super column

2009-10-15 Thread Jonathan Ellis
Quicker question: Is this ColumnFamily using UTF8Type as its comparator? On Thu, Oct 15, 2009 at 1:17 PM, Jonathan Ellis jbel...@gmail.com wrote: The exception on restart is occurring during a compaction of already-written sstables.  It logs what the files are beforehand (Compacting

Re: cassandra fatal error - The name should match the name of the current column or super column

2009-10-15 Thread Jonathan Ellis
, Oct 15, 2009 at 1:35 PM, Edmond Lau edm...@ooyala.com wrote: Yes, I'm using UTF8Type as the comparator. On Thu, Oct 15, 2009 at 11:33 AM, Jonathan Ellis jbel...@gmail.com wrote: Quicker question: Is this ColumnFamily using UTF8Type as its comparator? On Thu, Oct 15, 2009 at 1:17 PM, Jonathan

Re: Thrift Perl API Timeout Issues

2009-10-15 Thread Jonathan Ellis
Are you also using Perl? On Thu, Oct 15, 2009 at 1:38 PM, Anthony Molinaro antho...@alumni.caltech.edu wrote: I see a similar thing happening all the time.  I get around it by closing the current connection and reconnecting after a sleep.  Although I am able to do quite a few inserts between

Re: cassandra fatal error - The name should match the name of the current column or super column

2009-10-16 Thread Jonathan Ellis
Don't know if you tried this yet, but Eric noticed a bug in my patch. There's a new one now, fixed. -Jonathan

Re: Compile error on revision 826089.

2009-10-16 Thread Jonathan Ellis
fixed 2009/10/16 Teodor Sigaev teo...@sigaev.ru: % ant release Buildfile: build.xml build-subprojects: init:    [mkdir] Created dir: /spool/home/teodor/cvs/Cassandra/cassandra-0.4/build/classes    [mkdir] Created dir: /spool/home/teodor/cvs/Cassandra/cassandra-0.4/build/test/classes    

Re: cassandra fatal error - The name should match the name of the current column or super column

2009-10-16 Thread Jonathan Ellis
16, 2009 at 1:23 PM, Jonathan Ellis jbel...@gmail.com wrote: Don't know if you tried this yet, but Eric noticed a bug in my patch. There's a new one now, fixed. -Jonathan

  1   2   3   4   5   >