Re: Cassandra 0.3 RC is out

2009-05-14 Thread Jonathan Ellis
I've been asked to change the download url to http://people.apache.org/%7Ejbellis/cassandra/cassandra-0.3-rc.tgz to avoid incorrectly implying that this is An Official Release which it is not. -Jonathan

Re: Cassandra 0.3 RC is out

2009-05-14 Thread Jonathan Ellis
/14 Jonathan Ellis jbel...@gmail.com: Short version: http://incubator.apache.org/cassandra/cassandra-0.3.0-rc.tgz Long version: http://spyced.blogspot.com/2009/05/cassandra-03-release-candidate-and.html Release Candidate means we fixed all the bugs we could find; help us find more so

Re: Node Recovery

2009-05-18 Thread Jonathan Ellis
That's the price you pay for (a) eventual consistency in general and (b) doing read repair in the background specifically. Cassandra also has functionality (called strong read) to do a quorum read in the foreground and repair if necessary but that is not exposed in Thrift yet -- but even with

Re: multi-table

2009-05-18 Thread Jonathan ellis
Different apps will have different performance characteristics (and different key domains, which can also be important). So there are operational reasons to prefer cluster-per-app. That said, multi table support is high on my priority list. The changes required are straightforward so I'd

Re: Ingesting from Hadoop to Cassandra

2009-05-27 Thread Jonathan Ellis
On Wed, May 27, 2009 at 6:39 PM, Alexandre Linares lina...@ymail.com wrote: So it actually doesn't look blocked, but it's crawling.  Of course, in Hadoop, it always timed out (10 mins), before I could tell that it was crawling (I think) So, back to the original hypothesis: you need to increase

Re: cassandra's performance?

2009-06-03 Thread Jonathan Ellis
We're basically in a roll-your-own benchmark state. Johan can probably give some pointers: http://blog.oskarsson.nu/2009/05/vpork.html. Also see the how fast is it section here: http://spyced.blogspot.com/2009/05/cassandra-03-release-candidate-and.html -Jonathan On Wed, Jun 3, 2009 at 3:06 AM,

Re: cassandra's performance?

2009-06-03 Thread Jonathan Ellis
and the write path will never hit disk. On Wed, Jun 3, 2009 at 5:48 PM, Jonathan Ellis jbel...@gmail.com wrote: Cassandra is not designed to work memory-only.  It's designed designed to use disk for durability and to accommodate using large sets of data, letting the OS use memory as a huge cache

Re: questions about operations

2009-06-04 Thread Jonathan Ellis
On Thu, Jun 4, 2009 at 10:01 AM, Thorsten von Eicken t...@rightscale.com wrote: Ah, got it, I forgot about the time-sorted CFs. So does this mean that if I call get_columns_since on a name-sorted CF I will get an invalid request exception? And also if I call get_slice_by_name_range or

Re: New table and column families

2009-06-23 Thread Jonathan Ellis
you'll need to (a) make sure you have the latest trunk (b) wipe your data, commitlog, and system directories, since adding new tables or columnfamilies non-destructively is not yet supported (see https://issues.apache.org/jira/browse/CASSANDRA-44) -Jonathan On Tue, Jun 23, 2009 at 8:55 AM,

Re: schema example

2009-07-03 Thread Jonathan Ellis
On Fri, Jul 3, 2009 at 8:53 PM, Evan Weaverewea...@gmail.com wrote: (From talking on IRC): I think this boils down to the offset/limit vs. token/limit debate. Token/limit is fine in all cases for me, but you still have to be able to query the head of the list (with a limit, but no token) to

Re: [Announce] CassandraClient 0.1 for Ruby released

2009-07-04 Thread Jonathan Ellis
Nice! On Sat, Jul 4, 2009 at 4:59 AM, Evan Weaverewea...@gmail.com wrote: I am pleased to release: cassandra_client 0.1 A Ruby client for the Cassandra distributed database. http://blog.evanweaver.com/files/doc/fauna/cassandra_client/ http://github.com/fauna/cassandra_client/ Evan --

Re: problem running cassandra

2009-07-09 Thread Jonathan Ellis
cassandra Thanks a lot On Thu, Jul 9, 2009 at 10:27 AM, Jonathan Ellis jbel...@gmail.com wrote: what version are you trying to run?  on what platform? On Thu, Jul 9, 2009 at 12:04 PM, mobiledream...@gmail.com wrote: I did set it up as the readme file instructed but i encountered this error

Re: problem running cassandra

2009-07-09 Thread Jonathan Ellis
, 2009 at 3:20 PM, mobiledream...@gmail.com wrote: why was the web interface removed? Is there a simple python client for cassandra like python-couchdb thanks a lot On Thu, Jul 9, 2009 at 12:25 PM, Jonathan Ellis jbel...@gmail.com wrote: for 0.3 you can connect to the web interface on port 7002

Re: How to answer queries of form Give me the top 10 messages

2009-07-10 Thread Jonathan Ellis
Have you read this? http://blog.evanweaver.com/articles/2009/07/06/up-and-running-with-cassandra/ On Fri, Jul 10, 2009 at 4:43 PM, mobiledream...@gmail.com wrote: Hey guys how do we answer queries of type - give me the top 10 messages or top 10 users and so on thanks Example: SuperColumns

Re: cassandra slows down after inserts

2009-07-13 Thread Jonathan Ellis
Cassandra is replaying the transaction log and preloading SSTable indexes. This is normal. On Mon, Jul 13, 2009 at 8:10 AM, rkmr...@gmail.comrkmr...@gmail.com wrote: when i stop cassandra and start it again, this is what is printed. it takes just a couple of seconds for this to run. and after

Re: WARN - Unable to find a live Endpoint we might be out of live nodes , This is dangerous !!!!

2009-07-17 Thread Jonathan Ellis
programs # that require network functionality will fail. 127.0.0.1 localhost.localdomain localhost localhost ::1 localhost6.localdomain6 localhost6 On Fri, Jul 17, 2009 at 6:09 AM, Jonathan Ellis jbel...@gmail.com wrote: Please don't repeat your question separately on -user, -dev, and irc

Re: Scaling from 1 to x (was: one server or more servers?)

2009-07-17 Thread Jonathan Ellis
thought Hinted handoff will take care of this Right? Write will never fail insted it will write to another node right? correct me if i am wrong. Thanks and Regards, /VJ On Tue, Jul 14, 2009 at 7:26 AM, Jonathan Ellis jbel...@gmail.com wrote: N: guarantees consistent reads without having

Re: Scaling from 1 to x (was: one server or more servers?)

2009-07-17 Thread Jonathan Ellis
availability.) -Jonathan During query it will fail if i only have block for to be 3? Regards, /VJ On Fri, Jul 17, 2009 at 1:36 PM, Jonathan Ellis jbel...@gmail.com wrote: ck_for to zero, then writes will indeed never fail (unless the node the client is ta

Re: ì¤ ìì§ ì¤ ì thrift.Thrift.TApplicationEx ception: Internal error processing insert

2009-07-19 Thread Jonathan Ellis
That should be partially solved in trunk now that 139 is committed, and more solved when we commit 185 soon. On Sun, Jul 19, 2009 at 3:43 AM, mobiledream...@gmail.com wrote: Any utf-8 keyword causes cassandra to crash!

Re: how to delete an entire column family

2009-07-19 Thread Jonathan Ellis
iterate through the keys with get_key_range, and delete the row associated with each key On Sun, Jul 19, 2009 at 3:51 AM, mobiledream...@gmail.com wrote: In Super-column family Super1 there is a column family Related How do i delete the entire related column family thanks

Re: python thrift cassandra: get_slice_super vs get_slice_super_by_names

2009-07-19 Thread Jonathan Ellis
, Jonathan Ellis jbel...@gmail.com wrote: I would guess because kw != 'tofu' On Sun, Jul 19, 2009 at 12:24 AM, mobiledream...@gmail.com wrote: Why doesnt res return ColumnFamily Related whereas res2 works just fine thanks? timestamp = time.time() res = client.get_slice_super('Table1', kw

Re: New cassandra in trunk - breaks python thrift interface (was AttributeError: 'str' object has no attribute 'write')

2009-07-19 Thread Jonathan Ellis
Don't run trunk if you're not going to read svn log. The api changed with the commit of the 139 patches (and it will change again with the 185 ones). look at interface/cassandra.thrift to see what arguments are expected. On Sun, Jul 19, 2009 at 3:31 PM, mobiledream...@gmail.com wrote: Hey

Re: New cassandra in trunk - breaks python thrift interface (was AttributeError: 'str' object has no attribute 'write')

2009-07-19 Thread Jonathan Ellis
It works fine, it's just not the same as it was two weeks ago. On Sun, Jul 19, 2009 at 3:36 PM, mobiledream...@gmail.com wrote: ok so which is the version where cassandra python thrift works out of the box thanks On 7/19/09, Jonathan Ellis jbel...@gmail.com wrote: Don't run trunk if you're

Re: AttributeError: 'str' object has no attribute 'write'

2009-07-19 Thread Jonathan Ellis
Building the java interface is part of the build, but ant has no way to guess which additional client interfaces you want to use, if any. On Sun, Jul 19, 2009 at 6:46 PM, Ian Holsmani...@holsman.net wrote: hi Gasol. shouldn't regeneration of the interface be part of the build process? On

Fwd: thrift API changes

2009-07-20 Thread Jonathan Ellis
Oops, I sent this to the old google -user list by mistake the first time. Now that that's gone, I realized the error. -- Forwarded message -- From: Jonathan Ellis jbel...@gmail.com Date: Mon, Jul 20, 2009 at 10:10 PM Subject: Re: thrift API changes To: cassandra-u

Re: trunk

2009-07-21 Thread Jonathan Ellis
the internals should be solid but we are in the middle (towards the end of, actually) changing the thrift api pretty drastically. (the colons had to go, and the sooner we bit the bullet, the better. :) see this thread --

Re: keys and column names cannot be utf-8

2009-07-21 Thread Jonathan Ellis
did you read the new section in the config xml explaining how to use a UTF8 comparator? also: thrift itself is just plain broken for unicode support in some languages; see THRIFT-395 I think the short version is that when you have a java server, unicode will work with java or C# clients but not

Re: keys and column names cannot be utf-8

2009-07-21 Thread Jonathan Ellis
you may also want to specify CompareSubcolumnsWith. On Tue, Jul 21, 2009 at 4:27 PM, mobiledream...@gmail.com wrote: thanks jonathan trying this ColumnFamily ColumnType=Super CompareWith=UTF8Type Name=Super1/ On Tue, Jul 21, 2009 at 2:24 PM, Jonathan Ellis jbel...@gmail.com wrote: On Tue

Re: keys and column names cannot be utf-8

2009-07-21 Thread Jonathan Ellis
clarify if this will guarantee proper python thrift utf8 behavior thanks On Tue, Jul 21, 2009 at 2:29 PM, Jonathan Ellis jbel...@gmail.com wrote: you may also want to specify CompareSubcolumnsWith. On Tue, Jul 21, 2009 at 4:27 PM, mobiledream...@gmail.com wrote: thanks jonathan trying

Re: keys and column names cannot be utf-8

2009-07-21 Thread Jonathan Ellis
clarify if this will guarantee proper python thrift utf8 behavior thanks On Tue, Jul 21, 2009 at 2:29 PM, Jonathan Ellis jbel...@gmail.com wrote: you may also want to specify CompareSubcolumnsWith. On Tue, Jul 21, 2009 at 4:27 PM, mobiledream...@gmail.com wrote: thanks jonathan trying

Re: how to deploy cassadnra 0.30 in Running on multiple nodes

2009-07-24 Thread Jonathan Ellis
because nodeprobe is in 0.4, not 0.3 On Fri, Jul 24, 2009 at 2:26 AM, 李楠qingnan...@gmail.com wrote: i deploy in storage-conf.xml Seeds add Seed192.1.12.149/Seed use bin/nodeprobe -host X ring where X is each machine in your cluster to make sure all the nodes see each other why do not i find 

Re: ColumnFamily config options

2009-08-01 Thread Jonathan Ellis
On Sat, Aug 1, 2009 at 9:04 AM, Ivan Changivan.ch...@medigy.com wrote: Could someone points me to resources/examples how to use the config options below: 1) Extending index at run time using the Index node   ColumnFamily name=foo     Index generator=some java class/   /ColumnFamily Index

Re: [Fwd: Re: Greetings!]

2009-08-01 Thread Jonathan Ellis
On Fri, Jul 31, 2009 at 5:42 PM, Colin Mollenhourco...@mollenhour.com wrote: This reply keeps getting blocked as spam so I am just sending to you directly.. Jonathan, thank you very much for the excellent response. If I may, a few more questions (inline): One caveat is that the subcolumns

Re: MVCC

2009-08-03 Thread Jonathan Ellis
On Mon, Aug 3, 2009 at 10:49 AM, Jun Raojun...@almaden.ibm.com wrote: Ivan, The original cassandra keeps multiple versions of the column data. No, it didn't. (It had versioning-related bugs but multiple versions a la Bigtable was never part of the design.) -Jonathan

Re: MVCC

2009-08-03 Thread Jonathan Ellis
cassandra had free multiple versions and we needed to manually delete the older versions On Mon, Aug 3, 2009 at 8:56 AM, Jonathan Ellis jbel...@gmail.com wrote: On Mon, Aug 3, 2009 at 10:49 AM, Jun Raojun...@almaden.ibm.com wrote: Ivan, The original cassandra keeps multiple versions

Re: MVCC

2009-08-03 Thread Jonathan Ellis
Strictly speaking, no; timestamp is client-provided. But in the sense that you'd better use ntpd on your clients, yes. On Mon, Aug 3, 2009 at 12:10 PM, Wilson Marwilson...@gmail.com wrote: So if different servers are not synchronized in time (to a Tier 1 time server), then updates from slower

Re: MVCC

2009-08-03 Thread Jonathan Ellis
On Mon, Aug 3, 2009 at 12:12 PM, Mark McBridemark.mcbr...@gmail.com wrote: Thanks, that makes sense.  Is it an ok general rule that the timestamps should be set to 1) The time that the data to be mutated was generated 2) The current system time if the time the data was mutated isn't available

Re: MVCC

2009-08-03 Thread Jonathan Ellis
On Mon, Aug 3, 2009 at 3:39 PM, Ivan Changivan.ch...@medigy.com wrote: Is this going to be an inherent limitation of Cassandra? If someone writes a patch that adds multi-version support without compromising single-version performance then I don't see any reasons to turn it down. -Jonathan

Re: .NET client example

2009-08-04 Thread Jonathan Ellis
It should be fairly easy to port the Java example on http://wiki.apache.org/cassandra/ClientExamples to C#. -Jonathan On Tue, Aug 4, 2009 at 2:55 PM, Robinson, Michaelmichael.robins...@sage.com wrote: Does anyone have or know of a C#/.NET port of the example Cassandra client code?  Since I’m

heads up on on-disk data format changes in trunk

2009-08-04 Thread Jonathan Ellis
Hi guys, We're trying to get the data format incompatibilities done with before we put out a 0.4 beta. There's 3 coming up. In order of when they are likely to be finished: https://issues.apache.org/jira/browse/CASSANDRA-341 -- to fix bugs (today)

Re: heads up on on-disk data format changes in trunk

2009-08-04 Thread Jonathan Ellis
Re the index bugs: this is referring to column indexes, not key indexes. (The former are kept in the SSTable; the latter are in a separate file. Don't delete your -Index.db files. :) And a correction: 341 is the only one that the compaction process will fix, not 332 as well. -Jonathan On Tue,

default OrderPreservingPartitioner changed

2009-08-07 Thread Jonathan Ellis
The default OPP now does comparisons based strictly on byte order, and is no longer collation aware. This is a better default choice for those who don't need collation since it's much faster. If you do need collation, the old partitioner is still available as CollatingOPP: -

Re: new thrift API

2009-08-08 Thread Jonathan Ellis
Stabilizing but not quite finished (329 and 311 are still waiting for review). But the fundamentals are the same. You have start/finish because that's what defines a range. You have count because you often want the First N results. -Jonathan On Sat, Aug 8, 2009 at 8:33 AM, Jonas

Re: new thrift API

2009-08-08 Thread Jonathan Ellis
a numeric offset since that can't be supported efficiently with a log-structured merge disk format. On Sat, Aug 8, 2009 at 8:57 AM, Jonas Bonérjo...@jonasboner.com wrote: 2009/8/8 Jonathan Ellis jbel...@gmail.com: (either of start and finish may be empty) Thanks for fast reply. So count

Re: Cassandra memory footprint

2009-08-10 Thread Jonathan Ellis
I turns out there were several bugs that make 0.3 run out of memory during sustained insert. These are fixed in trunk, which is almost stable (#233 is the last disk format change, and will be committed as soon as review is done). -Jonathan On Mon, Aug 10, 2009 at 7:20 PM, Huming

Re: Valid consistency level values on trunk

2009-08-11 Thread Jonathan Ellis
Thanks for looking into it, though. On Tue, Aug 11, 2009 at 10:59 AM, Mark McBridemark.mcbr...@gmail.com wrote: My shot at hero status has been thwarted, due to a preexisting Jira issue http://issues.apache.org/jira/browse/THRIFT-551 On Mon, Aug 10, 2009 at 5:49 PM, Mark

Re: New thrift api question.

2009-08-12 Thread Jonathan Ellis
abandon the old design? how to build 'one to many' relationship in a effective way with new api? what data model should i use? thank? 2009/8/13 Jonathan Ellis jbel...@gmail.com The easy way is to just show one page of results and have a Next button that uses start=last one from current page

Re: Visual representation of Cassandra data model

2009-08-12 Thread Jonathan Ellis
Thanks for taking a stab at this, Mark. I'm not a fan of teaching this by showing CF-spanning rows. (The bigtable paper does this IIRC but it's wrong. :) You can have data in different CFs with the same key, yes, but all that means is they will be stored on the same nodes. Each CF is stored

0.4.0 beta1 artifacts are now being voted on

2009-08-14 Thread Jonathan Ellis
If you would like to review the artifacts while the Apache Incubator PMC votes on them, you can find them at http://people.apache.org/~eevans. After the vote (assuming it is in the affirmative) we will push them to the mirrors and website. Changelog (from CHANGES.txt): * On-disk data format

Re: Cassandra performance

2009-08-17 Thread Jonathan Ellis
What happened about 20h in to make the latency drop so dramatically? On Mon, Aug 17, 2009 at 12:14 PM, Huming Wuhuming...@gmail.com wrote: I did some performance test and I am not impressed :). The data set is 880K unique keys and there are 4 columns with 2 columns being string and the other 2

Re: Pls, help with fetching of super-column's value

2009-08-19 Thread Jonathan Ellis
2009/8/19 Teodor Sigaev teo...@sigaev.ru: It produces following error messages $VAR1 = 'Can\'t use string (0) as a SCALAR ref while strict refs in use at /usr/lib/perl5/site_perl/5.8.8/Thrift/BinaryProtocol.pm line 376.' If it's objecting to the 0 in reversed=0, it sounds like a perl specific

Re: Pls, help with fetching of super-column's value

2009-08-19 Thread Jonathan Ellis
2009/8/19 Teodor Sigaev teo...@sigaev.ru: If it's objecting to the 0 in reversed=0, it sounds like a perl specific problem -- why would it be turning that into a string? It seems to me that it isn't connected to reversed. I changed to 1 and nothing was changed. Dunno, then. Bug in the

Re: Pls, help with fetching of super-column's value

2009-08-19 Thread Jonathan Ellis
2009/8/19 Teodor Sigaev teo...@sigaev.ru: Dunno, then.  Bug in the generated perl code?  Wouldn't be the first time. Interesting, if column_names is added with known status ids then Cassandra returns them although it doesn't take into account reversed and count options. it's not supposed to.

Re: Anybody experience one Cassandra server locking up?

2009-08-19 Thread Jonathan Ellis
sounds like you are exhausting the memory on that instance and it is going into GC swap trying to free enough to continue. this is very easy to do on 0.3 -- try upgrading to the 0.4 beta if you are using 0.3. On Tue, Aug 18, 2009 at 3:36 PM, Brian Frank Coopercoop...@yahoo-inc.com wrote: Hi

Re: quorum read timeout

2009-08-19 Thread Jonathan Ellis
Is this 0.3 or 0.4/trunk? On Wed, Aug 19, 2009 at 2:36 PM, Phillip Michalakphil.micha...@digitalreasoning.com wrote: I'm running three Cassandra nodes in virtual machines. During a 'get' operation from Cassandra-remote directed at one of these nodes, I'm receiving the following output

Re: quorum read timeout

2009-08-19 Thread Jonathan Ellis
Looks like a bug in TcpConnectionManager. Can you file a ticket? thanks, -Jonathan On Wed, Aug 19, 2009 at 2:49 PM, Phillip Michalakphil.micha...@digitalreasoning.com wrote: It's cassandra-0.4-beta1. Thanks! Phil On Aug 19, 2009, at 4:43 PM, Jonathan Ellis wrote: Is this 0.3 or 0.4

Re: Anybody experience one Cassandra server locking up?

2009-08-19 Thread Jonathan Ellis
On Wed, Aug 19, 2009 at 5:19 PM, Brian Frank Coopercoop...@yahoo-inc.com wrote: We are trying to learn what we can about the performance of Cassandra. I hope to have some results to share publicly in the next couple of weeks. The 0.4 version seems to have handled the insert load better, but

Re: Cassandra performance

2009-08-19 Thread Jonathan Ellis
be careful when profiling blocking io -- I bet that means that I'm spending all my time blocking for more data to read since there is only one call per second. the internal Cassandra MessagingService uses nonblocking io, but the Thrift stuff is just your standard thread pool with blocking

Re: Server cannot startup after shutdown

2009-08-19 Thread Jonathan Ellis
The malformed input bug was fixed after beta1 and should be in a nightly build by now. (I introduced a regression where it couldn't handle the last entry in the commitlog being incomplete. So upgrading should be able to restart on the existing commitlogs.) The OOM puzzles me a little; I'm not

Re: Server cannot startup after shutdown

2009-08-20 Thread Jonathan Ellis
: Jonathan Ellis [jbel...@gmail.com] Sent: Wednesday, August 19, 2009 7:46 PM To: cassandra-user@incubator.apache.org Subject: Re: Server cannot startup after shutdown The malformed input bug was fixed after beta1 and should be in a nightly build by now.  (I introduced a regression where it couldn't

Re: Cassandra security

2009-08-21 Thread Jonathan Ellis
if your product is jvm based, just use the internal api and don't stzrt the thrift listeners at all. On 8/21/09, Mark McBride mark.mcbr...@gmail.com wrote: I'm looking at the potential of embedding Cassandra in one of our products. This ships as one or more virtual appliances that runs at a

Re: Newbe´s question

2009-08-24 Thread Jonathan Ellis
There's two things going on here, I think. One is that you're passing a null where there shouldn't be one. This is a thrift bug and will be addressed in the next beta. The other is that IIRC lazyboy needs to be updated to work with the latest 0.4 api. -Jonathan On Mon, Aug 24, 2009 at 9:09

Re: Newbe´s question

2009-08-24 Thread Jonathan Ellis
lazyboy works vs an earlier version of trunk, so it's already incompatible with 0.3, but not yet compatible w/ latest 0.4 :) On Mon, Aug 24, 2009 at 12:28 PM, Bruno Coutobco...@gmail.com wrote: Jonathan, First, thanks for answering so fast. I´m using version 0.3-final of Cassandra, then I

Re: Newbe´s question

2009-08-24 Thread Jonathan Ellis
That's probably the best option at the moment. Once you're familiar with the thrift API I'm sure the lazyboy devs would welcome updates too. -Jonathan On Mon, Aug 24, 2009 at 1:54 PM, Bruno Coutobco...@gmail.com wrote: Thanks for helpping me Jonathan! Well, now I know that I can´t use the

Re: Cassandra + Hadoop + BMT

2009-08-24 Thread Jonathan Ellis
Thanks, Chris! On Mon, Aug 24, 2009 at 9:44 PM, Chris Goffinetgoffi...@digg.com wrote: Hi Guys This is long overdue but I have posted a very rough rough example (with Digg stuff removed) for getting BMT working with Cassandra. Patches are coming next up for the JIRA tickets. I'll try to get

Re: Server cannot startup after shutdown

2009-08-26 Thread Jonathan Ellis
On Wed, Aug 26, 2009 at 1:03 AM, Brian Frank Coopercoop...@yahoo-inc.com wrote: Hi, Jonathan, I have been trying to shutdown and restart Cassandra again this morning. I still get the malformed entry bug (which you say below your patch fixes.) I also get: ERROR - Exception encountered

Re: Cassandra hardware setup

2009-08-26 Thread Jonathan Ellis
On Tue, Aug 25, 2009 at 7:07 PM, Scott Chaconscha...@gmail.com wrote: We're playing with Cassandra and would like to get a test cluster setup for evaluation.  I've been playing with it on my laptop and EC2, which are the resources easily available to me, but not that close to what I would be

Re: Server cannot startup after shutdown

2009-08-26 Thread Jonathan Ellis
On Wed, Aug 19, 2009 at 9:46 PM, Jonathan Ellisjbel...@gmail.com wrote: The OOM puzzles me a little; I'm not sure how it could be unable to replay a mutation that it was able to write to the commitlog in the first place. Ah, I think I know: if a compaction starts during recovery, that could

Re: Server cannot startup after shutdown

2009-08-26 Thread Jonathan Ellis
On Wed, Aug 26, 2009 at 12:26 PM, Brian Frank Coopercoop...@yahoo-inc.com wrote: Is the commitlog small enough that you can gzip it and attach to JIRA (10 MB limit)? /var/cassandra/commitlog has 215 files totaling about 28 GB. Most are 134 MB, the last one is 6MB. Which one would be useful

Re: Newbe´s question

2009-08-26 Thread Jonathan Ellis
I thought about that, but I really don't want Cassandra committers to have to be in the business of updating them all when we make changes, and having them in the repo creates that expectation even in contrib. On Wed, Aug 26, 2009 at 6:57 PM, Ian Holsmani...@holsman.net wrote: would it be

Re: Newbe´s question

2009-08-26 Thread Jonathan Ellis
or svn:link to pull them in from their own repos? (not sure how legal it would be). On Aug 27, 2009, at 10:03 AM, Jonathan Ellis wrote: I thought about that, but I really don't want Cassandra committers to have to be in the business of updating them all when we make changes, and having them

Re: Newbe´s question

2009-08-26 Thread Jonathan Ellis
...@holsman.net wrote: isn't there a way to use svn:external or svn:link to pull them in from their own repos? (not sure how legal it would be). On Aug 27, 2009, at 10:03 AM, Jonathan Ellis wrote: I thought about that, but I really don't want Cassandra committers to have to be in the business

Re: when using nodeprobe: java.lang.OutOfMemoryError: Java heap space

2009-08-28 Thread Jonathan Ellis
On Fri, Aug 28, 2009 at 11:25 AM, Simon Smithsimongsm...@gmail.com wrote: I'm getting a traceback when using nodeprobe against Cassandra. That looks like a Thrift bug. :( Can you try an older version of Cassandra, e.g. trunk from a week ago, or the beta1 release, to see if the Thrift library

Re: when using nodeprobe: java.lang.OutOfMemoryError: Java heap space

2009-08-28 Thread Jonathan Ellis
Oh, I see the problem: nodeprobe uses the jmx port (specified in cassandra.in.sh -- default 8080), not the thrift port. Can you file a bug with Thrift/java not to OOM when someone connects to the socket and sends nonsense? :) https://issues.apache.org/jira/browse/THRIFT -Jonathan On Fri, Aug

Re: get_slice() inconsistent in 0.4?

2009-09-01 Thread Jonathan Ellis
You must be using an old version (beta1 maybe?) We switched LongType to big-endian in https://issues.apache.org/jira/browse/CASSANDRA-384. -Jonathan On Tue, Sep 1, 2009 at 9:28 PM, Bhaskar Rathodbhaskar_rat...@yahoo.com wrote: Hi, I have been playing with get_slice() in cassandra 0.4, and I

Re: Error message

2009-09-02 Thread Jonathan Ellis
If you can come up with a minimal script to reproduce that would be awesome. 2009/9/2 Teodor Sigaev teo...@sigaev.ru: That's a bug...  Somehow you got it to try to write out a row containing zero columns, which isn't supposed to be allowed. After rechecking of my application, I tried to play

Re: random n00b question

2009-09-15 Thread Jonathan Ellis
We don't currently have any optimizations to provide lightweight session consistency (see #132), but if you do quorum reads + quorum writes then you are guaranteed to read the most recent write which should be fine for most apps. On Tue, Sep 15, 2009 at 5:30 AM, Eric Bowman ebow...@boboco.ie

Re: random n00b question

2009-09-15 Thread Jonathan Ellis
On Tue, Sep 15, 2009 at 10:09 AM, Mark Robson mar...@gmail.com wrote: Even using quorum reads and writes, if a user in the same session has two pages active at once, session data would be trashed. True. But for most web apps I've seen, last-write-wins is just fine. YMMV. :) -Jonathan

Re: New Features - Future releases

2009-09-18 Thread Jonathan Ellis
On Fri, Sep 18, 2009 at 9:09 PM, Jonathan Mischo jmis...@quagility.com wrote:        • Multiple data center replication in the background. maybe a multi master type thing It already has this. It was built from the ground up for this. It's highly tolerant to partitioning and has always

Re: Configuring multiple nodes

2009-09-20 Thread Jonathan Ellis
Could you mention this on http://wiki.apache.org/cassandra/RunningCassandra ? On Sun, Sep 20, 2009 at 10:46 PM, Anthony Molinaro antho...@alumni.caltech.edu wrote: One other gotcha I've come across trying to use jmx was on machines with multiple interfaces.  The rmi server doesn't always bind

Re: Your Feedback Needed (was: Logo Design Contest)

2009-09-24 Thread Jonathan Ellis
You just need to sign up for an account and then you can comment on the main contest page. (Only the proposal submitter can comment on the individual entry pages.) -Jonathan On Thu, Sep 24, 2009 at 6:13 PM, Jayaraman Sampathkumar jayaraman@gmail.com wrote: Eric, 99designs does not permit

Re: commit logs are not deleted

2009-09-24 Thread Jonathan Ellis
When all the data from a given commit log segment has been flushed as sstables, that segment can be deleted. So if you do a bunch of inserts and then stop, it's normal to have some commitlogs around indefinitely. All CFs are flushed on server restart, and the log segments can then be removed, or

Re: perfomance issue

2009-09-25 Thread Jonathan Ellis
It's impossible to say given what you have told us. Step zero in benchmarking cassandra is turning the log level to INFO. Step one is testing on a machine where you can put the commitlog directory on its own disk. It's true that frequently cassandra will be slower than custom code writing to

Re: perfomance issue

2009-09-25 Thread Jonathan Ellis
. -Jonathan On Fri, Sep 25, 2009 at 10:41 AM, Kirill A. Korinskiy catap+cassan...@catap.ru wrote: At Fri, 25 Sep 2009 10:32:55 -0500, Jonathan Ellis jbel...@gmail.com wrote: Step zero in benchmarking cassandra is turning the log level to INFO. sure, i'm switching off all log messages Step

Re: differences between keyspaces and tables

2009-09-25 Thread Jonathan Ellis
0.4 RC2 is better than anything using Table :) -final should be out soon. On Fri, Sep 25, 2009 at 3:27 PM, Joe Van Dyk joevan...@gmail.com wrote: Ah, ok.  I was using http://blog.evanweaver.com/articles/2009/07/06/up-and-running-with-cassandra/ and didn't realize that it wasn't using the

Re: commit logs are not deleted

2009-09-25 Thread Jonathan Ellis
This is fixed on the 0.4 branch (but not in trunk, yet) On Fri, Sep 25, 2009 at 1:57 PM, Jonathan Ellis jbel...@gmail.com wrote: https://issues.apache.org/jira/browse/CASSANDRA-455 will address FlushPeriod not working. On Fri, Sep 25, 2009 at 1:33 PM, Igor Katkov ikat...@gmail.com wrote: I

Re: read repair keeps occurring on every quorum read

2009-09-25 Thread Jonathan Ellis
No, you're mixing two related concepts. When you do a quorum read it will fetch the actual data from one replica and do digest reads from the others. If the data from the one does not match the hash from the others, then you have the digestmismatchexception Edmond is seeing and read repair is

Re: commit logs are not deleted

2009-09-26 Thread Jonathan Ellis
On Fri, Sep 25, 2009 at 11:55 PM, Igor Katkov ikat...@gmail.com wrote: I checked out and built 0.4 branch. It's all the same, files stays. If you turn on debug logging, does it show sstable flushing? -Jonathan

Re: commit logs are not deleted

2009-09-26 Thread Jonathan Ellis
response time is also growing. I assume this is because Cassandra now has to read through some these files on reach read/write request On Fri, Sep 25, 2009 at 4:53 PM, Jonathan Ellis jbel...@gmail.com wrote: This is fixed on the 0.4 branch (but not in trunk, yet) On Fri, Sep 25, 2009 at 1:57

Re: commit logs are not deleted

2009-09-26 Thread Jonathan Ellis
of commit log segments is growing, server response time is also growing. I assume this is because Cassandra now has to read through some these files on reach read/write request On Fri, Sep 25, 2009 at 4:53 PM, Jonathan Ellis jbel...@gmail.com wrote: This is fixed on the 0.4 branch (but not in trunk

Re: commit logs are not deleted

2009-09-27 Thread Jonathan Ellis
commit log c:/cassandra-4rc/data/commitlog\CommitLog-1254027292143.log; dirty is 1, ... P.S. OS: Windows server 2003 On Sat, Sep 26, 2009 at 2:50 PM, Jonathan Ellis jbel...@gmail.com wrote: Patch 0001 on https://issues.apache.org/jira/browse/CASSANDRA-459 fixes another bug that could cause

Re: commit logs are not deleted

2009-09-27 Thread Jonathan Ellis
for CommitLogContext(file='c:/cassandra-4rc/data/commitlog\CommitLog-1254064318865.log', position=67321599), column family 0. CFIDs are BenchMarkApp: TableMetadata(Channels: 0, }), system: TableMetadata(LocationInfo: 1, HintsColumnFamily: 2, }), } On Sun, Sep 27, 2009 at 8:49 AM, Jonathan Ellis jbel

Re: commit logs are not deleted

2009-09-28 Thread Jonathan Ellis
, Igor Katkov ikat...@gmail.com wrote: There is no log lines with column family 1 There are only column family 0 On Sun, Sep 27, 2009 at 12:02 PM, Jonathan Ellis jbel...@gmail.com wrote: we're looking for column family 1

Re: commit logs are not deleted

2009-09-28 Thread Jonathan Ellis
deleted. Log if filled with lines like Not safe to delete commit log d:/cassandra-4/data/commitlog\CommitLog-1254177667321.log; dirty is 1, On Mon, Sep 28, 2009 at 5:21 PM, Jonathan Ellis jbel...@gmail.com wrote: I have a fix now at https://issues.apache.org/jira/browse/CASSANDRA-459 -- only

Re: read repair keeps occurring on every quorum read

2009-09-30 Thread Jonathan Ellis
Since JIRA is mostly dead right now, here is the patch to test against 0.4. On Mon, Sep 28, 2009 at 4:30 PM, Edmond Lau edm...@ooyala.com wrote: On Fri, Sep 25, 2009 at 8:10 PM, Jonathan Ellis jbel...@gmail.com wrote: No, you're mixing two related concepts. When you do a quorum read

Re: distributing tokens equally along the key distribution space

2009-10-01 Thread Jonathan Ellis
On Thu, Oct 1, 2009 at 11:26 AM, Igor Katkov ikat...@gmail.com wrote: Hi, Question#1: How to manually select tokens to force equal spacing of tokens around the hash space? (Answered by Jun.) Question#2: Let's assume that #1 was resolved somehow and key distribution is more or less even.

Re: distributing tokens equally along the key distribution space

2009-10-01 Thread Jonathan Ellis
spaced. I see that there is tokenupdater (org.apache.cassandra.tools.TokenUpdater) What happens with keys and data if I run it on A, B, C and D with new, better spaced tokens? Should I? is there a better procedure? On Thu, Oct 1, 2009 at 12:48 PM, Jonathan Ellis jbel...@gmail.com wrote

Re: distributing tokens equally along the key distribution space

2009-10-01 Thread Jonathan Ellis
A and restart it with -b? Then same for the rest? restarting only one node at a time? On Thu, Oct 1, 2009 at 1:22 PM, Jonathan Ellis jbel...@gmail.com wrote: tokenupdater does not move data around; it's just an alternative to setting initialtoken on each node.  so you really want to get your

Re: distributing tokens equally along the key distribution space

2009-10-01 Thread Jonathan Ellis
node to node and appropriately modifying tokens? On Thu, Oct 1, 2009 at 1:42 PM, Jonathan Ellis jbel...@gmail.com wrote: You basically have two options.  You can wipe your data, change the tokens, and reload things, or you can add new nodes with -b to rebalance things that way. On Thu, Oct 1

Re: Cassandra Full-Text Search?

2009-10-01 Thread Jonathan Ellis
over the weekend. Best wishes, Neophytos On Wed, Sep 30, 2009 at 4:00 AM, Jonathan Ellis jbel...@gmail.com wrote: Cassandra does not give you FTS or any secondary indexing out of the box.  It's a pretty low-level API, but it will scale just fine if you want to build a FTS index out

  1   2   3   4   5   6   >