Re: NPE in apache cassandra

2009-03-11 Thread Jonathan Ellis
the config format changed. now you need to specify the cfname as an attribute: -Jonathan On Wed, Mar 11, 2009 at 3:52 PM, Jiansheng Huang wrote: > > > -- Forwarded message -- > From: Jiansheng Huang > Date: Wed, Mar 11, 2009 at 2:49 PM > Subject: NPE in apache cas

Re: NPE in apache cassandra

2009-03-11 Thread Jonathan Ellis
Also, it will not work AT ALL with data from the old version. you need to start fresh. -Jonathan On Wed, Mar 11, 2009 at 4:00 PM, Jonathan Ellis wrote: > the config format changed.  now you need to specify the cfname as an > attribute: > >             > > -Jonathan > >

Re: OPHF vs. Random

2009-03-11 Thread Jonathan Ellis
Use Random for now. The OPHF is the same as the old one, i.e., not actually OP. :) I'm pretty convinced at this point that it's impossible to have an order-preserving hash that doesn't either (a) impose a relatively short key length past which no partitioning is done (i.e., all keys w/ the same p

Re: OPHF vs. Random

2009-03-16 Thread Jonathan Ellis
The order-preserving partitioner code (not hash-based anymore) is up now at https://issues.apache.org/jira/browse/CASSANDRA-3. -Jonathan On Wed, Mar 11, 2009 at 6:48 PM, Jonathan Ellis wrote: > Use Random for now.  The OPHF is the same as the old one, i.e., not > actually OP. :) > >

Re: OPHF vs. Random

2009-03-16 Thread Jonathan Ellis
.messageid") Would this solve the problem? What is this > approach missing? > > Do we just need to decouple the hash used for routing from the key > used in the end-point for storage? Is this essentially what the series > of patches does? > > > On Mon, Mar 16,

some "getting started" information

2009-03-28 Thread Jonathan Ellis
Hi all, There's a bunch of useful material about getting started with Cassandra but it's rather scattered. So until we get our wiki going I wrote a blog post pulling some of that together: http://spyced.blogspot.com/2009/03/why-i-like-cassandra.html HTH, -Jonathan

cassandra-20

2009-03-31 Thread Jonathan Ellis
Just a heads up that I committed Eric Evans's patch from #20, which replaces bin/start-server with bin/cassandra and bestows it with magical shell kung-fu to background the server cleanly by default. Should work out of the box on linux, OS X, and cygwin. Use the -f flag to put it in foreground mod

Cassandra at OSCON

2009-04-02 Thread Jonathan Ellis
My proposal to present on Cassandra at OSCON this year was accepted. OSCON will be July 22 to 24 in San Jose. My talk will be on Thursday: http://en.oreilly.com/oscon2009/public/schedule/grid/2009-07-23 I covered similar material at my PyCon open space talk last week (standing room only); it went

Re: Cassandra at OSCON

2009-04-02 Thread Jonathan Ellis
make it? > > /Johan > > Jonathan Ellis wrote: >> My proposal to present on Cassandra at OSCON this year was accepted. >> OSCON will be July 22 to 24 in San Jose.  My talk will be on Thursday: >> http://en.oreilly.com/oscon2009/public/schedule/grid/2009-07-23 >> &g

Re: Sample Client Code

2009-04-09 Thread Jonathan Ellis
That looks reasonable. How are you reading the data back out? The web interface only hits the local machine so it is not very useful in a clustered situation. -Jonathan On Thu, Apr 9, 2009 at 4:02 PM, Sam D wrote: > Hi, > > I am new to Cassandra, just installed the latest version on my machine

Re: Sample Client Code

2009-04-09 Thread Jonathan Ellis
quot;,"content:xml"); >         System.out.println("xml : " + u1.value); > > Sam. > > On Thu, Apr 9, 2009 at 2:07 PM, Jonathan Ellis wrote: >> >> That looks reasonable.  How are you reading the data back out?  The >> web interface only hits the l

Re: Sample Client Code

2009-04-09 Thread Jonathan Ellis
ot; > com.facebook.thrift.transport.TTransportException: Cannot write to null > outputStream >     at com.facebook.thrift.transport.TIOStreamTransport.write(Unknown > Source) >     at com.facebook.thrift.protocol.TBinaryProtocol.writeI32(Unknown Source) > > Thanks > > On

Re: Sample Client Code

2009-04-09 Thread Jonathan Ellis
s due to some connectivity issues.  Its working > fine now. > > I had one more question. > > Can we insert byte arrays as values for the columns ?. I am trying to store > JPEG images. > > Thanks > > On Thu, Apr 9, 2009 at 2:38 PM, Jonathan Ellis wrote: >> >>

change to client API

2009-04-20 Thread Jonathan Ellis
All column values that were declared `string` in thrift are now `binary`. (See https://issues.apache.org/jira/browse/CASSANDRA-29.) For Java that means byte[] instead of String. For Python, because thrift treatment of `string` is broken, that actually means no change -- values were str before an

Re: Questions around API changes

2009-05-01 Thread Jonathan Ellis
On Fri, May 1, 2009 at 5:59 AM, Jonas Bonér wrote: > Hi there. > > First, should I use this ML or the google forum? This one. > * What does the new timestamp arg in > public boolean remove(String tablename, String key, String > columnFamily_column, long timestamp, boolean block) > specify? It's

Re: Questions around API changes

2009-05-01 Thread Jonathan Ellis
On Fri, May 1, 2009 at 11:19 AM, Jonas Bonér wrote: > Thanks for the answers. > > Btw, is the CQL in usable state? No idea. Probably not. :) > If not, any plans? The third cassandra committer from FB who mostly remains silent (forget his name atm) is supposedly planning to work on it more. >

Re: Some questions.

2009-05-02 Thread Jonathan Ellis
On Sat, May 2, 2009 at 6:22 AM, Manuel Crotti wrote: > Now I have some questions: > 1. each "storage-conf.xml" should contain just one of the above > ip-addresses (obviously not the localhost's IP address) in the > section to let cassandra learn the whole topology? Or it must contain the > whole

last api change for 0.3

2009-05-05 Thread Jonathan Ellis
I committed the patch for CASSANDRA-131 which (a) enables exception throwing on the insert methods (so you don't have to explicitly check return value to see if something worked), and (b) moves the _blocking method as a flag into the nonblocking ones. so instead of insert_blocking use insert with

Re: Non relational db meetup - San Francisco, June 11th

2009-05-12 Thread Jonathan Ellis
That's true, but 100 people is about the largest space you're going to find for free, so past that you'd have to start charging people and worrying about taxes and such. Messy. Maybe next year... :) -Jonathan On Tue, May 12, 2009 at 2:02 PM, Jonas Bonér wrote: > Great initiative. > Just sad th

Cassandra 0.3 RC is out

2009-05-13 Thread Jonathan Ellis
Short version: http://incubator.apache.org/cassandra/cassandra-0.3.0-rc.tgz Long version: http://spyced.blogspot.com/2009/05/cassandra-03-release-candidate-and.html Release Candidate means "we fixed all the bugs we could find; help us find more so the release is even more solid." :) I've created

Re: Cassandra 0.3 RC is out

2009-05-13 Thread Jonathan Ellis
Oops, fat-fingered the url: http://incubator.apache.org/cassandra/releases/cassandra-0.3-rc.tgz :) On Wed, May 13, 2009 at 10:28 PM, Jonathan Ellis wrote: > Short version: http://incubator.apache.org/cassandra/cassandra-0.3.0-rc.tgz > Long version: > http://spyced.blogspot.co

Re: Cassandra 0.3 RC is out

2009-05-14 Thread Jonathan Ellis
I've been asked to change the download url to http://people.apache.org/%7Ejbellis/cassandra/cassandra-0.3-rc.tgz to avoid incorrectly implying that this is An Official Release which it is not. -Jonathan

Re: Cassandra 0.3 RC is out

2009-05-14 Thread Jonathan Ellis
of others also contributed patches, bug reports, and testing. -Jonathan On May 14, 2009, at 8:34 AM, Jonas Bonér wrote: > Awesome job Jonathan. > Just getting into the codebase so fast is admirable. > Churning out code like this (and releases) is amazing. Keep it up. > > 200

Re: Node Recovery

2009-05-18 Thread Jonathan Ellis
That's the price you pay for (a) eventual consistency in general and (b) doing read repair in the background specifically. Cassandra also has functionality (called "strong read") to do a quorum read in the foreground and repair if necessary but that is not exposed in Thrift yet -- but even with th

Re: multi-table

2009-05-18 Thread Jonathan ellis
Different apps will have different performance characteristics (and different key domains, which can also be important). So there are operational reasons to prefer cluster-per-app. That said, multi table support is high on my priority list. The changes required are straightforward so I'd

schema example

2009-05-18 Thread Jonathan Ellis
Does anyone have a simple app schema they can share? I can't share the one for our main app. But we do need an example here. A real one would be nice if we can find one. I checked App Engine. They don't have a whole lot of examples either. They do have a really simple one: http://code.google.

Re: schema example

2009-05-19 Thread Jonathan Ellis
ex > > etc. > > Evan > > On Mon, May 18, 2009 at 8:19 PM, Jonathan Ellis wrote: >> Does anyone have a simple app schema they can share? >> >> I can't share the one for our main app.  But we do need an example >> here.  A real one would be nice if we ca

Re: Ingesting from Hadoop to Cassandra

2009-05-21 Thread Jonathan Ellis
Have you benchmarked the batch insert apis? If that is "fast enough" then it's by far the simplest way to go. Otherwise you'll have to use the binarymemtable stuff which is undocumented and not exposed as a client api (you basically write a custom "loader" version of cassandra to use it, I think)

Re: Ingesting from Hadoop to Cassandra

2009-05-21 Thread Jonathan Ellis
MessagingService internally but it's > not that clean. > > Thanks again, > -Alex > > > From: Jonathan Ellis > To: cassandra-user@incubator.apache.org > Sent: Thursday, May 21, 2009 7:44:59 AM > Subject: Re: Ingesting from Hadoop to Cassandra

Re: Ingesting from Hadoop to Cassandra

2009-05-25 Thread Jonathan Ellis
nt.mapred.CassandraImport$PushReduce.sendOut(CassandraImport.java:146) > at > com.yahoo.carmot.client.mapred.CassandraImport$PushReduce.reduce(CassandraImport.java:127) > at > com.yahoo.carmot.client.mapred.CassandraImport$PushReduce.reduce(CassandraImport.java:1) > at org.a

Re: Ingesting from Hadoop to Cassandra

2009-05-26 Thread Jonathan Ellis
apred.CassandraImport$PushReduce.reduce(CassandraImport.java:127) >     at > com.yahoo.carmot.client.mapred.CassandraImport$PushReduce.reduce(CassandraImport.java:1) >     at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:318) >     at > org.apache.hadoop.mapred.TaskTracker$Ch

Re: Ingesting from Hadoop to Cassandra

2009-05-27 Thread Jonathan Ellis
On Wed, May 27, 2009 at 6:39 PM, Alexandre Linares wrote: > So it actually doesn't look blocked, but it's crawling.  Of course, in > Hadoop, it always timed out (10 mins), before I could tell that it was > crawling (I think) So, back to the original hypothesis: you need to increase the memory you

Re: Ingesting from Hadoop to Cassandra

2009-05-28 Thread Jonathan Ellis
I can't reproduce with this, there is too much unspecified. (What is a Document? How do I get one?) Attached is a short program that successfully does 100k supercolumn inserts against a default configuration. Can you create a program like this for me to run? (Java is fine; Python is just more

Re: cassandra's performance?

2009-06-03 Thread Jonathan Ellis
We're basically in a roll-your-own benchmark state. Johan can probably give some pointers: http://blog.oskarsson.nu/2009/05/vpork.html. Also see the "how fast is it" section here: http://spyced.blogspot.com/2009/05/cassandra-03-release-candidate-and.html -Jonathan On Wed, Jun 3, 2009 at 3:06 AM

Re: cassandra's performance?

2009-06-03 Thread Jonathan Ellis
hat don't sync each update to disk." > Can Cassandra work in a memory-only mode? Can it be done by just > changing configuration? > > On Wed, Jun 3, 2009 at 10:38 PM, Jonathan Ellis wrote: >> We're basically in a roll-your-own benchmark state.  Johan can >> p

Re: cassandra's performance?

2009-06-03 Thread Jonathan Ellis
he > read path and the write path will never hit disk. > > > On Wed, Jun 3, 2009 at 5:48 PM, Jonathan Ellis wrote: >> Cassandra is not designed to work memory-only.  It's designed designed >> to use disk for durability and to accommodate using large sets of >> da

Re: questions about operations

2009-06-04 Thread Jonathan Ellis
On Thu, Jun 4, 2009 at 12:33 AM, Thorsten von Eicken wrote: > I'm looking at the cassandra data model and operations and I'm running into > a number of questions I have not been able to answer: > > - what does get_columns_since do? I thought there's only one version of a > column stored. I'm puzz

Re: questions about operations

2009-06-04 Thread Jonathan Ellis
On Thu, Jun 4, 2009 at 10:01 AM, Thorsten von Eicken wrote: > Ah, got it, I forgot about the time-sorted CFs. So does this mean that if I > call get_columns_since on a name-sorted CF I will get an invalid request > exception? And also if I call get_slice_by_name_range or get_slice_by_names > on a

Re: Database backstore

2009-06-11 Thread Jonathan Ellis
I suppose you could do that either directly from your client or with a proxy, but if your rdbms can handle the write volume then just use replication to handle the reads. Typically people move to Cassandra and other distributed dbs when they need to scale more writes than you can do on an rdbms.

Re: Viability of running on EC2

2009-06-13 Thread Jonathan Ellis
IMO the biggest downside to running on EC2 is that IO is terrible. I haven't done benchmarks, but anecdotally disk performance in particular seems like an order of magnitude slower than you'd get on non-virtual disks. So that is worth investigating before assuming that the price/performance on EC

Re: Viability of running on EC2

2009-06-13 Thread Jonathan Ellis
t; Thanks again, > > -Anthony > > On Sat, Jun 13, 2009 at 11:33:21AM -0500, Jonathan Ellis wrote: >> IMO the biggest downside to running on EC2 is that IO is terrible.  I >> haven't done benchmarks, but anecdotally disk performance in >> particular seems like an order of magni

Re: Querying columns return strange characters

2009-06-15 Thread Jonathan Ellis
byte[].toString is not the inverse of String.getBytes; you need to use new String(byte[]) for that. fyi, the characters you see are [: this is an array B: of bytes dcb03b: memory address this will let you recognize such output in the future :) -Jonathan On Mon, Jun 15, 2009 at 11:26 AM, Ivan C

Re: Distributed filtering / aggregation

2009-06-17 Thread Jonathan Ellis
There's some preliminary support for running server-side filters (see CalloutManager.java) but basically the first person who needs this functionality gets to finish coding it up. :) I'm happy to help you get started but it's not something we're going to need soon. -Jonathan On Wed, Jun 17, 2009

Re: Data persistency

2009-06-17 Thread Jonathan Ellis
You're using internal APIs. Don't do that unless you know what you're doing. :) The client API is in Cassandra.Client. We have some sample code here: http://wiki.apache.org/cassandra/ClientExamples (although none in Java yet, it should still be pretty clear.) -Jonathan On Wed, Jun 17, 2009 at

Re: Data persistency

2009-06-18 Thread Jonathan Ellis
;         column_arr2.add(new column_t("c2", "v2".getBytes(), now)); >         column_arr2.add(new column_t("c3", "v3".getBytes(), now)); >     superColumn_arr.add(new superColumn_t("sc1", column_arr2)); >     bt.c

Re: Database backstore

2009-06-22 Thread Jonathan Ellis
o it. Voldemort seems > to be quite extensible that way and I think it's quite suitable for an > application that has high read/write ratio. > > > Jonathan Ellis wrote: >> >> I suppose you could do that either directly from your client or with a >> proxy,

Re: New table and column families

2009-06-23 Thread Jonathan Ellis
you'll need to (a) make sure you have the latest trunk (b) wipe your data, commitlog, and system directories, since adding new tables or columnfamilies non-destructively is not yet supported (see https://issues.apache.org/jira/browse/CASSANDRA-44) -Jonathan On Tue, Jun 23, 2009 at 8:55 AM, Ivan

Re: Question about cassandra (replication)

2009-06-25 Thread Jonathan Ellis
Rather than post the same question verbatim, it would be more useful if you explained what you still don't understand after Alexander and Sandeep's explanations on the google group. (http://groups.google.com/group/cassandra-user/browse_thread/thread/4330e415e959e9d9) On Thu, Jun 25, 2009 at 9:11

Re: Question about cassandra (replication)

2009-06-25 Thread Jonathan Ellis
On Thu, Jun 25, 2009 at 10:10 AM, Harold Lim wrote: > > Hi, > > Is the replication factor configurable? For example, Can I configure the > replication factor per column-family (e.g., 5 for column-family a and 3 for > column-family b). It is currently only configurable globally. It may make sens

Re: schema example

2009-07-03 Thread Jonathan Ellis
ing error on r789419: > > InvalidRequestException: get_slice_from requires CF indexed by name > > Evan > > On Tue, May 19, 2009 at 8:00 PM, Jonathan Ellis wrote: >> Mail storage, man, I think pretty much anything I could come up with >> would look pretty simplistic compared to what "re

Re: schema example

2009-07-03 Thread Jonathan Ellis
On Fri, Jul 3, 2009 at 8:53 PM, Evan Weaver wrote: > (From talking on IRC): > > I think this boils down to the offset/limit vs. token/limit debate. > > Token/limit is fine in all cases for me, but you still have to be able > to query the head of the list (with a limit, but no token) to get > starte

Re: [Announce] CassandraClient 0.1 for Ruby released

2009-07-04 Thread Jonathan Ellis
Nice! On Sat, Jul 4, 2009 at 4:59 AM, Evan Weaver wrote: > I am pleased to release: > > cassandra_client 0.1 > > A Ruby client for the Cassandra distributed database. > > http://blog.evanweaver.com/files/doc/fauna/cassandra_client/ > http://github.com/fauna/cassandra_client/ > > Evan > > -- > Evan

Re: cassandra Cli example from wiki error

2009-07-06 Thread Jonathan Ellis
This is a known problem in trunk. It's fixed by the patch in issue 272, which should be applied tonight or tomorrow. -Jonathan On Mon, Jul 6, 2009 at 7:27 PM, Kevin Castiglione wrote: > hi > i just got cassandra compiled. > but the cli example from wiki is not working. the conf files are untouch

Re: cassandra Cli example from wiki error

2009-07-06 Thread Jonathan Ellis
n you tell me which svn version i can use so that it is working? > thanks again! > On Mon, Jul 6, 2009 at 5:31 PM, Jonathan Ellis wrote: >> >> This is a known problem in trunk.  It's fixed by the patch in issue >> 272, which should be applied tonight or tomorrow. >>

Re: problems with python client

2009-07-07 Thread Jonathan Ellis
you want start='' finish='' offset=0 On Tue, Jul 7, 2009 at 8:01 AM, Kevin Castiglione wrote: > i have inserted a row into the table Table1 and Standard1 column family. And > this works with the cassandra-cli > > cassandra> get Table1.Standard1['1'] > COLUMN_TIMESTAMP = 1246942866; COLUMN_VALUE =

Re: problems with python client

2009-07-07 Thread Jonathan Ellis
On Tue, Jul 7, 2009 at 8:19 AM, Kevin Castiglione wrote: > thanks a lot for this! it works. > can you pl. explain what start, finish, isAscending are? start = column name to start with finish = " " to stop with ascending = order to return columns in > also the value i pass to offset gets passed t

Re: problems with python client

2009-07-07 Thread Jonathan Ellis
On Tue, Jul 7, 2009 at 8:31 AM, Kevin Castiglione wrote: > you can see that i passed the value -1 to offset and in the cassandra server > log, it is received as the argument limit. > offset and limit mean different things right? is this a problem in python > client? or am i missing something here?

Re: Up and Running with Cassandra

2009-07-07 Thread Jonathan Ellis
Before 0.4 is released. :) The user-facing API is more of an immediate pain point (tickets 139, 185, 240), but the disk format change would be next in my mind. -Jonathan On Tue, Jul 7, 2009 at 1:06 PM, Kevin Castiglione wrote: > any ideas when this will happen? > thanks > > On Tue, Jul 7, 2009 a

Re: problem running cassandra

2009-07-09 Thread Jonathan Ellis
what version are you trying to run? on what platform? On Thu, Jul 9, 2009 at 12:04 PM, wrote: > I did set it up as the readme file instructed but i encountered this error, > Can you please suggest how i fix this > thanks > > cassandra]$ bin/cassandra -f > Listening for transport dt_socket at add

Re: problem running cassandra

2009-07-09 Thread Jonathan Ellis
tup cassandra > Thanks a lot > > On Thu, Jul 9, 2009 at 10:27 AM, Jonathan Ellis wrote: >> >> what version are you trying to run?  on what platform? >> >> On Thu, Jul 9, 2009 at 12:04 PM, wrote: >> > I did set it up as the readme file instructed but i

Re: problem running cassandra

2009-07-09 Thread Jonathan Ellis
, 2009 at 3:20 PM, wrote: > why was the web interface removed? > Is there a simple python client for cassandra like python-couchdb > thanks a lot > > On Thu, Jul 9, 2009 at 12:25 PM, Jonathan Ellis wrote: >> >> for 0.3 you can connect to the web interface on port 7002 (co

Re: How to answer queries of form "Give me the top 10 messages"

2009-07-10 Thread Jonathan Ellis
Have you read this? http://blog.evanweaver.com/articles/2009/07/06/up-and-running-with-cassandra/ On Fri, Jul 10, 2009 at 4:43 PM, wrote: > Hey guys > how do we answer queries of type - give me the top 10 messages > or top 10 users and so on > thanks > > Example: SuperColumns for Search Apps > >

Re: Can we connect to every node in cassandra ?

2009-07-12 Thread Jonathan Ellis
Every node assumes each other node listens on the same ports. (This might seem inflexible but it is actually a good policy to enforce.) So just make sure those numbers are consistent across the cluster. On Sun, Jul 12, 2009 at 5:31 PM, wrote: > Yes. There are more ports than just '9160' to consi

Re: cassandra slows down after inserts

2009-07-13 Thread Jonathan Ellis
On Mon, Jul 13, 2009 at 12:37 AM, Sandeep Tata wrote: > What hardware are you running one? How long does the slowdown last ? >  There are a few reasons for temporary slowdowns ... perhaps the JVM > started GCing? Every time someone has reported this symptom, that has been the problem. The object

Re: cassandra slows down after inserts

2009-07-13 Thread Jonathan Ellis
See the wiki page I linked. On Mon, Jul 13, 2009 at 8:06 AM, rkmr...@gmail.com wrote: > > how do i find out if JVM is GCing? > > On Sun, Jul 12, 2009 at 10:37 PM, Sandeep Tata > wrote: >> >> What hardware are you running one? > >  dual quadcore intel xeon 2.0 ghz, 32GB ram, and hardware raid conf

Re: cassandra slows down after inserts

2009-07-13 Thread Jonathan Ellis
Cassandra is replaying the transaction log and preloading SSTable indexes. This is normal. On Mon, Jul 13, 2009 at 8:10 AM, rkmr...@gmail.com wrote: > when i stop cassandra and start it again, this is what is printed. it takes > just a couple of seconds for this to run. > and after that it become

Re: cassandra slows down after inserts

2009-07-13 Thread Jonathan Ellis
decrease On Mon, Jul 13, 2009 at 8:53 AM, rkmr...@gmail.com wrote: > On Mon, Jul 13, 2009 at 6:03 AM, Jonathan Ellis wrote: >> >> On Mon, Jul 13, 2009 at 12:37 AM, Sandeep Tata >> wrote: >> > What hardware are you running one? How long does the slowdown last ? >

Re: Scaling from 1 to x (was: one server or more servers?)

2009-07-14 Thread Jonathan Ellis
On Tue, Jul 14, 2009 at 8:33 AM, Mark Robson wrote: > Cassandra doesn't provide the guarantees about the latest changes being > available from any given node, so you can't really use it in such an > application. > > I don't know if the "blocking" variants of the write operations make any > more gua

Re: Scaling from 1 to x (was: one server or more servers?)

2009-07-14 Thread Jonathan Ellis
trade write latency and availability (since the write will fail if one of the target nodes is down) for 100% consistency and reduced read latency -Jonathan On Tue, Jul 14, 2009 at 9:18 AM, Mark Robson wrote: > > > 2009/7/14 Jonathan Ellis >> >> On Tue, Jul 14, 2009 at 8:33

Re: one server or more servers?

2009-07-14 Thread Jonathan Ellis
gossip distributes the cluster status. the seeds are there to be an initial contact point. On Tue, Jul 14, 2009 at 10:04 AM, wrote: > Hey mark > thanks for the detailed reply explaining the example of Seeds > > How do we add servers other than Seeds as there is no such place in conf > file > > t

Re: one server or more servers?

2009-07-14 Thread Jonathan Ellis
the new servers contact the seeds, not the other way around On Tue, Jul 14, 2009 at 10:10 AM, wrote: > Mark and Jonathan > I m lost here > Dont we need to specify atleast the server ip address in  the conf file. How > would cassandra know which ips they are running in ie the other servers. > I ca

Re: replica on in the beginning or added later

2009-07-14 Thread Jonathan Ellis
although the repair code Stu is working on (https://issues.apache.org/jira/browse/CASSANDRA-193) could handle increasing the replica count, IMO there's little sense in relying any more on features that don't yet exist than necessary. :) On Tue, Jul 14, 2009 at 10:17 AM, wrote: > as a followup que

Re: replica on in the beginning or added later

2009-07-14 Thread Jonathan Ellis
t; On Tue, Jul 14, 2009 at 8:21 AM, Jonathan Ellis wrote: >> >> although the repair code Stu is working on >> (https://issues.apache.org/jira/browse/CASSANDRA-193) could handle >> increasing the replica count, IMO there's little sense in relying any >> more on

Re: problem running cassandra

2009-07-14 Thread Jonathan Ellis
the bind to port was successful; the ones to the messagingservice ports were not On Tue, Jul 14, 2009 at 10:59 PM, wrote: > http://pastie.org/546395 > > get this eror but > > cassandra]$  sudo netstat -apn | grep |wc -l > > is empty > > i wonder if this is a known issue > > thanks >

Re: Best way to use a Cassandra Client in a multi-threaded environment?

2009-07-15 Thread Jonathan Ellis
IIRC thrift makes no effort to generate threadsafe code. which makes sense in an rpc-oriented protocol really. On Wed, Jul 15, 2009 at 7:25 PM, Joel Meyer wrote: > Hello, > Are there any recommendations on how to use Cassandra Clients in a > multi-threaded front-end application (java)? Is the Cli

Re: Best way to use a Cassandra Client in a multi-threaded environment?

2009-07-15 Thread Jonathan Ellis
What I mean is, if you have client.rpc1() it doesn't really matter if you can do client.rpc2() from another thread or not, since it's dumb. :) On Wed, Jul 15, 2009 at 7:41 PM, Ian Holsman wrote: > > On 16/07/2009, at 10:35 AM, Jonathan Ellis wrote: > >> IIRC

Re: Best way to use a Cassandra Client in a multi-threaded environment?

2009-07-15 Thread Jonathan Ellis
On Wed, Jul 15, 2009 at 8:13 PM, Ian Holsman wrote: > ugh. > if this is a byproduct of thrift it is. > we should have another way of getting to > the backend. > serialization is *not* a desired feature for most people ;-0 maybe not, but that's how every single database client works that I can th

Re: one server or more servers?

2009-07-16 Thread Jonathan Ellis
the FAQ talks about using listenaddress: http://wiki.apache.org/cassandra/FAQ On Thu, Jul 16, 2009 at 1:49 AM, wrote: > if i make listenaddress blank > i get in oneserver > binding to 127.0.0.1 > in 2nd server > sometimes to the ip address of the server > in 3rd server > WARN - Exception was gene

Re: WARN - Unable to find a live Endpoint we might be out of live nodes , This is dangerous !!!!

2009-07-17 Thread Jonathan Ellis
Please don't repeat your question separately on -user, -dev, and irc. If nobody answers it's either because we're busy or we don't know the answer. In this case it's probably a bit of both. :) I've never heard of anyone running into this before so my guess is it's something weird with your networ

Re: Concurrent updates

2009-07-17 Thread Jonathan Ellis
This is the kind of inconsistency that vector clocks can handle but the more simplistic timestamp-based resolution cannot. Of test-and-set vs vector clocks, vector clocks fits cassandra much better. -Jonathan On Fri, Jul 17, 2009 at 9:59 AM, Jun Rao wrote: > This is a case where a test-and-set f

Re: WARN - Unable to find a live Endpoint we might be out of live nodes , This is dangerous !!!!

2009-07-17 Thread Jonathan Ellis
t remove the following line, or various programs > # that require network functionality will fail. > 127.0.0.1 localhost.localdomain localhost localhost > ::1 localhost6.localdomain6 localhost6 > > On Fri, Jul 17, 2009 at 6:09 AM, Jonathan Ellis wrote: >> >> Please

Re: Scaling from 1 to x (was: one server or more servers?)

2009-07-17 Thread Jonathan Ellis
own" > I thought Hinted handoff will take care of this Right? Write will never fail > insted it will write to another node right? > > correct me if i am wrong. > > Thanks and Regards, > > > > > > On Tue, Jul 14, 2009 at 7:26 AM, Jonathan Ellis wrote: >>

Re: Scaling from 1 to x (was: one server or more servers?)

2009-07-17 Thread Jonathan Ellis
s w/o losing availability.) -Jonathan > During query it will fail if i only have block for to be 3? > > Regards, > > > > > > On Fri, Jul 17, 2009 at 1:36 PM, Jonathan Ellis wrote: >> >> ck_for to zero, then writes will indeed never fail >> (unless the node the client is ta >

Re: python thrift cassandra: get_slice_super vs get_slice_super_by_names

2009-07-19 Thread Jonathan Ellis
I would guess because kw != 'tofu' On Sun, Jul 19, 2009 at 12:24 AM, wrote: > Why doesnt res return ColumnFamily Related whereas res2 works just fine > thanks? > > timestamp = time.time() > res = client.get_slice_super('Table1', kw, 'Super1','','',True,0,1000) > print res > [] > res2 = client.get

Re: ì¤ ìì§ ì¤ ì thrift.Thrift.TApplicationEx ception: Internal error processing insert

2009-07-19 Thread Jonathan Ellis
That should be partially solved in trunk now that 139 is committed, and more solved when we commit 185 soon. On Sun, Jul 19, 2009 at 3:43 AM, wrote: > Any utf-8 keyword causes cassandra to crash! >

Re: how to delete an entire column family

2009-07-19 Thread Jonathan Ellis
iterate through the keys with get_key_range, and delete the row associated with each key On Sun, Jul 19, 2009 at 3:51 AM, wrote: > In Super-column family Super1 there is a column family Related > How do i delete the entire related column family > thanks

Re: python thrift cassandra: get_slice_super vs get_slice_super_by_names

2009-07-19 Thread Jonathan Ellis
Strange. If you can post a script showing how to reproduce the problem from a fresh database then I can debug it. On Sun, Jul 19, 2009 at 11:23 AM, wrote: > Jon i should have mntioned kw is 'tofu' > that is why it looks quite not right > > On Sun, Jul 19, 2009 at 6:08 AM,

Re: New cassandra in trunk - breaks python thrift interface (was AttributeError: 'str' object has no attribute 'write')

2009-07-19 Thread Jonathan Ellis
Don't run trunk if you're not going to read "svn log." The api changed with the commit of the 139 patches (and it will change again with the 185 ones). look at interface/cassandra.thrift to see what arguments are expected. On Sun, Jul 19, 2009 at 3:31 PM, wrote: > Hey Gasol wu > i regenerated t

Re: New cassandra in trunk - breaks python thrift interface (was AttributeError: 'str' object has no attribute 'write')

2009-07-19 Thread Jonathan Ellis
, mobiledream...@gmail.com wrote: > >> ok >> so which is the version where cassandra python thrift works out of the box >> thanks >> >> On 7/19/09, Jonathan Ellis wrote: Don't run trunk if >> you're not going to read "svn log." >> >>

Re: New cassandra in trunk - breaks python thrift interface (was AttributeError: 'str' object has no attribute 'write')

2009-07-19 Thread Jonathan Ellis
It works fine, it's just not the same as it was two weeks ago. On Sun, Jul 19, 2009 at 3:36 PM, wrote: > ok > so which is the version where cassandra python thrift works out of the box > thanks > > On 7/19/09, Jonathan Ellis wrote: >> >> Don't run trunk

Re: AttributeError: 'str' object has no attribute 'write'

2009-07-19 Thread Jonathan Ellis
Building the java interface is part of the build, but ant has no way to guess which additional client interfaces you want to use, if any. On Sun, Jul 19, 2009 at 6:46 PM, Ian Holsman wrote: > hi Gasol. > shouldn't regeneration of the interface be part of the build process? > > On 20/07/2009, at 3:

Re: a talk on building an email app on Cassandra

2009-07-20 Thread Jonathan Ellis
Nice! On Mon, Jul 20, 2009 at 12:43 PM, Jun Rao wrote: > Last Friday, I gave an IEEE talk on an email app that we built on top of > Cassandra. Below is the link to the slides. I thought some of the people > here might find this interesting. > > http://ewh.ieee.org/r6/scv/computer//nfic/2009/IBM-Ju

Fwd: thrift API changes

2009-07-20 Thread Jonathan Ellis
Oops, I sent this to the old google -user list by mistake the first time. Now that that's gone, I realized the error. -- Forwarded message -- From: Jonathan Ellis Date: Mon, Jul 20, 2009 at 10:10 PM Subject: Re: thrift API changes To: cassandra-u...@googlegroups.com, cass

Re: trunk

2009-07-21 Thread Jonathan Ellis
the internals should be solid but we are in the middle (towards the end of, actually) changing the thrift api pretty drastically. (the colons had to go, and the sooner we bit the bullet, the better. :) see this thread -- http://mail-archives.apache.org/mod_mbox/incubator-cassandra-user/200907.mbo

Re: keys and column names cannot be utf-8

2009-07-21 Thread Jonathan Ellis
did you read the new section in the config xml explaining how to use a UTF8 comparator? also: thrift itself is just plain broken for unicode support in some languages; see THRIFT-395 I think the short version is that when you have a java server, unicode will work with java or C# clients but not w

Re: keys and column names cannot be utf-8

2009-07-21 Thread Jonathan Ellis
On Tue, Jul 21, 2009 at 4:06 PM, Jonathan Ellis wrote: > (so if you are using a python client for instance switching to jython > might be a workaround) that is, using the java thrift client, not the python ones.

Re: keys and column names cannot be utf-8

2009-07-21 Thread Jonathan Ellis
On Tue, Jul 21, 2009 at 4:18 PM, wrote: > Hey jonathan > this is not in the wiki or any documentation. this is trunk. i wrote it a couple days ago. feel free to step in and update the wiki. > does this work in python thrift probably not, given the thrift utf8 bugs. (but you could use BytesTy

Re: keys and column names cannot be utf-8

2009-07-21 Thread Jonathan Ellis
On Tue, Jul 21, 2009 at 4:21 PM, Jonathan Ellis wrote: >> does this work in python thrift > > probably not, given the thrift utf8 bugs. to correct myself: now that we are using binary data in the thrift api it can't screw us over. so yes, UTF8Type should be fine.

Re: keys and column names cannot be utf-8

2009-07-21 Thread Jonathan Ellis
you may also want to specify CompareSubcolumnsWith. On Tue, Jul 21, 2009 at 4:27 PM, wrote: > thanks jonathan > trying this > > > On Tue, Jul 21, 2009 at 2:24 PM, Jonathan Ellis wrote: >> >> On Tue, Jul 21, 2009 at 4:21 PM, Jonathan Ellis wrote: >> &

Re: keys and column names cannot be utf-8

2009-07-21 Thread Jonathan Ellis
8Type" FlushPeriodInMinutes="60"/> > > ColumnSort="Time"  CompareWith="UTF8Type" Name="StandardByTime1"/> > CompareSubcolumnsWith="UTF8Type" Name="Super1"/> > Jonathan can you clarify if this will guarantee proper p

  1   2   3   4   5   6   7   8   9   >