Re: Can't Run: "Address already in use" Error

2010-01-08 Thread Jonathan Ellis
stop whatever is using port 8080 or tell cassandra to use a different one (in cassandra.in.sh); see also http://wiki.apache.org/cassandra/RunningCassandra On Fri, Jan 8, 2010 at 7:13 PM, Dusty Reagan wrote: > Hi! > I'm trying to get Casandra running on Ubuntu 8.04.2 LTS (hardy). I've got it > ins

Can't Run: "Address already in use" Error

2010-01-08 Thread Dusty Reagan
Hi! I'm trying to get Casandra running on Ubuntu 8.04.2 LTS (hardy). I've got it installed but when I try and run it with "casandra -f" I get some lines of debug info then the following error twice in a row: ERROR - Exception encountered during startup. java.net.BindException: Address already in

Re: Data Model Index Text

2010-01-08 Thread Ian Holsman
Hi ML. this sounds more like a job for SOLR, but if you want to do this with cassandra, you should look at Jake's Lucandra http://github.com/tjake/Lucandra you should also look at http://nicklothian.com/blog/2009/10/27/solr-cassandra-solandra/ I wouldn't recommend you building your own IR eng

Re: How to UUID in .Net

2010-01-08 Thread Jonathan Ellis
I didn't see any C# libraries that generate type 1 UUIDs. You might have to port this one from java: http://johannburkard.de/software/uuid/ 2010/1/8 Nguyễn Minh Kha : > Hi, I'm writing Cassandra in .Net (C Sharp) but I have a problem on gen a > UUID for my project. > I used Guid to gen UUID Versi

Re: Large number of ROW-READ-STAGE pending tasks?

2010-01-08 Thread Jonathan Ellis
On Fri, Jan 8, 2010 at 6:08 PM, Anthony Molinaro wrote: > How soon before 0.5 out, it'll take some time to package test and launch > so I have to figure out if it's worth it to upgrade now or wait a bit > (assuming 0.5 is going to be soon). We have had a couple bugs reported against rc3 (https://

Re: Data Model Index Text

2010-01-08 Thread Drew Schleck
I think I am reading this right, basically you want to query for a word and find all of the documents that contain it? While there may be a better way to do this, the way the people at Facebook do it is with supercolumns. Inside the supercolumn column family they have columns for every word, such a

Re: How to UUID in .Net

2010-01-08 Thread Tatu Saloranta
2010/1/8 Nguyễn Minh Kha : > Hi, I'm writing Cassandra in .Net (C Sharp) but I have a problem on gen a > UUID for my project. > I used Guid to gen UUID Version 1 but when I add to Cassandra thow an > exception "TimeUUID only makes sense with version 1 UUIDs" > > I used uuidgen.exe (Windows SDK) to

Re: Large number of ROW-READ-STAGE pending tasks?

2010-01-08 Thread Anthony Molinaro
On Fri, Jan 08, 2010 at 04:59:01PM -0600, Jonathan Ellis wrote: > MSP going up and down doesn't seem like a big deal. is load > abnormally high on that box? % uptime 23:50:57 up 93 days, 2:56, 1 user, load average: 3.27, 3.12, 3.12 So not too high. Jconsole shows about 35% CPU Usage > did

Re: How to UUID in .Net

2010-01-08 Thread JKnight JKnight
Cassandra server use UUID version 1. Maybe the uuidgen.exe does not generator correct UUID version. 2010/1/9 Nguyễn Minh Kha > Hi, I'm writing Cassandra in .Net (C Sharp) but I have a problem on gen a > UUID for my project. > I used Guid to gen UUID Version 1 but when I add to Cassandra thow an

Re: Large number of ROW-READ-STAGE pending tasks?

2010-01-08 Thread Jonathan Ellis
MSP going up and down doesn't seem like a big deal. is load abnormally high on that box? did you check the log? (maybe it is sending hinted data to the node that was down 12h?) "loads of timeouts" isn't much to go on, again, what is your telemetry showing? high load? large work queues? compa

Re: Large number of ROW-READ-STAGE pending tasks?

2010-01-08 Thread Anthony Molinaro
So it seems to correlate with writes, the machines with pending tasks in their MESSAGE-SERIALIZER-POOL also have a high number of write counts, so my keyspace is probably out of balance. Hopefully the tools available in 0.5 will allow me to move keys around to make things a little more evenly dist

Re: Large number of ROW-READ-STAGE pending tasks?

2010-01-08 Thread Anthony Molinaro
So I restarted the node with the large number of ROW-READ-STAGE pending tasks, the timeouts are still occuring somewhat randomly, and now MESSAGE-SERIALIZER-POOL seems to be growing on one of the nodes % for h in 02 03 04 05 06 07 08 09 ; do echo "xtr-$h.mkt"; cassandra-nodeprobe -host xtr-$h.mk

Data Model Index Text

2010-01-08 Thread ML_Seda
Hey, I've been reading up on the Cassandra data model a bit, and would like to get some input from this forum on different techniques for a particular problem. Assume I need to index millions of text docs (e.g. research papers), and allow the ability to query them by a given word inside or aroun

Re: Large number of ROW-READ-STAGE pending tasks?

2010-01-08 Thread Jonathan Ellis
if the queued reads is increasing then you're going to OOM eventually, and it will probably freeze (to the clients' perspective) first while it desperately tries to GC enough to continue. i would restart the affected nodes. On Fri, Jan 8, 2010 at 3:15 PM, Anthony Molinaro wrote: > Hi, I had one

Large number of ROW-READ-STAGE pending tasks?

2010-01-08 Thread Anthony Molinaro
Hi, I had one of my machines fail last night (OOM), and upon restarting it about 12 hours later (have to get me some monitoring so I can restart it faster), I've noticed lots of errors like ERROR [pool-1-thread-6915] 2010-01-08 21:10:59,902 Cassandra.java (line 739) Internal error processing mult

Re: Can fix corrupt file? (Compaction step)

2010-01-08 Thread Jonathan Ellis
Can you gzip the sstable that OOMs and send it to me off-list? On Fri, Jan 8, 2010 at 11:26 AM, JKnight JKnight wrote: > Dear Mr Jonathan, > With the larger sstable, I don't have any problem. So I think that the error > does not related to the heap size. And my data model does not use > SuperColu

Re: Advise for choice

2010-01-08 Thread scott w
Good point although there has been very recent work integrating solr with katta so you can have your cake and eat it too: http://developer.yahoo.net/blogs/theater/archives/2009/12/hadoop_bay_area_user_group_session_1.html On Fri, Jan 8, 2010 at 1:09 AM, Erich Nachbar wrote: > I can give you a

How to UUID in .Net

2010-01-08 Thread Nguyễn Minh Kha
Hi, I'm writing Cassandra in .Net (C Sharp) but I have a problem on gen a UUID for my project. I used Guid to gen UUID Version 1 but when I add to Cassandra thow an exception "TimeUUID only makes sense with version 1 UUIDs" I used uuidgen.exe (Windows SDK) to gen this Guid. Pls help me resolve th

cassandra gem 0.7 released

2010-01-08 Thread Ryan King
I just pushed an updated version of our gem that's compatible with 0.5. http://gemcutter.org/gems/cassandra -ryan

Re: Error in Compact step - java.io.UTFDataFormatException: malformed input around byte 13

2010-01-08 Thread JKnight JKnight
Thanks Jonathan. I tried 0.5 rc2, 0.5 rc3 and the error still exists. On Thu, Jan 7, 2010 at 8:58 AM, Jonathan Ellis wrote: > Try upgrading to 0.5 rc2. > > On Wed, Jan 6, 2010 at 7:48 PM, JKnight JKnight > wrote: > > Thanks Mr Jonathan. > > > > But I did not change anything. > > > > On Wed, Ja

Re: Can fix corrupt file? (Compaction step)

2010-01-08 Thread JKnight JKnight
Dear Mr Jonathan, With the larger sstable, I don't have any problem. So I think that the error does not related to the heap size. And my data model does not use SuperColumn, so I think the the number of columns in row is not the problem. I have tried to delete error row and accept data lost. On

Re: Graduation

2010-01-08 Thread Ryan King
+1 from me, for sure. -ryan On Thu, Jan 7, 2010 at 10:56 AM, Eric Evans wrote: > > Greetings, > > It seems hard to believe, but it's been a little over a year now since > Cassandra entered the Apache Incubator. The project has made an > impressive amount of progress in that time, from expanding

Re: Advise for choice

2010-01-08 Thread Erich Nachbar
I can give you a few more data points. For one of my last projects, I built the search index of one of the largest IM aggregators. I got around 2.5k chat msg/s, keeping 400M messages in my index. I looked at Solr and while it is very convenient/luxurious, there was no way in hell I could scale it