Hi Kaz, I'm using my University box and the results are ten times better. The single core 1 GHz processor on my netbook really struggles with psudo dist hadoop, Cassandra server and goraci running over the top. The University box is 3.2GiB Mem Processor 0: Intel Core 2 Duo CPU E8400 @ 3.00GHz Processor 1: Intel Core 2 Duo CPU E8400 @ 3.00GHz
Anyway. I wonder if you can share you gora-cassandra-mapping configuration as well as which partitioner you use within the cassandra.yaml? I am getting a class cast exception when doing CassandraStore.flush() as my mapping config is like this [0]. Do you suggest any improvements/corrections to the mapping? Also I noticed before that we configured yaml to use a different partitioner than RandomPartitioner. Do you also use this with your standalone Cassandra server? If this is the case then I will also change but so far I haven't had any problems. Thanks Lewis [0] https://github.com/keith-turner/goraci/blob/master/src/main/resources/gora-cassandra-mapping.xml On Mon, Jul 23, 2012 at 8:19 PM, Kazuomi Kashii <kazu...@kashii.net> wrote: > Hi Lewis, > > I used Mac with Core2Quad and 8GB memory yesterday. > A single node Cassandra server is running, and Goraci/GORA/Cassandra used > that server. > " goraci.sh Generator 1 25000000" took about 4 hours to complete. > > I saw the message on every 1M nodes written (flushed). > Since gora-cassandra does not support delete() yet, "goraci.sh Delete" did > nothing. > "goraci.sh Verify" took a few dozens of minutes. > > In my understanding, gora-cassandra flushes its buffer only when flush() or > close() is explicitly called. > I have not checked the detail of gora-hbase or gora-accumulo, > but if they flush the buffer more intelligently, we may want gora-cassandra > to support such feature. > > Thanks, > -Kaz > > > > On 7/23/12 11:40 AM, Lewis John Mcgibbney wrote: >> >> Hi Kaz, >> >> On Mon, Jul 23, 2012 at 5:47 PM, Kazuomi Kashii <kazu...@kashii.net> >> wrote: >>> >>> I tried Goraci last night, and I had had some dependency problems. >> >> How did you get on with gora-cassandra and the goraci suite? I've >> shared some of my early experiences with Keith [0]. Unfortunately the >> hardware I'm running the test on in pretty primitive to say the last >> (small notebook) therefore I fear this is limiting the execution of >> the tests and Hadoop jobs are timing out and being killed. Also I have >> a few questions which I would like to reach out on. >> >> 1) When we use this test suite is the cassandra system swapping? How >> can I even find this out? Having spoken to Keith he clarified to me >> that the test writes in multiples of 1M nodes so if this is done in >> swap there will be problems. >> >> 2) How does gora-cassandra handle buffering? Keith also mentioned that >> Goraci will write 1000000 nodes and then call flush. Accumulo and >> Hbase handle this ok. If >> gora-cassandra actually buffered all 1000000 in memory until flush was >> called, then this could be bad with my small amount of memory. >> >> I'm keen to get some documentation on the execution of gora-cassandra >> with this test suite to understand more about the internals an of >> course the limitations of gora-cassandra. >> >> Any comments you have at this stage would be excellent. >> >>> For my case, I added some dependencies to Goraci's pom.xml, and it >>> worked, >>> but I am not sure that it is the same or similar issue to yours. >>> I used a standalone Cassandra server, not an embedded one, so I did not >>> include cassandra-all. >> >> I'm the same as you here. I suppose this dep can maybe be dropped from >> the goraci pom,xml in this instance then. >> >> Best >> Lewis >> >> [0] https://github.com/keith-turner/goraci/pull/7 > > -- Lewis