have you put your commit log on a disk by itself? not a logical
partition shared by oracle or cassandra "data". this will make a
difference, as you don't want the cassandra commit logs competing with
other OS and oracle I/O. look in storage-conf.xml and see if you can
move this.
also check "MemtableThroughputInMB". if you are doing a _lot_ of writes
you probably want to jack this up a bunch to get through the migration,
then put it back down for normal operation. the default out of the box
is too low i believe.
On 05/10/2010 02:05 AM, David Boxenhorn wrote:
I read something like 80,000 rows from Oracle and write them to
Cassandra in chunks of 1000 rows - so I'm supposedly working to
Cassandra's strength and Oracle's weakness.
Reading 1000 rows from Oracle is "instantaneous", writing them takes
maybe 30 seconds. Not too much data per row, maybe 1K.
On Mon, May 10, 2010 at 11:48 AM, Ran Tavory <ran...@gmail.com
<mailto:ran...@gmail.com>> wrote:
Hector uses tsocket. not sure what you mean by "buffered" - is
that framed? Hector by default does not use framed.
The code is here if you'd like to have a look
http://github.com/rantav/hector/blob/master/src/main/java/me/prettyprint/cassandra/service/CassandraClientFactory.java#L77
However, I find it hard to believe that the actual connection is
the slowing factor.
Roughly speaking, cassandra is fast on writes and slow on reads.
Exact numbers are per-scenario so it's hard to say, but if you
only write and objects are small then from my experience you
should expect a few k writes per second on a single host. How much
do you see?
There are many configuration factors and they all depend on
expected usage and available h/w.
On Mon, May 10, 2010 at 11:27 AM, vd <vineetdan...@gmail.com
<mailto:vineetdan...@gmail.com>> wrote:
What is the complete code string you are using to connect with
cassandra from Java code
On Mon, May 10, 2010 at 1:49 PM, David Boxenhorn
<da...@lookin2.com <mailto:da...@lookin2.com>> wrote:
I don't know what "TSocket or the buffered one" means.
Maybe I should know?
I'm using Hector. Does that explain anything?
On Mon, May 10, 2010 at 11:15 AM, vd
<vineetdan...@gmail.com <mailto:vineetdan...@gmail.com>>
wrote:
Hi
what is it that you are using to connect with
cassnadra TSocket or the buffered one ?
____________________________________
_______________________________________
On Mon, May 10, 2010 at 1:29 PM, David Boxenhorn
<da...@lookin2.com <mailto:da...@lookin2.com>> wrote:
I'm running Java on the client, jdbc queries on
Oracle, Hector on Cassandra.
The Cassandra and Oracle database designs are
radically different, as you might guess.
I have no doubt that Cassandra can be tuned, in a
multiple-server cluster, to have superior
throughput (that's why I'm doing it!). But for
now, it's really frustrating my development effort
that Cassandra is so slow. Can't I get it up to
twice as slow as Oracle in my configuration?
On Mon, May 10, 2010 at 10:47 AM, vd
<vineetdan...@gmail.com
<mailto:vineetdan...@gmail.com>> wrote:
Hi David
If I may ask...how do you plan to import data
from oracle to cassandra ?
As answer AFAIK cassandra's true ability comes
into play when running on more than one
machine...and please share how you are making
comparisons like on writes or reads from
cassandra.
_______________________________________
_______________________________________
On Mon, May 10, 2010 at 1:04 PM, David
Boxenhorn <da...@lookin2.com
<mailto:da...@lookin2.com>> wrote:
I'm running Oracle and Cassandra on my
machine, trying to import my data to
Cassandra from Oracle.
In my configuration Oracle is about ten
times faster than Cassandra. Cassandra has
out-of-the-box tuning.
I am new to Cassandra. How do I begin
trying to tune it?
Thanks.