Re: GCInspector info messages in cassandra log

2012-08-16 Thread Tamar Fraenkel
Thank you very much! *Tamar Fraenkel * Senior Software Engineer, TOK Media [image: Inline image 1] ta...@tok-media.com Tel: +972 2 6409736 Mob: +972 54 8356490 Fax: +972 2 5612956 On Thu, Aug 16, 2012 at 12:11 AM, aaron morton aa...@thelastpickle.comwrote: Is there anything to do

SSTable Index and Metadata - are they cached in RAM?

2012-08-16 Thread Maciej Miklas
Hi all, bloom filter for row keys is always in RAM. What about SSTable index, and Metadata? Is it cached by Cassandra, or it relays on memory mapped files? Thanks, Maciej

Re: Migrating to a new cluster (using SSTableLoader or other approaches)

2012-08-16 Thread Filippo Diotalevi
ERROR 09:02:38,614 Error in ThreadPoolExecutor java.lang.RuntimeException: java.io.EOFException: unable to seek to position 93069003 in /opt/analytics/analytics/chart-hd-104-Data.db (65737276 bytes) in read-only mode This one looks like an error. Can you run nodetool with

Re: indexing question related to playOrm on github

2012-08-16 Thread Hiller, Dean
Yes, the synch may work, and no, I do not want a transaction…I want a different kind of eventually consistent That might work. Let's say server 1 sends a mutation (65 is the pk) Remove: bill65 Add tim65 Server 2 also sends a mutation (65 is the pk) Remove: bill65 Add mike65 What everyone does

wild card on query

2012-08-16 Thread Swathi Vikas
Hi, I am trying to run query on cassandra cluster with predicate on row key. I have column family called Users and rows with row key like projectid_userid_photos. Each user within a project can have rows like projectid_userid_blog, projectid_userid_status and so on.  I want to retrieve all

Re: indexing question related to playOrm on github

2012-08-16 Thread Hiller, Dean
Maybe this would be a special type of column family that could contain these as my other tables definitely don't want the feature below by the way. Dean On 8/16/12 6:29 AM, Hiller, Dean dean.hil...@nrel.gov wrote: Yes, the synch may work, and no, I do not want a transactionŠI want a different

nodetool repair uses insane amount of disk space

2012-08-16 Thread Michael Morris
Occasionally as I'm doing my regular anti-entropy repair I end up with a node that uses an exceptional amount of disk space (node should have about 5-6 GB of data on it, but ends up with 25+GB, and consumes the limited amount of disk space I have available) How come a node would consume 5x its

Many ParNew collections

2012-08-16 Thread Rene Kochen
Hi I have a cluster of 7 nodes: - Windows Server 2008 - Cassandra 0.7.10 - The nodes are identical (hardware, configuration and client request load) - Standard batch file with 8GB heap - I use disk_access_mode = standard - Random partitioner - TP stats shows no problems - Ring command shows no

Opscenter 2.1 vs 1.3

2012-08-16 Thread Robin Verlangen
Hi there, I just upgraded to opscenter 2.1 (from 1.3). It appears that my writes have tripled. Is this a change in the display/measuring of opscenter? Best regards, Robin Verlangen *Software engineer* * * W http://www.robinverlangen.nl E ro...@us2.nl Disclaimer: The information contained in

C++ Bulk loader and Result set streaming.

2012-08-16 Thread Swathi Vikas
Hi All,   I am using C++ client libQtCassandra. I have two questions.   1) I want to bulk load data into cassandra through C++ interface. It is required by my group where i am doing internship. I could bulk load using sstableloader as specified in Datastax

'WHERE' with several indexed columns

2012-08-16 Thread A J
Hi If I have a WHERE clause in CQL with several 'AND' and each column is indexed, which index(es) is(are) used ? Just the first field in the where clause or all the indexes involved in the clause ? Also is index used only with an equality operator or also with greater than /less than comparator

Why the StageManager thread pools have 60 seconds keepalive time?

2012-08-16 Thread Guillermo Winkler
Hi, I have a cassandra cluster where I'm seeing a lot of thread trashing from the mutation pool. MutationStage:72031 Where threads get created and disposed in 100's batches every few minutes, since it's a 16 core server concurrent_writes is set in 100 in the cassandra.yaml. concurrent_writes:

Re: SSTable Index and Metadata - are they cached in RAM?

2012-08-16 Thread aaron morton
What about SSTable index, Not sure what you are referring to there. Each row has a in a SStable has a bloom filter and may have an index of columns. This is not cached. See http://thelastpickle.com/2011/07/04/Cassandra-Query-Plans/ or

Re: Migrating to a new cluster (using SSTableLoader or other approaches)

2012-08-16 Thread Filippo Diotalevi
ERROR 09:02:38,614 Error in ThreadPoolExecutor java.lang.RuntimeException: java.io.EOFException: unable to seek to position 93069003 in /opt/analytics/analytics/chart-hd-104-Data.db (65737276 bytes) in read-only mode This one looks like an error. Can you run nodetool with

Re: Migrating to a new cluster (using SSTableLoader or other approaches)

2012-08-16 Thread aaron morton
Which nodetool command are you referring to? (info, cfstats, ring,….) My bad. I meant to write sstableloader Do I modify the log4j-tools.properties in $CASSANDRA_HOME/conf to set the nodetool logs to DEBUG? You can use the --debug option with sstableloader to get a better exception message.

Re: wild card on query

2012-08-16 Thread aaron morton
I want to retrieve all the photos from all the users of certain project. My sql like query will be select projectid * photos from Users. How can i run this kind of row key predicate while executing query on cassandra? You cannot / should not do that using the data model you have. (i.e. you

Re: indexing question related to playOrm on github

2012-08-16 Thread aaron morton
I am not sure synchronization fixes thatŠŠIt would be kind of nice if the column bill65 would not actually be removed until after all servers are eventually consistent... Not sure thats possible. You can either serialise updating your custom secondary index on the client site or resolve

Re: nodetool repair uses insane amount of disk space

2012-08-16 Thread aaron morton
What version are using ? There were issues with repair using lots-o-space in 0.8.X, it's fixed in 1.X Cheers - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 17/08/2012, at 2:56 AM, Michael Morris michael.m.mor...@gmail.com wrote: Occasionally

Re: Opscenter 2.1 vs 1.3

2012-08-16 Thread aaron morton
You may have better luck on the Data Stax forums http://www.datastax.com/support-forums/ Cheers - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 17/08/2012, at 4:36 AM, Robin Verlangen ro...@us2.nl wrote: Hi there, I just upgraded to

Re: C++ Bulk loader and Result set streaming.

2012-08-16 Thread aaron morton
But i couldn't find any information on bulk loading using C++ client interface. You cannot. To bulk load data use the sstableloader, otherwise you need to use the RPC / CQL API. 2) I want to retrieve all the result of the query(not just first 100 result set) using C++ client. Is there

Omitting empty columns from CQL SELECT

2012-08-16 Thread Mat Brown
Hello all, I've noticed that when performing a SELECT statement with a list of columns specified, Cassandra returns all columns in the resulting row(s) even if they have no value. This creates an apparently considerable amount of transport and deserialization overhead, particularly in one use

Re: 'WHERE' with several indexed columns

2012-08-16 Thread aaron morton
If I have a WHERE clause in CQL with several 'AND' and each column is indexed, which index(es) is(are) used ? The most selective based on the average number of columns per row https://github.com/apache/cassandra/blob/trunk/src/java/org/apache/cassandra/db/index/keys/KeysSearcher.java Also is

Re: nodetool repair uses insane amount of disk space

2012-08-16 Thread Michael Morris
Upgraded to 1.1.3 from 1.0.8 about 2 weeks ago. On Thu, Aug 16, 2012 at 5:57 PM, aaron morton aa...@thelastpickle.comwrote: What version are using ? There were issues with repair using lots-o-space in 0.8.X, it's fixed in 1.X Cheers - Aaron Morton Freelance Developer

Re: Why the StageManager thread pools have 60 seconds keepalive time?

2012-08-16 Thread aaron morton
That's some pretty old code. I would guess it was done that way to conserve resources. And _i think_ thread creation is pretty light weight. Jonathan / Brandon / others - opinions ? Cheers - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On

Cassandra 1.0 row deletion

2012-08-16 Thread Terry Cumaranatunge
Hi, We have a Cassandra 1.0 cluster that we run with RF=3 and perform operations using a consistency level of quorum. We use batch_mutate for all inserts and updates for atomicity across column families with the same row key, but use the thrift interface remove API call in C++ to delete a row so

Re: Cassandra 1.0 row deletion

2012-08-16 Thread Derek Williams
On Thu, Aug 16, 2012 at 9:08 PM, Terry Cumaranatunge cumar...@gmail.comwrote: We have a Cassandra 1.0 cluster that we run with RF=3 and perform operations using a consistency level of quorum. We use batch_mutate for all inserts and updates for atomicity across column families with the same row