client.get_count query

2010-03-11 Thread Sonny Heer
What does this query return? Is there a way to do a range query and get the row count? (e.g. row start = TOW' row end = 'TOWZ') Thanks

Re: client.get_count query

2010-03-11 Thread Sonny Heer
Evans eev...@rackspace.com wrote: On Thu, 2010-03-11 at 11:44 -0800, Sonny Heer wrote: Thanks.  Are there plans to implement a row count feature? Not that I'm aware of. I have a model which doesn't store any columns since I could potentially have a large # of columns.  So all the valuable

compaction threshold

2010-03-01 Thread Sonny Heer
* We recommend adjusting the compaction threshold to 0, while the import is running. After the import, you need * to run `nodeprobe -host IP flush_binary Keyspace` on every node, as this will flush the remaining data still left * in memory to disk. Then it's recommended to adjust the

Re: compaction threshold

2010-03-01 Thread Sonny Heer
I get: Min threshold must be at least 2 On Mon, Mar 1, 2010 at 8:55 AM, Brandon Williams dri...@gmail.com wrote: On Mon, Mar 1, 2010 at 10:53 AM, Sonny Heer sonnyh...@gmail.com wrote:  * We recommend adjusting the compaction threshold to 0, while the import is running. After the import, you

Binary memory table flush question

2010-02-26 Thread Sonny Heer
Hey, I have an application which is iterating over a directory with text files in it. For each document it is ingesting words as keys, and the docid as the column name with column value empty (no super columns). Below is the code I'm using to construct a key and column: ColumnFamily cf =

Re: Bulk Ingestion Issues

2010-02-25 Thread Sonny Heer
On Wed, Feb 24, 2010 at 11:36 AM, Jonathan Ellis jbel...@gmail.com wrote: the exception is unrelated, it's from the network layer (and is gone in 0.6) Thanks. How is the bulk loader suppose to be setup? I start Cassandra using a given storage file with the local IP as the seed and thrift IP.

Re: Bulk Ingestion Issues

2010-02-25 Thread Sonny Heer
On Wed, Feb 24, 2010 at 11:36 AM, Jonathan Ellis jbel...@gmail.com wrote: the exception is unrelated, it's from the network layer (and is gone in 0.6) Any other ideas as to what could be causing this? I'm getting inconsistent results between ingests. The sendOneWay method is called a lot

Bulk Ingestion Issues

2010-02-24 Thread Sonny Heer
I have a single box, and trying to ingest some data into a single keyspace and 5 CFs. Basically it reads from a directory text files, and inserts into Cassandra. I've set the BinaryMemtableSizeInMB to 64. For some reason I'm not getting all my data into cassandra. I get some ingested, but very

Re: Bulk Ingestion Issues

2010-02-24 Thread Sonny Heer
, that wasn't 100% clear. With BMT you need to manually flush when you are done loading, the data isn't live until it's been converted to sstable. On Wed, Feb 24, 2010 at 11:45 AM, Sonny Heer sonnyh...@gmail.com wrote: On what symptom are you basing that conclusion? I've ingested the same data using

Re: Cassandra paging, gathering stats

2010-02-23 Thread Sonny Heer
Columns can easily be paginated via the 'start' and 'finish' parameters.  You can't jump to a random page, but you can provide next/previous behavior. Do you have an example of this? From a client, they can pass in the last key, which can then be used as the start with some predefined count.

Re: Cassandra paging, gathering stats

2010-02-22 Thread Sonny Heer
Is this a bug? ColumnParent columnParent = new ColumnParent(cp, null); SlicePredicate slicePredicate = new SlicePredicate(); // Get all columns SliceRange sliceRange = new SliceRange(new byte[] {}, new byte[] {}, false,