Can I retrieve specific key range from a table in RandomPartitioner?

2010-08-12 Thread ChingShen
Hi all, Can I retrieve specific key range from a table in RandomPartitioner? Because I always got below exception: Exception in thread main InvalidRequestException(why:start key's md5 sorts after end key's md5. this is not allowed; you probably should not specify end key at all, under

Re: Can I retrieve specific key range from a table in RandomPartitioner?

2010-08-12 Thread ChingShen
I have a key range that between 00 and 001000, and my code as below: SlicePredicate predicate = new SlicePredicate(); predicate.setColumn_names(columns); ColumnParent parent = new ColumnParent(columnFamily); KeyRange k = new KeyRange(1000); k.setStart_key(key[0]);

Filesystem for Cassandra

2010-08-12 Thread Michael Widmann
Hi out there ... Without starting an OT Thread or an Evangelists war it would be interesting what filesystems most cassandra installation uses, which performs best in which cases Actually we use Cassandra on ZFS (OpenSolaris) - fine tuned for our need. no Raidcontroller used. What are the

Re: Reload/Update schema 0.7

2010-08-12 Thread Gary Dusbabek
You should take a look at http://wiki.apache.org/cassandra/LiveSchemaUpdates loadSchemaFromYaml() is intended to initialize the schema on a seed node in a new cluster (or one that has been upgraded from 0.6). It is an operation that should only be performed one time *per cluster.* Gary On Wed,

how to retrieve data from supercolumns by phpcassa ?

2010-08-12 Thread lisek
Hi all, I've got cassandra superlcolumn looking like that: ColumnFamily Name=users CompareWith=TimeUUIDType ColumnType=Super CompareSubcolumnsWith=BytesType / now in this columnfamily I've inserted something like that: [client] = array(1) { [2a3909c0-a612-11df-b27e-346336336631]=

Re: 0.7 CLI w/TSocket

2010-08-12 Thread Mark
On 8/11/10 10:11 PM, Jonathan Ellis wrote: you have to use an up to date CLI, the old one used broken options w/ its framed mode On Wed, Aug 11, 2010 at 6:39 PM, Markstatic.void@gmail.com wrote: org.apache.thrift.protocol.TProtocolException: Missing version in readMessageBegin, old

Data Distribution / Replication

2010-08-12 Thread Stefan Kaufmann
Hello again, last day's I started several tests with Cassandra and learned quite some facts. However, of course, there are still enough things I need to understand. One thing is, how the data replication works. For my Testing: 1. I set the replication Factor to 3, started with 1 active node (the

How does cfstats calculate Row Size?

2010-08-12 Thread Julie
I am chasing down a row size discrepancy and am confused. I populated a single node Cassandra cluster with 10,000 rows of data, using numeric keys 1-10,000, where each row is a little over 100kB in length and has a single column in it. When I perform a cfstats on the node immediately after

Re: Post on experiences with Cassandra for Twitter retweet analysis

2010-08-12 Thread Eric Evans
On Thu, 2010-08-12 at 11:29 +0200, Mikio Braun wrote: So far, we're very pleased with Cassandra performance, but we've also had to overcome some issues on which I report in the blog and which are hopefully interesting for other users of Cassandra. The blog post can be found here:

Re: Filesystem for Cassandra

2010-08-12 Thread Peter Schuller
Actually we use Cassandra on ZFS (OpenSolaris) - fine tuned for our need. no Raidcontroller used. An interesting property of ZFS is the use of the ARC for caching. Contrary to the traditional behavior of buffer caches, the ARC should theoretically not evict all interesting data as a result of

Re: Data Distribution / Replication

2010-08-12 Thread Benjamin Black
On Thu, Aug 12, 2010 at 8:30 AM, Stefan Kaufmann sta...@gmail.com wrote: Hello again, last day's I started several tests with Cassandra and learned quite some facts. However, of course, there are still enough things I need to understand. One thing is, how the data replication works. For

Re: How does cfstats calculate Row Size?

2010-08-12 Thread Ryan King
On Thu, Aug 12, 2010 at 9:08 AM, Julie julie.su...@nextcentury.com wrote: I am chasing down a row size discrepancy and am confused. I populated a single node Cassandra cluster with 10,000 rows of data, using numeric keys 1-10,000, where each row is a little over 100kB in length and has a

Re: Can I retrieve specific key range from a table in RandomPartitioner?

2010-08-12 Thread Aaron Morton
Try setting the end key to an empty string, and then set the number of rows to something sane and make multiple calls if needed. Or you may be able to make your own secondary index another CF, so you do two reads: one on the secondary index then one of the rows you want.There has been some

Re: 0.7 CLI w/TSocket

2010-08-12 Thread Mark
On 8/12/10 8:29 AM, Mark wrote: On 8/11/10 10:11 PM, Jonathan Ellis wrote: you have to use an up to date CLI, the old one used broken options w/ its framed mode On Wed, Aug 11, 2010 at 6:39 PM, Markstatic.void@gmail.com wrote: org.apache.thrift.protocol.TProtocolException: Missing

RE: error using get_range_slice with random partitioner

2010-08-12 Thread David McIntosh
I'm also seeing an issue with not being able to iterate over all keys in Cassandra 0.6.4. In my unit test I create 20 keys (0-19) and iterate with a batch size of 6. This is what I get. Cassandra 0.6.4 start key: 9, 14, 4, 15, 11, 18 start key: 18 18, 7, 17, 7, 17 start key:17 17

Re: How does cfstats calculate Row Size?

2010-08-12 Thread Jonathan Ellis
Right, row stats in 0.6 are just what I've seen during the compactions that happened to run since this node restarted last. 0.7 has persistent (and more fine-grained) statistics. On Thu, Aug 12, 2010 at 1:28 PM, Ryan King r...@twitter.com wrote: On Thu, Aug 12, 2010 at 9:08 AM, Julie

Re: 0.7 CLI w/TSocket

2010-08-12 Thread Jonathan Ellis
Works fine here. bin/cassandra-cli --host localhost --port 9160 Connected to: Test Cluster on localhost/9160 Welcome to cassandra CLI. On Thu, Aug 12, 2010 at 2:18 PM, Mark static.void@gmail.com wrote: On 8/12/10 8:29 AM, Mark wrote: On 8/11/10 10:11 PM, Jonathan Ellis wrote: you have

Re: 0.7 CLI w/TSocket

2010-08-12 Thread Mark
On 8/12/10 9:14 PM, Jonathan Ellis wrote: Works fine here. bin/cassandra-cli --host localhost --port 9160 Connected to: Test Cluster on localhost/9160 Welcome to cassandra CLI. On Thu, Aug 12, 2010 at 2:18 PM, Markstatic.void@gmail.com wrote: On 8/12/10 8:29 AM, Mark wrote: On

Re: 0.7 CLI w/TSocket

2010-08-12 Thread Mark
On 8/12/10 10:20 PM, Mark wrote: On 8/12/10 9:14 PM, Jonathan Ellis wrote: Works fine here. bin/cassandra-cli --host localhost --port 9160 Connected to: Test Cluster on localhost/9160 Welcome to cassandra CLI. On Thu, Aug 12, 2010 at 2:18 PM, Markstatic.void@gmail.com wrote: On 8/12/10