Re: 5 node cluster - Recommended seed configuration.

2011-08-09 Thread Evgeniy Ryabitskiy
Rule of thumb, you should identify two servers in the cluster to be your seed nodes. Is this rule same for N node cluster? Any common practice/formula for seed number? Going to use about 10 nodes and extend it in a future. -- Evgeniy Ryabitskiy

Re: 5 node cluster - Recommended seed configuration.

2011-08-09 Thread Evgeniy Ryabitskiy
Thanks a lot! Maybe this should be placed at Cassandra FAQ. http://wiki.apache.org/cassandra/FAQ#seed has less information. -- Evgeniy Ryabitskiy

Re: Is Cassandra suitable for this use case?

2011-08-25 Thread Evgeniy Ryabitskiy
Hi, If you want to store files with partition/replication, you could use Distributed File System(DFS). Like http://hadoop.apache.org/hdfs/ or any other: http://en.wikipedia.org/wiki/Distributed_file_system Still you could use Cassandra to store any metadata and filepath in DFS. So: Cassandra +

NPT while get_range_slices in 0.8.1

2011-08-26 Thread Evgeniy Ryabitskiy
Hi, we have 4 node Cassandra (version 0.8.1) cluster. 2 CF inside. While first CF is working properly (read/store), get_range_slices query on second CF return NPE error. Any idea why it happen? Maybe some known bug and fixed in 0.8.3 ? ERROR [pool-2-thread-51] 2011-08-25 15:02:04,360

Re: Trying to understand QUORUM and Strategies

2011-08-31 Thread Evgeniy Ryabitskiy
Hi Actually you can use LOCAL_QUORUM and EACH_QUORUM policy everywhere on DEV/QA/Prod. Even it would be better for integration tests to use same Consistency level as on production. For production with multiple DC you usually need to chouse between 2 common solutions: Geographical Distribution or

Re: Trying to understand QUORUM and Strategies

2011-09-02 Thread Evgeniy Ryabitskiy
So. You have created keyspace with SimpleStrategy. If you want to use *LOCAL_QUORUM, *you should create keyspace (or change existing) with NetworkTopologyStrategy. I have provided CLI examples on how to do it. If you are creating keyspace from Hector, you have to do same via Java API. Evgeny.

UnavailableException while storing with EACH_QUORUM and RF=3

2011-09-05 Thread Evgeniy Ryabitskiy
Hi, I'am trying to store record with EACH_QUORUM consistency and RF=3. While same thing with RF=2 is working. Could some one tell me why EACH_QUORUM is working with RF=2 but not with RF =3 I have 7 nodes cluster. All nodes are UP. Here is simple CLI script: create keyspace kspace3 with

Re: UnavailableException while storing with EACH_QUORUM and RF=3

2011-09-05 Thread Evgeniy Ryabitskiy
One more thing, Cassandra version is 0.8.4. And if I try same thing from Pelops(thrift), I get UnavailableException.

Re: UnavailableException while storing with EACH_QUORUM and RF=3

2011-09-05 Thread Evgeniy Ryabitskiy
great thanks! Evgeny.

Index search in provided list of rows (list of rowKeys).

2011-09-12 Thread Evgeniy Ryabitskiy
Hi, We have an issue to search over Cassandra and we are using Sphinx for indexing. Because of Sphinx architecture we can't use range queries over all fields that we need to. So we have to run Sphinx Query first to get List of rowKeys and perform additional range filtering over column values.

Re: Index search in provided list of rows (list of rowKeys).

2011-09-12 Thread Evgeniy Ryabitskiy
Something like this. Actually I think it's better to extend get_indexed_slice() API instead of creating new one thrift method. I wish to have something like this: //here we run query to external search engine Listbyte[] keys = performSphinxQuery(someFullTextSearchQuery); IndexClause indexClause

Re: Index search in provided list of rows (list of rowKeys).

2011-09-14 Thread Evgeniy Ryabitskiy
Why it's radically? It will be same get_indexes_slices search but in specified set of rows. So mostly it will be one more Search Expression over rowIDs not only column values. Usually the more restrictions you could specify in search query, the faster search it can be (not slower at least).

Setting java heap size for Cassandra process

2011-09-20 Thread Evgeniy Ryabitskiy
Hi, I am running Cassandra over Linux VMs, each VM is: 2GB RAM, 4 core CPU. Using RPM distribution. I have set -Xmx to 512M in cassandra-env.sh After day of running I see that Cassandra process is utilizing over 80% of memory that is 3 times more then 512M. In result after 2 days of running,

Re: Setting java heap size for Cassandra process

2011-09-20 Thread Evgeniy Ryabitskiy
Thanks for reply. Now it looks much more clear. Top shows this: PID USER PR NI VIRT *RES* SHR 67 cassandr 18 0 6267m *1.6g * 805m S 0.3 79.0 24:35.80 java It's Ok with huge VIRT memory since it's 64 bit architecture. But RES is keep growing. And I still have questions: 1) If

Re: Setting java heap size for Cassandra process

2011-09-21 Thread Evgeniy Ryabitskiy
Looks like I have same problem as here: https://issues.apache.org/jira/browse/CASSANDRA-2868 But, it's been fixed in 0.8.5 and I'm using 0.8.5 ... Evgeny.

Re: 1.0.3 CLI oddities

2011-11-28 Thread Evgeniy Ryabitskiy
Hi, Just now migrated to 1.0.3 and got same error. I did folowing; 1) Create CF with compression 2) Update cf metadata on new created CF. update failed with same Exception about Caused by: java.util.concurrent. ExecutionException: java.io.IOException:

Re: [RELEASE] Apache Cassandra 1.0.5 released

2011-12-01 Thread Evgeniy Ryabitskiy
+1 After upgrade to 1.0.5 also have Timeout exception on Secondary Index search (get_indexed_slices API) .