date:20110411


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-2156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stu Hood updated CASSANDRA-2156:


Attachment: (was: 
0006-Throttle-total-compaction-to-a-configurable-throughput.txt)

 Compaction Throttling
 -

 Key: CASSANDRA-2156
 URL: https://issues.apache.org/jira/browse/CASSANDRA-2156
 Project: Cassandra
  Issue Type: New Feature
Reporter: Stu Hood
Assignee: Stu Hood
 Fix For: 0.8

 Attachments: 
 for-0.6-0001-Throttle-compaction-to-a-fixed-throughput.txt, 
 for-0.6-0002-Make-compaction-throttling-configurable.txt


 Compaction is currently relatively bursty: we compact as fast as we can, and 
 then we wait for the next compaction to be possible (hurry up and wait).
 Instead, to properly amortize compaction, you'd like to compact exactly as 
 fast as you need to to keep the sstable count under control.
 For every new level of compaction, you need to increase the rate that you 
 compact at: a rule of thumb that we're testing on our clusters is to 
 determine the maximum number of buckets a node can support (aka, if the 15th 
 bucket holds 750 GB, we're not going to have more than 15 buckets), and then 
 multiply the flush throughput by the number of buckets to get a minimum 
 compaction throughput to maintain your sstable count.
 Full explanation: for a min compaction threshold of {{T}}, the bucket at 
 level {{N}} can contain {{SsubN = T^N}} 'units' (unit == memtable's worth of 
 data on disk). Every time a new unit is added, it has a {{1/SsubN}} chance of 
 causing the bucket at level N to fill. If the bucket at level N fills, it 
 causes {{SsubN}} units to be compacted. So, for each active level in your 
 system you have {{SubN * 1 / SsubN}}, or {{1}} amortized unit to compact any 
 time a new unit is added.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-2156) Compaction Throttling


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-2156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stu Hood updated CASSANDRA-2156:


Attachment: 0007-Throttle-total-compaction-to-a-configurable-throughput.txt

 Compaction Throttling
 -

 Key: CASSANDRA-2156
 URL: https://issues.apache.org/jira/browse/CASSANDRA-2156
 Project: Cassandra
  Issue Type: New Feature
Reporter: Stu Hood
Assignee: Stu Hood
 Fix For: 0.8

 Attachments: 
 0007-Throttle-total-compaction-to-a-configurable-throughput.txt, 
 for-0.6-0001-Throttle-compaction-to-a-fixed-throughput.txt, 
 for-0.6-0002-Make-compaction-throttling-configurable.txt


 Compaction is currently relatively bursty: we compact as fast as we can, and 
 then we wait for the next compaction to be possible (hurry up and wait).
 Instead, to properly amortize compaction, you'd like to compact exactly as 
 fast as you need to to keep the sstable count under control.
 For every new level of compaction, you need to increase the rate that you 
 compact at: a rule of thumb that we're testing on our clusters is to 
 determine the maximum number of buckets a node can support (aka, if the 15th 
 bucket holds 750 GB, we're not going to have more than 15 buckets), and then 
 multiply the flush throughput by the number of buckets to get a minimum 
 compaction throughput to maintain your sstable count.
 Full explanation: for a min compaction threshold of {{T}}, the bucket at 
 level {{N}} can contain {{SsubN = T^N}} 'units' (unit == memtable's worth of 
 data on disk). Every time a new unit is added, it has a {{1/SsubN}} chance of 
 causing the bucket at level N to fill. If the bucket at level N fills, it 
 causes {{SsubN}} units to be compacted. So, for each active level in your 
 system you have {{SubN * 1 / SsubN}}, or {{1}} amortized unit to compact any 
 time a new unit is added.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-2191) Multithread across compaction buckets


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-2191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stu Hood updated CASSANDRA-2191:


Attachment: (was: 
0005-Add-a-harness-to-allow-compaction-tasks-that-need-to-a.txt)

 Multithread across compaction buckets
 -

 Key: CASSANDRA-2191
 URL: https://issues.apache.org/jira/browse/CASSANDRA-2191
 Project: Cassandra
  Issue Type: Improvement
  Components: Core
Reporter: Stu Hood
Assignee: Stu Hood
Priority: Critical
  Labels: compaction
 Fix For: 0.8


 This ticket overlaps with CASSANDRA-1876 to a degree, but the approaches and 
 reasoning are different enough to open a separate issue.
 The problem with compactions currently is that they compact the set of 
 sstables that existed the moment the compaction started. This means that for 
 longer running compactions (even when running as fast as possible on the 
 hardware), a very large number of new sstables might be created in the 
 meantime. We have observed this proliferation of sstables killing performance 
 during major/high-bucketed compactions.
 One approach would be to pause compactions in upper buckets (containing 
 larger files) when compactions in lower buckets become possible. While this 
 would likely solve the problem with read performance, it does not actually 
 help us perform compaction any faster, which is a reasonable requirement for 
 other situations.
 Instead, we need to be able to perform any compactions that are currently 
 required in parallel, independent of what bucket they might be in.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-2191) Multithread across compaction buckets


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-2191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stu Hood updated CASSANDRA-2191:


Attachment: (was: 0001-Add-a-compacting-set-to-DataTracker.txt)

 Multithread across compaction buckets
 -

 Key: CASSANDRA-2191
 URL: https://issues.apache.org/jira/browse/CASSANDRA-2191
 Project: Cassandra
  Issue Type: Improvement
  Components: Core
Reporter: Stu Hood
Assignee: Stu Hood
Priority: Critical
  Labels: compaction
 Fix For: 0.8


 This ticket overlaps with CASSANDRA-1876 to a degree, but the approaches and 
 reasoning are different enough to open a separate issue.
 The problem with compactions currently is that they compact the set of 
 sstables that existed the moment the compaction started. This means that for 
 longer running compactions (even when running as fast as possible on the 
 hardware), a very large number of new sstables might be created in the 
 meantime. We have observed this proliferation of sstables killing performance 
 during major/high-bucketed compactions.
 One approach would be to pause compactions in upper buckets (containing 
 larger files) when compactions in lower buckets become possible. While this 
 would likely solve the problem with read performance, it does not actually 
 help us perform compaction any faster, which is a reasonable requirement for 
 other situations.
 Instead, we need to be able to perform any compactions that are currently 
 required in parallel, independent of what bucket they might be in.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-2191) Multithread across compaction buckets


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-2191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stu Hood updated CASSANDRA-2191:


Attachment: (was: 0004-Allow-multithread-compaction-to-be-disabled.txt)

 Multithread across compaction buckets
 -

 Key: CASSANDRA-2191
 URL: https://issues.apache.org/jira/browse/CASSANDRA-2191
 Project: Cassandra
  Issue Type: Improvement
  Components: Core
Reporter: Stu Hood
Assignee: Stu Hood
Priority: Critical
  Labels: compaction
 Fix For: 0.8


 This ticket overlaps with CASSANDRA-1876 to a degree, but the approaches and 
 reasoning are different enough to open a separate issue.
 The problem with compactions currently is that they compact the set of 
 sstables that existed the moment the compaction started. This means that for 
 longer running compactions (even when running as fast as possible on the 
 hardware), a very large number of new sstables might be created in the 
 meantime. We have observed this proliferation of sstables killing performance 
 during major/high-bucketed compactions.
 One approach would be to pause compactions in upper buckets (containing 
 larger files) when compactions in lower buckets become possible. While this 
 would likely solve the problem with read performance, it does not actually 
 help us perform compaction any faster, which is a reasonable requirement for 
 other situations.
 Instead, we need to be able to perform any compactions that are currently 
 required in parallel, independent of what bucket they might be in.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-2191) Multithread across compaction buckets


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-2191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stu Hood updated CASSANDRA-2191:


Attachment: (was: 
0003-Expose-multiple-compactions-via-JMX-and-a-concrete-ser.txt)

 Multithread across compaction buckets
 -

 Key: CASSANDRA-2191
 URL: https://issues.apache.org/jira/browse/CASSANDRA-2191
 Project: Cassandra
  Issue Type: Improvement
  Components: Core
Reporter: Stu Hood
Assignee: Stu Hood
Priority: Critical
  Labels: compaction
 Fix For: 0.8


 This ticket overlaps with CASSANDRA-1876 to a degree, but the approaches and 
 reasoning are different enough to open a separate issue.
 The problem with compactions currently is that they compact the set of 
 sstables that existed the moment the compaction started. This means that for 
 longer running compactions (even when running as fast as possible on the 
 hardware), a very large number of new sstables might be created in the 
 meantime. We have observed this proliferation of sstables killing performance 
 during major/high-bucketed compactions.
 One approach would be to pause compactions in upper buckets (containing 
 larger files) when compactions in lower buckets become possible. While this 
 would likely solve the problem with read performance, it does not actually 
 help us perform compaction any faster, which is a reasonable requirement for 
 other situations.
 Instead, we need to be able to perform any compactions that are currently 
 required in parallel, independent of what bucket they might be in.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-2191) Multithread across compaction buckets


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-2191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stu Hood updated CASSANDRA-2191:


Attachment: (was: 
0002-Use-the-compacting-set-of-sstables-to-schedule-multith.txt)

 Multithread across compaction buckets
 -

 Key: CASSANDRA-2191
 URL: https://issues.apache.org/jira/browse/CASSANDRA-2191
 Project: Cassandra
  Issue Type: Improvement
  Components: Core
Reporter: Stu Hood
Assignee: Stu Hood
Priority: Critical
  Labels: compaction
 Fix For: 0.8


 This ticket overlaps with CASSANDRA-1876 to a degree, but the approaches and 
 reasoning are different enough to open a separate issue.
 The problem with compactions currently is that they compact the set of 
 sstables that existed the moment the compaction started. This means that for 
 longer running compactions (even when running as fast as possible on the 
 hardware), a very large number of new sstables might be created in the 
 meantime. We have observed this proliferation of sstables killing performance 
 during major/high-bucketed compactions.
 One approach would be to pause compactions in upper buckets (containing 
 larger files) when compactions in lower buckets become possible. While this 
 would likely solve the problem with read performance, it does not actually 
 help us perform compaction any faster, which is a reasonable requirement for 
 other situations.
 Instead, we need to be able to perform any compactions that are currently 
 required in parallel, independent of what bucket they might be in.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-2191) Multithread across compaction buckets


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-2191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stu Hood updated CASSANDRA-2191:


Attachment: 0006-Prevent-cache-saves-from-occuring-concurrently.txt
0005-Acquire-the-writeLock-for-major-cleanup-scrub-in-order.txt
0004-Allow-multithread-compaction-to-be-disabled.txt
0003-Expose-multiple-compactions-via-JMX-and-a-concrete-ser.txt
0002-Use-the-compacting-set-of-sstables-to-schedule-multith.txt
0001-Add-a-compacting-set-to-DataTracker.txt

* Inlined stopTheWorld in 0005. Yes, I agree that the name sucked, but whether 
or not it is possible for a lock acquisition to fail on a server that is not 
already screwed, and whether an abstraction is in order here is still up for 
debate
* Removed the 'forceMajor' parameter: will open a ticket post-commit to allow 
for guaranteeing that a manually triggered compaction is major
* Moved ksname/cfname into getters. I didn't do this initially because the CFS 
is sometimes null, but I guess you'd get the NPE in either case
* Added an AtomicBoolean to AutoSavingCache in 0006. I reeeally think this 
should go to the flush stage, since the tasks have almost identical lifetimes, 
and we don't really need progress for either of them
* Wrapped the IdentityHashMap into an IdentityHashSet
* Returned printCompactionStats to its former glory
* Removed OperationType from SSTableWriter.Builder's task type

Thanks! CASSANDRA-2156 has been rebased as well.

 Multithread across compaction buckets
 -

 Key: CASSANDRA-2191
 URL: https://issues.apache.org/jira/browse/CASSANDRA-2191
 Project: Cassandra
  Issue Type: Improvement
  Components: Core
Reporter: Stu Hood
Assignee: Stu Hood
Priority: Critical
  Labels: compaction
 Fix For: 0.8

 Attachments: 0001-Add-a-compacting-set-to-DataTracker.txt, 
 0002-Use-the-compacting-set-of-sstables-to-schedule-multith.txt, 
 0003-Expose-multiple-compactions-via-JMX-and-a-concrete-ser.txt, 
 0004-Allow-multithread-compaction-to-be-disabled.txt, 
 0005-Acquire-the-writeLock-for-major-cleanup-scrub-in-order.txt, 
 0006-Prevent-cache-saves-from-occuring-concurrently.txt


 This ticket overlaps with CASSANDRA-1876 to a degree, but the approaches and 
 reasoning are different enough to open a separate issue.
 The problem with compactions currently is that they compact the set of 
 sstables that existed the moment the compaction started. This means that for 
 longer running compactions (even when running as fast as possible on the 
 hardware), a very large number of new sstables might be created in the 
 meantime. We have observed this proliferation of sstables killing performance 
 during major/high-bucketed compactions.
 One approach would be to pause compactions in upper buckets (containing 
 larger files) when compactions in lower buckets become possible. While this 
 would likely solve the problem with read performance, it does not actually 
 help us perform compaction any faster, which is a reasonable requirement for 
 other situations.
 Instead, we need to be able to perform any compactions that are currently 
 required in parallel, independent of what bucket they might be in.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[Cassandra Wiki] Update of API07 by AhmetEkremSaban

Dear Wiki user,

You have subscribed to a wiki page or wiki category on Cassandra Wiki for 
change notification.

The API07 page has been changed by AhmetEkremSaban.
The comment on this change is: Started a new page for API 0.7.
http://wiki.apache.org/cassandra/API07

--

New page:
## page was copied from API
== Overview ==
The Cassandra Thrift API changed between [[API03|0.3]], [[API04|0.4]], 
[[API|0.5]] and [[API|0.6]]; this document explains the 0.7 version.

Cassandra's client API is built entirely on top of Thrift. It should be noted 
that these documents mention default values, but these are not generated in all 
of the languages that Thrift supports.  Full examples of using Cassandra from 
Thrift, including setup boilerplate, are found on ThriftExamples.  Higher-level 
clients are linked from ClientOptions.

'''WARNING:''' Some SQL/RDBMS terms are used in this documentation for analogy 
purposes. They should be thought of as just that; analogies. There are few 
similarities between how data is managed in a traditional RDBMS and Cassandra. 
Please see DataModel for more information.

'''This article is a stub for the Apache Cassandra API 0.7'''

[Cassandra Wiki] Trivial Update of ClientOptions by AhmetEkremSaban

Dear Wiki user,

You have subscribed to a wiki page or wiki category on Cassandra Wiki for 
change notification.

The ClientOptions page has been changed by AhmetEkremSaban.
The comment on this change is: Ordered entries alphabetically.
http://wiki.apache.org/cassandra/ClientOptions?action=diffrev1=124rev2=125

--

  If no high-level client exists for your environment, you may be able to 
update an [[ClientOptions06|older client]]; failing that, you'll have to use 
the raw Thrift [[API]].
  
   * Python:
+   * Pycassa: http://github.com/pycassa/pycassa
* Telephus: http://github.com/driftx/Telephus (Twisted)
-   * Pycassa: http://github.com/pycassa/pycassa
   * Java:
+   * Datanucleus JDO: http://github.com/tnine/Datanucleus-Cassandra-Plugin
* Hector: http://github.com/rantav/hector (Examples 
https://github.com/zznate/hector-examples )
+   * Kundera http://code.google.com/p/kundera/
* Pelops: http://github.com/s7/scale7-pelops
-   * Kundera http://code.google.com/p/kundera/
-   * Datanucleus JDO: http://github.com/tnine/Datanucleus-Cassandra-Plugin
   * Grails:
* grails-cassandra: https://github.com/wolpert/grails-cassandra
   * .NET:
+   * Aquiles: http://aquiles.codeplex.com/
* FluentCassandra: http://github.com/managedfusion/fluentcassandra
-   * Aquiles: http://aquiles.codeplex.com/
   * Ruby:
* Cassandra: http://github.com/fauna/cassandra
   * PHP:

svn commit: r1090979 - in /cassandra/trunk: ./ conf/ src/java/org/apache/cassandra/config/ src/java/org/apache/cassandra/db/ src/java/org/apache/cassandra/io/ src/java/org/apache/cassandra/service/ sr

2011-04-11 Thread slebresne

Author: slebresne
Date: Mon Apr 11 08:45:07 2011
New Revision: 1090979

URL: http://svn.apache.org/viewvc?rev=1090979view=rev
Log:
Compaction throttling
patch by stuhood; reviewed by slebresne for CASSANDRA-2156

Modified:
cassandra/trunk/CHANGES.txt
cassandra/trunk/conf/cassandra.yaml
cassandra/trunk/src/java/org/apache/cassandra/config/Config.java
cassandra/trunk/src/java/org/apache/cassandra/config/DatabaseDescriptor.java
cassandra/trunk/src/java/org/apache/cassandra/db/CompactionManager.java
cassandra/trunk/src/java/org/apache/cassandra/io/CompactionIterator.java
cassandra/trunk/src/java/org/apache/cassandra/service/StorageService.java

cassandra/trunk/src/java/org/apache/cassandra/service/StorageServiceMBean.java
cassandra/trunk/src/java/org/apache/cassandra/tools/NodeCmd.java
cassandra/trunk/src/java/org/apache/cassandra/tools/NodeProbe.java

Modified: cassandra/trunk/CHANGES.txt
URL: 
http://svn.apache.org/viewvc/cassandra/trunk/CHANGES.txt?rev=1090979r1=1090978r2=1090979view=diff
==
--- cassandra/trunk/CHANGES.txt (original)
+++ cassandra/trunk/CHANGES.txt Mon Apr 11 08:45:07 2011
@@ -20,7 +20,7 @@
  * push replication_factor into strategy_options (CASSANDRA-1263)
  * give snapshots the same name on each node (CASSANDRA-1791)
  * multithreaded compaction (CASSANDRA-2191)
-
+ * compaction throttling (CASSANDRA-2156)
 
 0.7.5
  * Avoid seeking when sstable2json exports the entire file (CASSANDRA-2318)

Modified: cassandra/trunk/conf/cassandra.yaml
URL: 
http://svn.apache.org/viewvc/cassandra/trunk/conf/cassandra.yaml?rev=1090979r1=1090978r2=1090979view=diff
==
--- cassandra/trunk/conf/cassandra.yaml (original)
+++ cassandra/trunk/conf/cassandra.yaml Mon Apr 11 08:45:07 2011
@@ -250,9 +250,17 @@ column_index_size_in_kb: 64
 in_memory_compaction_limit_in_mb: 64
 
 # Enables multiple compactions to execute at once. This is highly recommended
-# for preserving read performance in a mixed read/write workload.
+# for preserving read performance in a mixed read/write workload as this
+# avoids sstables from accumulating during long running compactions.
 compaction_multithreading: true
 
+# Throttles compaction to the given total throughput across the entire
+# system. The faster you insert data, the faster you need to compact in
+# order to keep the sstable count down, but in general, setting this to
+# 16 to 32 times the rate you are inserting data is more than sufficient.
+# Setting this to 0 disables throttling.
+compaction_throughput_mb_per_sec: 16
+
 # Track cached row keys during compaction, and re-cache their new
 # positions in the compacted sstable.  Disable if you use really large
 # key caches.

Modified: cassandra/trunk/src/java/org/apache/cassandra/config/Config.java
URL: 
http://svn.apache.org/viewvc/cassandra/trunk/src/java/org/apache/cassandra/config/Config.java?rev=1090979r1=1090978r2=1090979view=diff
==
--- cassandra/trunk/src/java/org/apache/cassandra/config/Config.java (original)
+++ cassandra/trunk/src/java/org/apache/cassandra/config/Config.java Mon Apr 11 
08:45:07 2011
@@ -83,6 +83,7 @@ public class Config
 public Integer column_index_size_in_kb = 64;
 public Integer in_memory_compaction_limit_in_mb = 256;
 public Boolean compaction_multithreading = true;
+public Integer compaction_throughput_mb_per_sec = 16;
 
 public String[] data_file_directories;
 

Modified: 
cassandra/trunk/src/java/org/apache/cassandra/config/DatabaseDescriptor.java
URL: 
http://svn.apache.org/viewvc/cassandra/trunk/src/java/org/apache/cassandra/config/DatabaseDescriptor.java?rev=1090979r1=1090978r2=1090979view=diff
==
--- 
cassandra/trunk/src/java/org/apache/cassandra/config/DatabaseDescriptor.java 
(original)
+++ 
cassandra/trunk/src/java/org/apache/cassandra/config/DatabaseDescriptor.java 
Mon Apr 11 08:45:07 2011
@@ -344,6 +344,9 @@ public class DatabaseDescriptor
 if (conf.compaction_multithreading == null)
 conf.compaction_multithreading = true;
 
+if (conf.compaction_throughput_mb_per_sec == null)
+conf.compaction_throughput_mb_per_sec = 16;
+
 /* data file and commit log directories. they get created later, 
when they're needed. */
 if (conf.commitlog_directory != null  conf.data_file_directories 
!= null  conf.saved_caches_directory != null)
 {
@@ -731,6 +734,16 @@ public class DatabaseDescriptor
 return conf.compaction_multithreading;
 }
 
+public static int getCompactionThroughputMbPerSec()
+{
+return conf.compaction_throughput_mb_per_sec;
+}
+
+public static void setCompactionThroughputMbPerSec(int value)
+

[jira] [Commented] (CASSANDRA-2191) Multithread across compaction buckets

[
https://issues.apache.org/jira/browse/CASSANDRA-2191?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13018251#comment-13018251
]

Sylvain Lebresne commented on CASSANDRA-2191:
-

For the record:

bq. Inlined stopTheWorld in 0005. Yes, I agree that the name sucked, but
whether or not it is possible for a lock acquisition to fail on a server that
is not already screwed, and whether an abstraction is in order here is still up
for debate

I do like the inlined version much more. I did not pretended that the previous
version wasn't working. It was just hard to check that the umarking was
happening correctly and even though I agree lock acquisition is unlikely to
fail, it would have been easy for someone else to add lines inside stopTheWorld
at the wrong place that could fail. And the name sucked :)

bq. Added an AtomicBoolean to AutoSavingCache in 0006. I reeeally think this
should go to the flush stage, since the tasks have almost identical lifetimes,
and we don't really need progress for either of them

I just don't want for cache saving to block flush too long. So I'm not saying
it should not go to flush stage ever, but I'm inconfortable putting it there
without some proper testing of its impact. We could make the flush stage
multithreaded (with throttling), then I would have no problem with moving cache
saving there (but then we would still have to make sure only one saving happen
at a time).

Multithread across compaction buckets
-

Key: CASSANDRA-2191
URL: https://issues.apache.org/jira/browse/CASSANDRA-2191
Project: Cassandra
Issue Type: Improvement
Components: Core
Reporter: Stu Hood
Assignee: Stu Hood
Priority: Critical
Labels: compaction
Fix For: 0.8

Attachments: 0001-Add-a-compacting-set-to-DataTracker.txt,
0002-Use-the-compacting-set-of-sstables-to-schedule-multith.txt,
0003-Expose-multiple-compactions-via-JMX-and-a-concrete-ser.txt,
0004-Allow-multithread-compaction-to-be-disabled.txt,
0005-Acquire-the-writeLock-for-major-cleanup-scrub-in-order.txt,
0006-Prevent-cache-saves-from-occuring-concurrently.txt

This ticket overlaps with CASSANDRA-1876 to a degree, but the approaches and
reasoning are different enough to open a separate issue.
The problem with compactions currently is that they compact the set of
sstables that existed the moment the compaction started. This means that for
longer running compactions (even when running as fast as possible on the
hardware), a very large number of new sstables might be created in the
meantime. We have observed this proliferation of sstables killing performance
during major/high-bucketed compactions.
One approach would be to pause compactions in upper buckets (containing
larger files) when compactions in lower buckets become possible. While this
would likely solve the problem with read performance, it does not actually
help us perform compaction any faster, which is a reasonable requirement for
other situations.
Instead, we need to be able to perform any compactions that are currently
required in parallel, independent of what bucket they might be in.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-2156) Compaction Throttling

2011-04-11 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-2156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13018255#comment-13018255
 ] 

Hudson commented on CASSANDRA-2156:
---

Integrated in Cassandra #848 (See 
[https://hudson.apache.org/hudson/job/Cassandra/848/])
Compaction throttling
patch by stuhood; reviewed by slebresne for CASSANDRA-2156


 Compaction Throttling
 -

 Key: CASSANDRA-2156
 URL: https://issues.apache.org/jira/browse/CASSANDRA-2156
 Project: Cassandra
  Issue Type: New Feature
Reporter: Stu Hood
Assignee: Stu Hood
 Fix For: 0.8

 Attachments: 
 0007-Throttle-total-compaction-to-a-configurable-throughput.txt, 
 for-0.6-0001-Throttle-compaction-to-a-fixed-throughput.txt, 
 for-0.6-0002-Make-compaction-throttling-configurable.txt


 Compaction is currently relatively bursty: we compact as fast as we can, and 
 then we wait for the next compaction to be possible (hurry up and wait).
 Instead, to properly amortize compaction, you'd like to compact exactly as 
 fast as you need to to keep the sstable count under control.
 For every new level of compaction, you need to increase the rate that you 
 compact at: a rule of thumb that we're testing on our clusters is to 
 determine the maximum number of buckets a node can support (aka, if the 15th 
 bucket holds 750 GB, we're not going to have more than 15 buckets), and then 
 multiply the flush throughput by the number of buckets to get a minimum 
 compaction throughput to maintain your sstable count.
 Full explanation: for a min compaction threshold of {{T}}, the bucket at 
 level {{N}} can contain {{SsubN = T^N}} 'units' (unit == memtable's worth of 
 data on disk). Every time a new unit is added, it has a {{1/SsubN}} chance of 
 causing the bucket at level N to fill. If the bucket at level N fills, it 
 causes {{SsubN}} units to be compacted. So, for each active level in your 
 system you have {{SubN * 1 / SsubN}}, or {{1}} amortized unit to compact any 
 time a new unit is added.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-2191) Multithread across compaction buckets

2011-04-11 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-2191?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13018254#comment-13018254
 ] 

Hudson commented on CASSANDRA-2191:
---

Integrated in Cassandra #848 (See 
[https://hudson.apache.org/hudson/job/Cassandra/848/])
Multithreaded compactions
patch by stuhood; reviewed by slebresne for CASSANDRA-2191


 Multithread across compaction buckets
 -

 Key: CASSANDRA-2191
 URL: https://issues.apache.org/jira/browse/CASSANDRA-2191
 Project: Cassandra
  Issue Type: Improvement
  Components: Core
Reporter: Stu Hood
Assignee: Stu Hood
Priority: Critical
  Labels: compaction
 Fix For: 0.8

 Attachments: 0001-Add-a-compacting-set-to-DataTracker.txt, 
 0002-Use-the-compacting-set-of-sstables-to-schedule-multith.txt, 
 0003-Expose-multiple-compactions-via-JMX-and-a-concrete-ser.txt, 
 0004-Allow-multithread-compaction-to-be-disabled.txt, 
 0005-Acquire-the-writeLock-for-major-cleanup-scrub-in-order.txt, 
 0006-Prevent-cache-saves-from-occuring-concurrently.txt


 This ticket overlaps with CASSANDRA-1876 to a degree, but the approaches and 
 reasoning are different enough to open a separate issue.
 The problem with compactions currently is that they compact the set of 
 sstables that existed the moment the compaction started. This means that for 
 longer running compactions (even when running as fast as possible on the 
 hardware), a very large number of new sstables might be created in the 
 meantime. We have observed this proliferation of sstables killing performance 
 during major/high-bucketed compactions.
 One approach would be to pause compactions in upper buckets (containing 
 larger files) when compactions in lower buckets become possible. While this 
 would likely solve the problem with read performance, it does not actually 
 help us perform compaction any faster, which is a reasonable requirement for 
 other situations.
 Instead, we need to be able to perform any compactions that are currently 
 required in parallel, independent of what bucket they might be in.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-2441) Cassandra crashes with segmentation fault on Debian 5.0 and Ubuntu 10.10


[ 
https://issues.apache.org/jira/browse/CASSANDRA-2441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13018271#comment-13018271
 ] 

Pavel Yaskevich commented on CASSANDRA-2441:


+1

 Cassandra crashes with segmentation fault on Debian 5.0 and Ubuntu 10.10
 

 Key: CASSANDRA-2441
 URL: https://issues.apache.org/jira/browse/CASSANDRA-2441
 Project: Cassandra
  Issue Type: Bug
  Components: Core
Affects Versions: 0.8
 Environment: Both servers have identical hardware configuration: 
 Quad-Core AMD Opteron(tm) Processor 2374 HE, 4 GB RAM (rackspace servers)
 Java version 1.6.0_20
 OpenJDK Runtime Environment (IcedTea6 1.9.7) (6b20-1.9.7-0ubuntu1)
 OpenJDK 64-Bit Server VM (build 19.0-b09, mixed mode)
Reporter: Pavel Yaskevich
Assignee: Jonathan Ellis
Priority: Critical

 Last working commit is c8d1984bf17cab58f40069e522d074c7b0077bc1 (merge from 
 0.7), branch: trunk.
 What I did is cloned git://git.apache.org/cassandra.git and did git reset 
 each commit with `ant clean  ant  ./bin/cassandra -f` until I got 
 cassandra started

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Assigned] (CASSANDRA-2420) row cache / streaming aren't aware of each other


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-2420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sylvain Lebresne reassigned CASSANDRA-2420:
---

Assignee: Sylvain Lebresne

 row cache / streaming aren't aware of each other
 

 Key: CASSANDRA-2420
 URL: https://issues.apache.org/jira/browse/CASSANDRA-2420
 Project: Cassandra
  Issue Type: Bug
Affects Versions: 0.6
Reporter: Matthew F. Dennis
Assignee: Sylvain Lebresne
Priority: Minor
 Fix For: 0.7.5


 SSTableWriter.Builder.build() takes tables that resulted from streaming, 
 repair, bootstrapping, et cetera and builds the indexes and bloom filters 
 before adding it so the current node is aware of it.
 However, if there is data present in the cache for a row that is also present 
 in the streamed table the row cache can over shadow the data in the newly 
 built table.  In other words, until the row in row cache is removed from the 
 cache (e.g. because it's pushed out because of size, the node is restarted, 
 the cache is manually cleared) the data in the newly built table will never 
 be returned to clients.
 The solution that seems most reasonable at this point is to have 
 SSTableWriter.Builder.build() (or something below it) update the row cache if 
 the row key in the table being built is also present in the cache.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-2326) stress.java indexed range slicing is broken


[ 
https://issues.apache.org/jira/browse/CASSANDRA-2326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13018276#comment-13018276
 ] 

Pavel Yaskevich commented on CASSANDRA-2326:


We can maybe offer users to provide list of the values for indexes range 
slices, don't have any other solution right now...

 stress.java indexed range slicing is broken
 ---

 Key: CASSANDRA-2326
 URL: https://issues.apache.org/jira/browse/CASSANDRA-2326
 Project: Cassandra
  Issue Type: Bug
  Components: Contrib
Reporter: Brandon Williams
Assignee: Pavel Yaskevich
Priority: Trivial

 I probably broke it when I fixed the build that CASSANDRA-2312 broke.  Now it 
 compiles, but never works.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-1740) Nodetool commands to query and stop compaction, repair, cleanup and scrub


[ 
https://issues.apache.org/jira/browse/CASSANDRA-1740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13018303#comment-13018303
 ] 

Pavel Yaskevich commented on CASSANDRA-1740:


Stu Hood: do you want to be in charge of this one since your changes in 
compaction mechanism?

 Nodetool commands to query and stop compaction, repair, cleanup and scrub
 -

 Key: CASSANDRA-1740
 URL: https://issues.apache.org/jira/browse/CASSANDRA-1740
 Project: Cassandra
  Issue Type: Improvement
  Components: Tools
Reporter: Chip Salzenberg
Assignee: Pavel Yaskevich
Priority: Minor
 Fix For: 0.7.5

 Attachments: CASSANDRA-1740.patch

   Original Estimate: 24h
  Remaining Estimate: 24h

 The only way to stop compaction, repair, cleanup, or scrub in progress is to 
 stop and restart the entire Cassandra server.  Please provide nodetool 
 commands to query whether such things are running, and stop them if they are.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-1740) Nodetool commands to query and stop compaction, repair, cleanup and scrub


[ 
https://issues.apache.org/jira/browse/CASSANDRA-1740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13018310#comment-13018310
 ] 

Sylvain Lebresne commented on CASSANDRA-1740:
-

I committed CASSANDRA-2191 so as stu said, this will need some rebasing to 
handle it.

For repair, let's just create another ticket. If stu wants/have time to do it, 
fine, otherwise I may do it.

 Nodetool commands to query and stop compaction, repair, cleanup and scrub
 -

 Key: CASSANDRA-1740
 URL: https://issues.apache.org/jira/browse/CASSANDRA-1740
 Project: Cassandra
  Issue Type: Improvement
  Components: Tools
Reporter: Chip Salzenberg
Assignee: Pavel Yaskevich
Priority: Minor
 Fix For: 0.7.5

 Attachments: CASSANDRA-1740.patch

   Original Estimate: 24h
  Remaining Estimate: 24h

 The only way to stop compaction, repair, cleanup, or scrub in progress is to 
 stop and restart the entire Cassandra server.  Please provide nodetool 
 commands to query whether such things are running, and stop them if they are.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-1740) Nodetool commands to query and stop compaction, repair, cleanup and scrub


[ 
https://issues.apache.org/jira/browse/CASSANDRA-1740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13018315#comment-13018315
 ] 

Sylvain Lebresne commented on CASSANDRA-1740:
-

As for handling the multi-threaded compactions, the hashcode would more or less 
work but since it's not totally safe I would prefer assigning a name to each 
compaction which could be:
  * a uuid assigned to each compaction when created
  * a simple (atomically) increasing number
  * a simple (atomically) increasing number for each type of compaction, the 
name being something like major-42 or minor-3012. Nice thing is it tells you 
how many minor, major, validata, ... compaction you have run already.

 Nodetool commands to query and stop compaction, repair, cleanup and scrub
 -

 Key: CASSANDRA-1740
 URL: https://issues.apache.org/jira/browse/CASSANDRA-1740
 Project: Cassandra
  Issue Type: Improvement
  Components: Tools
Reporter: Chip Salzenberg
Assignee: Pavel Yaskevich
Priority: Minor
 Fix For: 0.7.5

 Attachments: CASSANDRA-1740.patch

   Original Estimate: 24h
  Remaining Estimate: 24h

 The only way to stop compaction, repair, cleanup, or scrub in progress is to 
 stop and restart the entire Cassandra server.  Please provide nodetool 
 commands to query whether such things are running, and stop them if they are.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-1740) Nodetool commands to query and stop compaction, repair, cleanup and scrub


[ 
https://issues.apache.org/jira/browse/CASSANDRA-1740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13018316#comment-13018316
 ] 

Pavel Yaskevich commented on CASSANDRA-1740:


I like the last option!

 Nodetool commands to query and stop compaction, repair, cleanup and scrub
 -

 Key: CASSANDRA-1740
 URL: https://issues.apache.org/jira/browse/CASSANDRA-1740
 Project: Cassandra
  Issue Type: Improvement
  Components: Tools
Reporter: Chip Salzenberg
Assignee: Pavel Yaskevich
Priority: Minor
 Fix For: 0.7.5

 Attachments: CASSANDRA-1740.patch

   Original Estimate: 24h
  Remaining Estimate: 24h

 The only way to stop compaction, repair, cleanup, or scrub in progress is to 
 stop and restart the entire Cassandra server.  Please provide nodetool 
 commands to query whether such things are running, and stop them if they are.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-2056) Need a way of flattening schemas.


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-2056?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gary Dusbabek updated CASSANDRA-2056:
-

Attachment: 
v2-0003-a-way-to-upgrade-schema-when-protocol-version-changes.txt

v2-0002-bail-on-migrations-originating-from-newer-protocol-ver.txt
v2-0001-convert-MigrationManager-into-a-singleton.txt

 Need a way of flattening schemas.
 -

 Key: CASSANDRA-2056
 URL: https://issues.apache.org/jira/browse/CASSANDRA-2056
 Project: Cassandra
  Issue Type: Improvement
Reporter: Gary Dusbabek
Assignee: Gary Dusbabek
 Fix For: 0.8

 Attachments: v2-0001-convert-MigrationManager-into-a-singleton.txt, 
 v2-0002-bail-on-migrations-originating-from-newer-protocol-ver.txt, 
 v2-0003-a-way-to-upgrade-schema-when-protocol-version-changes.txt


 For all of our trying not to, we still managed to screw this up.  Schema 
 updates currently contain a serialized RowMutation stored as a column value.  
 When a node needs updated schema, it requests these values, deserializes them 
 and applies them.  As the serialization scheme for RowMutation changes over 
 time (this is inevitable), those old migrations will become incompatible with 
 newer implementations of the RowMutation deserializer.  This means that when 
 new nodes come online, they'll get migration messages that they have trouble 
 deserializing.  (Remember, we've only made the promise that we'll be 
 backwards compatible for one version--see CASSANDRA-1015--even though we'd 
 eventually have this problem without that guarantee.)
 What I propose is a cluster command to flatten the schema prior to upgrading. 
  This would basically purge the old schema updates and replace them with a 
 single serialized migration (serialized in the current protocol version).

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-2056) Need a way of flattening schemas.

[
https://issues.apache.org/jira/browse/CASSANDRA-2056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13018339#comment-13018339
]

Gary Dusbabek commented on CASSANDRA-2056:
--

Attached rebased v2. CliTest fails though, so not committing yet.

Need a way of flattening schemas.
-

Key: CASSANDRA-2056
URL: https://issues.apache.org/jira/browse/CASSANDRA-2056
Project: Cassandra
Issue Type: Improvement
Reporter: Gary Dusbabek
Assignee: Gary Dusbabek
Fix For: 0.8

Attachments: v2-0001-convert-MigrationManager-into-a-singleton.txt,
v2-0002-bail-on-migrations-originating-from-newer-protocol-ver.txt,
v2-0003-a-way-to-upgrade-schema-when-protocol-version-changes.txt

For all of our trying not to, we still managed to screw this up. Schema
updates currently contain a serialized RowMutation stored as a column value.
When a node needs updated schema, it requests these values, deserializes them
and applies them. As the serialization scheme for RowMutation changes over
time (this is inevitable), those old migrations will become incompatible with
newer implementations of the RowMutation deserializer. This means that when
new nodes come online, they'll get migration messages that they have trouble
deserializing. (Remember, we've only made the promise that we'll be
backwards compatible for one version--see CASSANDRA-1015--even though we'd
eventually have this problem without that guarantee.)
What I propose is a cluster command to flatten the schema prior to upgrading.
This would basically purge the old schema updates and replace them with a
single serialized migration (serialized in the current protocol version).

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-2056) Need a way of flattening schemas.

[
https://issues.apache.org/jira/browse/CASSANDRA-2056?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Gary Dusbabek updated CASSANDRA-2056:
-

Attachment: (was: v1-0001-convert-MigrationManager-into-a-singleton.txt)

Need a way of flattening schemas.
-

Key: CASSANDRA-2056
URL: https://issues.apache.org/jira/browse/CASSANDRA-2056
Project: Cassandra
Issue Type: Improvement
Reporter: Gary Dusbabek
Assignee: Gary Dusbabek
Fix For: 0.8

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-2056) Need a way of flattening schemas.


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-2056?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gary Dusbabek updated CASSANDRA-2056:
-

Attachment: (was: 
v1-0002-bail-on-migrations-originating-from-newer-protocol-ver.txt)

 Need a way of flattening schemas.
 -

 Key: CASSANDRA-2056
 URL: https://issues.apache.org/jira/browse/CASSANDRA-2056
 Project: Cassandra
  Issue Type: Improvement
Reporter: Gary Dusbabek
Assignee: Gary Dusbabek
 Fix For: 0.8

 Attachments: v2-0001-convert-MigrationManager-into-a-singleton.txt, 
 v2-0002-bail-on-migrations-originating-from-newer-protocol-ver.txt, 
 v2-0003-a-way-to-upgrade-schema-when-protocol-version-changes.txt


 For all of our trying not to, we still managed to screw this up.  Schema 
 updates currently contain a serialized RowMutation stored as a column value.  
 When a node needs updated schema, it requests these values, deserializes them 
 and applies them.  As the serialization scheme for RowMutation changes over 
 time (this is inevitable), those old migrations will become incompatible with 
 newer implementations of the RowMutation deserializer.  This means that when 
 new nodes come online, they'll get migration messages that they have trouble 
 deserializing.  (Remember, we've only made the promise that we'll be 
 backwards compatible for one version--see CASSANDRA-1015--even though we'd 
 eventually have this problem without that guarantee.)
 What I propose is a cluster command to flatten the schema prior to upgrading. 
  This would basically purge the old schema updates and replace them with a 
 single serialized migration (serialized in the current protocol version).

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-2056) Need a way of flattening schemas.


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-2056?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gary Dusbabek updated CASSANDRA-2056:
-

Attachment: (was: 
v1-0003-a-way-to-upgrade-schema-when-protocol-version-changes.txt)

 Need a way of flattening schemas.
 -

 Key: CASSANDRA-2056
 URL: https://issues.apache.org/jira/browse/CASSANDRA-2056
 Project: Cassandra
  Issue Type: Improvement
Reporter: Gary Dusbabek
Assignee: Gary Dusbabek
 Fix For: 0.8

 Attachments: v2-0001-convert-MigrationManager-into-a-singleton.txt, 
 v2-0002-bail-on-migrations-originating-from-newer-protocol-ver.txt, 
 v2-0003-a-way-to-upgrade-schema-when-protocol-version-changes.txt


 For all of our trying not to, we still managed to screw this up.  Schema 
 updates currently contain a serialized RowMutation stored as a column value.  
 When a node needs updated schema, it requests these values, deserializes them 
 and applies them.  As the serialization scheme for RowMutation changes over 
 time (this is inevitable), those old migrations will become incompatible with 
 newer implementations of the RowMutation deserializer.  This means that when 
 new nodes come online, they'll get migration messages that they have trouble 
 deserializing.  (Remember, we've only made the promise that we'll be 
 backwards compatible for one version--see CASSANDRA-1015--even though we'd 
 eventually have this problem without that guarantee.)
 What I propose is a cluster command to flatten the schema prior to upgrading. 
  This would basically purge the old schema updates and replace them with a 
 single serialized migration (serialized in the current protocol version).

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[Cassandra Wiki] Trivial Update of Operations_JP by MakiWatanabe

Dear Wiki user,

You have subscribed to a wiki page or wiki category on Cassandra Wiki for 
change notification.

The Operations_JP page has been changed by MakiWatanabe.
The comment on this change is: fix typo.
http://wiki.apache.org/cassandra/Operations_JP?action=diffrev1=104rev2=105

--

  レプリケーションファクタを減らすのは簡単です。レプリケーションファクタを減らした後、余分なレプリカデータを削除するためにcleanupを実行して下さい。
  
  === ネットワークトポロジー ===
- 
レプリケーションストラテジーによってデータセンター間のレプリカ配置を制御できますが、これに加えてできますが、データセンター内でどのノードが同じラックに設置されているかをCassandraに認識させることができます。Cassandraはreadやトークン範囲変更のためのデータの移動の際にこの情報を使用して最も近いレプリカを使用します。近接ノード検出の挙動は設定ファイルで差し替え可能な!EndpointSnitchクラスで変更可能です。
+ 
レプリケーションストラテジーによってデータセンター間のレプリカ配置を制御できますが、これに加えてデータセンター内でどのノードが同じラックに設置されているかをCassandraに認識させることができます。Cassandraはreadやトークン範囲変更のためのデータの移動の際にこの情報を使用して最も近いレプリカを使用します。近接ノード検出の挙動は設定ファイルで差し替え可能な!EndpointSnitchクラスで変更可能です。
  
  
!EndpointSnitchはレプリケーションストラテジーに関係していますが、レプリケーションストラテジーそのものとは異なるものです。!RackAwareStrategyが適切にレプリカを配置するには正しく構成されたSnitch
  が必要です。しかしデータセンターを意識したレプリケーションストラテジーを使用しない場合もCassandraはノード間の近接情報を必要としています。

svn commit: r1091080 - in /cassandra/branches/cassandra-0.8: ./ interface/ interface/thrift/gen-java/org/apache/cassandra/thrift/ src/avro/ src/java/org/apache/cassandra/config/ src/java/org/apache/ca

Author: jbellis
Date: Mon Apr 11 14:11:23 2011
New Revision: 1091080

URL: http://svn.apache.org/viewvc?rev=1091080view=rev
Log:
add optional key alias to CFMetaData
patch by jhermes; reviewed by jbellis for CASSANDRA-2396

Modified:
cassandra/branches/cassandra-0.8/CHANGES.txt
cassandra/branches/cassandra-0.8/interface/cassandra.thrift

cassandra/branches/cassandra-0.8/interface/thrift/gen-java/org/apache/cassandra/thrift/CfDef.java

cassandra/branches/cassandra-0.8/interface/thrift/gen-java/org/apache/cassandra/thrift/Constants.java
cassandra/branches/cassandra-0.8/src/avro/internode.genavro

cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/config/CFMetaData.java

cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/thrift/CassandraServer.java

Modified: cassandra/branches/cassandra-0.8/CHANGES.txt
URL: 
http://svn.apache.org/viewvc/cassandra/branches/cassandra-0.8/CHANGES.txt?rev=1091080r1=1091079r2=1091080view=diff
==
--- cassandra/branches/cassandra-0.8/CHANGES.txt (original)
+++ cassandra/branches/cassandra-0.8/CHANGES.txt Mon Apr 11 14:11:23 2011
@@ -18,7 +18,7 @@
  * purge tombstones from row cache (CASSANDRA-2305)
  * push replication_factor into strategy_options (CASSANDRA-1263)
  * give snapshots the same name on each node (CASSANDRA-1791)
- * add key type information (CASSANDRA-2311)
+ * add key type information and alias (CASSANDRA-2311, 2396)
 
 
 0.7.5

Modified: cassandra/branches/cassandra-0.8/interface/cassandra.thrift
URL: 
http://svn.apache.org/viewvc/cassandra/branches/cassandra-0.8/interface/cassandra.thrift?rev=1091080r1=1091079r2=1091080view=diff
==
--- cassandra/branches/cassandra-0.8/interface/cassandra.thrift (original)
+++ cassandra/branches/cassandra-0.8/interface/cassandra.thrift Mon Apr 11 
14:11:23 2011
@@ -46,7 +46,7 @@ namespace rb CassandraThrift
 #   for every edit that doesn't result in a change to major/minor.
 #
 # See the Semantic Versioning Specification (SemVer) http://semver.org.
-const string VERSION = 20.0.0
+const string VERSION = 20.1.0
 
 
 #
@@ -394,6 +394,7 @@ struct CfDef {
 25: optional double merge_shards_chance,
 26: optional string key_validation_class,
 27: optional string 
row_cache_provider=org.apache.cassandra.cache.ConcurrentLinkedHashCacheProvider,
+28: optional binary key_alias,
 }
 
 /* describes a keyspace. */

Modified: 
cassandra/branches/cassandra-0.8/interface/thrift/gen-java/org/apache/cassandra/thrift/CfDef.java
URL: 
http://svn.apache.org/viewvc/cassandra/branches/cassandra-0.8/interface/thrift/gen-java/org/apache/cassandra/thrift/CfDef.java?rev=1091080r1=1091079r2=1091080view=diff
==
--- 
cassandra/branches/cassandra-0.8/interface/thrift/gen-java/org/apache/cassandra/thrift/CfDef.java
 (original)
+++ 
cassandra/branches/cassandra-0.8/interface/thrift/gen-java/org/apache/cassandra/thrift/CfDef.java
 Mon Apr 11 14:11:23 2011
@@ -69,6 +69,7 @@ public class CfDef implements org.apache
   private static final org.apache.thrift.protocol.TField 
MERGE_SHARDS_CHANCE_FIELD_DESC = new 
org.apache.thrift.protocol.TField(merge_shards_chance, 
org.apache.thrift.protocol.TType.DOUBLE, (short)25);
   private static final org.apache.thrift.protocol.TField 
KEY_VALIDATION_CLASS_FIELD_DESC = new 
org.apache.thrift.protocol.TField(key_validation_class, 
org.apache.thrift.protocol.TType.STRING, (short)26);
   private static final org.apache.thrift.protocol.TField 
ROW_CACHE_PROVIDER_FIELD_DESC = new 
org.apache.thrift.protocol.TField(row_cache_provider, 
org.apache.thrift.protocol.TType.STRING, (short)27);
+  private static final org.apache.thrift.protocol.TField KEY_ALIAS_FIELD_DESC 
= new org.apache.thrift.protocol.TField(key_alias, 
org.apache.thrift.protocol.TType.STRING, (short)28);
 
   public String keyspace;
   public String name;
@@ -94,6 +95,7 @@ public class CfDef implements org.apache
   public double merge_shards_chance;
   public String key_validation_class;
   public String row_cache_provider;
+  public ByteBuffer key_alias;
 
   /** The set of fields this struct contains, along with convenience methods 
for finding and manipulating them. */
   public enum _Fields implements org.apache.thrift.TFieldIdEnum {
@@ -120,7 +122,8 @@ public class CfDef implements org.apache
 REPLICATE_ON_WRITE((short)24, replicate_on_write),
 MERGE_SHARDS_CHANCE((short)25, merge_shards_chance),
 KEY_VALIDATION_CLASS((short)26, key_validation_class),
-ROW_CACHE_PROVIDER((short)27, row_cache_provider);
+ROW_CACHE_PROVIDER((short)27, row_cache_provider),
+KEY_ALIAS((short)28, key_alias);
 
 private static final MapString, _Fields byName = new HashMapString, 
_Fields();
 
@@ -183,6 +186,8 @@ public class CfDef implements org.apache

svn commit: r1091087 [1/3] - in /cassandra/branches/cassandra-0.8/drivers: py/cql/cassandra/constants.py py/cql/cassandra/ttypes.py txpy/txcql/cassandra/Cassandra.py txpy/txcql/cassandra/constants.py

Author: jbellis
Date: Mon Apr 11 14:23:30 2011
New Revision: 1091087

URL: http://svn.apache.org/viewvc?rev=1091087view=rev
Log:
update generated .py code

Modified:
cassandra/branches/cassandra-0.8/drivers/py/cql/cassandra/constants.py
cassandra/branches/cassandra-0.8/drivers/py/cql/cassandra/ttypes.py
cassandra/branches/cassandra-0.8/drivers/txpy/txcql/cassandra/Cassandra.py
cassandra/branches/cassandra-0.8/drivers/txpy/txcql/cassandra/constants.py
cassandra/branches/cassandra-0.8/drivers/txpy/txcql/cassandra/ttypes.py

Modified: cassandra/branches/cassandra-0.8/drivers/py/cql/cassandra/constants.py
URL: 
http://svn.apache.org/viewvc/cassandra/branches/cassandra-0.8/drivers/py/cql/cassandra/constants.py?rev=1091087r1=1091086r2=1091087view=diff
==
--- cassandra/branches/cassandra-0.8/drivers/py/cql/cassandra/constants.py 
(original)
+++ cassandra/branches/cassandra-0.8/drivers/py/cql/cassandra/constants.py Mon 
Apr 11 14:23:30 2011
@@ -7,4 +7,4 @@
 from thrift.Thrift import *
 from ttypes import *
 
-VERSION = 20.0.0
+VERSION = 20.1.0

Modified: cassandra/branches/cassandra-0.8/drivers/py/cql/cassandra/ttypes.py
URL: 
http://svn.apache.org/viewvc/cassandra/branches/cassandra-0.8/drivers/py/cql/cassandra/ttypes.py?rev=1091087r1=1091086r2=1091087view=diff
==
--- cassandra/branches/cassandra-0.8/drivers/py/cql/cassandra/ttypes.py 
(original)
+++ cassandra/branches/cassandra-0.8/drivers/py/cql/cassandra/ttypes.py Mon Apr 
11 14:23:30 2011
@@ -2324,6 +2324,7 @@ class CfDef:
- merge_shards_chance
- key_validation_class
- row_cache_provider
+   - key_alias
   
 
   thrift_spec = (
@@ -2355,9 +2356,10 @@ class CfDef:
 (25, TType.DOUBLE, 'merge_shards_chance', None, None, ), # 25
 (26, TType.STRING, 'key_validation_class', None, None, ), # 26
 (27, TType.STRING, 'row_cache_provider', None, 
org.apache.cassandra.cache.ConcurrentLinkedHashCacheProvider, ), # 27
+(28, TType.STRING, 'key_alias', None, None, ), # 28
   )
 
-  def __init__(self, keyspace=None, name=None, column_type=thrift_spec[3][4], 
comparator_type=thrift_spec[5][4], subcomparator_type=None, comment=None, 
row_cache_size=thrift_spec[9][4], key_cache_size=thrift_spec[11][4], 
read_repair_chance=thrift_spec[12][4], column_metadata=None, 
gc_grace_seconds=None, default_validation_class=None, id=None, 
min_compaction_threshold=None, max_compaction_threshold=None, 
row_cache_save_period_in_seconds=None, key_cache_save_period_in_seconds=None, 
memtable_flush_after_mins=None, memtable_throughput_in_mb=None, 
memtable_operations_in_millions=None, replicate_on_write=None, 
merge_shards_chance=None, key_validation_class=None, 
row_cache_provider=thrift_spec[27][4],):
+  def __init__(self, keyspace=None, name=None, column_type=thrift_spec[3][4], 
comparator_type=thrift_spec[5][4], subcomparator_type=None, comment=None, 
row_cache_size=thrift_spec[9][4], key_cache_size=thrift_spec[11][4], 
read_repair_chance=thrift_spec[12][4], column_metadata=None, 
gc_grace_seconds=None, default_validation_class=None, id=None, 
min_compaction_threshold=None, max_compaction_threshold=None, 
row_cache_save_period_in_seconds=None, key_cache_save_period_in_seconds=None, 
memtable_flush_after_mins=None, memtable_throughput_in_mb=None, 
memtable_operations_in_millions=None, replicate_on_write=None, 
merge_shards_chance=None, key_validation_class=None, 
row_cache_provider=thrift_spec[27][4], key_alias=None,):
 self.keyspace = keyspace
 self.name = name
 self.column_type = column_type
@@ -2382,6 +2384,7 @@ class CfDef:
 self.merge_shards_chance = merge_shards_chance
 self.key_validation_class = key_validation_class
 self.row_cache_provider = row_cache_provider
+self.key_alias = key_alias
 
   def read(self, iprot):
 if iprot.__class__ == TBinaryProtocol.TBinaryProtocolAccelerated and 
isinstance(iprot.trans, TTransport.CReadableTransport) and self.thrift_spec is 
not None and fastbinary is not None:
@@ -2518,6 +2521,11 @@ class CfDef:
   self.row_cache_provider = iprot.readString();
 else:
   iprot.skip(ftype)
+  elif fid == 28:
+if ftype == TType.STRING:
+  self.key_alias = iprot.readString();
+else:
+  iprot.skip(ftype)
   else:
 iprot.skip(ftype)
   iprot.readFieldEnd()
@@ -2627,6 +2635,10 @@ class CfDef:
   oprot.writeFieldBegin('row_cache_provider', TType.STRING, 27)
   oprot.writeString(self.row_cache_provider)
   oprot.writeFieldEnd()
+if self.key_alias != None:
+  oprot.writeFieldBegin('key_alias', TType.STRING, 28)
+  oprot.writeString(self.key_alias)
+  oprot.writeFieldEnd()
 oprot.writeFieldStop()
 oprot.writeStructEnd()
 def validate(self):

svn commit: r1091087 [3/3] - in /cassandra/branches/cassandra-0.8/drivers: py/cql/cassandra/constants.py py/cql/cassandra/ttypes.py txpy/txcql/cassandra/Cassandra.py txpy/txcql/cassandra/constants.py

Modified: 
cassandra/branches/cassandra-0.8/drivers/txpy/txcql/cassandra/constants.py
URL: 
http://svn.apache.org/viewvc/cassandra/branches/cassandra-0.8/drivers/txpy/txcql/cassandra/constants.py?rev=1091087r1=1091086r2=1091087view=diff
==
--- cassandra/branches/cassandra-0.8/drivers/txpy/txcql/cassandra/constants.py 
(original)
+++ cassandra/branches/cassandra-0.8/drivers/txpy/txcql/cassandra/constants.py 
Mon Apr 11 14:23:30 2011
@@ -7,4 +7,4 @@
 from thrift.Thrift import *
 from ttypes import *
 
-VERSION = 20.0.0
+VERSION = 20.1.0

Modified: 
cassandra/branches/cassandra-0.8/drivers/txpy/txcql/cassandra/ttypes.py
URL: 
http://svn.apache.org/viewvc/cassandra/branches/cassandra-0.8/drivers/txpy/txcql/cassandra/ttypes.py?rev=1091087r1=1091086r2=1091087view=diff
==
--- cassandra/branches/cassandra-0.8/drivers/txpy/txcql/cassandra/ttypes.py 
(original)
+++ cassandra/branches/cassandra-0.8/drivers/txpy/txcql/cassandra/ttypes.py Mon 
Apr 11 14:23:30 2011
@@ -2324,6 +2324,7 @@ class CfDef:
- merge_shards_chance
- key_validation_class
- row_cache_provider
+   - key_alias
   
 
   thrift_spec = (
@@ -2355,9 +2356,10 @@ class CfDef:
 (25, TType.DOUBLE, 'merge_shards_chance', None, None, ), # 25
 (26, TType.STRING, 'key_validation_class', None, None, ), # 26
 (27, TType.STRING, 'row_cache_provider', None, 
org.apache.cassandra.cache.ConcurrentLinkedHashCacheProvider, ), # 27
+(28, TType.STRING, 'key_alias', None, None, ), # 28
   )
 
-  def __init__(self, keyspace=None, name=None, column_type=thrift_spec[3][4], 
comparator_type=thrift_spec[5][4], subcomparator_type=None, comment=None, 
row_cache_size=thrift_spec[9][4], key_cache_size=thrift_spec[11][4], 
read_repair_chance=thrift_spec[12][4], column_metadata=None, 
gc_grace_seconds=None, default_validation_class=None, id=None, 
min_compaction_threshold=None, max_compaction_threshold=None, 
row_cache_save_period_in_seconds=None, key_cache_save_period_in_seconds=None, 
memtable_flush_after_mins=None, memtable_throughput_in_mb=None, 
memtable_operations_in_millions=None, replicate_on_write=None, 
merge_shards_chance=None, key_validation_class=None, 
row_cache_provider=thrift_spec[27][4],):
+  def __init__(self, keyspace=None, name=None, column_type=thrift_spec[3][4], 
comparator_type=thrift_spec[5][4], subcomparator_type=None, comment=None, 
row_cache_size=thrift_spec[9][4], key_cache_size=thrift_spec[11][4], 
read_repair_chance=thrift_spec[12][4], column_metadata=None, 
gc_grace_seconds=None, default_validation_class=None, id=None, 
min_compaction_threshold=None, max_compaction_threshold=None, 
row_cache_save_period_in_seconds=None, key_cache_save_period_in_seconds=None, 
memtable_flush_after_mins=None, memtable_throughput_in_mb=None, 
memtable_operations_in_millions=None, replicate_on_write=None, 
merge_shards_chance=None, key_validation_class=None, 
row_cache_provider=thrift_spec[27][4], key_alias=None,):
 self.keyspace = keyspace
 self.name = name
 self.column_type = column_type
@@ -2382,6 +2384,7 @@ class CfDef:
 self.merge_shards_chance = merge_shards_chance
 self.key_validation_class = key_validation_class
 self.row_cache_provider = row_cache_provider
+self.key_alias = key_alias
 
   def read(self, iprot):
 if iprot.__class__ == TBinaryProtocol.TBinaryProtocolAccelerated and 
isinstance(iprot.trans, TTransport.CReadableTransport) and self.thrift_spec is 
not None and fastbinary is not None:
@@ -2518,6 +2521,11 @@ class CfDef:
   self.row_cache_provider = iprot.readString();
 else:
   iprot.skip(ftype)
+  elif fid == 28:
+if ftype == TType.STRING:
+  self.key_alias = iprot.readString();
+else:
+  iprot.skip(ftype)
   else:
 iprot.skip(ftype)
   iprot.readFieldEnd()
@@ -2627,6 +2635,10 @@ class CfDef:
   oprot.writeFieldBegin('row_cache_provider', TType.STRING, 27)
   oprot.writeString(self.row_cache_provider)
   oprot.writeFieldEnd()
+if self.key_alias != None:
+  oprot.writeFieldBegin('key_alias', TType.STRING, 28)
+  oprot.writeString(self.key_alias)
+  oprot.writeFieldEnd()
 oprot.writeFieldStop()
 oprot.writeStructEnd()
 def validate(self):

[Cassandra Wiki] Update of MultinodeCluster by MakiWatanabe

Dear Wiki user,

You have subscribed to a wiki page or wiki category on Cassandra Wiki for 
change notification.

The MultinodeCluster page has been changed by MakiWatanabe.
The comment on this change is: update for 0.7 format.
http://wiki.apache.org/cassandra/MultinodeCluster?action=diffrev1=5rev2=6

--

+ Prior to the 0.7 release, Cassandra storage configuration is described by the 
''conf/storage-conf.xml'' file. As of 0.7, it is described by the  
''conf/cassandra.yaml'' file. Please refer to MultinodeCluster06 for about 
pre-0.7 configuration.
+ 
  = Creating a multinode cluster =
  
- The default storage-conf.xml provided with cassandra is great for getting up 
and running on a single node.  However, it is inappropriate for use in a 
multi-node cluster.  The configuration and process here are the ''simplest'' 
way to create a multi-node cluster, but may not be the ''best'' way in 
production deployments.
+ The default cassandra.yaml provided with cassandra is great for getting up 
and running on a single node.  However, it is inappropriate for use in a 
multi-node cluster.  The configuration and process here are the ''simplest'' 
way to create a multi-node cluster, but may not be the ''best'' way in 
production deployments.
  
  == Preparing the first node ==
  
- The default storage-conf.xml uses the local, loopback address as its listen 
(inter-node) and Thrift (client access) addresses:
+ The default cassandra.yaml uses the local, loopback address as its listen 
(inter-node) and Thrift (client access) addresses:
  
  {{{
- ListenAddresslocalhost/ListenAddress
+ listen_address: localhost
  
- ThriftAddresslocalhost/ThriftAddress
+ rpc_address: localhost
  }}}
  
  As the listen address is used for intra-cluster communication, it must be 
changed to a routable address so the other nodes can reach it.  For example, 
assuming you have an Ethernet interface with address 192.168.1.1, you would 
change the listen address like so:
  
  {{{
- ListenAddress192.168.1.1/ListenAddress
+ listen_address: 192.168.1.1
  }}}
  
  The Thrift interface can be configured using either a specified address, like 
the listen address, or using the wildcard 0.0.0.0, which causes cassandra to 
listen for clients on all available interfaces.  Update it as either:
  
  {{{
- ThriftAddress192.168.1.1/ThriftAddress
+ rpc_address: 192.168.1.1
  }}}
  
  Or:
  
  {{{
- ThriftAddress0.0.0.0/ThriftAddress
+ rpc_address: 0.0.0.0
  }}}
  
  If the DNS entry for your host is correct, it is safe to use a hostname 
instead of an IP address.  Similarly, the seed information should be changed 
from the loopback address:
  
  {{{
- Seeds
-   Seed127.0.0.1/Seed
- /Seeds
+ seeds:
+   - 127.0.0.1
+ 
  }}}
  
  Becomes:
  
  {{{
- Seeds
-   Seed192.168.1.1/Seed
- /Seeds
+ seeds:
+   - 192.168.1.1
+ 
  }}}
  
  Once these changes are made, simply restart cassandra on this node.  Use 
netstat to verify cassandra is listening on the right address.  Look for a line 
like this:
  
  {{{tcp4   0  0  192.168.1.1.7000 *.*
LISTEN}}}
  
- If netstat still shows cassandra listening on 127.0.0.1.7000, then either the 
previous cassandra process was not properly killed or you are not editing the 
storage-conf.xml file cassandra is actually using.
+ If netstat still shows cassandra listening on 127.0.0.1.7000, then either the 
previous cassandra process was not properly killed or you are not editing the 
cassandra.yaml file cassandra is actually using.
  
  
  == Preparing the rest of the nodes ==
  
- The other nodes in the ring will use a storage-conf.xml almost identical to 
the one on your first node, so use that configuration as the base for these 
changes rather than the default storage-conf.xml.  The first change is to turn 
on automatic bootstrapping.  This will cause the node to join the ring and 
attempt to take control of a range of the token space:
+ The other nodes in the ring will use a cassandra.yaml almost identical to the 
one on your first node, so use that configuration as the base for these changes 
rather than the default cassandra.yaml.  The first change is to turn on 
automatic bootstrapping.  This will cause the node to join the ring and attempt 
to take control of a range of the token space:
  
  {{{
- AutoBootstraptrue/AutoBootstrap
+ auto_bootstrap: true
  }}}
  
  The second change is to the listen address, as it must also not be the 
loopback and cannot be the same as any other node.  Assuming your second node 
has an Ethernet interface with the address 192.168.2.34, set its listen address 
with:
  
  {{{
- ListenAddress192.168.2.34/ListenAddress
+ listen_address: 192.168.2.34
  }}}
  
  Finally, update the the Thrift address to accept client connections, as with 
the first node, either with a specific address or the wildcard:
  
  {{{
- ThriftAddress192.168.2.34/ThriftAddress
+ rpc_address: 192.168.2.34
  }}}
  
  Or:
  
  {{{
-

[Cassandra Wiki] Update of MultinodeCluster06_JP by MakiWatanabe

Dear Wiki user,

You have subscribed to a wiki page or wiki category on Cassandra Wiki for 
change notification.

The MultinodeCluster06_JP page has been changed by MakiWatanabe.
http://wiki.apache.org/cassandra/MultinodeCluster06_JP?action=diffrev1=12rev2=13

--

  ## page was copied from MultinodeCluster_JP
  ## page was copied from MultinodeCluster
  
- 
'''0.7より前のバージョンではストレージ設定はconf/storage-conf.xmlに記述されていましたが、0.7ではconf/cassandra.yamlに記述されます。詳しくはStorageConfigurationを参照してください。'''
+ 
'''0.7より前のバージョンではストレージ設定はconf/storage-conf.xmlに記述されていましたが、0.7ではconf/cassandra.yamlに記述されます。詳しくはStorageConfiguration_JPを参照してください。'''
  
  
  = マルチノードクラスタの作成 =

svn commit: r1091090 - in /cassandra/branches/cassandra-0.8: ./ contrib/ interface/thrift/gen-java/org/apache/cassandra/thrift/ src/java/org/apache/cassandra/service/

Author: jbellis
Date: Mon Apr 11 14:32:45 2011
New Revision: 1091090

URL: http://svn.apache.org/viewvc?rev=1091090view=rev
Log:
merge from 0.7

Modified:
cassandra/branches/cassandra-0.8/   (props changed)
cassandra/branches/cassandra-0.8/CHANGES.txt
cassandra/branches/cassandra-0.8/contrib/   (props changed)

cassandra/branches/cassandra-0.8/interface/thrift/gen-java/org/apache/cassandra/thrift/Cassandra.java
   (props changed)

cassandra/branches/cassandra-0.8/interface/thrift/gen-java/org/apache/cassandra/thrift/Column.java
   (props changed)

cassandra/branches/cassandra-0.8/interface/thrift/gen-java/org/apache/cassandra/thrift/InvalidRequestException.java
   (props changed)

cassandra/branches/cassandra-0.8/interface/thrift/gen-java/org/apache/cassandra/thrift/NotFoundException.java
   (props changed)

cassandra/branches/cassandra-0.8/interface/thrift/gen-java/org/apache/cassandra/thrift/SuperColumn.java
   (props changed)

cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/service/StorageService.java

Propchange: cassandra/branches/cassandra-0.8/
--
--- svn:mergeinfo (original)
+++ svn:mergeinfo Mon Apr 11 14:32:45 2011
@@ -1,5 +1,5 @@
 
/cassandra/branches/cassandra-0.6:922689-1052356,1052358-1053452,1053454,1053456-1081914,1083000
-/cassandra/branches/cassandra-0.7:1026516-1090647
+/cassandra/branches/cassandra-0.7:1026516-1091087
 /cassandra/branches/cassandra-0.7.0:1053690-1055654
 /cassandra/tags/cassandra-0.7.0-rc3:1051699-1053689
 /incubator/cassandra/branches/cassandra-0.3:774578-796573

Modified: cassandra/branches/cassandra-0.8/CHANGES.txt
URL: 
http://svn.apache.org/viewvc/cassandra/branches/cassandra-0.8/CHANGES.txt?rev=1091090r1=1091089r2=1091090view=diff
==
--- cassandra/branches/cassandra-0.8/CHANGES.txt (original)
+++ cassandra/branches/cassandra-0.8/CHANGES.txt Mon Apr 11 14:32:45 2011
@@ -45,7 +45,9 @@
index (CASSANDRA-2376)
  * fix race condition that could leave orphaned data files when
dropping CF or KS (CASSANDRA-2381)
+ * convert mmap assertion to if/throw so scrub can catch it (CASSANDRA-2417)
  * Try harder to close files after compaction (CASSANDRA-2431)
+ * re-set bootstrapped flag after move finishes (CASSANDRA-2435)
 
 
 0.7.4

Propchange: cassandra/branches/cassandra-0.8/contrib/
--
--- svn:mergeinfo (original)
+++ svn:mergeinfo Mon Apr 11 14:32:45 2011
@@ -1,5 +1,5 @@
 
/cassandra/branches/cassandra-0.6/contrib:922689-1052356,1052358-1053452,1053454,1053456-1068009
-/cassandra/branches/cassandra-0.7/contrib:1026516-1090647
+/cassandra/branches/cassandra-0.7/contrib:1026516-1091087
 /cassandra/branches/cassandra-0.7.0/contrib:1053690-1055654
 /cassandra/tags/cassandra-0.7.0-rc3/contrib:1051699-1053689
 /incubator/cassandra/branches/cassandra-0.3/contrib:774578-796573

Propchange: 
cassandra/branches/cassandra-0.8/interface/thrift/gen-java/org/apache/cassandra/thrift/Cassandra.java
--
--- svn:mergeinfo (original)
+++ svn:mergeinfo Mon Apr 11 14:32:45 2011
@@ -1,5 +1,5 @@
 
/cassandra/branches/cassandra-0.6/interface/thrift/gen-java/org/apache/cassandra/thrift/Cassandra.java:922689-1052356,1052358-1053452,1053454,1053456-1081914,1083000
-/cassandra/branches/cassandra-0.7/interface/thrift/gen-java/org/apache/cassandra/thrift/Cassandra.java:1026516-1090647
+/cassandra/branches/cassandra-0.7/interface/thrift/gen-java/org/apache/cassandra/thrift/Cassandra.java:1026516-1091087
 
/cassandra/branches/cassandra-0.7.0/interface/thrift/gen-java/org/apache/cassandra/thrift/Cassandra.java:1053690-1055654
 
/cassandra/tags/cassandra-0.7.0-rc3/interface/thrift/gen-java/org/apache/cassandra/thrift/Cassandra.java:1051699-1053689
 
/incubator/cassandra/branches/cassandra-0.3/interface/gen-java/org/apache/cassandra/service/Cassandra.java:774578-796573

Propchange: 
cassandra/branches/cassandra-0.8/interface/thrift/gen-java/org/apache/cassandra/thrift/Column.java
--
--- svn:mergeinfo (original)
+++ svn:mergeinfo Mon Apr 11 14:32:45 2011
@@ -1,5 +1,5 @@
 
/cassandra/branches/cassandra-0.6/interface/thrift/gen-java/org/apache/cassandra/thrift/Column.java:922689-1052356,1052358-1053452,1053454,1053456-1081914,1083000
-/cassandra/branches/cassandra-0.7/interface/thrift/gen-java/org/apache/cassandra/thrift/Column.java:1026516-1090647
+/cassandra/branches/cassandra-0.7/interface/thrift/gen-java/org/apache/cassandra/thrift/Column.java:1026516-1091087
 
/cassandra/branches/cassandra-0.7.0/interface/thrift/gen-java/org/apache/cassandra/thrift/Column.java:1053690-1055654

[Cassandra Wiki] Update of MultinodeCluster06_JP by MakiWatanabe

Dear Wiki user,

You have subscribed to a wiki page or wiki category on Cassandra Wiki for 
change notification.

The MultinodeCluster06_JP page has been changed by MakiWatanabe.
http://wiki.apache.org/cassandra/MultinodeCluster06_JP?action=diffrev1=13rev2=14

--

  ## page was copied from MultinodeCluster_JP
  ## page was copied from MultinodeCluster
  
- 
'''0.7より前のバージョンではストレージ設定はconf/storage-conf.xmlに記述されていましたが、0.7ではconf/cassandra.yamlに記述されます。詳しくはStorageConfiguration_JPを参照してください。'''
+ 
'''0.7より前のバージョンではストレージ設定はconf/storage-conf.xmlに記述されていましたが、0.7ではconf/cassandra.yamlに記述されます。詳しくは[[StorageConfiguration|StorageConfiguration_JP]]を参照してください。'''
  
  
  = マルチノードクラスタの作成 =
@@ -14, +14 @@

  
  
標準のstorage-conf.xmlはloopbackアドレスをlistenアドレス（ノード間通信用）及びThriftアドレス（クライアントアクセス用）に使用しています：
  
- 0.7より前のバージョン
  {{{
  ListenAddresslocalhost/ListenAddress
  ThriftAddresslocalhost/ThriftAddress
  }}}
  
- 0.7
- {{{
- listen_address: localhost
- rpc_address: localhost
- }}}
  
  listenアドレスはノード間通信に使用されるので、他のノードからアクセス可能なアドレスに変更する必要があります。
  例えば、そのノードが192.168.1.1のEthernetインターフェースを持っている場合、listenアドレスを次のように変更すればいいでしょう：
  
- 0.7より前のバージョン
  {{{
  ListenAddress192.168.1.1/ListenAddress
- }}}
- 
- 0.7
- {{{
- listen_address: 192.168.1.1
  }}}
  
  
  
Thriftインターフェースには特定のIPアドレス、あるいはワイルドカードアドレス0.0.0.0を指定できます。ワイルドカードアドレスを指定すると、cassandraは使用可能なすべてのインターフェースでクライアントからの要求を受け付けます。Thrfitアドレスを次のように指定して下さい：
  
- 0.7より前のバージョン
  {{{
  ThriftAddress192.168.1.1/ThriftAddress
  }}}
  
- 0.7
- {{{
- rpc_address: 192.168.1.1
- }}}
- 
  あるいは：
  
- 0.7より前のバージョン
  {{{
  ThriftAddress0.0.0.0/ThriftAddress
  }}}
  
- 0.7
- {{{
- rpc_address: 0.0.0.0
- }}}
- 
  
そのホストのDNSエントリが正しければ、IPアドレスよりホスト名を使った方が安全です。同様に、seed情報もloopbackアドレスから変更する必要があります：
  
- 0.7より前のバージョン
  {{{
  Seeds
Seed127.0.0.1/Seed
  /Seeds
  }}}
  
- 0.7
- {{{
- seeds:
- - 127.0.0.1
- }}}
- 
  これを次のように変更:
  
- 0.7より前のバージョン
  {{{
  Seeds
Seed192.168.1.1/Seed
  /Seeds
  }}}
  
- 0.7
- {{{
- seeds:
- - 192.168.1.1
- }}}
  
  
これらの変更が済んだら、cassandraをリスタートしてください。netstatを使用してcassandraが正しいアドレスをlistenしていることを確認してください。
  設定が正しければ次のように表示されるはです：
@@ -105, +68 @@

  
  
Ring内の他のノードでは最初のノードで設定したstorage-conf.xmlとほぼ同一のものを使用します。従って最初のノードで編集したstorage-conf.xmlをベースに変更を加えていくことにしましょう。最初の変更は、自動ブートストラップを有効にすることです。この設定により、ノードはRingに参加し、トークン空間における一定範囲を担当範囲にすることを試みます：
  
- 0.7より前のバージョン
  {{{
  AutoBootstraptrue/AutoBootstrap
  }}}
  
- 0.7
- {{{
- auto_bootstrap: true
- }}}
- 
  
二つ目の変更はlistenアドレスです。listenアドレスはloopbackアドレスでもなく、また他のノードと重複してもいけません。二つ目のノードが192.168.2.34のEthernetインターフェースを持っている場合、listenアドレスを次のように設定します：
  
- 0.7より前のバージョン
  {{{
  ListenAddress192.168.2.34/ListenAddress
  }}}
  
- 0.7
- {{{
- listen_address: 192.168.2.34
- }}}
- 
  最後に、Thriftアドレスを変更し、クライアントアクセスを受け付け可能にします。最初のノードと同様に、特定のアドレス、もしくはワイルドカードを指定します：
  
- 0.7より前のバージョン
  {{{
  ThriftAddress192.168.2.34/ThriftAddress
  }}}
  
- 0.7
- {{{
- rpc_address: 192.168.2.34
- }}}
- 
  または：
  
- 0.7より前のバージョン
  {{{
  ThriftAddress0.0.0.0/ThriftAddress
  }}}
  
- 0.7
- {{{
- rpc_address: 0.0.0.0
- }}}
  
  
設定のSeedsセクションをそのまま残さなければいけないことに注意してください。この設定は追加するノードがブートストラップで最初のノードを参照するために必要です。これらの設定変更が済んだら、新しいノードでcassandraを起動してください。新しいノードは自動的にRingに参加し、初期トークンを自分自身に割り当て、要求待ちになります。

[Cassandra Wiki] Trivial Update of MultinodeCluster06_JP by MakiWatanabe

Dear Wiki user,

You have subscribed to a wiki page or wiki category on Cassandra Wiki for 
change notification.

The MultinodeCluster06_JP page has been changed by MakiWatanabe.
http://wiki.apache.org/cassandra/MultinodeCluster06_JP?action=diffrev1=14rev2=15

--

  ## page was copied from MultinodeCluster_JP
  ## page was copied from MultinodeCluster
  
- 
'''0.7より前のバージョンではストレージ設定はconf/storage-conf.xmlに記述されていましたが、0.7ではconf/cassandra.yamlに記述されます。詳しくは[[StorageConfiguration|StorageConfiguration_JP]]を参照してください。'''
+ 
'''0.7より前のバージョンではストレージ設定はconf/storage-conf.xmlに記述されていましたが、0.7ではconf/cassandra.yamlに記述されます。詳しくは[[StorageConfiguration_JP|StorageConfiguration]]を参照してください。'''
  
  
  = マルチノードクラスタの作成 =

[Cassandra Wiki] Update of MultinodeCluster_JP by MakiWatanabe

Dear Wiki user,

You have subscribed to a wiki page or wiki category on Cassandra Wiki for 
change notification.

The MultinodeCluster_JP page has been changed by MakiWatanabe.
The comment on this change is: update for 0.7 format.
http://wiki.apache.org/cassandra/MultinodeCluster_JP?action=diffrev1=11rev2=12

--

  ## page was copied from MultinodeCluster
  
- 
'''0.7より前のバージョンではストレージ設定はconf/storage-conf.xmlに記述されていましたが、0.7ではconf/cassandra.yamlに記述されます。詳しくはStorageConfigurationを参照してください。'''
+ 
'''0.7より前のバージョンではストレージ設定はconf/storage-conf.xmlに記述されていましたが、0.7ではconf/cassandra.yamlに記述されます。
+ 0.6以前のバージョンの設定方法については[[MultinodeCluster06_JP|MultinodeCluster06]]を参照してください。
+ パラメータの詳細については[[StorageConfiguration_JP]]を参照してください。'''
  
  
  = マルチノードクラスタの作成 =
  
- 
Cassandraパッケージに含まれている標準のstorage-conf.xmlはシングルノード環境を構築するには便利ですが、マルチノードクラスタの構築には適当ではありません。ここではマルチノードクラスタを構築するための最も簡単な手順と設定を説明します。ただし、ここで述べる方法は本番環境の構築においては必ずしも最善ではないでしょう。
+ 
Cassandraパッケージに含まれている標準のcassandra.yamlはシングルノード環境を構築するには便利ですが、マルチノードクラスタの構築には適当ではありません。ここではマルチノードクラスタを構築するための最も簡単な手順と設定を説明します。ただし、ここで述べる方法は本番環境の構築においては必ずしも最善ではないでしょう。
  
  == 最初のノードの準備 ==
  
- 
標準のstorage-conf.xmlはloopbackアドレスをlistenアドレス（ノード間通信用）及びThriftアドレス（クライアントアクセス用）に使用しています：
+ 
標準のcassandra.yamlはloopbackアドレスをlistenアドレス（ノード間通信用）及びThriftアドレス（クライアントアクセス用）に使用しています：
  
- 0.7より前のバージョン
- {{{
- ListenAddresslocalhost/ListenAddress
- ThriftAddresslocalhost/ThriftAddress
- }}}
- 
- 0.7
  {{{
  listen_address: localhost
  rpc_address: localhost
@@ -28, +23 @@

  listenアドレスはノード間通信に使用されるので、他のノードからアクセス可能なアドレスに変更する必要があります。
  例えば、そのノードが192.168.1.1のEthernetインターフェースを持っている場合、listenアドレスを次のように変更すればいいでしょう：
  
- 0.7より前のバージョン
- {{{
- ListenAddress192.168.1.1/ListenAddress
- }}}
- 
- 0.7
  {{{
  listen_address: 192.168.1.1
  }}}
@@ -41, +30 @@

  
  
Thriftインターフェースには特定のIPアドレス、あるいはワイルドカードアドレス0.0.0.0を指定できます。ワイルドカードアドレスを指定すると、cassandraは使用可能なすべてのインターフェースでクライアントからの要求を受け付けます。Thrfitアドレスを次のように指定して下さい：
  
- 0.7より前のバージョン
- {{{
- ThriftAddress192.168.1.1/ThriftAddress
- }}}
- 
- 0.7
  {{{
  rpc_address: 192.168.1.1
  }}}
  
  あるいは：
  
- 0.7より前のバージョン
- {{{
- ThriftAddress0.0.0.0/ThriftAddress
- }}}
- 
- 0.7
  {{{
  rpc_address: 0.0.0.0
  }}}
  
  
そのホストのDNSエントリが正しければ、IPアドレスよりホスト名を使った方が安全です。同様に、seed情報もloopbackアドレスから変更する必要があります：
  
- 0.7より前のバージョン
- {{{
- Seeds
-   Seed127.0.0.1/Seed
- /Seeds
- }}}
- 
- 0.7
  {{{
  seeds:
  - 127.0.0.1
@@ -80, +49 @@

  
  これを次のように変更:
  
- 0.7より前のバージョン
- {{{
- Seeds
-   Seed192.168.1.1/Seed
- /Seeds
- }}}
- 
- 0.7
  {{{
  seeds:
  - 192.168.1.1
@@ -98, +59 @@

  
  {{{tcp4   0  0  192.168.1.1.7000 *.*
LISTEN}}}
  
- 
cassandraが依然として127.0.0.1.7000をlistenしているなら、前のcassandraプロセスが正常にkillされなかったか、あなたが編集したstorage-conf.xmlをcassandraが参照していないかのいずれかでしょう。
+ 
cassandraが依然として127.0.0.1.7000をlistenしているなら、前のcassandraプロセスが正常にkillされなかったか、あなたが編集したcassandra.yamlをcassandraが参照していないかのいずれかでしょう。
  
  == 残りのノードの準備 ==
  
- 
Ring内の他のノードでは最初のノードで設定したstorage-conf.xmlとほぼ同一のものを使用します。従って最初のノードで編集したstorage-conf.xmlをベースに変更を加えていくことにしましょう。最初の変更は、自動ブートストラップを有効にすることです。この設定により、ノードはRingに参加し、トークン空間における一定範囲を担当範囲にすることを試みます：
+ 
Ring内の他のノードでは最初のノードで設定したcassandra.yamlとほぼ同一のものを使用します。従って最初のノードで編集したcassandra.yamlをベースに変更を加えていくことにしましょう。最初の変更は、自動ブートストラップを有効にすることです。この設定により、ノードはRingに参加し、トークン空間における一定範囲を担当範囲にすることを試みます：
  
- 0.7より前のバージョン
- {{{
- AutoBootstraptrue/AutoBootstrap
- }}}
- 
- 0.7
  {{{
  auto_bootstrap: true
  }}}
  
  
二つ目の変更はlistenアドレスです。listenアドレスはloopbackアドレスでもなく、また他のノードと重複してもいけません。二つ目のノードが192.168.2.34のEthernetインターフェースを持っている場合、listenアドレスを次のように設定します：
  
- 0.7より前のバージョン
- {{{
- ListenAddress192.168.2.34/ListenAddress
- }}}
- 
- 0.7
  {{{
  listen_address: 192.168.2.34
  }}}
  
  最後に、Thriftアドレスを変更し、クライアントアクセスを受け付け可能にします。最初のノードと同様に、特定のアドレス、もしくはワイルドカードを指定します：
  
- 0.7より前のバージョン
- {{{
- ThriftAddress192.168.2.34/ThriftAddress
- }}}
- 
- 0.7
  {{{
  rpc_address: 192.168.2.34
  }}}
  
  または：
  
- 0.7より前のバージョン
- {{{
- ThriftAddress0.0.0.0/ThriftAddress
- }}}
- 
- 0.7
  {{{
  rpc_address: 0.0.0.0
  }}}

[jira] [Updated] (CASSANDRA-2342) Add range slice support for counters


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-2342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sylvain Lebresne updated CASSANDRA-2342:


Attachment: 0001-Range-slice-support-for-counters.patch

CASSANDRA-2440 almost fixed that, this patch really only remove the check that 
rejects range_slice query on counter CF (and fix the cli and add a system test).

 Add range slice support for counters
 

 Key: CASSANDRA-2342
 URL: https://issues.apache.org/jira/browse/CASSANDRA-2342
 Project: Cassandra
  Issue Type: Improvement
  Components: Core
Affects Versions: 0.8
Reporter: Sylvain Lebresne
Assignee: Sylvain Lebresne
Priority: Minor
 Fix For: 0.8

 Attachments: 0001-Range-slice-support-for-counters.patch


 There is no equivalent for get_range_slice() for counters right now.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[Cassandra Wiki] Trivial Update of Operations_JP by MakiWatanabe