date:20120830

[jira] [Commented] (CASSANDRA-4538) Strange CorruptedBlockException when massive insert binary data

2012-08-30 Thread Christian Schnidrig (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-4538?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13444765#comment-13444765
 ] 

Christian Schnidrig commented on CASSANDRA-4538:


I'm affraid, I ran into the same bug with version 1.1.4:

INFO [CompactionExecutor:1137] 2012-08-29 16:24:14,005 CompactionTask.java 
(line 109) Compacting 
[SSTableReader(path='/mnt/md0/cassandra/data/content/oneChunkFileData/content-oneChunkFileData-he-6698-Data.db'),
 SSTableReader(path='/mnt/md0/cassandra/data/content/oneChun
kFileData/content-oneChunkFileData-he-6697-Data.db'), 
SSTableReader(path='/mnt/md0/cassandra/data/content/oneChunkFileData/content-oneChunkFileData-he-6696-Data.db'),
 
SSTableReader(path='/mnt/md0/cassandra/data/content/oneChunkFileData/content-oneChunkFileData-he-6889-Da
ta.db'), 
SSTableReader(path='/mnt/md0/cassandra/data/content/oneChunkFileData/content-oneChunkFileData-he-7053-Data.db')]

ERROR [CompactionExecutor:1137] 2012-08-29 16:24:14,712 
AbstractCassandraDaemon.java (line 134) Exception in thread 
Thread[CompactionExecutor:1137,1,main]
java.io.IOError: org.apache.cassandra.io.compress.CorruptedBlockException: 
(/mnt/md0/cassandra/data/content/oneChunkFileData/content-oneChunkFileData-he-6889-Data.db):
 corruption detected, chunk at 262155 of length 65545.
at 
org.apache.cassandra.db.compaction.PrecompactedRow.merge(PrecompactedRow.java:116)
at 
org.apache.cassandra.db.compaction.PrecompactedRow.init(PrecompactedRow.java:99)
at 
org.apache.cassandra.db.compaction.CompactionController.getCompactedRow(CompactionController.java:176)
at 
org.apache.cassandra.db.compaction.CompactionIterable$Reducer.getReduced(CompactionIterable.java:83)
at 
org.apache.cassandra.db.compaction.CompactionIterable$Reducer.getReduced(CompactionIterable.java:68)
at 
org.apache.cassandra.utils.MergeIterator$ManyToOne.consume(MergeIterator.java:118)
at 
org.apache.cassandra.utils.MergeIterator$ManyToOne.computeNext(MergeIterator.java:101)
at 
com.google.common.collect.AbstractIterator.tryToComputeNext(AbstractIterator.java:140)
at 
com.google.common.collect.AbstractIterator.hasNext(AbstractIterator.java:135)
at com.google.common.collect.Iterators$7.computeNext(Iterators.java:614)
at 
com.google.common.collect.AbstractIterator.tryToComputeNext(AbstractIterator.java:140)
at 
com.google.common.collect.AbstractIterator.hasNext(AbstractIterator.java:135)
at 
org.apache.cassandra.db.compaction.CompactionTask.execute(CompactionTask.java:173)
at 
org.apache.cassandra.db.compaction.LeveledCompactionTask.execute(LeveledCompactionTask.java:50)
at 
org.apache.cassandra.db.compaction.CompactionManager$1.runMayThrow(CompactionManager.java:154)
at 
org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:30)
at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)
at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:334)
at java.util.concurrent.FutureTask.run(FutureTask.java:166)
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1110)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:603)
at java.lang.Thread.run(Thread.java:636)
Caused by: org.apache.cassandra.io.compress.CorruptedBlockException: 
(/mnt/md0/cassandra/data/content/oneChunkFileData/content-oneChunkFileData-he-6889-Data.db):
 corruption detected, chunk at 262155 of length 65545.
at 
org.apache.cassandra.io.compress.CompressedRandomAccessReader.decompressChunk(CompressedRandomAccessReader.java:98)
at 
org.apache.cassandra.io.compress.CompressedRandomAccessReader.reBuffer(CompressedRandomAccessReader.java:77)
at 
org.apache.cassandra.io.util.RandomAccessReader.read(RandomAccessReader.java:302)
at java.io.RandomAccessFile.readFully(RandomAccessFile.java:414)
at java.io.RandomAccessFile.readFully(RandomAccessFile.java:394)
at 
org.apache.cassandra.utils.BytesReadTracker.readFully(BytesReadTracker.java:95)
at 
org.apache.cassandra.utils.ByteBufferUtil.read(ByteBufferUtil.java:401)
at 
org.apache.cassandra.utils.ByteBufferUtil.readWithLength(ByteBufferUtil.java:363)
at 
org.apache.cassandra.db.ColumnSerializer.deserialize(ColumnSerializer.java:119)
at 
org.apache.cassandra.db.ColumnSerializer.deserialize(ColumnSerializer.java:36)
at 
org.apache.cassandra.db.ColumnFamilySerializer.deserializeColumns(ColumnFamilySerializer.java:144)
at 
org.apache.cassandra.io.sstable.SSTableIdentityIterator.getColumnFamilyWithColumns(SSTableIdentityIterator.java:234)
at 
org.apache.cassandra.db.compaction.PrecompactedRow.merge(PrecompactedRow.java:112)
... 21 more

-
This happend on a CF with binary

[jira] [Created] (CASSANDRA-4587) StackOverflowError in LeveledCompactionStrategy$LeveledScanner.computeNext

2012-08-30 Thread Christian Schnidrig (JIRA)

Christian Schnidrig created CASSANDRA-4587:
--

 Summary: StackOverflowError in 
LeveledCompactionStrategy$LeveledScanner.computeNext
 Key: CASSANDRA-4587
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4587
 Project: Cassandra
  Issue Type: Bug
Affects Versions: 1.1.4
 Environment: debian
OpenJDK 64-Bit Server VM/1.6.0_18
Heap size: 8341422080/8342470656
Reporter: Christian Schnidrig


while running nodetool repair, the following was logged in system.log:


ERROR [ValidationExecutor:2] 2012-08-30 10:58:19,490 
AbstractCassandraDaemon.java (line 134) Exception in thread 
Thread[ValidationExecutor:2,1,main]
java.lang.StackOverflowError
at sun.nio.cs.UTF_8.updatePositions(UTF_8.java:76)
at sun.nio.cs.UTF_8$Encoder.encodeArrayLoop(UTF_8.java:411)
at sun.nio.cs.UTF_8$Encoder.encodeLoop(UTF_8.java:466)
at java.nio.charset.CharsetEncoder.encode(CharsetEncoder.java:561)
at java.lang.StringCoding$StringEncoder.encode(StringCoding.java:258)
at java.lang.StringCoding.encode(StringCoding.java:290)
at java.lang.String.getBytes(String.java:954)
at java.io.RandomAccessFile.open(Native Method)
at java.io.RandomAccessFile.init(RandomAccessFile.java:233)
at 
org.apache.cassandra.io.util.RandomAccessReader.init(RandomAccessReader.java:67)
at 
org.apache.cassandra.io.compress.CompressedRandomAccessReader.init(CompressedRandomAccessReader.java:64)
at 
org.apache.cassandra.io.compress.CompressedRandomAccessReader.open(CompressedRandomAccessReader.java:46)
at 
org.apache.cassandra.io.sstable.SSTableReader.openDataReader(SSTableReader.java:1007)
at 
org.apache.cassandra.io.sstable.SSTableScanner.init(SSTableScanner.java:56)
at 
org.apache.cassandra.io.sstable.SSTableBoundedScanner.init(SSTableBoundedScanner.java:41)
at 
org.apache.cassandra.io.sstable.SSTableReader.getDirectScanner(SSTableReader.java:869)
at 
org.apache.cassandra.db.compaction.LeveledCompactionStrategy$LeveledScanner.computeNext(LeveledCompactionStrategy.java:247)
at 
org.apache.cassandra.db.compaction.LeveledCompactionStrategy$LeveledScanner.computeNext(LeveledCompactionStrategy.java:240)
at 
org.apache.cassandra.db.compaction.LeveledCompactionStrategy$LeveledScanner.computeNext(LeveledCompactionStrategy.java:248)
at 
org.apache.cassandra.db.compaction.LeveledCompactionStrategy$LeveledScanner.computeNext(LeveledCompactionStrategy.java:240)
at 
org.apache.cassandra.db.compaction.LeveledCompactionStrategy$LeveledScanner.computeNext(LeveledCompactionStrategy.java:248)
at 
org.apache.cassandra.db.compaction.LeveledCompactionStrategy$LeveledScanner.computeNext(LeveledCompactionStrategy.java:240)
at 
org.apache.cassandra.db.compaction.LeveledCompactionStrategy$LeveledScanner.computeNext(LeveledCompactionStrategy.java:248)
at 
org.apache.cassandra.db.compaction.LeveledCompactionStrategy$LeveledScanner.computeNext(LeveledCompactionStrategy.java:240)
at 
org.apache.cassandra.db.compaction.LeveledCompactionStrategy$LeveledScanner.computeNext(LeveledCompactionStrategy.java:248)
.

(about 900 lines deleted)
.


at 
org.apache.cassandra.db.compaction.LeveledCompactionStrategy$LeveledScanner.computeNext(LeveledCompactionStrategy.java:240)
at 
org.apache.cassandra.db.compaction.LeveledCompactionStrategy$LeveledScanner.computeNext(LeveledCompactionStrategy.java:248)
at 
org.apache.cassandra.db.compaction.LeveledCompactionStrategy$LeveledScanner.computeNext(LeveledCompactionStrategy.java:202)
at 
com.google.common.collect.AbstractIterator.tryToComputeNext(AbstractIterator.java:140)
at 
com.google.common.collect.AbstractIterator.hasNext(AbstractIterator.java:135)
at 
org.apache.cassandra.utils.MergeIterator$Candidate.advance(MergeIterator.java:147)
at 
org.apache.cassandra.utils.MergeIterator$ManyToOne.init(MergeIterator.java:90)
at org.apache.cassandra.utils.MergeIterator.get(MergeIterator.java:47)
at 
org.apache.cassandra.db.compaction.CompactionIterable.iterator(CompactionIterable.java:60)
at 
org.apache.cassandra.db.compaction.CompactionManager.doValidationCompaction(CompactionManager.java:703)
at 
org.apache.cassandra.db.compaction.CompactionManager.access$600(CompactionManager.java:69)
at 
org.apache.cassandra.db.compaction.CompactionManager$8.call(CompactionManager.java:442)
at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:334)
at java.util.concurrent.FutureTask.run(FutureTask.java:166)
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1110)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:603)
at

[Cassandra Wiki] Trivial Update of ZLeoma by ZLeoma

2012-08-30 Thread Apache Wiki

Dear Wiki user,

You have subscribed to a wiki page or wiki category on Cassandra Wiki for 
change notification.

The ZLeoma page has been changed by ZLeoma:
http://wiki.apache.org/cassandra/ZLeoma

New page:
Nothing to write about myself at all.BRYes! Im a member of 
apache.org.BRI just hope Im useful at allBRBRLook at my blog 
[[http://www.bootland.nl/tercoo-perago-roterende-straler.html|Informatie]]

[jira] [Created] (CASSANDRA-4588) CQL COPY ... FROM command is slow

2012-08-30 Thread JIRA

Piotr Kołaczkowski created CASSANDRA-4588:
-

 Summary: CQL COPY ... FROM command is slow
 Key: CASSANDRA-4588
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4588
 Project: Cassandra
  Issue Type: Bug
  Components: Tools
Affects Versions: 1.1.4
 Environment: Ubuntu Linux 12.04, kernel 3.4.0
Reporter: Piotr Kołaczkowski


1. created a csv file with 10,000,000 rows with two integer columns; saved it 
to an SSD disk, it took a few seconds, the file is 184 MB large. 
2. started a single local cassandra node from fresh empty data and commit log 
dirs
3. created a keyspace with simple strategy and RF=1
4. loading the file with COPY ... FROM command - it is over 15 minutes now and 
still loading

top reports about 50% CPU usage for java (cassandra) and 50% for python.
I/O is almost idle, iowait  0.1%. 



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (CASSANDRA-4588) CQL COPY ... FROM command is slow

2012-08-30 Thread Brandon Williams (JIRA)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brandon Williams resolved CASSANDRA-4588.
-

Resolution: Won't Fix

Wontfixing, since performance was never a goal as outlined in CASSANDRA-4012

 CQL COPY ... FROM command is slow
 -

 Key: CASSANDRA-4588
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4588
 Project: Cassandra
  Issue Type: Bug
  Components: Tools
Affects Versions: 1.1.4
 Environment: Ubuntu Linux 12.04, kernel 3.4.0
Reporter: Piotr Kołaczkowski

 1. created a csv file with 10,000,000 rows with two integer columns; saved it 
 to an SSD disk, it took a few seconds, the file is 184 MB large. 
 2. started a single local cassandra node from fresh empty data and commit log 
 dirs
 3. created a keyspace with simple strategy and RF=1
 4. loading the file with COPY ... FROM command - it is over 15 minutes now 
 and still loading
 top reports about 50% CPU usage for java (cassandra) and 50% for python.
 I/O is almost idle, iowait  0.1%. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[3/3] git commit: Fix writing sstables to wrong directory when compacting

2012-08-30 Thread yukim

Fix writing sstables to wrong directory when compacting


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/4e6167da
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/4e6167da
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/4e6167da

Branch: refs/heads/trunk
Commit: 4e6167da57915e803946f35f039d7a33680f4693
Parents: 0525ae2
Author: Yuki Morishita yu...@apache.org
Authored: Wed Aug 29 10:20:11 2012 -0500
Committer: Yuki Morishita yu...@apache.org
Committed: Thu Aug 30 07:19:01 2012 -0500

--
 .../cassandra/db/compaction/CompactionTask.java|4 ++--
 1 files changed, 2 insertions(+), 2 deletions(-)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/4e6167da/src/java/org/apache/cassandra/db/compaction/CompactionTask.java
--
diff --git a/src/java/org/apache/cassandra/db/compaction/CompactionTask.java 
b/src/java/org/apache/cassandra/db/compaction/CompactionTask.java
index 4d2a90f..ff08b61 100644
--- a/src/java/org/apache/cassandra/db/compaction/CompactionTask.java
+++ b/src/java/org/apache/cassandra/db/compaction/CompactionTask.java
@@ -155,7 +155,7 @@ public class CompactionTask extends AbstractCompactionTask
 return;
 }
 
-SSTableWriter writer = cfs.createCompactionWriter(keysPerSSTable, 
dataDirectory, toCompact);
+SSTableWriter writer = cfs.createCompactionWriter(keysPerSSTable, 
cfs.directories.getLocationForDisk(dataDirectory), toCompact);
 writers.add(writer);
 while (nni.hasNext())
 {
@@ -187,7 +187,7 @@ public class CompactionTask extends AbstractCompactionTask
 sstables.add(toIndex);
 if (nni.hasNext())
 {
-writer = cfs.createCompactionWriter(keysPerSSTable, 
dataDirectory, toCompact);
+writer = cfs.createCompactionWriter(keysPerSSTable, 
cfs.directories.getLocationForDisk(dataDirectory), toCompact);
 writers.add(writer);
 cachedKeys = new HashMapDecoratedKey, 
RowIndexEntry();
 }

[jira] [Resolved] (CASSANDRA-4292) Improve JBOD loadbalancing and reduce contention

2012-08-30 Thread Yuki Morishita (JIRA)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4292?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yuki Morishita resolved CASSANDRA-4292.
---

Resolution: Fixed

Committed.

 Improve JBOD loadbalancing and reduce contention
 

 Key: CASSANDRA-4292
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4292
 Project: Cassandra
  Issue Type: Improvement
  Components: Core
Reporter: Jonathan Ellis
Assignee: Yuki Morishita
 Fix For: 1.2.0 beta 1

 Attachments: 
 0001-Fix-writing-sstables-to-wrong-directory-when-compact.patch, 4292.txt, 
 4292-v2.txt, 4292-v3.txt, 4292-v4.txt


 As noted in CASSANDRA-809, we have a certain amount of flush (and compaction) 
 threads, which mix and match disk volumes indiscriminately.  It may be worth 
 creating a tight thread - disk affinity, to prevent unnecessary conflict at 
 that level.
 OTOH as SSDs become more prevalent this becomes a non-issue.  Unclear how 
 much pain this actually causes in practice in the meantime.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-4383) Binary encoding of vnode tokens

2012-08-30 Thread Brandon Williams (JIRA)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brandon Williams updated CASSANDRA-4383:


Attachment: (was: 0002-Fix-tests.txt)

 Binary encoding of vnode tokens
 ---

 Key: CASSANDRA-4383
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4383
 Project: Cassandra
  Issue Type: Sub-task
Reporter: Brandon Williams
Assignee: Brandon Williams
 Fix For: 1.2.0 beta 1

 Attachments: 
 0001-Add-HOST_ID-and-TOKENS-app-states-binary-serialization.txt, 
 0002-Fix-tests.txt


 Since after CASSANDRA-4317 we can know which version a remote node is using 
 (that is, whether it is vnode-aware or not) this a good opportunity to change 
 the token encoding to binary, since with a default of 256 tokens per node 
 even a fixed-length 16 byte encoding per token provides a great deal of 
 savings in gossip traffic over a text representation.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-4383) Binary encoding of vnode tokens

2012-08-30 Thread Brandon Williams (JIRA)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brandon Williams updated CASSANDRA-4383:


Attachment: (was: 0003-Add-tokens-and-status-atomically.txt)

 Binary encoding of vnode tokens
 ---

 Key: CASSANDRA-4383
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4383
 Project: Cassandra
  Issue Type: Sub-task
Reporter: Brandon Williams
Assignee: Brandon Williams
 Fix For: 1.2.0 beta 1

 Attachments: 
 0001-Add-HOST_ID-and-TOKENS-app-states-binary-serialization.txt, 
 0002-Fix-tests.txt


 Since after CASSANDRA-4317 we can know which version a remote node is using 
 (that is, whether it is vnode-aware or not) this a good opportunity to change 
 the token encoding to binary, since with a default of 256 tokens per node 
 even a fixed-length 16 byte encoding per token provides a great deal of 
 savings in gossip traffic over a text representation.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-4383) Binary encoding of vnode tokens

2012-08-30 Thread Brandon Williams (JIRA)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brandon Williams updated CASSANDRA-4383:


Attachment: (was: 
0001-Add-HOST_ID-and-TOKENS-app-states-binary-serialization.txt)

 Binary encoding of vnode tokens
 ---

 Key: CASSANDRA-4383
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4383
 Project: Cassandra
  Issue Type: Sub-task
Reporter: Brandon Williams
Assignee: Brandon Williams
 Fix For: 1.2.0 beta 1

 Attachments: 
 0001-Add-HOST_ID-and-TOKENS-app-states-binary-serialization.txt, 
 0002-Fix-tests.txt


 Since after CASSANDRA-4317 we can know which version a remote node is using 
 (that is, whether it is vnode-aware or not) this a good opportunity to change 
 the token encoding to binary, since with a default of 256 tokens per node 
 even a fixed-length 16 byte encoding per token provides a great deal of 
 savings in gossip traffic over a text representation.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-4383) Binary encoding of vnode tokens

2012-08-30 Thread Brandon Williams (JIRA)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brandon Williams updated CASSANDRA-4383:


Attachment: 0002-Fix-tests.txt
0001-Add-HOST_ID-and-TOKENS-app-states-binary-serialization.txt

 Binary encoding of vnode tokens
 ---

 Key: CASSANDRA-4383
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4383
 Project: Cassandra
  Issue Type: Sub-task
Reporter: Brandon Williams
Assignee: Brandon Williams
 Fix For: 1.2.0 beta 1

 Attachments: 
 0001-Add-HOST_ID-and-TOKENS-app-states-binary-serialization.txt, 
 0002-Fix-tests.txt


 Since after CASSANDRA-4317 we can know which version a remote node is using 
 (that is, whether it is vnode-aware or not) this a good opportunity to change 
 the token encoding to binary, since with a default of 256 tokens per node 
 even a fixed-length 16 byte encoding per token provides a great deal of 
 savings in gossip traffic over a text representation.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-4383) Binary encoding of vnode tokens

2012-08-30 Thread Brandon Williams (JIRA)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brandon Williams updated CASSANDRA-4383:


Attachment: (was: 
0001-Add-HOST_ID-and-TOKENS-app-states-binary-serialization.txt)

 Binary encoding of vnode tokens
 ---

 Key: CASSANDRA-4383
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4383
 Project: Cassandra
  Issue Type: Sub-task
Reporter: Brandon Williams
Assignee: Brandon Williams
 Fix For: 1.2.0 beta 1

 Attachments: 0002-Fix-tests.txt


 Since after CASSANDRA-4317 we can know which version a remote node is using 
 (that is, whether it is vnode-aware or not) this a good opportunity to change 
 the token encoding to binary, since with a default of 256 tokens per node 
 even a fixed-length 16 byte encoding per token provides a great deal of 
 savings in gossip traffic over a text representation.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-4383) Binary encoding of vnode tokens

2012-08-30 Thread Brandon Williams (JIRA)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brandon Williams updated CASSANDRA-4383:


Attachment: 0001-Add-HOST_ID-and-TOKENS-app-states-binary-serialization.txt

 Binary encoding of vnode tokens
 ---

 Key: CASSANDRA-4383
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4383
 Project: Cassandra
  Issue Type: Sub-task
Reporter: Brandon Williams
Assignee: Brandon Williams
 Fix For: 1.2.0 beta 1

 Attachments: 
 0001-Add-HOST_ID-and-TOKENS-app-states-binary-serialization.txt, 
 0002-Fix-tests.txt


 Since after CASSANDRA-4317 we can know which version a remote node is using 
 (that is, whether it is vnode-aware or not) this a good opportunity to change 
 the token encoding to binary, since with a default of 256 tokens per node 
 even a fixed-length 16 byte encoding per token provides a great deal of 
 savings in gossip traffic over a text representation.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-4383) Binary encoding of vnode tokens

2012-08-30 Thread Brandon Williams (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-4383?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13444935#comment-13444935
 ] 

Brandon Williams commented on CASSANDRA-4383:
-

bq. The approach taken here continues to make me a little hesitant simply 
because, I think, it introduces for the first time a need for proper ordering 
of STATE transmission/reception. I don't have a clear-enough understanding of 
how the underlying messaging works to know if we can firmly rely on that or not

In this case, it's ok, because SS only reacts to STATUS, so we can change any 
other gossip state before changing that one and, regardless of how many gossip 
events fire in the meantime, be guaranteed that other hosts react to the STATUS 
update after all other gossip state changes are received.

bq. What I had in mind was (at least) something like a 
sendState{Normal,Bootstrap,...}(Collectiontoken tokens) to encapsulate those 
operations sensitive to ordering. Gossiper.addLocalApplicationStates(...) still 
makes it too easy to do the wrong thing.

That's basically syntactic sugar over 
{{Gossiper.addLocalApplicationStates(...)}} which we'd still need as a building 
block.  I've given up on my previous attempt which actually did not provide 
atomicity, and it looks like doing that would require rewriting EndpointState 
to use the Reference and SnapTree approach similar to AtomicSortedColumns, and 
that is way, way out of scope for this ticket (especially since we don't 
actually need it)

bq. And in addition to getHostId, usesHostId also seems better suited to the 
Gossiper.

You're right, updated patch moves those methods to Gossiper.  I kept the name 
{{getHostId}} there though since, by asking the gossiper, you should know 
you're getting the hostId as it is *in gossip*, which may be a necessary thing 
instead of tMD for non-ring members (or old removed members, or whatever.)

bq. But, mostly I meant that it doesn't read as well as the old code that 
clearly did one thing when the version was  X, and another when it was = Y.

That makes sense, I brought the old NET_VERSION checks back, but relocated to 
the gossiper.

 Binary encoding of vnode tokens
 ---

 Key: CASSANDRA-4383
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4383
 Project: Cassandra
  Issue Type: Sub-task
Reporter: Brandon Williams
Assignee: Brandon Williams
 Fix For: 1.2.0 beta 1

 Attachments: 
 0001-Add-HOST_ID-and-TOKENS-app-states-binary-serialization.txt, 
 0002-Fix-tests.txt


 Since after CASSANDRA-4317 we can know which version a remote node is using 
 (that is, whether it is vnode-aware or not) this a good opportunity to change 
 the token encoding to binary, since with a default of 256 tokens per node 
 even a fixed-length 16 byte encoding per token provides a great deal of 
 savings in gossip traffic over a text representation.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-4383) Binary encoding of vnode tokens

2012-08-30 Thread Brandon Williams (JIRA)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brandon Williams updated CASSANDRA-4383:


Attachment: (was: 
0001-Add-HOST_ID-and-TOKENS-app-states-binary-serialization.txt)

 Binary encoding of vnode tokens
 ---

 Key: CASSANDRA-4383
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4383
 Project: Cassandra
  Issue Type: Sub-task
Reporter: Brandon Williams
Assignee: Brandon Williams
 Fix For: 1.2.0 beta 1

 Attachments: 0002-Fix-tests.txt


 Since after CASSANDRA-4317 we can know which version a remote node is using 
 (that is, whether it is vnode-aware or not) this a good opportunity to change 
 the token encoding to binary, since with a default of 256 tokens per node 
 even a fixed-length 16 byte encoding per token provides a great deal of 
 savings in gossip traffic over a text representation.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-4383) Binary encoding of vnode tokens

2012-08-30 Thread Brandon Williams (JIRA)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brandon Williams updated CASSANDRA-4383:


Attachment: 0002-Fix-tests.txt
0001-Add-HOST_ID-and-TOKENS-app-states-binary-serialization.txt

 Binary encoding of vnode tokens
 ---

 Key: CASSANDRA-4383
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4383
 Project: Cassandra
  Issue Type: Sub-task
Reporter: Brandon Williams
Assignee: Brandon Williams
 Fix For: 1.2.0 beta 1

 Attachments: 
 0001-Add-HOST_ID-and-TOKENS-app-states-binary-serialization.txt, 
 0002-Fix-tests.txt


 Since after CASSANDRA-4317 we can know which version a remote node is using 
 (that is, whether it is vnode-aware or not) this a good opportunity to change 
 the token encoding to binary, since with a default of 256 tokens per node 
 even a fixed-length 16 byte encoding per token provides a great deal of 
 savings in gossip traffic over a text representation.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-4383) Binary encoding of vnode tokens

2012-08-30 Thread Brandon Williams (JIRA)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brandon Williams updated CASSANDRA-4383:


Attachment: (was: 0002-Fix-tests.txt)

 Binary encoding of vnode tokens
 ---

 Key: CASSANDRA-4383
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4383
 Project: Cassandra
  Issue Type: Sub-task
Reporter: Brandon Williams
Assignee: Brandon Williams
 Fix For: 1.2.0 beta 1

 Attachments: 
 0001-Add-HOST_ID-and-TOKENS-app-states-binary-serialization.txt, 
 0002-Fix-tests.txt


 Since after CASSANDRA-4317 we can know which version a remote node is using 
 (that is, whether it is vnode-aware or not) this a good opportunity to change 
 the token encoding to binary, since with a default of 256 tokens per node 
 even a fixed-length 16 byte encoding per token provides a great deal of 
 savings in gossip traffic over a text representation.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-4383) Binary encoding of vnode tokens

2012-08-30 Thread Brandon Williams (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-4383?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13444944#comment-13444944
 ] 

Brandon Williams commented on CASSANDRA-4383:
-

Quoting myself:

bq. The last wrinkle is the case of bootstrapping a node without using vnodes, 
and without specifying a token. This technically still works, but produces a 
harmless NPE on the existing nodes when they first see the state and look for 
TOKENS. I'm hesitant to allow this to pass through though since it could paper 
over a truly serious bug, and bootstrapping in this fashion is already 
deprecated by vnodes. 

I fixed that in this patch too, by simple confirming that TOKENS is actually 
present for the host in handleStateBootstrap, and if not, just handle things 
the old way since it's a legacy-style bootstrap.

 Binary encoding of vnode tokens
 ---

 Key: CASSANDRA-4383
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4383
 Project: Cassandra
  Issue Type: Sub-task
Reporter: Brandon Williams
Assignee: Brandon Williams
 Fix For: 1.2.0 beta 1

 Attachments: 
 0001-Add-HOST_ID-and-TOKENS-app-states-binary-serialization.txt, 
 0002-Fix-tests.txt


 Since after CASSANDRA-4317 we can know which version a remote node is using 
 (that is, whether it is vnode-aware or not) this a good opportunity to change 
 the token encoding to binary, since with a default of 256 tokens per node 
 even a fixed-length 16 byte encoding per token provides a great deal of 
 savings in gossip traffic over a text representation.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-2897) Secondary indexes without read-before-write

2012-08-30 Thread Sam Tunnicliffe (JIRA)

[
https://issues.apache.org/jira/browse/CASSANDRA-2897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13444955#comment-13444955
]

Sam Tunnicliffe commented on CASSANDRA-2897:

Looks like 0525ae25 introduced that test failure.

Secondary indexes without read-before-write
---

Key: CASSANDRA-2897
URL: https://issues.apache.org/jira/browse/CASSANDRA-2897
Project: Cassandra
Issue Type: Improvement
Components: Core
Affects Versions: 0.7.0
Reporter: Sylvain Lebresne
Assignee: Sam Tunnicliffe
Priority: Minor
Labels: secondary_index
Fix For: 1.2.0 beta 1

Attachments:
0001-CASSANDRA-2897-Secondary-indexes-without-read-before-w.txt,
0002-CASSANDRA-2897-Secondary-indexes-without-read-before-w.txt,
0003-CASSANDRA-2897.txt, 2897-apply-cleanup.txt, 2897-v4.txt, 41ec9fc-2897.txt

Currently, secondary index updates require a read-before-write to maintain
the index consistency. Keeping the index consistent at all time is not
necessary however. We could let the (secondary) index get inconsistent on
writes and repair those on reads. This would be easy because on reads, we
make sure to request the indexed columns anyway, so we can just skip the row
that are not needed and repair the index at the same time.
This does trade work on writes for work on reads. However, read-before-write
is sufficiently costly that it will likely be a win overall.
There is (at least) two small technical difficulties here though:
# If we repair on read, this will be racy with writes, so we'll probably have
to synchronize there.
# We probably shouldn't only rely on read to repair and we should also have a
task to repair the index for things that are rarely read. It's unclear how to
make that low impact though.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

87 matches

Mail list logo