date:20131203

[jira] [Commented] (CASSANDRA-6431) Prevent same CF from being enqueued to flush more than once

2013-12-03 Thread Sylvain Lebresne (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-6431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837439#comment-13837439
 ] 

Sylvain Lebresne commented on CASSANDRA-6431:
-

I might misunderstand that suggestion but I would say that the fact the block 
writes if writes gets in faster than we're able to flush is a feature (to 
avoid OOM), even if all writes goes to the same sstable. That is, it could be 
that we're too agressive in blocking writes in some cases because our heuristic 
for writes are faster than we can flush is not good enough, but it's not 
entirely clear to me what not queuing 2 memtables for the same CF achieve 
(outside potentially having the memtable we don't queue grow unbounded and 
OOMing us that is).

 Prevent same CF from being enqueued to flush more than once
 ---

 Key: CASSANDRA-6431
 URL: https://issues.apache.org/jira/browse/CASSANDRA-6431
 Project: Cassandra
  Issue Type: Bug
Reporter: Benedict
Assignee: Benedict
Priority: Minor

 As things stand we can, in certain circumstances, fill up the flush queue 
 with multiple requests to flush the same CF, which will lead to all writes 
 blocking until the CF is flushed. Ideally we would only enqueue each 
 CF/Memtable once and, if requested to be flushed whilst already enqueued, 
 mark it to be requeued once the outstanding flush completes.
 On a related note, a single table can already block writes if it has flush 
 queue size or more secondary indexes. At the same time it might be worth 
 deciding if this is also a problem and address it.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (CASSANDRA-6218) Reduce WAN traffic while doing repairs

2013-12-03 Thread sankalp kohli (JIRA)

[
https://issues.apache.org/jira/browse/CASSANDRA-6218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837447#comment-13837447
]

sankalp kohli commented on CASSANDRA-6218:
--

As per my first comment, we need a way to run repair among specified endpoints.
I think with this change, you can specify data centers. So it will help people
who have 3 or more DC but not with 2 DC.

Reduce WAN traffic while doing repairs
--

Key: CASSANDRA-6218
URL: https://issues.apache.org/jira/browse/CASSANDRA-6218
Project: Cassandra
Issue Type: New Feature
Components: Core, Tools
Reporter: sankalp kohli
Assignee: Jimmy Mårdell
Priority: Minor
Fix For: 2.0.4

Attachments: trunk-6218-v2.txt, trunk-6218-v3.patch, trunk-6218.txt

The way we send out data that does not match over WAN can be improved.
Example: Say there are four nodes(A,B,C,D) which are replica of a range we
are repairing. A, B is in DC1 and C,D is in DC2. If A does not have the data
which other replicas have, then we will have following streams
1) A to B and back
2) A to C and back(Goes over WAN)
3) A to D and back(Goes over WAN)
One of the ways of doing it to reduce WAN traffic is this.
1) Repair A and B only with each other and C and D with each other starting
at same time t.
2) Once these repairs have finished, A,B and C,D are in sync with respect to
time t.
3) Now run a repair between A and C, the streams which are exchanged as a
result of the diff will also be streamed to B and D via A and C(C and D
behaves like a proxy to the streams).
For a replication of DC1:2,DC2:2, the WAN traffic will get reduced by 50% and
even more for higher replication factors.

--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (CASSANDRA-6435) nodetool outputs xss and jamm errors in 1.2.12

2013-12-03 Thread Sam Tunnicliffe (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-6435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837453#comment-13837453
 ] 

Sam Tunnicliffe commented on CASSANDRA-6435:


This is a partial duplicate of CASSANDRA-6404 (partial because that only 
addresses the jamm ERRORs, the echo of the xss... string is separate).

 nodetool outputs xss and jamm errors in 1.2.12
 --

 Key: CASSANDRA-6435
 URL: https://issues.apache.org/jira/browse/CASSANDRA-6435
 Project: Cassandra
  Issue Type: Bug
Reporter: Karl Mueller
Assignee: Brandon Williams
Priority: Minor

 Since 1.2.12, just running nodetool is producing this output. Probably this 
 is related to CASSANDRA-6273.
 it's unclear to me whether jamm is actually not being loaded, but clearly 
 nodetool should not be having this output, which is likely from 
 cassandra-env.sh
 [cassandra@dev-cass00 cassandra]$ /data2/cassandra/bin/nodetool ring
 xss =  -ea -javaagent:/data2/cassandra/bin/../lib/jamm-0.2.5.jar 
 -XX:+UseThreadPriorities -XX:ThreadPriorityPolicy=42 -Xms14G -Xmx14G -Xmn1G 
 -XX:+HeapDumpOnOutOfMemoryError -Xss256k
 Note: Ownership information does not include topology; for complete 
 information, specify a keyspace
 Datacenter: datacenter1
 ==
 Address  RackStatus State   LoadOwns
 Token
 
 170141183460469231731687303715884105727
 10.93.15.10  rack1   Up Normal  123.82 GB   20.00%  
 34028236692093846346337460743176821145
 10.93.15.11  rack1   Up Normal  124 GB  20.00%  
 68056473384187692692674921486353642290
 10.93.15.12  rack1   Up Normal  123.97 GB   20.00%  
 102084710076281539039012382229530463436
 10.93.15.13  rack1   Up Normal  124.03 GB   20.00%  
 136112946768375385385349842972707284581
 10.93.15.14  rack1   Up Normal  123.93 GB   20.00%  
 170141183460469231731687303715884105727
 ERROR 16:20:01,408 Unable to initialize MemoryMeter (jamm not specified as 
 javaagent).  This means Cassandra will be unable to measure object sizes 
 accurately and may consequently OOM.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Comment Edited] (CASSANDRA-5074) Add an official way to disable compaction

2013-12-03 Thread Ngoc Minh Vo (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-5074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837004#comment-13837004
 ] 

Ngoc Minh Vo edited comment on CASSANDRA-5074 at 12/3/13 9:23 AM:
--

Thanks a lot for your quick answer.
It is indeed very weird. I downloaded the binary for Windows from this address 
last week:
http://archive.apache.org/dist/cassandra/2.0.3/

I will recheck it tomorrow ...
(edit)
The cqlsh shows this:
{quote}
[cqlsh 4.1.0 | Cassandra 2.0.3 | CQL spec 3.1.1 | Thrift protocol 19.38.0]
{quote}




was (Author: vongocminh):
Thanks a lot for your quick answer.
It is indeed very weird. I downloaded the binary for Windows from this address 
last week:
http://archive.apache.org/dist/cassandra/2.0.3/

I will recheck it tomorrow ...

 Add an official way to disable compaction
 -

 Key: CASSANDRA-5074
 URL: https://issues.apache.org/jira/browse/CASSANDRA-5074
 Project: Cassandra
  Issue Type: Bug
Reporter: Jonathan Ellis
Assignee: Marcus Eriksson
Priority: Minor
 Fix For: 2.0 beta 1

 Attachments: 
 0001-CASSANDRA-5074-make-it-possible-to-disable-autocompa.patch, 
 0001-CASSANDRA-5074-v2.patch


 We've traditionally used min or max compaction threshold = 0 to disable 
 compaction, but this isn't exactly intuitive and it's inconsistently 
 implemented -- allowed from jmx, not allowed from cli.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (CASSANDRA-6431) Prevent same CF from being enqueued to flush more than once

2013-12-03 Thread Benedict (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-6431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837531#comment-13837531
 ] 

Benedict commented on CASSANDRA-6431:
-

Yeah, I've been thinking about this and realise we need to probably split 
cfs.forceFlush() into cfs.forceFlushNow() and cfs.forceEnqueueFlush(), the 
former used only for flushes that are due to memory pressure, and the latter 
for any other reasons.

I don't want to go too crazy on this, as I'll need to change this altogether 
very soon with CASSANDRA-5549 (perhaps split it off into another ticket), but 
this should be reasonably manageable.

 Prevent same CF from being enqueued to flush more than once
 ---

 Key: CASSANDRA-6431
 URL: https://issues.apache.org/jira/browse/CASSANDRA-6431
 Project: Cassandra
  Issue Type: Bug
Reporter: Benedict
Assignee: Benedict
Priority: Minor

 As things stand we can, in certain circumstances, fill up the flush queue 
 with multiple requests to flush the same CF, which will lead to all writes 
 blocking until the CF is flushed. Ideally we would only enqueue each 
 CF/Memtable once and, if requested to be flushed whilst already enqueued, 
 mark it to be requeued once the outstanding flush completes.
 On a related note, a single table can already block writes if it has flush 
 queue size or more secondary indexes. At the same time it might be worth 
 deciding if this is also a problem and address it.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (CASSANDRA-4880) Endless loop flushing+compacting system/schema_keyspaces and system/schema_columnfamilies

2013-12-03 Thread Wei-dun Teng (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-4880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837624#comment-13837624
 ] 

Wei-dun Teng commented on CASSANDRA-4880:
-

I have encountered similar bug after I upgrade one of my 1.2.2 node to 1.2.12.
Was using FreeBSD 8.2 + diablo-jre-1.6.0.07.02_18.

Before I upgrading the node to 1.2.12, I changed JVM to openjdk-7.25.15_2 (in 
retrospective probably not a good idea ...), saw flipping Memtable flushes, 
changes JVM back to diablo-jre-1.6.0.07.02_18, and still seeing rapid flushes. 
Now I've taken the 1.2.12 node offline (but not decommissioned it).

After that I see tens of Memtable flushes on the 1.2.12 per second, while once 
or twice a second on nodes with 1.2.2.

Attached files are the flipping schema versions.

 Endless loop flushing+compacting system/schema_keyspaces and 
 system/schema_columnfamilies
 -

 Key: CASSANDRA-4880
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4880
 Project: Cassandra
  Issue Type: Bug
  Components: Core
Affects Versions: 1.1.6, 1.2.0 beta 1
 Environment: Linux x86_64 3.4.9, sun-jdk 1.6.0_33
Reporter: Mina Naguib
Assignee: Pavel Yaskevich
 Fix For: 1.1.7, 1.2.0 beta 3

 Attachments: 131203-schema-1.txt, 131203-schema-2.txt, 
 CASSANDRA-4880-fix.patch, CASSANDRA-4880.patch


 After upgrading a node from 1.1.2 to 1.1.6, the startup sequence entered a 
 loop as seen here:
 http://mina.naguib.ca/misc/cassandra_116_startup_loop.txt
 Stopping and starting the node entered the same loop.
 Reverting back to 1.1.2 started successfully.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

git commit: Secondary index support for collections

2013-12-03 Thread slebresne

Updated Branches:
  refs/heads/trunk 57516e082 - d12a0d7b0


Secondary index support for collections

patch by slebresne; reviewed by iamaleksey for CASSANDRA-4511


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/d12a0d7b
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/d12a0d7b
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/d12a0d7b

Branch: refs/heads/trunk
Commit: d12a0d7b0299786bf1d0484f3770bae6a94cb0c9
Parents: 57516e0
Author: Sylvain Lebresne sylv...@datastax.com
Authored: Thu Nov 14 09:17:51 2013 +0100
Committer: Sylvain Lebresne sylv...@datastax.com
Committed: Tue Dec 3 14:49:02 2013 +0100

--
 CHANGES.txt |   1 +
 src/java/org/apache/cassandra/cql3/Cql.g|   4 +
 .../org/apache/cassandra/cql3/Relation.java |  15 ++-
 .../cql3/statements/CreateIndexStatement.java   |  21 +++-
 .../cassandra/cql3/statements/Restriction.java  | 111 +-
 .../cql3/statements/SelectStatement.java|  71 ++--
 .../apache/cassandra/db/ColumnFamilyStore.java  |   2 +-
 .../apache/cassandra/db/IndexExpression.java|  19 +++-
 .../cassandra/db/filter/ExtendedFilter.java |  47 ++--
 .../AbstractSimplePerColumnSecondaryIndex.java  |  13 ++-
 .../db/index/SecondaryIndexSearcher.java|   2 +-
 .../db/index/composites/CompositesIndex.java|  50 -
 .../CompositesIndexOnCollectionKey.java | 112 +++
 .../CompositesIndexOnCollectionValue.java   | 110 ++
 .../db/index/composites/CompositesSearcher.java |  21 +++-
 15 files changed, 566 insertions(+), 33 deletions(-)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/d12a0d7b/CHANGES.txt
--
diff --git a/CHANGES.txt b/CHANGES.txt
index 3bc50ac..08c3a67 100644
--- a/CHANGES.txt
+++ b/CHANGES.txt
@@ -14,6 +14,7 @@
  * User-defined types for CQL3 (CASSANDRA-5590)
  * Use of o.a.c.metrics in nodetool (CASSANDRA-5871, 6406)
  * Batch read from OTC's queue and cleanup (CASSANDRA-1632)
+ * Secondary index support for collections (CASSANDRA-4511)
 
 
 2.0.4

http://git-wip-us.apache.org/repos/asf/cassandra/blob/d12a0d7b/src/java/org/apache/cassandra/cql3/Cql.g
--
diff --git a/src/java/org/apache/cassandra/cql3/Cql.g 
b/src/java/org/apache/cassandra/cql3/Cql.g
index 325d6f6..fb0054d 100644
--- a/src/java/org/apache/cassandra/cql3/Cql.g
+++ b/src/java/org/apache/cassandra/cql3/Cql.g
@@ -947,6 +947,8 @@ relation[ListRelation clauses]
 { $clauses.add(new Relation(name, Relation.Type.IN, marker)); }
 | name=cident K_IN { Relation rel = Relation.createInRelation($name.id); }
'(' ( f1=term { rel.addInValue(f1); } (',' fN=term { 
rel.addInValue(fN); } )* )? ')' { $clauses.add(rel); }
+| name=cident K_CONTAINS { Relation.Type rt = Relation.Type.CONTAINS; } /* 
(K_KEY { rt = Relation.Type.CONTAINS_KEY })? */
+t=term { $clauses.add(new Relation(name, rt, t)); }
 | '(' relation[$clauses] ')'
 ;
 
@@ -1045,6 +1047,7 @@ basic_unreserved_keyword returns [String str]
 | K_CUSTOM
 | K_TRIGGER
 | K_DISTINCT
+| K_CONTAINS
 ) { $str = $k.text; }
 ;
 
@@ -1101,6 +1104,7 @@ K_DESC:D E S C;
 K_ALLOW:   A L L O W;
 K_FILTERING:   F I L T E R I N G;
 K_IF:  I F;
+K_CONTAINS:C O N T A I N S;
 
 K_GRANT:   G R A N T;
 K_ALL: A L L;

http://git-wip-us.apache.org/repos/asf/cassandra/blob/d12a0d7b/src/java/org/apache/cassandra/cql3/Relation.java
--
diff --git a/src/java/org/apache/cassandra/cql3/Relation.java 
b/src/java/org/apache/cassandra/cql3/Relation.java
index 15ed540..cfcdd54 100644
--- a/src/java/org/apache/cassandra/cql3/Relation.java
+++ b/src/java/org/apache/cassandra/cql3/Relation.java
@@ -35,7 +35,20 @@ public class Relation
 
 public static enum Type
 {
-EQ, LT, LTE, GTE, GT, IN;
+EQ, LT, LTE, GTE, GT, IN, CONTAINS, CONTAINS_KEY;
+
+public boolean allowsIndexQuery()
+{
+switch (this)
+{
+case EQ:
+case CONTAINS:
+case CONTAINS_KEY:
+return true;
+default:
+return false;
+}
+}
 }
 
 private Relation(ColumnIdentifier entity, Type type, Term.Raw value, 
ListTerm.Raw inValues, boolean onToken)

http://git-wip-us.apache.org/repos/asf/cassandra/blob/d12a0d7b/src/java/org/apache/cassandra/cql3/statements/CreateIndexStatement.java
--
diff --git

git commit: Warn when a read collection has 64k elements

2013-12-03 Thread slebresne

Updated Branches:
  refs/heads/cassandra-1.2 ecd94221a - f634ac7ea


Warn when a read collection has  64k elements

patch by slebresne; reviewed by iamaleksey for CASSANDRA-5428


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/f634ac7e
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/f634ac7e
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/f634ac7e

Branch: refs/heads/cassandra-1.2
Commit: f634ac7eae468b944d22951fc7c9d05aa6c7f447
Parents: ecd9422
Author: Sylvain Lebresne sylv...@datastax.com
Authored: Tue Dec 3 14:53:33 2013 +0100
Committer: Sylvain Lebresne sylv...@datastax.com
Committed: Tue Dec 3 14:53:33 2013 +0100

--
 CHANGES.txt|  1 +
 .../cassandra/db/marshal/CollectionType.java   | 17 +
 .../org/apache/cassandra/db/marshal/ListType.java  |  2 ++
 .../org/apache/cassandra/db/marshal/MapType.java   |  2 ++
 .../org/apache/cassandra/db/marshal/SetType.java   |  2 ++
 5 files changed, 24 insertions(+)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/f634ac7e/CHANGES.txt
--
diff --git a/CHANGES.txt b/CHANGES.txt
index c80a00a..8e6cffa 100644
--- a/CHANGES.txt
+++ b/CHANGES.txt
@@ -8,6 +8,7 @@
  * Throw IRE if a prepared has more markers than supported (CASSANDRA-5598)
  * Expose Thread metrics for the native protocol server (CASSANDRA-6234)
  * Change snapshot response message verb (CASSANDRA-6415)
+ * Warn when collection read has  65K elements (CASSANDRA-5428)
 
 
 1.2.12

http://git-wip-us.apache.org/repos/asf/cassandra/blob/f634ac7e/src/java/org/apache/cassandra/db/marshal/CollectionType.java
--
diff --git a/src/java/org/apache/cassandra/db/marshal/CollectionType.java 
b/src/java/org/apache/cassandra/db/marshal/CollectionType.java
index ad2ea67..a34a2b7 100644
--- a/src/java/org/apache/cassandra/db/marshal/CollectionType.java
+++ b/src/java/org/apache/cassandra/db/marshal/CollectionType.java
@@ -20,6 +20,9 @@ package org.apache.cassandra.db.marshal;
 import java.nio.ByteBuffer;
 import java.util.List;
 
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+
 import org.apache.cassandra.cql3.CQL3Type;
 import org.apache.cassandra.db.IColumn;
 import org.apache.cassandra.utils.ByteBufferUtil;
@@ -33,6 +36,10 @@ import org.apache.cassandra.utils.Pair;
  */
 public abstract class CollectionTypeT extends AbstractTypeT
 {
+private static final Logger logger = 
LoggerFactory.getLogger(CollectionType.class);
+
+public static final int MAX_ELEMENTS = 65535;
+
 public enum Kind
 {
 MAP, SET, LIST
@@ -105,6 +112,16 @@ public abstract class CollectionTypeT extends 
AbstractTypeT
 return (ByteBuffer)result.flip();
 }
 
+protected ListPairByteBuffer, IColumn 
enforceLimit(ListPairByteBuffer, IColumn columns)
+{
+if (columns.size() = MAX_ELEMENTS)
+return columns;
+
+logger.error(Detected collection with {} elements, more than the {} 
limit. Only the first {} elements will be returned to the client. 
+   + Please see 
http://cassandra.apache.org/doc/cql3/CQL.html#collections for more details., 
columns.size(), MAX_ELEMENTS, MAX_ELEMENTS);
+return columns.subList(0, MAX_ELEMENTS);
+}
+
 public static ByteBuffer pack(ListByteBuffer buffers, int elements)
 {
 int size = 0;

http://git-wip-us.apache.org/repos/asf/cassandra/blob/f634ac7e/src/java/org/apache/cassandra/db/marshal/ListType.java
--
diff --git a/src/java/org/apache/cassandra/db/marshal/ListType.java 
b/src/java/org/apache/cassandra/db/marshal/ListType.java
index b6613ae..b219af1 100644
--- a/src/java/org/apache/cassandra/db/marshal/ListType.java
+++ b/src/java/org/apache/cassandra/db/marshal/ListType.java
@@ -120,6 +120,8 @@ public class ListTypeT extends CollectionTypeListT
 
 public ByteBuffer serialize(ListPairByteBuffer, IColumn columns)
 {
+columns = enforceLimit(columns);
+
 ListByteBuffer bbs = new ArrayListByteBuffer(columns.size());
 int size = 0;
 for (PairByteBuffer, IColumn p : columns)

http://git-wip-us.apache.org/repos/asf/cassandra/blob/f634ac7e/src/java/org/apache/cassandra/db/marshal/MapType.java
--
diff --git a/src/java/org/apache/cassandra/db/marshal/MapType.java 
b/src/java/org/apache/cassandra/db/marshal/MapType.java
index 19310df..750851e 100644
--- a/src/java/org/apache/cassandra/db/marshal/MapType.java
+++ b/src/java/org/apache/cassandra/db/marshal/MapType.java
@@ -137,6 +137,8 @@ public class

[2/3] git commit: Merge branch 'cassandra-1.2' into cassandra-2.0

2013-12-03 Thread slebresne

Merge branch 'cassandra-1.2' into cassandra-2.0


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/b2da839f
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/b2da839f
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/b2da839f

Branch: refs/heads/cassandra-2.0
Commit: b2da839f076f14f35c5591b39736c8d7241974ee
Parents: 6724964 f634ac7
Author: Sylvain Lebresne sylv...@datastax.com
Authored: Tue Dec 3 14:54:41 2013 +0100
Committer: Sylvain Lebresne sylv...@datastax.com
Committed: Tue Dec 3 14:54:41 2013 +0100

--
 CHANGES.txt|  1 +
 .../cassandra/db/marshal/CollectionType.java   | 17 +
 .../org/apache/cassandra/db/marshal/ListType.java  |  2 ++
 .../org/apache/cassandra/db/marshal/MapType.java   |  2 ++
 .../org/apache/cassandra/db/marshal/SetType.java   |  2 ++
 5 files changed, 24 insertions(+)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/b2da839f/CHANGES.txt
--
diff --cc CHANGES.txt
index 11f4c09,8e6cffa..a7ab215
--- a/CHANGES.txt
+++ b/CHANGES.txt
@@@ -13,44 -8,10 +13,45 @@@ Merged from 1.2
   * Throw IRE if a prepared has more markers than supported (CASSANDRA-5598)
   * Expose Thread metrics for the native protocol server (CASSANDRA-6234)
   * Change snapshot response message verb (CASSANDRA-6415)
+  * Warn when collection read has  65K elements (CASSANDRA-5428)
  
  
 -1.2.12
 +2.0.3
 + * Fix FD leak on slice read path (CASSANDRA-6275)
 + * Cancel read meter task when closing SSTR (CASSANDRA-6358)
 + * free off-heap IndexSummary during bulk (CASSANDRA-6359)
 + * Recover from IOException in accept() thread (CASSANDRA-6349)
 + * Improve Gossip tolerance of abnormally slow tasks (CASSANDRA-6338)
 + * Fix trying to hint timed out counter writes (CASSANDRA-6322)
 + * Allow restoring specific columnfamilies from archived CL (CASSANDRA-4809)
 + * Avoid flushing compaction_history after each operation (CASSANDRA-6287)
 + * Fix repair assertion error when tombstones expire (CASSANDRA-6277)
 + * Skip loading corrupt key cache (CASSANDRA-6260)
 + * Fixes for compacting larger-than-memory rows (CASSANDRA-6274)
 + * Compact hottest sstables first and optionally omit coldest from
 +   compaction entirely (CASSANDRA-6109)
 + * Fix modifying column_metadata from thrift (CASSANDRA-6182)
 + * cqlsh: fix LIST USERS output (CASSANDRA-6242)
 + * Add IRequestSink interface (CASSANDRA-6248)
 + * Update memtable size while flushing (CASSANDRA-6249)
 + * Provide hooks around CQL2/CQL3 statement execution (CASSANDRA-6252)
 + * Require Permission.SELECT for CAS updates (CASSANDRA-6247)
 + * New CQL-aware SSTableWriter (CASSANDRA-5894)
 + * Reject CAS operation when the protocol v1 is used (CASSANDRA-6270)
 + * Correctly throw error when frame too large (CASSANDRA-5981)
 + * Fix serialization bug in PagedRange with 2ndary indexes (CASSANDRA-6299)
 + * Fix CQL3 table validation in Thrift (CASSANDRA-6140)
 + * Fix bug missing results with IN clauses (CASSANDRA-6327)
 + * Fix paging with reversed slices (CASSANDRA-6343)
 + * Set minTimestamp correctly to be able to drop expired sstables 
(CASSANDRA-6337)
 + * Support NaN and Infinity as float literals (CASSANDRA-6003)
 + * Remove RF from nodetool ring output (CASSANDRA-6289)
 + * Fix attempting to flush empty rows (CASSANDRA-6374)
 + * Fix potential out of bounds exception when paging (CASSANDRA-6333)
 +Merged from 1.2:
 + * Optimize FD phi calculation (CASSANDRA-6386)
 + * Improve initial FD phi estimate when starting up (CASSANDRA-6385)
 + * Don't list CQL3 table in CLI describe even if named explicitely 
(CASSANDRA-5750)
   * Invalidate row cache when dropping CF (CASSANDRA-6351)
   * add non-jamm path for cached statements (CASSANDRA-6293)
   * (Hadoop) Require CFRR batchSize to be at least 2 (CASSANDRA-6114)

http://git-wip-us.apache.org/repos/asf/cassandra/blob/b2da839f/src/java/org/apache/cassandra/db/marshal/CollectionType.java
--
diff --cc src/java/org/apache/cassandra/db/marshal/CollectionType.java
index f922d56,a34a2b7..9408980
--- a/src/java/org/apache/cassandra/db/marshal/CollectionType.java
+++ b/src/java/org/apache/cassandra/db/marshal/CollectionType.java
@@@ -20,9 -20,11 +20,12 @@@ package org.apache.cassandra.db.marshal
  import java.nio.ByteBuffer;
  import java.util.List;
  
+ import org.slf4j.Logger;
+ import org.slf4j.LoggerFactory;
+ 
  import org.apache.cassandra.cql3.CQL3Type;
 -import org.apache.cassandra.db.IColumn;
 +import org.apache.cassandra.db.Column;
 +import org.apache.cassandra.serializers.MarshalException;
  import org.apache.cassandra.utils.ByteBufferUtil;
  import org.apache.cassandra.utils.Pair;

[3/3] git commit: Fix merge

2013-12-03 Thread slebresne

Fix merge


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/1334f94e
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/1334f94e
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/1334f94e

Branch: refs/heads/cassandra-2.0
Commit: 1334f94e40ce5dbed7270808abb2330ea6d37c51
Parents: b2da839
Author: Sylvain Lebresne sylv...@datastax.com
Authored: Tue Dec 3 14:56:19 2013 +0100
Committer: Sylvain Lebresne sylv...@datastax.com
Committed: Tue Dec 3 14:56:19 2013 +0100

--
 src/java/org/apache/cassandra/db/marshal/CollectionType.java | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/1334f94e/src/java/org/apache/cassandra/db/marshal/CollectionType.java
--
diff --git a/src/java/org/apache/cassandra/db/marshal/CollectionType.java 
b/src/java/org/apache/cassandra/db/marshal/CollectionType.java
index 9408980..07c86e0 100644
--- a/src/java/org/apache/cassandra/db/marshal/CollectionType.java
+++ b/src/java/org/apache/cassandra/db/marshal/CollectionType.java
@@ -113,7 +113,7 @@ public abstract class CollectionTypeT extends 
AbstractTypeT
 return (ByteBuffer)result.flip();
 }
 
-protected ListPairByteBuffer, IColumn 
enforceLimit(ListPairByteBuffer, IColumn columns)
+protected ListPairByteBuffer, Column 
enforceLimit(ListPairByteBuffer, Column columns)
 {
 if (columns.size() = MAX_ELEMENTS)
 return columns;

[1/3] git commit: Warn when a read collection has 64k elements

2013-12-03 Thread slebresne

Updated Branches:
  refs/heads/cassandra-2.0 672496430 - 1334f94e4


Warn when a read collection has  64k elements

patch by slebresne; reviewed by iamaleksey for CASSANDRA-5428


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/f634ac7e
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/f634ac7e
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/f634ac7e

Branch: refs/heads/cassandra-2.0
Commit: f634ac7eae468b944d22951fc7c9d05aa6c7f447
Parents: ecd9422
Author: Sylvain Lebresne sylv...@datastax.com
Authored: Tue Dec 3 14:53:33 2013 +0100
Committer: Sylvain Lebresne sylv...@datastax.com
Committed: Tue Dec 3 14:53:33 2013 +0100

--
 CHANGES.txt|  1 +
 .../cassandra/db/marshal/CollectionType.java   | 17 +
 .../org/apache/cassandra/db/marshal/ListType.java  |  2 ++
 .../org/apache/cassandra/db/marshal/MapType.java   |  2 ++
 .../org/apache/cassandra/db/marshal/SetType.java   |  2 ++
 5 files changed, 24 insertions(+)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/f634ac7e/CHANGES.txt
--
diff --git a/CHANGES.txt b/CHANGES.txt
index c80a00a..8e6cffa 100644
--- a/CHANGES.txt
+++ b/CHANGES.txt
@@ -8,6 +8,7 @@
  * Throw IRE if a prepared has more markers than supported (CASSANDRA-5598)
  * Expose Thread metrics for the native protocol server (CASSANDRA-6234)
  * Change snapshot response message verb (CASSANDRA-6415)
+ * Warn when collection read has  65K elements (CASSANDRA-5428)
 
 
 1.2.12

http://git-wip-us.apache.org/repos/asf/cassandra/blob/f634ac7e/src/java/org/apache/cassandra/db/marshal/CollectionType.java
--
diff --git a/src/java/org/apache/cassandra/db/marshal/CollectionType.java 
b/src/java/org/apache/cassandra/db/marshal/CollectionType.java
index ad2ea67..a34a2b7 100644
--- a/src/java/org/apache/cassandra/db/marshal/CollectionType.java
+++ b/src/java/org/apache/cassandra/db/marshal/CollectionType.java
@@ -20,6 +20,9 @@ package org.apache.cassandra.db.marshal;
 import java.nio.ByteBuffer;
 import java.util.List;
 
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+
 import org.apache.cassandra.cql3.CQL3Type;
 import org.apache.cassandra.db.IColumn;
 import org.apache.cassandra.utils.ByteBufferUtil;
@@ -33,6 +36,10 @@ import org.apache.cassandra.utils.Pair;
  */
 public abstract class CollectionTypeT extends AbstractTypeT
 {
+private static final Logger logger = 
LoggerFactory.getLogger(CollectionType.class);
+
+public static final int MAX_ELEMENTS = 65535;
+
 public enum Kind
 {
 MAP, SET, LIST
@@ -105,6 +112,16 @@ public abstract class CollectionTypeT extends 
AbstractTypeT
 return (ByteBuffer)result.flip();
 }
 
+protected ListPairByteBuffer, IColumn 
enforceLimit(ListPairByteBuffer, IColumn columns)
+{
+if (columns.size() = MAX_ELEMENTS)
+return columns;
+
+logger.error(Detected collection with {} elements, more than the {} 
limit. Only the first {} elements will be returned to the client. 
+   + Please see 
http://cassandra.apache.org/doc/cql3/CQL.html#collections for more details., 
columns.size(), MAX_ELEMENTS, MAX_ELEMENTS);
+return columns.subList(0, MAX_ELEMENTS);
+}
+
 public static ByteBuffer pack(ListByteBuffer buffers, int elements)
 {
 int size = 0;

http://git-wip-us.apache.org/repos/asf/cassandra/blob/f634ac7e/src/java/org/apache/cassandra/db/marshal/ListType.java
--
diff --git a/src/java/org/apache/cassandra/db/marshal/ListType.java 
b/src/java/org/apache/cassandra/db/marshal/ListType.java
index b6613ae..b219af1 100644
--- a/src/java/org/apache/cassandra/db/marshal/ListType.java
+++ b/src/java/org/apache/cassandra/db/marshal/ListType.java
@@ -120,6 +120,8 @@ public class ListTypeT extends CollectionTypeListT
 
 public ByteBuffer serialize(ListPairByteBuffer, IColumn columns)
 {
+columns = enforceLimit(columns);
+
 ListByteBuffer bbs = new ArrayListByteBuffer(columns.size());
 int size = 0;
 for (PairByteBuffer, IColumn p : columns)

http://git-wip-us.apache.org/repos/asf/cassandra/blob/f634ac7e/src/java/org/apache/cassandra/db/marshal/MapType.java
--
diff --git a/src/java/org/apache/cassandra/db/marshal/MapType.java 
b/src/java/org/apache/cassandra/db/marshal/MapType.java
index 19310df..750851e 100644
--- a/src/java/org/apache/cassandra/db/marshal/MapType.java
+++ b/src/java/org/apache/cassandra/db/marshal/MapType.java
@@ -137,6 +137,8 @@ public class

[4/4] git commit: Merge branch 'cassandra-2.0' into trunk

2013-12-03 Thread slebresne

Merge branch 'cassandra-2.0' into trunk


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/b34d43f9
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/b34d43f9
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/b34d43f9

Branch: refs/heads/trunk
Commit: b34d43f9747d2ebc1feb516d9675801bcd293d8b
Parents: d12a0d7 1334f94
Author: Sylvain Lebresne sylv...@datastax.com
Authored: Tue Dec 3 14:57:06 2013 +0100
Committer: Sylvain Lebresne sylv...@datastax.com
Committed: Tue Dec 3 14:57:06 2013 +0100

--
 CHANGES.txt|  1 +
 .../cassandra/db/marshal/CollectionType.java   | 17 +
 .../org/apache/cassandra/db/marshal/ListType.java  |  2 ++
 .../org/apache/cassandra/db/marshal/MapType.java   |  2 ++
 .../org/apache/cassandra/db/marshal/SetType.java   |  2 ++
 5 files changed, 24 insertions(+)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/b34d43f9/CHANGES.txt
--

[1/4] git commit: Warn when a read collection has 64k elements

2013-12-03 Thread slebresne

Updated Branches:
  refs/heads/trunk d12a0d7b0 - b34d43f97


Warn when a read collection has  64k elements

patch by slebresne; reviewed by iamaleksey for CASSANDRA-5428


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/f634ac7e
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/f634ac7e
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/f634ac7e

Branch: refs/heads/trunk
Commit: f634ac7eae468b944d22951fc7c9d05aa6c7f447
Parents: ecd9422
Author: Sylvain Lebresne sylv...@datastax.com
Authored: Tue Dec 3 14:53:33 2013 +0100
Committer: Sylvain Lebresne sylv...@datastax.com
Committed: Tue Dec 3 14:53:33 2013 +0100

--
 CHANGES.txt|  1 +
 .../cassandra/db/marshal/CollectionType.java   | 17 +
 .../org/apache/cassandra/db/marshal/ListType.java  |  2 ++
 .../org/apache/cassandra/db/marshal/MapType.java   |  2 ++
 .../org/apache/cassandra/db/marshal/SetType.java   |  2 ++
 5 files changed, 24 insertions(+)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/f634ac7e/CHANGES.txt
--
diff --git a/CHANGES.txt b/CHANGES.txt
index c80a00a..8e6cffa 100644
--- a/CHANGES.txt
+++ b/CHANGES.txt
@@ -8,6 +8,7 @@
  * Throw IRE if a prepared has more markers than supported (CASSANDRA-5598)
  * Expose Thread metrics for the native protocol server (CASSANDRA-6234)
  * Change snapshot response message verb (CASSANDRA-6415)
+ * Warn when collection read has  65K elements (CASSANDRA-5428)
 
 
 1.2.12

http://git-wip-us.apache.org/repos/asf/cassandra/blob/f634ac7e/src/java/org/apache/cassandra/db/marshal/CollectionType.java
--
diff --git a/src/java/org/apache/cassandra/db/marshal/CollectionType.java 
b/src/java/org/apache/cassandra/db/marshal/CollectionType.java
index ad2ea67..a34a2b7 100644
--- a/src/java/org/apache/cassandra/db/marshal/CollectionType.java
+++ b/src/java/org/apache/cassandra/db/marshal/CollectionType.java
@@ -20,6 +20,9 @@ package org.apache.cassandra.db.marshal;
 import java.nio.ByteBuffer;
 import java.util.List;
 
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+
 import org.apache.cassandra.cql3.CQL3Type;
 import org.apache.cassandra.db.IColumn;
 import org.apache.cassandra.utils.ByteBufferUtil;
@@ -33,6 +36,10 @@ import org.apache.cassandra.utils.Pair;
  */
 public abstract class CollectionTypeT extends AbstractTypeT
 {
+private static final Logger logger = 
LoggerFactory.getLogger(CollectionType.class);
+
+public static final int MAX_ELEMENTS = 65535;
+
 public enum Kind
 {
 MAP, SET, LIST
@@ -105,6 +112,16 @@ public abstract class CollectionTypeT extends 
AbstractTypeT
 return (ByteBuffer)result.flip();
 }
 
+protected ListPairByteBuffer, IColumn 
enforceLimit(ListPairByteBuffer, IColumn columns)
+{
+if (columns.size() = MAX_ELEMENTS)
+return columns;
+
+logger.error(Detected collection with {} elements, more than the {} 
limit. Only the first {} elements will be returned to the client. 
+   + Please see 
http://cassandra.apache.org/doc/cql3/CQL.html#collections for more details., 
columns.size(), MAX_ELEMENTS, MAX_ELEMENTS);
+return columns.subList(0, MAX_ELEMENTS);
+}
+
 public static ByteBuffer pack(ListByteBuffer buffers, int elements)
 {
 int size = 0;

http://git-wip-us.apache.org/repos/asf/cassandra/blob/f634ac7e/src/java/org/apache/cassandra/db/marshal/ListType.java
--
diff --git a/src/java/org/apache/cassandra/db/marshal/ListType.java 
b/src/java/org/apache/cassandra/db/marshal/ListType.java
index b6613ae..b219af1 100644
--- a/src/java/org/apache/cassandra/db/marshal/ListType.java
+++ b/src/java/org/apache/cassandra/db/marshal/ListType.java
@@ -120,6 +120,8 @@ public class ListTypeT extends CollectionTypeListT
 
 public ByteBuffer serialize(ListPairByteBuffer, IColumn columns)
 {
+columns = enforceLimit(columns);
+
 ListByteBuffer bbs = new ArrayListByteBuffer(columns.size());
 int size = 0;
 for (PairByteBuffer, IColumn p : columns)

http://git-wip-us.apache.org/repos/asf/cassandra/blob/f634ac7e/src/java/org/apache/cassandra/db/marshal/MapType.java
--
diff --git a/src/java/org/apache/cassandra/db/marshal/MapType.java 
b/src/java/org/apache/cassandra/db/marshal/MapType.java
index 19310df..750851e 100644
--- a/src/java/org/apache/cassandra/db/marshal/MapType.java
+++ b/src/java/org/apache/cassandra/db/marshal/MapType.java
@@ -137,6 +137,8 @@ public class MapTypeK, V extends

[2/4] git commit: Merge branch 'cassandra-1.2' into cassandra-2.0

2013-12-03 Thread slebresne

Merge branch 'cassandra-1.2' into cassandra-2.0


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/b2da839f
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/b2da839f
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/b2da839f

Branch: refs/heads/trunk
Commit: b2da839f076f14f35c5591b39736c8d7241974ee
Parents: 6724964 f634ac7
Author: Sylvain Lebresne sylv...@datastax.com
Authored: Tue Dec 3 14:54:41 2013 +0100
Committer: Sylvain Lebresne sylv...@datastax.com
Committed: Tue Dec 3 14:54:41 2013 +0100

--
 CHANGES.txt|  1 +
 .../cassandra/db/marshal/CollectionType.java   | 17 +
 .../org/apache/cassandra/db/marshal/ListType.java  |  2 ++
 .../org/apache/cassandra/db/marshal/MapType.java   |  2 ++
 .../org/apache/cassandra/db/marshal/SetType.java   |  2 ++
 5 files changed, 24 insertions(+)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/b2da839f/CHANGES.txt
--
diff --cc CHANGES.txt
index 11f4c09,8e6cffa..a7ab215
--- a/CHANGES.txt
+++ b/CHANGES.txt
@@@ -13,44 -8,10 +13,45 @@@ Merged from 1.2
   * Throw IRE if a prepared has more markers than supported (CASSANDRA-5598)
   * Expose Thread metrics for the native protocol server (CASSANDRA-6234)
   * Change snapshot response message verb (CASSANDRA-6415)
+  * Warn when collection read has  65K elements (CASSANDRA-5428)
  
  
 -1.2.12
 +2.0.3
 + * Fix FD leak on slice read path (CASSANDRA-6275)
 + * Cancel read meter task when closing SSTR (CASSANDRA-6358)
 + * free off-heap IndexSummary during bulk (CASSANDRA-6359)
 + * Recover from IOException in accept() thread (CASSANDRA-6349)
 + * Improve Gossip tolerance of abnormally slow tasks (CASSANDRA-6338)
 + * Fix trying to hint timed out counter writes (CASSANDRA-6322)
 + * Allow restoring specific columnfamilies from archived CL (CASSANDRA-4809)
 + * Avoid flushing compaction_history after each operation (CASSANDRA-6287)
 + * Fix repair assertion error when tombstones expire (CASSANDRA-6277)
 + * Skip loading corrupt key cache (CASSANDRA-6260)
 + * Fixes for compacting larger-than-memory rows (CASSANDRA-6274)
 + * Compact hottest sstables first and optionally omit coldest from
 +   compaction entirely (CASSANDRA-6109)
 + * Fix modifying column_metadata from thrift (CASSANDRA-6182)
 + * cqlsh: fix LIST USERS output (CASSANDRA-6242)
 + * Add IRequestSink interface (CASSANDRA-6248)
 + * Update memtable size while flushing (CASSANDRA-6249)
 + * Provide hooks around CQL2/CQL3 statement execution (CASSANDRA-6252)
 + * Require Permission.SELECT for CAS updates (CASSANDRA-6247)
 + * New CQL-aware SSTableWriter (CASSANDRA-5894)
 + * Reject CAS operation when the protocol v1 is used (CASSANDRA-6270)
 + * Correctly throw error when frame too large (CASSANDRA-5981)
 + * Fix serialization bug in PagedRange with 2ndary indexes (CASSANDRA-6299)
 + * Fix CQL3 table validation in Thrift (CASSANDRA-6140)
 + * Fix bug missing results with IN clauses (CASSANDRA-6327)
 + * Fix paging with reversed slices (CASSANDRA-6343)
 + * Set minTimestamp correctly to be able to drop expired sstables 
(CASSANDRA-6337)
 + * Support NaN and Infinity as float literals (CASSANDRA-6003)
 + * Remove RF from nodetool ring output (CASSANDRA-6289)
 + * Fix attempting to flush empty rows (CASSANDRA-6374)
 + * Fix potential out of bounds exception when paging (CASSANDRA-6333)
 +Merged from 1.2:
 + * Optimize FD phi calculation (CASSANDRA-6386)
 + * Improve initial FD phi estimate when starting up (CASSANDRA-6385)
 + * Don't list CQL3 table in CLI describe even if named explicitely 
(CASSANDRA-5750)
   * Invalidate row cache when dropping CF (CASSANDRA-6351)
   * add non-jamm path for cached statements (CASSANDRA-6293)
   * (Hadoop) Require CFRR batchSize to be at least 2 (CASSANDRA-6114)

http://git-wip-us.apache.org/repos/asf/cassandra/blob/b2da839f/src/java/org/apache/cassandra/db/marshal/CollectionType.java
--
diff --cc src/java/org/apache/cassandra/db/marshal/CollectionType.java
index f922d56,a34a2b7..9408980
--- a/src/java/org/apache/cassandra/db/marshal/CollectionType.java
+++ b/src/java/org/apache/cassandra/db/marshal/CollectionType.java
@@@ -20,9 -20,11 +20,12 @@@ package org.apache.cassandra.db.marshal
  import java.nio.ByteBuffer;
  import java.util.List;
  
+ import org.slf4j.Logger;
+ import org.slf4j.LoggerFactory;
+ 
  import org.apache.cassandra.cql3.CQL3Type;
 -import org.apache.cassandra.db.IColumn;
 +import org.apache.cassandra.db.Column;
 +import org.apache.cassandra.serializers.MarshalException;
  import org.apache.cassandra.utils.ByteBufferUtil;
  import org.apache.cassandra.utils.Pair;

[3/4] git commit: Fix merge

2013-12-03 Thread slebresne

Fix merge


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/1334f94e
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/1334f94e
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/1334f94e

Branch: refs/heads/trunk
Commit: 1334f94e40ce5dbed7270808abb2330ea6d37c51
Parents: b2da839
Author: Sylvain Lebresne sylv...@datastax.com
Authored: Tue Dec 3 14:56:19 2013 +0100
Committer: Sylvain Lebresne sylv...@datastax.com
Committed: Tue Dec 3 14:56:19 2013 +0100

--
 src/java/org/apache/cassandra/db/marshal/CollectionType.java | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/1334f94e/src/java/org/apache/cassandra/db/marshal/CollectionType.java
--
diff --git a/src/java/org/apache/cassandra/db/marshal/CollectionType.java 
b/src/java/org/apache/cassandra/db/marshal/CollectionType.java
index 9408980..07c86e0 100644
--- a/src/java/org/apache/cassandra/db/marshal/CollectionType.java
+++ b/src/java/org/apache/cassandra/db/marshal/CollectionType.java
@@ -113,7 +113,7 @@ public abstract class CollectionTypeT extends 
AbstractTypeT
 return (ByteBuffer)result.flip();
 }
 
-protected ListPairByteBuffer, IColumn 
enforceLimit(ListPairByteBuffer, IColumn columns)
+protected ListPairByteBuffer, Column 
enforceLimit(ListPairByteBuffer, Column columns)
 {
 if (columns.size() = MAX_ELEMENTS)
 return columns;

[jira] [Created] (CASSANDRA-6436) AbstractColumnFamilyInputFormat does not use start and end tokens configured via ConfigHelper.setInputRange()

2013-12-03 Thread Paulo Ricardo Motta Gomes (JIRA)

Paulo Ricardo Motta Gomes created CASSANDRA-6436:


 Summary: AbstractColumnFamilyInputFormat does not use start and 
end tokens configured via ConfigHelper.setInputRange()
 Key: CASSANDRA-6436
 URL: https://issues.apache.org/jira/browse/CASSANDRA-6436
 Project: Cassandra
  Issue Type: Bug
  Components: Hadoop
Reporter: Paulo Ricardo Motta Gomes
 Fix For: 1.2.6


ConfigHelper allows to set a token input range via the setInputRange(conf, 
startToken, endToken) call (ConfigHelper:254).

We used this feature to limit a hadoop job range to a single Cassandra node's 
range, or even to single row key, mostly for testing purposes. 

This worked before the fix for CASSANDRA-5536 
(https://github.com/apache/cassandra/commit/aaf18bd08af50bbaae0954d78d5e6cbb684aded9),
 but after this ColumnFamilyInputFormat never uses the value of 
KeyRange.start_token when defining the input splits 
(AbstractColumnFamilyInputFormat:142-160), but only KeyRange.start_key, which 
needs an order preserving partitioner to work.

I propose the attached fix in order to allow defining Cassandra token ranges 
for a given Hadoop job even when using a non-order preserving partitioner.

Example use of ConfigHelper.setInputRange(conf, startToken, endToken) to limit 
the range to a single Cassandra Key with RandomPartitioner: 

IPartitioner part = ConfigHelper.getInputPartitioner(job.getConfiguration());
Token token = part.getToken(ByteBufferUtil.bytes(Cassandra Key));
BigInteger endToken = (BigInteger) new 
BigIntegerConverter().convert(BigInteger.class, 
part.getTokenFactory().toString(token));
BigInteger startToken = endToken.subtract(new BigInteger(1));
ConfigHelper.setInputRange(job.getConfiguration(), startToken.toString(), 
endToken.toString());



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (CASSANDRA-6436) AbstractColumnFamilyInputFormat does not use start and end tokens configured via ConfigHelper.setInputRange()

2013-12-03 Thread Paulo Ricardo Motta Gomes (JIRA)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-6436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Paulo Ricardo Motta Gomes updated CASSANDRA-6436:
-

Attachment: cassandra-1.2-6436.txt

Fix patch attached.

 AbstractColumnFamilyInputFormat does not use start and end tokens configured 
 via ConfigHelper.setInputRange()
 -

 Key: CASSANDRA-6436
 URL: https://issues.apache.org/jira/browse/CASSANDRA-6436
 Project: Cassandra
  Issue Type: Bug
  Components: Hadoop
Reporter: Paulo Ricardo Motta Gomes
  Labels: hadoop, patch
 Fix For: 1.2.6

 Attachments: cassandra-1.2-6436.txt, cassandra-1.2-6436.txt


 ConfigHelper allows to set a token input range via the setInputRange(conf, 
 startToken, endToken) call (ConfigHelper:254).
 We used this feature to limit a hadoop job range to a single Cassandra node's 
 range, or even to single row key, mostly for testing purposes. 
 This worked before the fix for CASSANDRA-5536 
 (https://github.com/apache/cassandra/commit/aaf18bd08af50bbaae0954d78d5e6cbb684aded9),
  but after this ColumnFamilyInputFormat never uses the value of 
 KeyRange.start_token when defining the input splits 
 (AbstractColumnFamilyInputFormat:142-160), but only KeyRange.start_key, which 
 needs an order preserving partitioner to work.
 I propose the attached fix in order to allow defining Cassandra token ranges 
 for a given Hadoop job even when using a non-order preserving partitioner.
 Example use of ConfigHelper.setInputRange(conf, startToken, endToken) to 
 limit the range to a single Cassandra Key with RandomPartitioner: 
 IPartitioner part = ConfigHelper.getInputPartitioner(job.getConfiguration());
 Token token = part.getToken(ByteBufferUtil.bytes(Cassandra Key));
 BigInteger endToken = (BigInteger) new 
 BigIntegerConverter().convert(BigInteger.class, 
 part.getTokenFactory().toString(token));
 BigInteger startToken = endToken.subtract(new BigInteger(1));
 ConfigHelper.setInputRange(job.getConfiguration(), startToken.toString(), 
 endToken.toString());



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (CASSANDRA-6436) AbstractColumnFamilyInputFormat does not use start and end tokens configured via ConfigHelper.setInputRange()

2013-12-03 Thread Paulo Ricardo Motta Gomes (JIRA)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-6436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Paulo Ricardo Motta Gomes updated CASSANDRA-6436:
-

Attachment: cassandra-1.2-6436.txt

Fix patch attached.

 AbstractColumnFamilyInputFormat does not use start and end tokens configured 
 via ConfigHelper.setInputRange()
 -

 Key: CASSANDRA-6436
 URL: https://issues.apache.org/jira/browse/CASSANDRA-6436
 Project: Cassandra
  Issue Type: Bug
  Components: Hadoop
Reporter: Paulo Ricardo Motta Gomes
  Labels: hadoop, patch
 Fix For: 1.2.6

 Attachments: cassandra-1.2-6436.txt, cassandra-1.2-6436.txt


 ConfigHelper allows to set a token input range via the setInputRange(conf, 
 startToken, endToken) call (ConfigHelper:254).
 We used this feature to limit a hadoop job range to a single Cassandra node's 
 range, or even to single row key, mostly for testing purposes. 
 This worked before the fix for CASSANDRA-5536 
 (https://github.com/apache/cassandra/commit/aaf18bd08af50bbaae0954d78d5e6cbb684aded9),
  but after this ColumnFamilyInputFormat never uses the value of 
 KeyRange.start_token when defining the input splits 
 (AbstractColumnFamilyInputFormat:142-160), but only KeyRange.start_key, which 
 needs an order preserving partitioner to work.
 I propose the attached fix in order to allow defining Cassandra token ranges 
 for a given Hadoop job even when using a non-order preserving partitioner.
 Example use of ConfigHelper.setInputRange(conf, startToken, endToken) to 
 limit the range to a single Cassandra Key with RandomPartitioner: 
 IPartitioner part = ConfigHelper.getInputPartitioner(job.getConfiguration());
 Token token = part.getToken(ByteBufferUtil.bytes(Cassandra Key));
 BigInteger endToken = (BigInteger) new 
 BigIntegerConverter().convert(BigInteger.class, 
 part.getTokenFactory().toString(token));
 BigInteger startToken = endToken.subtract(new BigInteger(1));
 ConfigHelper.setInputRange(job.getConfiguration(), startToken.toString(), 
 endToken.toString());



--
This message was sent by Atlassian JIRA
(v6.1#6144)

78 matches

Mail list logo