date:20110628

[jira] [Commented] (CASSANDRA-2753) Capture the max client timestamp for an SSTable

2011-06-28 Thread Alan Liang (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-2753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13056362#comment-13056362
 ] 

Alan Liang commented on CASSANDRA-2753:
---

bq. No support for supercolumns?

Wow. Good catch. I've added test tests for this as well.

bq. it would be more clear if observeColumnsInSSTable took a CFMetaData object 
instead of a CF, to get a serializer from.

I've added a helper method CFMetaData.getColumnSerializer() to do this.

bq. nit: SSTMC.setMaxTimestamp would be more accurately named updateMaxTimestamp

Makes sense.

bq. IMO SSTM deserialize versioning logic would be clearer if it were all in 
SSTMSerializer instead of split between that and openFromDescriptor.

Makes sense.

bq. Suggest adding a comment that SSTableWriter.append(AbstractCompactedRow 
row) deliberately avoids calling updateMaxTimestamp b/c otherwise we'd have to 
deserialize EchoedRow.

Sounds good.

bq. where is the max-timestamp-of-compacted-sstables logic? I didn't notice it.

I put this in ColumnFamilyStore.createCompactionWriter():

{code}
public SSTableWriter createCompactionWriter(long estimatedRows, String 
location, CollectionSSTableReader sstables) throws IOException
{
ReplayPosition rp = ReplayPosition.getReplayPosition(sstables);
SSTableMetadata.Collector sstableMetadataCollector = 
SSTableMetadata.createCollector().replayPosition(rp);

// get the max timestamp of the precompacted sstables
for (SSTableReader sstable : sstables)
sstableMetadataCollector.updateMaxTimestamp(sstable.getMaxTimestamp());

return new SSTableWriter(getTempSSTablePath(location), estimatedRows, 
metadata, partitioner, sstableMetadataCollector);
}
{code}

bq. nit: renaming SSTableWriter.writeMetadata feels gratuitous

I renamed it back to writeMetadata.

bq. nit: prefer initializing fields that don't need constructor parameters, at 
declaration time (looking at RowIndexer.sstMC)

Makes sense.


 Capture the max client timestamp for an SSTable
 ---

 Key: CASSANDRA-2753
 URL: https://issues.apache.org/jira/browse/CASSANDRA-2753
 Project: Cassandra
  Issue Type: New Feature
  Components: Core
Reporter: Alan Liang
Assignee: Alan Liang
Priority: Minor
 Attachments: 
 0001-capture-max-timestamp-and-created-SSTableMetadata-to.patch, 
 0003-capture-max-timestamp-for-sstable-and-introduced-SST.patch




--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-2753) Capture the max client timestamp for an SSTable

2011-06-28 Thread Alan Liang (JIRA)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-2753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alan Liang updated CASSANDRA-2753:
--

Attachment: 
0001-capture-max-timestamp-and-created-SSTableMetadata-to-V2.patch

V2 patch based on jbellis' comments

 Capture the max client timestamp for an SSTable
 ---

 Key: CASSANDRA-2753
 URL: https://issues.apache.org/jira/browse/CASSANDRA-2753
 Project: Cassandra
  Issue Type: New Feature
  Components: Core
Reporter: Alan Liang
Assignee: Alan Liang
Priority: Minor
 Attachments: 
 0001-capture-max-timestamp-and-created-SSTableMetadata-to-V2.patch, 
 0001-capture-max-timestamp-and-created-SSTableMetadata-to.patch, 
 0003-capture-max-timestamp-for-sstable-and-introduced-SST.patch




--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

svn commit: r1140470 - in /cassandra/branches/cassandra-0.8: CHANGES.txt src/java/org/apache/cassandra/service/RangeSliceResponseResolver.java src/java/org/apache/cassandra/service/RowRepairResolver.j

2011-06-28 Thread slebresne

Author: slebresne
Date: Tue Jun 28 07:53:08 2011
New Revision: 1140470

URL: http://svn.apache.org/viewvc?rev=1140470view=rev
Log:
Fix potential NPE in range slice read repair
patch by slebresne; reviewed by jbellis for CASSANDRA-2823

Modified:
cassandra/branches/cassandra-0.8/CHANGES.txt

cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/service/RangeSliceResponseResolver.java

cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/service/RowRepairResolver.java

Modified: cassandra/branches/cassandra-0.8/CHANGES.txt
URL: 
http://svn.apache.org/viewvc/cassandra/branches/cassandra-0.8/CHANGES.txt?rev=1140470r1=1140469r2=1140470view=diff
==
--- cassandra/branches/cassandra-0.8/CHANGES.txt (original)
+++ cassandra/branches/cassandra-0.8/CHANGES.txt Tue Jun 28 07:53:08 2011
@@ -7,6 +7,8 @@
  * add ability to return endpoints to nodetool (CASSANDRA-2776)
  * Add support for multiple (comma-delimited) coordinator addresses
to ColumnFamilyInputFormat (CASSANDRA-2807)
+ * fix potential NPE while scheduling read repair for range slice
+   (CASSANDRA-2823)
 
 
 0.8.1

Modified: 
cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/service/RangeSliceResponseResolver.java
URL: 
http://svn.apache.org/viewvc/cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/service/RangeSliceResponseResolver.java?rev=1140470r1=1140469r2=1140470view=diff
==
--- 
cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/service/RangeSliceResponseResolver.java
 (original)
+++ 
cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/service/RangeSliceResponseResolver.java
 Tue Jun 28 07:53:08 2011
@@ -117,7 +117,9 @@ public class RangeSliceResponseResolver 
 }
 }
 }
-RowRepairResolver.maybeScheduleRepairs(resolved, table, key, 
versions, versionSources);
+// resolved can be null even if versions doesn't have all 
nulls because of the call to removeDeleted in resolveSuperSet
+if (resolved != null)
+RowRepairResolver.maybeScheduleRepairs(resolved, table, 
key, versions, versionSources);
 versions.clear();
 versionSources.clear();
 return new Row(key, resolved);

Modified: 
cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/service/RowRepairResolver.java
URL: 
http://svn.apache.org/viewvc/cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/service/RowRepairResolver.java?rev=1140470r1=1140469r2=1140470view=diff
==
--- 
cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/service/RowRepairResolver.java
 (original)
+++ 
cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/service/RowRepairResolver.java
 Tue Jun 28 07:53:08 2011
@@ -80,7 +80,9 @@ public class RowRepairResolver extends A
 resolved = resolveSuperset(versions);
 if (logger.isDebugEnabled())
 logger.debug(versions merged);
-maybeScheduleRepairs(resolved, table, key, versions, endpoints);
+// resolved can be null even if versions doesn't have all nulls 
because of the call to removeDeleted in resolveSuperSet
+if (resolved != null)
+maybeScheduleRepairs(resolved, table, key, versions, 
endpoints);
 }
 else
 {

[jira] [Resolved] (CASSANDRA-2823) NPE during range slices with rowrepairs

2011-06-28 Thread Sylvain Lebresne (JIRA)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-2823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sylvain Lebresne resolved CASSANDRA-2823.
-

   Resolution: Fixed
Fix Version/s: 0.8.2
 Reviewer: jbellis

Committed, thanks

 NPE during range slices with rowrepairs
 ---

 Key: CASSANDRA-2823
 URL: https://issues.apache.org/jira/browse/CASSANDRA-2823
 Project: Cassandra
  Issue Type: Bug
Affects Versions: 0.8.2
 Environment: This is a trunk build with 2521 and 2433
 I somewhat doubt that is related however.
Reporter: Terje Marthinussen
Assignee: Sylvain Lebresne
 Fix For: 0.8.2

 Attachments: 2823.patch


 Doing some heavy testing of relatively fast feeding (5000+ mutations/sec) + 
 repair on all node + range slices.
 Then occasionally killing a node here and there and restarting it.
 Triggers the following NPE
  ERROR [pool-2-thread-3] 2011-06-24 20:56:27,289 Cassandra.java (line 3210) 
 Internal error processing get_range_slices
 java.lang.NullPointerException
   at 
 org.apache.cassandra.service.RowRepairResolver.maybeScheduleRepairs(RowRepairResolver.java:109)
   at 
 org.apache.cassandra.service.RangeSliceResponseResolver$2.getReduced(RangeSliceResponseResolver.java:112)
   at 
 org.apache.cassandra.service.RangeSliceResponseResolver$2.getReduced(RangeSliceResponseResolver.java:83)
   at 
 org.apache.cassandra.utils.MergeIterator$ManyToOne.consume(MergeIterator.java:161)
   at 
 org.apache.cassandra.utils.MergeIterator.computeNext(MergeIterator.java:88)
   at 
 com.google.common.collect.AbstractIterator.tryToComputeNext(AbstractIterator.java:140)
   at 
 com.google.common.collect.AbstractIterator.hasNext(AbstractIterator.java:135)
   at 
 org.apache.cassandra.service.RangeSliceResponseResolver.resolve(RangeSliceResponseResolver.java:120)
   at 
 org.apache.cassandra.service.RangeSliceResponseResolver.resolve(RangeSliceResponseResolver.java:43)
 Looking at the code in getReduced:
 {noformat}
 ColumnFamily resolved = versions.size()  1
   ? 
 RowRepairResolver.resolveSuperset(versions)
   : versions.get(0);
 {noformat}
 seems like resolved becomes null when this happens and versions.size is 
 larger than 1.
 RowRepairResolver.resolveSuperset() does actually return null if it cannot 
 resolve anything, so there is definately a case here which can occur and is 
 not handled.
 It may also be an interesting question if it is guaranteed that   
  
 versions.add(current.left.cf);
 can never return null?
 Jonathan suggested on IRC that maybe 
 {noformat}
 ColumnFamily resolved = versions.size()  1
   ? 
 RowRepairResolver.resolveSuperset(versions)
   : versions.get(0);
 if (resolved == null)
   return new Row(key, resolved);
 {noformat}
 could be a fix.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

svn commit: r1140472 - in /cassandra/branches/cassandra-0.8: CHANGES.txt src/java/org/apache/cassandra/db/SystemTable.java

2011-06-28 Thread slebresne

Author: slebresne
Date: Tue Jun 28 07:58:56 2011
New Revision: 1140472

URL: http://svn.apache.org/viewvc?rev=1140472view=rev
Log:
Avoids race in SystemTable.getCurrentLocalNodeId
patch by slebresne; reviewed by jbellis for CASSANDRA-2824

Modified:
cassandra/branches/cassandra-0.8/CHANGES.txt

cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/db/SystemTable.java

Modified: cassandra/branches/cassandra-0.8/CHANGES.txt
URL: 
http://svn.apache.org/viewvc/cassandra/branches/cassandra-0.8/CHANGES.txt?rev=1140472r1=1140471r2=1140472view=diff
==
--- cassandra/branches/cassandra-0.8/CHANGES.txt (original)
+++ cassandra/branches/cassandra-0.8/CHANGES.txt Tue Jun 28 07:58:56 2011
@@ -9,6 +9,7 @@
to ColumnFamilyInputFormat (CASSANDRA-2807)
  * fix potential NPE while scheduling read repair for range slice
(CASSANDRA-2823)
+ * Fix race in SystemTable.getCurrentLocalNodeId (CASSANDRA-2824)
 
 
 0.8.1

Modified: 
cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/db/SystemTable.java
URL: 
http://svn.apache.org/viewvc/cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/db/SystemTable.java?rev=1140472r1=1140471r2=1140472view=diff
==
--- 
cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/db/SystemTable.java
 (original)
+++ 
cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/db/SystemTable.java
 Tue Jun 28 07:58:56 2011
@@ -380,6 +380,8 @@ public class SystemTable
 ColumnFamily cf = 
table.getColumnFamilyStore(NODE_ID_CF).getColumnFamily(filter);
 if (cf != null)
 {
+// Even though gc_grace==0 on System table, we can have a race 
where we get back tombstones (see CASSANDRA-2824)
+cf = ColumnFamilyStore.removeDeleted(cf, 0);
 assert cf.getColumnCount() = 1;
 if (cf.getColumnCount()  0)
 id = cf.iterator().next().name();

[jira] [Updated] (CASSANDRA-2653) index scan errors out when zero columns are requested

2011-06-28 Thread Sylvain Lebresne (JIRA)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-2653?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sylvain Lebresne updated CASSANDRA-2653:


Attachment: 2653_v3.patch

 index scan errors out when zero columns are requested
 -

 Key: CASSANDRA-2653
 URL: https://issues.apache.org/jira/browse/CASSANDRA-2653
 Project: Cassandra
  Issue Type: Bug
  Components: Core
Affects Versions: 0.7.6, 0.8.0 beta 2
Reporter: Jonathan Ellis
Assignee: Sylvain Lebresne
Priority: Minor
 Fix For: 0.7.7, 0.8.1

 Attachments: 
 0001-Handle-data-get-returning-null-in-secondary-indexes.patch, 
 0001-Handle-null-returns-in-data-index-query-v0.7.patch, 
 0001-Reset-SSTII-in-EchoedRow-constructor.patch, 2653_v2.patch, 
 2653_v3.patch, v1-0001-CASSANDRA-2653-reproduce-regression.txt


 As reported by Tyler Hobbs as an addendum to CASSANDRA-2401,
 {noformat}
 ERROR 16:13:38,864 Fatal exception in thread Thread[ReadStage:16,5,main]
 java.lang.AssertionError: No data found for 
 SliceQueryFilter(start=java.nio.HeapByteBuffer[pos=10 lim=10 cap=30], 
 finish=java.nio.HeapByteBuffer[pos=17 lim=17 cap=30], reversed=false, 
 count=0] in DecoratedKey(81509516161424251288255223397843705139, 
 6b657931):QueryPath(columnFamilyName='cf', superColumnName='null', 
 columnName='null') (original filter 
 SliceQueryFilter(start=java.nio.HeapByteBuffer[pos=10 lim=10 cap=30], 
 finish=java.nio.HeapByteBuffer[pos=17 lim=17 cap=30], reversed=false, 
 count=0]) from expression 'cf.626972746864617465 EQ 1'
   at 
 org.apache.cassandra.db.ColumnFamilyStore.scan(ColumnFamilyStore.java:1517)
   at 
 org.apache.cassandra.service.IndexScanVerbHandler.doVerb(IndexScanVerbHandler.java:42)
   at 
 org.apache.cassandra.net.MessageDeliveryTask.run(MessageDeliveryTask.java:72)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
   at java.lang.Thread.run(Thread.java:662)
 {noformat}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-2824) assert err on SystemTable.getCurrentLocalNodeId during a cleanup

2011-06-28 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-2824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13056384#comment-13056384
 ] 

Hudson commented on CASSANDRA-2824:
---

Integrated in Cassandra-0.8 #195 (See 
[https://builds.apache.org/job/Cassandra-0.8/195/])
Avoids race in SystemTable.getCurrentLocalNodeId
patch by slebresne; reviewed by jbellis for CASSANDRA-2824

slebresne : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1140472
Files : 
* /cassandra/branches/cassandra-0.8/CHANGES.txt
* 
/cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/db/SystemTable.java


 assert err on SystemTable.getCurrentLocalNodeId during a cleanup
 

 Key: CASSANDRA-2824
 URL: https://issues.apache.org/jira/browse/CASSANDRA-2824
 Project: Cassandra
  Issue Type: Bug
Affects Versions: 0.8.0
Reporter: Jackson Chung
Assignee: Sylvain Lebresne
Priority: Minor
 Fix For: 0.8.2

 Attachments: 2824.patch, 2824_v2.patch


 when running nodetool cleanup the following happened:
 $ ./bin/nodetool cleanup --host localhost
 Exception in thread main java.lang.AssertionError
 at 
 org.apache.cassandra.db.SystemTable.getCurrentLocalNodeId(SystemTable.java:383)
 at 
 org.apache.cassandra.utils.NodeId$LocalNodeIdHistory.init(NodeId.java:179)
 at org.apache.cassandra.utils.NodeId.clinit(NodeId.java:38)
 at org.apache.cassandra.utils.NodeId$OneShotRenewer.init(NodeId.java:159)
 at 
 org.apache.cassandra.service.StorageService.forceTableCleanup(StorageService.java:1317)
 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
 at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
 at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
 at java.lang.reflect.Method.invoke(Method.java:597)
 at 
 com.sun.jmx.mbeanserver.StandardMBeanIntrospector.invokeM2(StandardMBeanIntrospector.java:93)
 at 
 com.sun.jmx.mbeanserver.StandardMBeanIntrospector.invokeM2(StandardMBeanIntrospector.java:27)
 at 
 com.sun.jmx.mbeanserver.MBeanIntrospector.invokeM(MBeanIntrospector.java:208)
 at com.sun.jmx.mbeanserver.PerInterface.invoke(PerInterface.java:120)
 at com.sun.jmx.mbeanserver.MBeanSupport.invoke(MBeanSupport.java:262)
 at 
 com.sun.jmx.interceptor.DefaultMBeanServerInterceptor.invoke(DefaultMBeanServerInterceptor.java:836)
 at com.sun.jmx.mbeanserver.JmxMBeanServer.invoke(JmxMBeanServer.java:761)
 at 
 javax.management.remote.rmi.RMIConnectionImpl.doOperation(RMIConnectionImpl.java:1427)
 at 
 javax.management.remote.rmi.RMIConnectionImpl.access$200(RMIConnectionImpl.java:72)
 at 
 javax.management.remote.rmi.RMIConnectionImpl$PrivilegedOperation.run(RMIConnectionImpl.java:1265)
 at 
 javax.management.remote.rmi.RMIConnectionImpl.doPrivilegedOperation(RMIConnectionImpl.java:1360)
 at 
 javax.management.remote.rmi.RMIConnectionImpl.invoke(RMIConnectionImpl.java:788)
 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
 at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
 at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
 at java.lang.reflect.Method.invoke(Method.java:597)
 at sun.rmi.server.UnicastServerRef.dispatch(UnicastServerRef.java:305)
 at sun.rmi.transport.Transport$1.run(Transport.java:159)
 at java.security.AccessController.doPrivileged(Native Method)
 at sun.rmi.transport.Transport.serviceCall(Transport.java:155)
 at sun.rmi.transport.tcp.TCPTransport.handleMessages(TCPTransport.java:535)
 at 
 sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.run0(TCPTransport.java:790)
 at 
 sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.run(TCPTransport.java:649)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662) 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-2823) NPE during range slices with rowrepairs

2011-06-28 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-2823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13056383#comment-13056383
 ] 

Hudson commented on CASSANDRA-2823:
---

Integrated in Cassandra-0.8 #195 (See 
[https://builds.apache.org/job/Cassandra-0.8/195/])
Fix potential NPE in range slice read repair
patch by slebresne; reviewed by jbellis for CASSANDRA-2823

slebresne : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1140470
Files : 
* /cassandra/branches/cassandra-0.8/CHANGES.txt
* 
/cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/service/RowRepairResolver.java
* 
/cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/service/RangeSliceResponseResolver.java


 NPE during range slices with rowrepairs
 ---

 Key: CASSANDRA-2823
 URL: https://issues.apache.org/jira/browse/CASSANDRA-2823
 Project: Cassandra
  Issue Type: Bug
Affects Versions: 0.8.2
 Environment: This is a trunk build with 2521 and 2433
 I somewhat doubt that is related however.
Reporter: Terje Marthinussen
Assignee: Sylvain Lebresne
 Fix For: 0.8.2

 Attachments: 2823.patch


 Doing some heavy testing of relatively fast feeding (5000+ mutations/sec) + 
 repair on all node + range slices.
 Then occasionally killing a node here and there and restarting it.
 Triggers the following NPE
  ERROR [pool-2-thread-3] 2011-06-24 20:56:27,289 Cassandra.java (line 3210) 
 Internal error processing get_range_slices
 java.lang.NullPointerException
   at 
 org.apache.cassandra.service.RowRepairResolver.maybeScheduleRepairs(RowRepairResolver.java:109)
   at 
 org.apache.cassandra.service.RangeSliceResponseResolver$2.getReduced(RangeSliceResponseResolver.java:112)
   at 
 org.apache.cassandra.service.RangeSliceResponseResolver$2.getReduced(RangeSliceResponseResolver.java:83)
   at 
 org.apache.cassandra.utils.MergeIterator$ManyToOne.consume(MergeIterator.java:161)
   at 
 org.apache.cassandra.utils.MergeIterator.computeNext(MergeIterator.java:88)
   at 
 com.google.common.collect.AbstractIterator.tryToComputeNext(AbstractIterator.java:140)
   at 
 com.google.common.collect.AbstractIterator.hasNext(AbstractIterator.java:135)
   at 
 org.apache.cassandra.service.RangeSliceResponseResolver.resolve(RangeSliceResponseResolver.java:120)
   at 
 org.apache.cassandra.service.RangeSliceResponseResolver.resolve(RangeSliceResponseResolver.java:43)
 Looking at the code in getReduced:
 {noformat}
 ColumnFamily resolved = versions.size()  1
   ? 
 RowRepairResolver.resolveSuperset(versions)
   : versions.get(0);
 {noformat}
 seems like resolved becomes null when this happens and versions.size is 
 larger than 1.
 RowRepairResolver.resolveSuperset() does actually return null if it cannot 
 resolve anything, so there is definately a case here which can occur and is 
 not handled.
 It may also be an interesting question if it is guaranteed that   
  
 versions.add(current.left.cf);
 can never return null?
 Jonathan suggested on IRC that maybe 
 {noformat}
 ColumnFamily resolved = versions.size()  1
   ? 
 RowRepairResolver.resolveSuperset(versions)
   : versions.get(0);
 if (resolved == null)
   return new Row(key, resolved);
 {noformat}
 could be a fix.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-2653) index scan errors out when zero columns are requested

2011-06-28 Thread Sylvain Lebresne (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-2653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13056385#comment-13056385
 ] 

Sylvain Lebresne commented on CASSANDRA-2653:
-

bq. doesn't this assert still have the the query to the index and the data is 
not atomic problem?

No you're right, I focused on adding back the assert forgetting it wasn't safe 
in the first place. Attaching v3 based on v2, but instead of asserting that the 
row return contains the primary clause column, it skips the row if it doesn't 
contain it. That is, instead of asserting the non-corruption of the index, it 
ignores any possible corruption. But more importantly (one could hope we don't 
have a bug that corrupt indexes), it will avoid returning incoherent result to 
the user in the event of a race between reads and writes.

Trying to prevent the race from happening would require synchronization with 
write, which will be much harder and less efficient. And we probably need to 
have a fix for that out sooner than later (both the error when zero columns are 
requested and the possibly to throw assertion errors wrongly).

In the longer term, I think we should explore the possibility of stopping to 
care whether our secondary indexes are coherent at all time and repair them at 
read time as  this may allow us to get rid of the read-before-write. But it's a 
longer term goal at best and work for another ticket.

 

 index scan errors out when zero columns are requested
 -

 Key: CASSANDRA-2653
 URL: https://issues.apache.org/jira/browse/CASSANDRA-2653
 Project: Cassandra
  Issue Type: Bug
  Components: Core
Affects Versions: 0.7.6, 0.8.0 beta 2
Reporter: Jonathan Ellis
Assignee: Sylvain Lebresne
Priority: Minor
 Fix For: 0.7.7, 0.8.1

 Attachments: 
 0001-Handle-data-get-returning-null-in-secondary-indexes.patch, 
 0001-Handle-null-returns-in-data-index-query-v0.7.patch, 
 0001-Reset-SSTII-in-EchoedRow-constructor.patch, 2653_v2.patch, 
 2653_v3.patch, v1-0001-CASSANDRA-2653-reproduce-regression.txt


 As reported by Tyler Hobbs as an addendum to CASSANDRA-2401,
 {noformat}
 ERROR 16:13:38,864 Fatal exception in thread Thread[ReadStage:16,5,main]
 java.lang.AssertionError: No data found for 
 SliceQueryFilter(start=java.nio.HeapByteBuffer[pos=10 lim=10 cap=30], 
 finish=java.nio.HeapByteBuffer[pos=17 lim=17 cap=30], reversed=false, 
 count=0] in DecoratedKey(81509516161424251288255223397843705139, 
 6b657931):QueryPath(columnFamilyName='cf', superColumnName='null', 
 columnName='null') (original filter 
 SliceQueryFilter(start=java.nio.HeapByteBuffer[pos=10 lim=10 cap=30], 
 finish=java.nio.HeapByteBuffer[pos=17 lim=17 cap=30], reversed=false, 
 count=0]) from expression 'cf.626972746864617465 EQ 1'
   at 
 org.apache.cassandra.db.ColumnFamilyStore.scan(ColumnFamilyStore.java:1517)
   at 
 org.apache.cassandra.service.IndexScanVerbHandler.doVerb(IndexScanVerbHandler.java:42)
   at 
 org.apache.cassandra.net.MessageDeliveryTask.run(MessageDeliveryTask.java:72)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
   at java.lang.Thread.run(Thread.java:662)
 {noformat}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-2834) Avoid repair getting started twice at the same time for the same CF

2011-06-28 Thread Sylvain Lebresne (JIRA)

[
https://issues.apache.org/jira/browse/CASSANDRA-2834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13056399#comment-13056399
]

Sylvain Lebresne commented on CASSANDRA-2834:
-

bq. It may seem like it is possible to start repair twice at the same time on
the same CF?

It is possible. Right now the only cases where we abort a repair quickly is if
some neighbors are dead.

Repairing twice on the same CF is indeed useless and we can try to avoid it.
This is however not totally trivial because the two repairs can be started on
different nodes so we'll have to synchronize somehow. Rather, it's not hard per
se, but this will require some addition to the network protocol and is thus a
little longer term that one could hope.

Note that this may be made simpler by CASSANDRA-1740 in that it would propose
to have a way to abort a repair (which don't have so far).

Avoid repair getting started twice at the same time for the same CF
---

Key: CASSANDRA-2834
URL: https://issues.apache.org/jira/browse/CASSANDRA-2834
Project: Cassandra
Issue Type: Improvement
Reporter: Terje Marthinussen

It may seem like it is possible to start repair twice at the same time on the
same CF?
Not 100% verified, but if this is indeed the case, we may want to consider
avoiding that including making nodetool repair abort and return and error if
repair is attempted on the same CF as one which already have repair running.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-2521) Move away from Phantom References for Compaction/Memtable

2011-06-28 Thread Sylvain Lebresne (JIRA)

[
https://issues.apache.org/jira/browse/CASSANDRA-2521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13056406#comment-13056406
]

Sylvain Lebresne commented on CASSANDRA-2521:
-

bq. but I guess these may be results of references acquired which are not freed
as the streaming fills up the disk and fails.

Yes, until we have CASSANDRA-2433 (rebased with this), we don't detect failing
streaming and thus never delete the files (until restart). Which btw make me
say that we better have CASSANDRA-2433 if we have this ticket. But that was the
plan anyway.

bq. there are no less but 53 tmp files. A lot of concurrent streams here!

Though it is not related to this ticket, I'll note that CASSANDRA-2816 only
stagger the merkle tree creation, not the streaming. That is, the streaming
will be staggered to some extends, but if the streaming part is much longer
than the merkle tree creation one, you will still have lots of concurrent
stream going on. But -tmp files also includes the compaction that are going on,
and failed repair leaves -tmp file around, which could also help explaining
there large number. In any case, this is not related to the issue at hand :)

{quote}
However I noticed this in the log:
INFO [Thread-185] 2011-06-28 05:01:15,390 StorageService.java (line 2083)
requesting GC to free disk space
I guess we can get rid of that?
{quote}

In some cases (mmap with non-sun jvm at least) we are still relying on the GC
to free space.

Terje, if you can confirm that you didn't saw something utterly wrong with the
last patch (related to that patch, no repair), I'll commit it. I think having
it in trunk quicker will help with having more testing quicker. And given that
we don't want to have bugs in our force unmapping it'll be a good thing. In
particular, could be good to have someone try that on windows.

Move away from Phantom References for Compaction/Memtable
-

Key: CASSANDRA-2521
URL: https://issues.apache.org/jira/browse/CASSANDRA-2521
Project: Cassandra
Issue Type: Improvement
Components: Core
Reporter: Chris Goffinet
Assignee: Sylvain Lebresne
Fix For: 1.0

Attachments:
0001-Use-reference-counting-to-decide-when-a-sstable-can-.patch,
0001-Use-reference-counting-to-decide-when-a-sstable-can-v2.patch,
0002-Force-unmapping-files-before-deletion-v2.patch, 2521-v3.txt, 2521-v4.txt

http://wiki.apache.org/cassandra/MemtableSSTable
Let's move to using reference counting instead of relying on GC to be called
in StorageService.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-2833) CounterColumn should have an optional binary field so that double can be incremented/decremented along with long

2011-06-28 Thread Sylvain Lebresne (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-2833?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13056425#comment-13056425
 ] 

Sylvain Lebresne commented on CASSANDRA-2833:
-

Do people really still use double nowadays ?! :)

I mean I'll admit that I don't see right away why you can't use long to track 
durations or other values common to analytics. Don't get me wrong, that may 
require the client to multiply its values by some power of 10, but *not 
feasible* seems a bit of a strong word to me. I guess what I'm saying is that I 
think we should avoid being in the let's have it because it sounds cool 
(a.k.a feature creep) and make sure we stay in let's have it because it is 
generally useful and solve a problem that can't be solved otherwise easily 
land.  And I'm not saying that adding double support is of the former kind, but 
I guess I'm not yet convinced it is completely of the latter kind either.

Initial ranting being done, this is technically doable and fairly easily so.  
This will add a bit of clutter internally (imho, you'd want to add a 
DoubleCounterType (or RealCounterType so people don't think it doubles the 
value each time) and subclass/refactor CounterContext somehow) but probably not 
too much.

 CounterColumn should have an optional binary field so that double can be 
 incremented/decremented along with long
 

 Key: CASSANDRA-2833
 URL: https://issues.apache.org/jira/browse/CASSANDRA-2833
 Project: Cassandra
  Issue Type: Improvement
  Components: Core
Reporter: Joe Stein

 Currently CounterColumn only has a long making it not feasible to track 
 increment/decrement of durations or other values common to analytics 
 represented as a double
 The change I am proposing to implement, after some discussions/advice in the 
 irc to issues raised, is to add a new optional binary field to CounterColumn 
 (thrift).  I was thinking we could call it *operand*
 Under the hood (src/java/org/apache/cassandra/db/CounterColumn.java) I would 
 handle things with byte[] moving between long and double as internal helper 
 functions with case switch on type of operand we are setting might also 
 need an optional enum for type perhaps too so the client can let the server 
 know how it should materialize the bytes for when it += the value stored
 The clients should continue to work as expected and folks looking to use this 
 can just do so.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-2755) ColumnFamilyRecordWriter fails to throw a write exception encountered after the user begins to close the writer

2011-06-28 Thread Mck SembWever (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-2755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13056446#comment-13056446
 ] 

Mck SembWever commented on CASSANDRA-2755:
--

Jonathan: Is your patch being applied?

 ColumnFamilyRecordWriter fails to throw a write exception encountered after 
 the user begins to close the writer
 ---

 Key: CASSANDRA-2755
 URL: https://issues.apache.org/jira/browse/CASSANDRA-2755
 Project: Cassandra
  Issue Type: Bug
  Components: Hadoop
Affects Versions: 0.8.0
Reporter: Greg Katz
Assignee: Mck SembWever
 Attachments: 2755-v2.txt, CASSANDRA-2755.patch


 There appears to be a race condition in {{ColumnFamilyRecordWriter}} that can 
 result in the loss of an exception. Here is how it can happen (W stands for 
 the {{RangeClient}}'s worker thread; U stands for the 
 {{ColumnFamilyRecordWriter}} user's thread):
 # W: {{RangeClient}}'s {{run}} method catches an exception originating in the 
 Thrift client/socket, but doesn't get a chance to set it on the 
 {{lastException}} field before it the thread is preempted.
 # U: The user calls {{close}} which calls {{stopNicely}}. Because the 
 {{lastException}} field is null, {{stopNicely}} does not throw anything. 
 {{close}} then joins on the worker thread.
 # W: The {{RangeClient}}'s {{run}} method sets the {{lastException}} field 
 and exits.
 # U: Although the thread in {{close}} is waiting for the worker thread to 
 exit, it has already checked the {{lastException}} field so it doesn't detect 
 the presence of the last exception. Instead, {{close}} returns without 
 throwing anything.
 This race condition means that intermittently write failures will go 
 undetected.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

1 2 >

1 - 100 of 104 matches

Mail list logo