[VOTE] Release Apache Cassandra 2.0.0-beta1

2013-07-09 Thread Sylvain Lebresne
Cassandra 2.0 is coming along but we now need wider testing. So I propose
the
following artifacts for release as 2.0.0-beta1. Let it be clear that it is
only
a beta (and the first one at that), so we know it's not perfect, but the
current goal is first and foremost to get better testing.

sha1: fcdb39384e8570cf38c027f38b799181da06d56d
Git:
http://git-wip-us.apache.org/repos/asf?p=cassandra.git;a=shortlog;h=refs/tags/2.0.0-beta1-tentative
Artifacts:
https://repository.apache.org/content/repositories/orgapachecassandra-117/org/apache/cassandra/apache-cassandra/2.0.0-beta1/
Staging repository:
https://repository.apache.org/content/repositories/orgapachecassandra-117/

The artifacts as well as the debian package are also available here:
http://people.apache.org/~slebresne/

The vote will be open for 72 hours (longer if needed).

[1]: http://goo.gl/LorY5 (CHANGES.txt)
[2]: http://goo.gl/zEt5i (NEWS.txt)


org.apache.cassandra.cli.CliTest:testCli is failing

2013-07-09 Thread Suresh Sajja
Hi,

I checked out fresh sources from git. I am trying to run unit tests by
executing

$ant test

but the CliTest is failing.

The log trace is attached below. Can someone help me if i am missing any
configs.

Thank you.


ERROR [FlushWriter:1] 2013-07-09 15:03:28,428 SSTable.java (line 306)
Missing component:
build/test/cassandra/data/system/IndexInfo/system-IndexInfo-tmp-ja-6-Summary.db
ERROR [FlushWriter:1] 2013-07-09 15:03:28,429 SSTable.java (line 306)
Missing component:
build/test/cassandra/data/system/IndexInfo/system-IndexInfo-tmp-ja-6-Statistics.db
ERROR [FlushWriter:1] 2013-07-09 15:03:28,430 SSTable.java (line 306)
Missing component:
build/test/cassandra/data/system/IndexInfo/system-IndexInfo-tmp-ja-6-CompressionInfo.db
ERROR [FlushWriter:1] 2013-07-09 15:03:28,430 SSTable.java (line 306)
Missing component:
build/test/cassandra/data/system/IndexInfo/system-IndexInfo-tmp-ja-6-Data.db
DEBUG [FlushWriter:1] 2013-07-09 15:03:28,430 FileUtils.java (line 113)
Deleting system-IndexInfo-tmp-ja-6-TOC.txt
DEBUG [FlushWriter:1] 2013-07-09 15:03:28,430 FileUtils.java (line 113)
Deleting system-IndexInfo-tmp-ja-6-Index.db
DEBUG [FlushWriter:1] 2013-07-09 15:03:28,430 FileUtils.java (line 113)
Deleting system-IndexInfo-tmp-ja-6-Filter.db
DEBUG [FlushWriter:1] 2013-07-09 15:03:28,431 SSTable.java (line 153)
Deleted build/test/cassandra/data/system/IndexInfo/system-IndexInfo-tmp-ja-6
ERROR [FlushWriter:1] 2013-07-09 15:03:28,433 CassandraDaemon.java (line
196) Exception in thread Thread[FlushWriter:1,5,main]
java.lang.AssertionError
at
org.apache.cassandra.io.util.FileUtils.renameWithConfirm(FileUtils.java:130)
at
org.apache.cassandra.io.util.FileUtils.renameWithConfirm(FileUtils.java:125)
at
org.apache.cassandra.io.sstable.SSTableWriter.rename(SSTableWriter.java:363)
at
org.apache.cassandra.io.sstable.SSTableWriter.rename(SSTableWriter.java:355)
at
org.apache.cassandra.io.sstable.SSTableWriter.closeAndOpenReader(SSTableWriter.java:314)
at
org.apache.cassandra.io.sstable.SSTableWriter.closeAndOpenReader(SSTableWriter.java:296)
at
org.apache.cassandra.db.Memtable$FlushRunnable.writeSortedContents(Memtable.java:427)
at
org.apache.cassandra.db.Memtable$FlushRunnable.runWith(Memtable.java:381)
at
org.apache.cassandra.io.util.DiskAwareRunnable.runMayThrow(DiskAwareRunnable.java:48)
at
org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:28)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
at java.lang.Thread.run(Thread.java:724)


Re: [VOTE] Release Apache Cassandra 2.0.0-beta1

2013-07-09 Thread Murali
+1

On Tue, Jul 9, 2013 at 2:50 PM, Sylvain Lebresne sylv...@datastax.comwrote:

 Cassandra 2.0 is coming along but we now need wider testing. So I propose
 the
 following artifacts for release as 2.0.0-beta1. Let it be clear that it is
 only
 a beta (and the first one at that), so we know it's not perfect, but the
 current goal is first and foremost to get better testing.

 sha1: fcdb39384e8570cf38c027f38b799181da06d56d
 Git:

 http://git-wip-us.apache.org/repos/asf?p=cassandra.git;a=shortlog;h=refs/tags/2.0.0-beta1-tentative
 Artifacts:

 https://repository.apache.org/content/repositories/orgapachecassandra-117/org/apache/cassandra/apache-cassandra/2.0.0-beta1/
 Staging repository:
 https://repository.apache.org/content/repositories/orgapachecassandra-117/

 The artifacts as well as the debian package are also available here:
 http://people.apache.org/~slebresne/

 The vote will be open for 72 hours (longer if needed).

 [1]: http://goo.gl/LorY5 (CHANGES.txt)
 [2]: http://goo.gl/zEt5i (NEWS.txt)




-- 
Thanks,
Murali
99025-5


Re: [VOTE] Release Apache Cassandra 2.0.0-beta1

2013-07-09 Thread Carl Yeksigian
+1


On Tue, Jul 9, 2013 at 5:20 AM, Sylvain Lebresne sylv...@datastax.comwrote:

 Cassandra 2.0 is coming along but we now need wider testing. So I propose
 the
 following artifacts for release as 2.0.0-beta1. Let it be clear that it is
 only
 a beta (and the first one at that), so we know it's not perfect, but the
 current goal is first and foremost to get better testing.

 sha1: fcdb39384e8570cf38c027f38b799181da06d56d
 Git:

 http://git-wip-us.apache.org/repos/asf?p=cassandra.git;a=shortlog;h=refs/tags/2.0.0-beta1-tentative
 Artifacts:

 https://repository.apache.org/content/repositories/orgapachecassandra-117/org/apache/cassandra/apache-cassandra/2.0.0-beta1/
 Staging repository:
 https://repository.apache.org/content/repositories/orgapachecassandra-117/

 The artifacts as well as the debian package are also available here:
 http://people.apache.org/~slebresne/

 The vote will be open for 72 hours (longer if needed).

 [1]: http://goo.gl/LorY5 (CHANGES.txt)
 [2]: http://goo.gl/zEt5i (NEWS.txt)



Re: [VOTE] Release Apache Cassandra 2.0.0-beta1

2013-07-09 Thread Jason Brown
+1. Woo-hoo!


On Tue, Jul 9, 2013 at 4:45 AM, Carl Yeksigian c...@yeksigian.com wrote:

 +1


 On Tue, Jul 9, 2013 at 5:20 AM, Sylvain Lebresne sylv...@datastax.com
 wrote:

  Cassandra 2.0 is coming along but we now need wider testing. So I propose
  the
  following artifacts for release as 2.0.0-beta1. Let it be clear that it
 is
  only
  a beta (and the first one at that), so we know it's not perfect, but the
  current goal is first and foremost to get better testing.
 
  sha1: fcdb39384e8570cf38c027f38b799181da06d56d
  Git:
 
 
 http://git-wip-us.apache.org/repos/asf?p=cassandra.git;a=shortlog;h=refs/tags/2.0.0-beta1-tentative
  Artifacts:
 
 
 https://repository.apache.org/content/repositories/orgapachecassandra-117/org/apache/cassandra/apache-cassandra/2.0.0-beta1/
  Staging repository:
 
 https://repository.apache.org/content/repositories/orgapachecassandra-117/
 
  The artifacts as well as the debian package are also available here:
  http://people.apache.org/~slebresne/
 
  The vote will be open for 72 hours (longer if needed).
 
  [1]: http://goo.gl/LorY5 (CHANGES.txt)
  [2]: http://goo.gl/zEt5i (NEWS.txt)
 



Configuration of network connectors

2013-07-09 Thread Łukasz Dywicki
Hello,
First of all I would like to say hello to cassandra user and developer 
community. :)

I write because we are using Cassandra in our unit tests and we have some 
troubles with network connectivity. We ca not run multiple cassandra instances 
during tests because we would need to randomize configuration of port and so 
on. For now if we try to fork our tests we get address already in use on one 
from two ports - native or thrift. In other apache projects we can VM 
connectors (ActiveMQ, Camel, Mina) based on in-memory queue. I took some time 
to see how CassandraDaemon starts servers and it's kinda of hardcoded. I 
thought about changing configuration to be more like:

servers:
  - class org.apache.cassandra.thrift.ThriftServer
  - class org.apache.cassandra.transport.Server

Then we will be able to disable these servers for unit tests:
servers:
  - class org.apache.cassandra.vm.VmServer

This requires some small changes in daemon code and client libraries. I'm not 
really deeply involved in cassandra stuff so I don't know the internal 
architecture and implications thus I look forward for you to discuss this topic.

Cheers,
Łukasz Dywicki
--
l...@code-house.org
Twitter: ldywicki
Blog: http://dywicki.pl
Code-House - http://code-house.org



Re: Configuration of network connectors

2013-07-09 Thread Jeremy Hanna
Have you seen https://github.com/pcmanus/ccm as described in 
http://www.datastax.com/dev/blog/ccm-a-development-tool-for-creating-local-cassandra-clusters
 or does that not fit your use case?

On 9 Jul 2013, at 14:02, Łukasz Dywicki l...@code-house.org wrote:

 Hello,
 First of all I would like to say hello to cassandra user and developer 
 community. :)
 
 I write because we are using Cassandra in our unit tests and we have some 
 troubles with network connectivity. We ca not run multiple cassandra 
 instances during tests because we would need to randomize configuration of 
 port and so on. For now if we try to fork our tests we get address already 
 in use on one from two ports - native or thrift. In other apache projects we 
 can VM connectors (ActiveMQ, Camel, Mina) based on in-memory queue. I took 
 some time to see how CassandraDaemon starts servers and it's kinda of 
 hardcoded. I thought about changing configuration to be more like:
 
 servers:
  - class org.apache.cassandra.thrift.ThriftServer
  - class org.apache.cassandra.transport.Server
 
 Then we will be able to disable these servers for unit tests:
 servers:
  - class org.apache.cassandra.vm.VmServer
 
 This requires some small changes in daemon code and client libraries. I'm not 
 really deeply involved in cassandra stuff so I don't know the internal 
 architecture and implications thus I look forward for you to discuss this 
 topic.
 
 Cheers,
 Łukasz Dywicki
 --
 l...@code-house.org
 Twitter: ldywicki
 Blog: http://dywicki.pl
 Code-House - http://code-house.org
 



Re: [VOTE] Release Apache Cassandra 2.0.0-beta1

2013-07-09 Thread Jonathan Ellis
+1
On Jul 9, 2013 4:21 AM, Sylvain Lebresne sylv...@datastax.com wrote:

 Cassandra 2.0 is coming along but we now need wider testing. So I propose
 the
 following artifacts for release as 2.0.0-beta1. Let it be clear that it is
 only
 a beta (and the first one at that), so we know it's not perfect, but the
 current goal is first and foremost to get better testing.

 sha1: fcdb39384e8570cf38c027f38b799181da06d56d
 Git:

 http://git-wip-us.apache.org/repos/asf?p=cassandra.git;a=shortlog;h=refs/tags/2.0.0-beta1-tentative
 Artifacts:

 https://repository.apache.org/content/repositories/orgapachecassandra-117/org/apache/cassandra/apache-cassandra/2.0.0-beta1/
 Staging repository:
 https://repository.apache.org/content/repositories/orgapachecassandra-117/

 The artifacts as well as the debian package are also available here:
 http://people.apache.org/~slebresne/

 The vote will be open for 72 hours (longer if needed).

 [1]: http://goo.gl/LorY5 (CHANGES.txt)
 [2]: http://goo.gl/zEt5i (NEWS.txt)



Re: [VOTE] Release Apache Cassandra 2.0.0-beta1

2013-07-09 Thread Eric Evans
[ Sylvain Lebresne ]
 Cassandra 2.0 is coming along but we now need wider testing. So I propose
 the
 following artifacts for release as 2.0.0-beta1. Let it be clear that it is
 only
 a beta (and the first one at that), so we know it's not perfect, but the
 current goal is first and foremost to get better testing.
 
 sha1: fcdb39384e8570cf38c027f38b799181da06d56d
 Git:
 http://git-wip-us.apache.org/repos/asf?p=cassandra.git;a=shortlog;h=refs/tags/2.0.0-beta1-tentative
 Artifacts:
 https://repository.apache.org/content/repositories/orgapachecassandra-117/org/apache/cassandra/apache-cassandra/2.0.0-beta1/
 Staging repository:
 https://repository.apache.org/content/repositories/orgapachecassandra-117/
 
 The artifacts as well as the debian package are also available here:
 http://people.apache.org/~slebresne/
 
 The vote will be open for 72 hours (longer if needed).

+1

 [1]: http://goo.gl/LorY5 (CHANGES.txt)
 [2]: http://goo.gl/zEt5i (NEWS.txt)

-- 
Eric Evans
eev...@sym-link.com


Composite Key Overload

2013-07-09 Thread Darren Lee
Hi Everyone,

I have a CF which currently has a large composite key. Let's say it has 5 
components, A,B,C,D,E.

I know I need to be able to query and find all rows where A,B,C equal a certain 
value. And I also need to query when A,B,C,D are a certain value.

Traditionally what is the best approach for this?

* Composite key? Is it possible to query on components of the composite 
key?

* Duplicate the fields I want to query on and create a secondary index

* Expand the data into supercolumns for horizontal traversal

* Create lookup tables and suffer having to do reads before being able 
to write updates

Thanks


Re: Configuration of network connectors

2013-07-09 Thread Łukasz Dywicki
Jeremy,
Sadly it does not cover our case. We have unit tests and we want to test really 
basic things like mappings of data contained in cassandra to our model. For 
that we don't need cluster at all because in unit tests we don't want to test 
data distribution. We also would like to run everything in JVM, thus CCM 
written in Python is not really what we need.
What we are looking for is minimal cassandra set up which could be embedded and 
used concurrently multiple times. For example we now use CassandraUnit:

@Rule
public CassandraUnit unit = new CassandraUnit(new EmptyDataSet(), 
embedded-cassandra.yaml);

@Test
public void fistTest() {
// do something with data
}

@Test
public void secondTest() {
// do something else
}

In this set up JUnit will launch new CassandraDaemon for every test. If we set 
FORK_MODE per test then we may have two cassandra instances running at the same 
time. First test which launch CassandraDaemon will pass, second may fail due 
port usage conflict. That's why we thought about testing without network layer. 
This can save some time. It would be great because for some older hardware used 
by our developers it takes up to 9 minutes to run build with all unit tests. 
Some of this time is consumed by startup and shutdown of cassandra.

Cheers,
Łukasz Dywicki
--
l...@code-house.org
Twitter: ldywicki
Blog: http://dywicki.pl
Code-House - http://code-house.org

Wiadomość napisana przez Jeremy Hanna jeremy.hanna1...@gmail.com w dniu 9 lip 
2013, o godz. 15:22:

 Have you seen https://github.com/pcmanus/ccm as described in 
 http://www.datastax.com/dev/blog/ccm-a-development-tool-for-creating-local-cassandra-clusters
  or does that not fit your use case?
 
 On 9 Jul 2013, at 14:02, Łukasz Dywicki l...@code-house.org wrote:
 
 Hello,
 First of all I would like to say hello to cassandra user and developer 
 community. :)
 
 I write because we are using Cassandra in our unit tests and we have some 
 troubles with network connectivity. We ca not run multiple cassandra 
 instances during tests because we would need to randomize configuration of 
 port and so on. For now if we try to fork our tests we get address already 
 in use on one from two ports - native or thrift. In other apache projects 
 we can VM connectors (ActiveMQ, Camel, Mina) based on in-memory queue. I 
 took some time to see how CassandraDaemon starts servers and it's kinda of 
 hardcoded. I thought about changing configuration to be more like:
 
 servers:
 - class org.apache.cassandra.thrift.ThriftServer
 - class org.apache.cassandra.transport.Server
 
 Then we will be able to disable these servers for unit tests:
 servers:
 - class org.apache.cassandra.vm.VmServer
 
 This requires some small changes in daemon code and client libraries. I'm 
 not really deeply involved in cassandra stuff so I don't know the internal 
 architecture and implications thus I look forward for you to discuss this 
 topic.
 
 Cheers,
 Łukasz Dywicki
 --
 l...@code-house.org
 Twitter: ldywicki
 Blog: http://dywicki.pl
 Code-House - http://code-house.org
 
 



Re: Configuration of network connectors

2013-07-09 Thread Gary Dusbabek
You'd still end up needing to fork tests because of the singleton problem.
(Google that one.)

I say this without trying, but it shouldn't be terribly hard for you to
code up some fixture classes that allow you to test the storage layer, so
long as you can tolerate each test running in a forked VM. Same goes for
the ports, with the exception of JMX iirc.

Gary.


On Tuesday, July 9, 2013, Łukasz Dywicki wrote:

 Jeremy,
 Sadly it does not cover our case. We have unit tests and we want to test
 really basic things like mappings of data contained in cassandra to our
 model. For that we don't need cluster at all because in unit tests we don't
 want to test data distribution. We also would like to run everything in
 JVM, thus CCM written in Python is not really what we need.
 What we are looking for is minimal cassandra set up which could be
 embedded and used concurrently multiple times. For example we now use
 CassandraUnit:

 @Rule
 public CassandraUnit unit = new CassandraUnit(new EmptyDataSet(),
 embedded-cassandra.yaml);

 @Test
 public void fistTest() {
 // do something with data
 }

 @Test
 public void secondTest() {
 // do something else
 }

 In this set up JUnit will launch new CassandraDaemon for every test. If we
 set FORK_MODE per test then we may have two cassandra instances running at
 the same time. First test which launch CassandraDaemon will pass, second
 may fail due port usage conflict. That's why we thought about testing
 without network layer. This can save some time. It would be great because
 for some older hardware used by our developers it takes up to 9 minutes to
 run build with all unit tests. Some of this time is consumed by startup and
 shutdown of cassandra.

 Cheers,
 Łukasz Dywicki
 --
 l...@code-house.org javascript:;
 Twitter: ldywicki
 Blog: http://dywicki.pl
 Code-House - http://code-house.org

 Wiadomość napisana przez Jeremy Hanna 
 jeremy.hanna1...@gmail.comjavascript:;
 w dniu 9 lip 2013, o godz. 15:22:

  Have you seen https://github.com/pcmanus/ccm as described in
 http://www.datastax.com/dev/blog/ccm-a-development-tool-for-creating-local-cassandra-clustersor
  does that not fit your use case?
 
  On 9 Jul 2013, at 14:02, Łukasz Dywicki l...@code-house.orgjavascript:;
 wrote:
 
  Hello,
  First of all I would like to say hello to cassandra user and developer
 community. :)
 
  I write because we are using Cassandra in our unit tests and we have
 some troubles with network connectivity. We ca not run multiple cassandra
 instances during tests because we would need to randomize configuration of
 port and so on. For now if we try to fork our tests we get address already
 in use on one from two ports - native or thrift. In other apache projects
 we can VM connectors (ActiveMQ, Camel, Mina) based on in-memory queue. I
 took some time to see how CassandraDaemon starts servers and it's kinda of
 hardcoded. I thought about changing configuration to be more like:
 
  servers:
  - class org.apache.cassandra.thrift.ThriftServer
  - class org.apache.cassandra.transport.Server
 
  Then we will be able to disable these servers for unit tests:
  servers:
  - class org.apache.cassandra.vm.VmServer
 
  This requires some small changes in daemon code and client libraries.
 I'm not really deeply involved in cassandra stuff so I don't know the
 internal architecture and implications thus I look forward for you to
 discuss this topic.
 
  Cheers,
  Łukasz Dywicki
  --
  l...@code-house.org javascript:;
  Twitter: ldywicki
  Blog: http://dywicki.pl
  Code-House - http://code-house.org
 
 




Re: [VOTE] Release Apache Cassandra 2.0.0-beta1

2013-07-09 Thread Brandon Williams
+1

On Tue, Jul 9, 2013 at 4:20 AM, Sylvain Lebresne sylv...@datastax.comwrote:

 Cassandra 2.0 is coming along but we now need wider testing. So I propose
 the
 following artifacts for release as 2.0.0-beta1. Let it be clear that it is
 only
 a beta (and the first one at that), so we know it's not perfect, but the
 current goal is first and foremost to get better testing.

 sha1: fcdb39384e8570cf38c027f38b799181da06d56d
 Git:

 http://git-wip-us.apache.org/repos/asf?p=cassandra.git;a=shortlog;h=refs/tags/2.0.0-beta1-tentative
 Artifacts:

 https://repository.apache.org/content/repositories/orgapachecassandra-117/org/apache/cassandra/apache-cassandra/2.0.0-beta1/
 Staging repository:
 https://repository.apache.org/content/repositories/orgapachecassandra-117/

 The artifacts as well as the debian package are also available here:
 http://people.apache.org/~slebresne/

 The vote will be open for 72 hours (longer if needed).

 [1]: http://goo.gl/LorY5 (CHANGES.txt)
 [2]: http://goo.gl/zEt5i (NEWS.txt)



Re: [VOTE] Release Apache Cassandra 2.0.0-beta1

2013-07-09 Thread Gary Dusbabek
+1
On Jul 9, 2013 3:21 AM, Sylvain Lebresne sylv...@datastax.com wrote:

 Cassandra 2.0 is coming along but we now need wider testing. So I propose
 the
 following artifacts for release as 2.0.0-beta1. Let it be clear that it is
 only
 a beta (and the first one at that), so we know it's not perfect, but the
 current goal is first and foremost to get better testing.

 sha1: fcdb39384e8570cf38c027f38b799181da06d56d
 Git:

 http://git-wip-us.apache.org/repos/asf?p=cassandra.git;a=shortlog;h=refs/tags/2.0.0-beta1-tentative
 Artifacts:

 https://repository.apache.org/content/repositories/orgapachecassandra-117/org/apache/cassandra/apache-cassandra/2.0.0-beta1/
 Staging repository:
 https://repository.apache.org/content/repositories/orgapachecassandra-117/

 The artifacts as well as the debian package are also available here:
 http://people.apache.org/~slebresne/

 The vote will be open for 72 hours (longer if needed).

 [1]: http://goo.gl/LorY5 (CHANGES.txt)
 [2]: http://goo.gl/zEt5i (NEWS.txt)



Re: [VOTE] Release Apache Cassandra 2.0.0-beta1

2013-07-09 Thread Marcus Eriksson
+1


On Tue, Jul 9, 2013 at 11:20 AM, Sylvain Lebresne sylv...@datastax.comwrote:

 Cassandra 2.0 is coming along but we now need wider testing. So I propose
 the
 following artifacts for release as 2.0.0-beta1. Let it be clear that it is
 only
 a beta (and the first one at that), so we know it's not perfect, but the
 current goal is first and foremost to get better testing.

 sha1: fcdb39384e8570cf38c027f38b799181da06d56d
 Git:

 http://git-wip-us.apache.org/repos/asf?p=cassandra.git;a=shortlog;h=refs/tags/2.0.0-beta1-tentative
 Artifacts:

 https://repository.apache.org/content/repositories/orgapachecassandra-117/org/apache/cassandra/apache-cassandra/2.0.0-beta1/
 Staging repository:
 https://repository.apache.org/content/repositories/orgapachecassandra-117/

 The artifacts as well as the debian package are also available here:
 http://people.apache.org/~slebresne/

 The vote will be open for 72 hours (longer if needed).

 [1]: http://goo.gl/LorY5 (CHANGES.txt)
 [2]: http://goo.gl/zEt5i (NEWS.txt)