Re: Mongo DB vs Cassandra

2018-05-31 Thread Russell Bateman

Sudhakar,

MongoDB will accommodate loading CSV without regard to schema while 
still creating identifiable "columns" in the database, but you'll have 
to predict or back-impose some schema later if you're going to create 
indices for fast searching of the data. You can perform searching of 
data without indexing in MongoDB, but it's slower.


Cassandra will require you to understand the schema, i.e.: what the 
columns are up front unless you're just going to store the data without 
schema and, therefore, without ability to search effectively.


As suggested already, you should share more detail if you want good 
advice. Both DBs are excellent. Both do different things in different ways.


Hope this helps,
Russ

On 05/31/2018 05:49 AM, Sudhakar Ganesan wrote:


Team,

I need to make a decision on Mongo DB vs Cassandra for loading the csv 
file data and store csv file as well. If any of you did such study in 
last couple of months, please share your analysis or observations.


Regards,

Sudhakar

Legal Disclaimer :
The information contained in this message may be privileged and 
confidential.
It is intended to be read only by the individual or entity to whom it 
is addressed
or by their designee. If the reader of this message is not the 
intended recipient,

you are on notice that any distribution of this message, in any form,
is strictly prohibited. If you have received this message in error,
please immediately notify the sender and delete or destroy any copy of 
this message!




Re: JVM Tuning post

2018-04-11 Thread Russell Bateman
Nice write-up. G1GC became the default garbage collection mechanism 
beginning in Java 9, right?



On 04/11/2018 09:05 AM, Joao Serrachinha wrote:

Many thanks to "The Last Pickle", also for TWCS advice's. Especially for
C* new features on version 3.11.1

Regards,
João

On 11/04/2018 16:00, Jon Haddad wrote:

Hey folks,

We (The Last Pickle) have helped a lot of teams with JVM tuning over
the years, finally managed to write some stuff down.  We’re hoping the
community finds it helpful.

http://thelastpickle.com/blog/2018/04/11/gc-tuning.html

Jon







Re: Cassandra Needs to Grow Up by Version Five!

2018-02-20 Thread Russell Bateman
I ask Cassandra to be a database that is high-performance, highly 
scalable with no single point of failure. Anything "cool" that's added 
beyond must be added only as a separate, optional ring around Cassandra 
and must not get in the way of my usage.


Yes, I would like some help with some of what's listed here, but you 
should understand that most shops adopting Cassandra are already going 
to have DevOps/database management personnel, expertise, methods, 
protocols and, in some instances, tools already in place. Even the small 
shop I work in has guys saddled with taking care of Cassandra (I'm a 
developer and not one of these guys) and seem not to share these 
concerns because they've already got it covered (like the specific YAML 
configuration complaint).


If there were an option or two I'd like to see, one would be the ability 
to duplicate data centers exactly (as part of what we stipulate when 
creating our KEYSPACE), but this is probably something I want because of 
what we were doing up until or what we wanted when we adopted Cassandra 
for our future product direction. I would also like to see an option in 
Cassandra configuration for absolutelylocking out access to certain 
commands (like DROP TABLE, DROP INDEXand DELETE).


From my point of view as a developer, I've had to do many of these 
things also for MongoDB, PostgreSQL, MySQL and other databases over my 
career.


I'm not criticizing these concerns and suggestions. I'm just pointing 
out that, in my opinion, not everything said here is in the realm of, 
"duh, Cassandra needs to grow up."


There's so much right about Cassandra, from the great, unequaled 
technology to the very liberal licensing model without which I could not 
be here.


Russ Bateman


On 02/18/2018 10:39 PM, Kenneth Brotman wrote:


Cassandra feels like an unfinished program to me.  The problem is not 
that it’s open source or cutting edge.  It’s an open source cutting 
edge program that lacks some of its basic functionality.  We are all 
stuck addressing fundamental mechanical tasks for Cassandra because 
the basic code that would do that part has not been contributed yet.


Ease of use issues need to be given much more attention.  For an 
administrator, the ease of use of Cassandra is very poor.


Furthermore, currently Cassandra is an idiot.  We have to do 
everything for Cassandra. Contrast that with the fact that we are in 
the dawn of artificial intelligence.


Software exists to automate tasks for humans, not mechanize humans to 
administer tasks for a database.  I’m an engineering type.  My job is 
to apply science and technology to solve real world problems.  And 
that’s where I need an organization’s I.T. talent to focus; not in 
crank starting an unfinished database.


For example, I should be able to go to any node, replace the 
Cassandra.yaml file and have a prompt on the display ask me if I want 
to update all the yaml files across the cluster.  I shouldn’t have to 
manually modify yaml files on each node or have to create a script for 
some third party automation tool to do it.


I should not have to turn off service, clear directories, restart 
service in coordination with the other nodes.  It’s already a computer 
system.  It can do those things on its own.


How about read repair.  First there is something wrong with the name.  
Maybe it should be called Consistency Repair.  An administrator 
shouldn’t have to do anything.  It should be a behavior of Cassandra 
that is programmed in. It should consider the GC setting of each node, 
calculate how often it has to run repair, when it should run it so all 
the nodes aren’t trying at the same time and when other circumstances 
indicate it should also run it.


Certificate management should be automated.

Cluster wide management should be a big theme in any next major 
release. What is a major release?  How many major releases could a 
program have before all the coding for basic stuff like installation, 
configuration and maintenance is included!


Finish the basic coding of Cassandra, make it easy to use for 
administrators, make is smart, add cluster wide management.  Keep 
Cassandra competitive or it will soon be the old Model T we all 
remember fondly.


I ask the Committee to compile a list of all such items, make a plan, 
and commit to including the completed and tested code as part of major 
release 5.0.  I further ask that release 4.0 not be delayed and then 
there be an unusually short skip to version 5.0.


Kenneth Brotman





Re: Policy on 3.11.0 packages...

2017-12-06 Thread Russell Bateman

Ah, thanks, this will do, I think.

On 12/06/2017 10:49 AM, Michael Shuler wrote:

Old packages are always in the deb repo pool, but reprepro-built
metadata lists only the latest with apt. `wget` and `dpkg` are your
friends here. RPM should "just work" for old packages, I think.

3.11.0 is still fetchable and installable:
(bintray redirect on deb repo)
http://www.apache.org/dist/cassandra/debian/pool/main/c/cassandra/
http://www.apache.org/dist/cassandra/redhat/311x/

Full release archives for every release ever - drill down under debian
and redhat directories:
http://archive.apache.org/dist/cassandra/





Policy on 3.11.0 packages...

2017-12-06 Thread Russell Bateman
We maintain a custom-index plug-in that we haven't ported to 3.11.1 
which is a non-zero effort. (Stratio's Lucene plug-in is in the same 
boat here.)


I would like to inquire as to the non-availability of Debian and RPM 
packages for Cassandra 3.11.0 now that 3.11.1 is the latest. What's the 
thinking on this? Why does package support move on so quickly without 
back-support for most recent, stable version, etc?


Thanks,

Russ


Re: [RELEASE] Apache Cassandra 3.11.1 released

2017-10-11 Thread Russell Bateman

I got it on Linux Mint via the Update Manager this morning already.


On 10/11/2017 07:33 AM, Lucas Benevides wrote:

Hello Michael Schuler,

When will this version become available for upgrade from apt-get? I 
visited the address http://www.apache.org/dist/cassandra/debian and 
there was no version 3111.


To me it is easier to upgrade the nodes this way as I am in a lab, not 
in a production site.


Thanks in advance,
Lucas Benevides


2017-10-10 18:14 GMT-03:00 Michael Shuler >:


The Cassandra team is pleased to announce the release of Apache
Cassandra version 3.11.1.

Apache Cassandra is a fully distributed database. It is the right
choice
when you need scalability and high availability without compromising
performance.

http://cassandra.apache.org/

Downloads of source and binary distributions are listed in our
download
section:

http://cassandra.apache.org/download/


This version is a bug fix release[1] on the 3.11 series. As always,
please pay attention to the release notes[2] and Let us know[3] if you
were to encounter any problem.

Enjoy!

[1]: (CHANGES.txt) https://goo.gl/QFBuPn
[2]: (NEWS.txt) https://goo.gl/vHd41x
[3]: https://issues.apache.org/jira/browse/CASSANDRA


-
To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org

For additional commands, e-mail: dev-h...@cassandra.apache.org







Re: How do I install Cassandra on AWS

2017-10-03 Thread Russell Bateman

http://lmgtfy.com/?q=how+to+install+cassandra+on+aws


On 10/03/2017 05:28 AM, Lutaya Shafiq Holmes wrote:

  How do I install Cassandra on AWS- Amazon web services

The instructions are not listed there

On 10/2/17, Michael Shuler  wrote:

On 10/02/2017 10:53 AM, Lutaya Shafiq Holmes wrote:

How do I install Cassandra on AWS- Amazon web services

Follow the installation instructions on the following page, relevant to
your OS of choice:

   http://cassandra.apache.org/download/

Let the list know if you have any problems!

--
Kind regards,
Michael

-
To unsubscribe, e-mail: user-unsubscr...@cassandra.apache.org
For additional commands, e-mail: user-h...@cassandra.apache.org








The future: Java 9 and the disappearance of CLASSPATH

2017-09-21 Thread Russell Bateman

We're very new to Cassandra.

We implement org.apache.index.Indexdropping a JAR containing our 
custom-index service into Cassandra's /lib/ subdirectory because this 
subdirectory is on the classpath. It's early days yet, but I thought I'd 
ask about the plans for Java 9 given that Jigsaw sort of closes the door 
on classpath (though it doesn't shut and lock it absolutely).


What are Cassandra's plans in this direction? Do I have anything to fear 
long-term? Given the importance of Stratio's Lucene index extension, 
which uses this mechanism too, I'd guess no one wants to do anything 
that would destroy that either, but I need to ask.


Many thanks,

Russ


Re: ExceptionInInitializerError encountered during startup

2017-08-22 Thread Russell Bateman

Reporting back...

I gave up and did this work in a stand-alone project where 
/EmbeddedCass//andraServ//erHelper.startEmbeddedCassandra()/ works fine. 
I think now that Cassandra's dependency upon /slf4j/ clashes with what 
we've had to do in our greater product to regulate which version of 
/slf4j/ is included by the myriad, disparate components (see error log 
in original post). Maybe what we're doing, mostly, requiring 1.7.25 and 
excluding (via Maven) any linked-in /slf4j/ from any of many components, 
is getting us into trouble with Cassandra. You can't mix and match 
/slf4j/ versions. There has been lots of hair-pulling over /slf4j/ as it 
is and this is not a welcome development.


Thanks.


On 08/22/2017 12:34 PM, Russell Bateman wrote:


Thanks, Myrle. This confirms what I've tried so far. The problem may 
be an assumed requirement, such as the YAML file and perhaps 
/log4j-embedded-cassandra.properties/. But, I'm supplying both of 
those. This has something to do with /slf4j/ logging, a logger that 
has no name when it goes to get it. It's unclear how it was supposed 
to get a name.



On 08/22/2017 08:48 AM, Myrle Krantz wrote:

On Tue, Aug 22, 2017 at 4:21 PM, Russell Bateman<r...@windofkeltia.com>  wrote:

As this was my first post to this forum, I wonder if someone would reply to
it if only to prove to myself that I've not posted to /dev/null as it were
even if there's no answer or the question is stupid, etc. (Note: I am
getting other forum posts, but maybe what I've posted didn't reach the
forum?)

Profuse thanks,

Russ

This will be my second post to this forum : o).  We're using embedded
Cassandra in our component tests as a junit ExternalResource, together
with datastax.  Here's some of what our start code looks like:
The original code can be found here:
https://github.com/mifosio/test/blob/develop/src/main/java/io/mifos/core/test/fixture/cassandra/CassandraInitializer.java

An example yaml file with the properties requested here can be found:
https://github.com/mifosio/portfolio/blob/develop/service/src/main/resources/application.yml

I use this hundreds of times a day and it works, but because our use
case is kind of special (multi-tenancy via keyspaces and multiple data
stores initialized as TestRules), you may have to noodle through what
we've done a bit to get your stuff working.

Greets,
Myrle

public final class CassandraInitializer {
   public void initialize() throws Exception {

 Builder clusterBuilder = (new
Builder()).withClusterName(System.getProperty("cassandra.clusterName"));
 ContactPointUtils.process(clusterBuilder,
System.getProperty("cassandra.contactPoints"));
 this.cluster = clusterBuilder.build();

 this.setup();
   }


   private void setup() throws Exception {
 if (!this.useExistingDB) {
   this.startEmbeddedCassandra();
   this.createKeyspaceSeshat();
 }

   }


   private void startEmbeddedCassandra() throws Exception {
 
EmbeddedCassandraServerHelper.startEmbeddedCassandra(TimeUnit.SECONDS.toMillis(30L));
   }
}



On 08/18/2017 05:49 PM, Russell Bateman wrote:

Cassandra version 3.9, -unit version 3.1.3.2.

In my (first ever) unit test, I've coded:

@BeforeClass public static void initFakeCassandra() throws
InterruptedException, IOException, TTransportException
{
 EmbeddedCassandraServerHelper.startEmbeddedCassandra( 2L );
}

Execution crashes down inside at

 at org.apache.cassandra.transport.Server.start(Server.java:128)
 at java.util.Collections$SingletonSet.forEach(Collections.java:4767)
 at
org.apache.cassandra.service.NativeTransportService.start(NativeTransportService.java:128)
 at
org.apache.cassandra.service.CassandraDaemon.startNativeTransport(CassandraDaemon.java:649)
 at
org.apache.cassandra.service.CassandraDaemon.start(CassandraDaemon.java:511)
 at
org.apache.cassandra.service.CassandraDaemon.activate(CassandraDaemon.java:616)
 at
org.cassandraunit.utils.EmbeddedCassandraServerHelper$1.run(EmbeddedCassandraServerHelper.java:129)
 at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
 at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
 at java.lang.Thread.run(Thread.java:745)
Caused by: java.lang.NullPointerException: name
 at
io.netty.util.internal.logging.AbstractInternalLogger.(AbstractInternalLogger.java:39)
 at
io.netty.util.internal.logging.Slf4JLogger.(Slf4JLogger.java:30)
 at
io.netty.util.internal.logging.Slf4JLoggerFactory.newInstance(Slf4JLoggerFactory.java:73)
 at
io.netty.util.internal.logging.InternalLoggerFactory.getInstance(InternalLoggerFactory.java:84)
 at
io.netty.util.internal.logging.InternalLoggerFactory.getInstance(InternalLoggerFactory.java:77)
 at io.netty.bootstrap.ServerBootstrap.(ServerBootstrap.java:46)
 ... 10 more

I am following the tutorial at Baeldung. Not sure where to go from here.
Stackoverflow response was not

Re: ExceptionInInitializerError encountered during startup

2017-08-22 Thread Russell Bateman
Thanks, Myrle. This confirms what I've tried so far. The problem may be 
an assumed requirement, such as the YAML file and perhaps 
/log4j-embedded-cassandra.properties/. But, I'm supplying both of those. 
This has something to do with /slf4j/ logging, a logger that has no name 
when it goes to get it. It's unclear how it was supposed to get a name.



On 08/22/2017 08:48 AM, Myrle Krantz wrote:

On Tue, Aug 22, 2017 at 4:21 PM, Russell Bateman <r...@windofkeltia.com> wrote:

As this was my first post to this forum, I wonder if someone would reply to
it if only to prove to myself that I've not posted to /dev/null as it were
even if there's no answer or the question is stupid, etc. (Note: I am
getting other forum posts, but maybe what I've posted didn't reach the
forum?)

Profuse thanks,

Russ

This will be my second post to this forum : o).  We're using embedded
Cassandra in our component tests as a junit ExternalResource, together
with datastax.  Here's some of what our start code looks like:
The original code can be found here:
https://github.com/mifosio/test/blob/develop/src/main/java/io/mifos/core/test/fixture/cassandra/CassandraInitializer.java

An example yaml file with the properties requested here can be found:
https://github.com/mifosio/portfolio/blob/develop/service/src/main/resources/application.yml

I use this hundreds of times a day and it works, but because our use
case is kind of special (multi-tenancy via keyspaces and multiple data
stores initialized as TestRules), you may have to noodle through what
we've done a bit to get your stuff working.

Greets,
Myrle

public final class CassandraInitializer {
   public void initialize() throws Exception {

 Builder clusterBuilder = (new
Builder()).withClusterName(System.getProperty("cassandra.clusterName"));
 ContactPointUtils.process(clusterBuilder,
System.getProperty("cassandra.contactPoints"));
 this.cluster = clusterBuilder.build();

 this.setup();
   }


   private void setup() throws Exception {
 if (!this.useExistingDB) {
   this.startEmbeddedCassandra();
   this.createKeyspaceSeshat();
 }

   }


   private void startEmbeddedCassandra() throws Exception {
 
EmbeddedCassandraServerHelper.startEmbeddedCassandra(TimeUnit.SECONDS.toMillis(30L));
   }
}



On 08/18/2017 05:49 PM, Russell Bateman wrote:

Cassandra version 3.9, -unit version 3.1.3.2.

In my (first ever) unit test, I've coded:

@BeforeClass public static void initFakeCassandra() throws
InterruptedException, IOException, TTransportException
{
 EmbeddedCassandraServerHelper.startEmbeddedCassandra( 2L );
}

Execution crashes down inside at

 at org.apache.cassandra.transport.Server.start(Server.java:128)
 at java.util.Collections$SingletonSet.forEach(Collections.java:4767)
 at
org.apache.cassandra.service.NativeTransportService.start(NativeTransportService.java:128)
 at
org.apache.cassandra.service.CassandraDaemon.startNativeTransport(CassandraDaemon.java:649)
 at
org.apache.cassandra.service.CassandraDaemon.start(CassandraDaemon.java:511)
 at
org.apache.cassandra.service.CassandraDaemon.activate(CassandraDaemon.java:616)
 at
org.cassandraunit.utils.EmbeddedCassandraServerHelper$1.run(EmbeddedCassandraServerHelper.java:129)
 at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
 at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
 at java.lang.Thread.run(Thread.java:745)
Caused by: java.lang.NullPointerException: name
 at
io.netty.util.internal.logging.AbstractInternalLogger.(AbstractInternalLogger.java:39)
 at
io.netty.util.internal.logging.Slf4JLogger.(Slf4JLogger.java:30)
 at
io.netty.util.internal.logging.Slf4JLoggerFactory.newInstance(Slf4JLoggerFactory.java:73)
 at
io.netty.util.internal.logging.InternalLoggerFactory.getInstance(InternalLoggerFactory.java:84)
 at
io.netty.util.internal.logging.InternalLoggerFactory.getInstance(InternalLoggerFactory.java:77)
 at io.netty.bootstrap.ServerBootstrap.(ServerBootstrap.java:46)
 ... 10 more

I am following the tutorial at Baeldung. Not sure where to go from here.
Stackoverflow response was not helpful to me, I probably don't know enough
yet.

Thanks.



-
To unsubscribe, e-mail: user-unsubscr...@cassandra.apache.org
For additional commands, e-mail: user-h...@cassandra.apache.org





Re: ExceptionInInitializerError encountered during startup

2017-08-22 Thread Russell Bateman
As this was my first post to this forum, I wonder if someone would reply 
to it if only to prove to myself that I've not posted to //dev/null/ as 
it were even if there's no answer or the question is stupid, etc. (Note: 
I am getting other forum posts, but maybe what I've posted didn't reach 
the forum?)


Profuse thanks,

Russ


On 08/18/2017 05:49 PM, Russell Bateman wrote:


Cassandra version 3.9, -unit version 3.1.3.2.

In my (first ever) unit test, I've coded:

@BeforeClass public static void initFakeCassandra() throws 
InterruptedException, IOException, TTransportException

{
EmbeddedCassandraServerHelper.startEmbeddedCassandra( 2L );
}

Execution crashes down inside at

at org.apache.cassandra.transport.Server.start(Server.java:128)
at java.util.Collections$SingletonSet.forEach(Collections.java:4767)
at 
org.apache.cassandra.service.NativeTransportService.start(NativeTransportService.java:128)
at 
org.apache.cassandra.service.CassandraDaemon.startNativeTransport(CassandraDaemon.java:649)
at 
org.apache.cassandra.service.CassandraDaemon.start(CassandraDaemon.java:511)
at 
org.apache.cassandra.service.CassandraDaemon.activate(CassandraDaemon.java:616)
at 
org.cassandraunit.utils.EmbeddedCassandraServerHelper$1.run(EmbeddedCassandraServerHelper.java:129)
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)

at java.lang.Thread.run(Thread.java:745)
Caused by: java.lang.NullPointerException: name
at 
io.netty.util.internal.logging.AbstractInternalLogger.(AbstractInternalLogger.java:39)
at 
io.netty.util.internal.logging.Slf4JLogger.(Slf4JLogger.java:30)
at 
io.netty.util.internal.logging.Slf4JLoggerFactory.newInstance(Slf4JLoggerFactory.java:73)
at 
io.netty.util.internal.logging.InternalLoggerFactory.getInstance(InternalLoggerFactory.java:84)
at 
io.netty.util.internal.logging.InternalLoggerFactory.getInstance(InternalLoggerFactory.java:77)
at 
io.netty.bootstrap.ServerBootstrap.(ServerBootstrap.java:46)

... 10 more

I am following the tutorial at Baeldung. Not sure where to go from 
here. Stackoverflow response 
<https://stackoverflow.com/questions/33514745/cassandra-startup-failed-with-exception-exception-encountered-during-startup> 
was not helpful to me, I probably don't know enough yet.


Thanks.





ExceptionInInitializerError encountered during startup

2017-08-18 Thread Russell Bateman

Cassandra version 3.9, -unit version 3.1.3.2.

In my (first ever) unit test, I've coded:

@BeforeClass public static void initFakeCassandra() throws 
InterruptedException, IOException, TTransportException

{
EmbeddedCassandraServerHelper.startEmbeddedCassandra( 2L );
}

Execution crashes down inside at

at org.apache.cassandra.transport.Server.start(Server.java:128)
at java.util.Collections$SingletonSet.forEach(Collections.java:4767)
at 
org.apache.cassandra.service.NativeTransportService.start(NativeTransportService.java:128)
at 
org.apache.cassandra.service.CassandraDaemon.startNativeTransport(CassandraDaemon.java:649)
at 
org.apache.cassandra.service.CassandraDaemon.start(CassandraDaemon.java:511)
at 
org.apache.cassandra.service.CassandraDaemon.activate(CassandraDaemon.java:616)
at 
org.cassandraunit.utils.EmbeddedCassandraServerHelper$1.run(EmbeddedCassandraServerHelper.java:129)
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)

at java.lang.Thread.run(Thread.java:745)
Caused by: java.lang.NullPointerException: name
at 
io.netty.util.internal.logging.AbstractInternalLogger.(AbstractInternalLogger.java:39)
at 
io.netty.util.internal.logging.Slf4JLogger.(Slf4JLogger.java:30)
at 
io.netty.util.internal.logging.Slf4JLoggerFactory.newInstance(Slf4JLoggerFactory.java:73)
at 
io.netty.util.internal.logging.InternalLoggerFactory.getInstance(InternalLoggerFactory.java:84)
at 
io.netty.util.internal.logging.InternalLoggerFactory.getInstance(InternalLoggerFactory.java:77)

at io.netty.bootstrap.ServerBootstrap.(ServerBootstrap.java:46)
... 10 more

I am following the tutorial at Baeldung. Not sure where to go from here. 
Stackoverflow response 
 
was not helpful to me, I probably don't know enough yet.


Thanks.