Re: bandwidth limiting Cassandra's replication and access control

Jonathan Mischo Wed, 11 Nov 2009 16:54:30 -0800

Let me put this in a slightly different (and probably fairly tactless)context here, for those of us without an infosec background.

1) Per-keyspace authentication allows for accountability and accesscontrol, at least at a rudimentary level. Without this, there areseveral types of applications that it would be impossible to useCassandra for, due to data security restrictions. Not justimpractical, but probably illegal in some cases.

2) If you have an intrusion into your network or a malicious userinternally (yes, this happens), you're completely unprotected withoutper-keyspace authentication.

3) Per-keyspace authentication is done once-per-session; if you useconnection pooling or persistent connections of any sort in yourapplication, this has virtually zero overhead.

4) It's REALLY bad infosec practice to have your data accessible byanyone, anytime, with no restrictions, which is how Cassandracurrently operates. While good network and application security canminimize the risks, at present, a typo or missed change in a cut andpaste can potentially damage your data, especially if you have CFs/SCFs that have the same name in two keyspaces.

5) This is common practice, pretty much everywhere. Every SQLdatabase, and almost every NOSQL database employs some form of AAA(Authentication, Authorization, and Accountability) functionality.It's pretty much an expected feature, though the granularity variesdepending on the implementation. AAA isn't expensive, but it sure cansave your butt later (and/or help you prosecute that Evil NastyPerson). It can also provide you with nifty "Unauthorized" exceptionsthat let you know you forgot to change the keyspace in yourconfiguration and almost did a Really Bad Thing.

6) Your bosses will like it (and you) a lot better if they don't haveto ask you where your organization's data went.

Hope this helps drive the point home that authentication andauthorization (as proposed) has near-zero overhead and is actuallypretty bad (read: slightly horrifying to auditors and security people,but really exciting for evil people and typo gremlins) to not have.


-Jon

On Nov 11, 2009, at 4:18 PM, Coe, Robin wrote:

Do you mean that users interacting with Cassandra through the CLIshould be restricted based on a security service? I agree.However, I believe the more common case is to front the Cassandraservice with an application layer, as you would expose a relationalbackend. For that kind of service, the application should controlthe security.
Basically, a user request to Cassandra is not stateful; any requestshould be able to perform a transaction against any node in thecluster, using an appropriate consistency model for the request.Requiring something like real time token synchronization across allnodes in a cluster seems outside of Cassandra’s eventualconsistency model.
Securing the data is intrinsically application-specific. While Icould see adding a plugin that makes the CLI access point secured, Iwould still want that to be made in a pluggable fashion, so it couldbe swapped out with a custom implementation.
Of course, this is just my point of view, but I make it after havingwritten several security layers on J2EE apps over the years and noneof them have been the same. Besides that, I want the data layer tobe highly efficient and in my opinion, it isn’t the work of the dataservice to impose security.
Robin.

From: Brandon Williams [mailto:[email protected]]
Sent: November 11, 2009 4:44 PM
To: [email protected]
Subject: Re: Re: bandwidth limiting Cassandra's replication andaccess control
On Wed, Nov 11, 2009 at 9:40 AM, Coe, Robin <[email protected]>wrote:IMO, auth services should be left to the application layer thatinterfaces to Cassandra and not built into Cassandra. In thetutorial snippet included below, the access being granted is at thecodebase level, not the transaction level. Since users of Cassandrawill generally be fronted by a service layer, the java securitymanager isn’t going to suffice. What this snippet could do, though,and may be the rationale for the request, is to ensure thatunauthorized users cannot instantiate a new Cassandra server.However, if a user has physical access to the machine on whichCassandra is installed, they could easily bypass that layer ofsecurity.
What if Cassandra IS the application you're exposing? Imagine alarge company that creates one large internal Cassandra deployment,and has multiple departments it wants to create separate keyspacesfor. You can do that now, but there's nothing except a gentlemen'sagreement to prevent one department from trashing anotherdepartment's keyspace, and accidents do happen. You can front theservice with some kind of application layer, but then you haveanother API to maintain, and you'll lose some performance this way.
-Brandon

Re: bandwidth limiting Cassandra's replication and access control

Reply via email to