Re: bandwidth limiting Cassandra's replication and access control

Jonathan Mischo Wed, 11 Nov 2009 18:21:39 -0800

Ian,

All valid points, though using a common login is considered poorsecurity. Usually you see a per-application login, which is generallyjust fine, and then anyone who might directly access the database hasa named login for accountability reasons. Yes, some companies use asingle common login, but that's just laziness. And yes, DBAs arepeople too, but if you limit who has access to your system, then youmake your pool of potential security violators a lot smaller, and makeinvestigations a lot faster (usually pointing to a connection from thewrong application...let's face it, developers aren't infallible, asmuch as some like to pretend otherwise).


-Jon

On Nov 11, 2009, at 7:40 PM, Ian Holsman wrote:

just on point on 5.
most places i've seen don't use DB auth anywhere. there is a commonlogin, stored in a property file, sometimes stored in a internally-world-readable SVN repo.they usually use network ACLs to restrict access to good hosts.(jump hosts). network ACLs have been tested for decades and theywork. implementing your own auth is just asking for problems. It'stoo hard to do properly, and will probably never work well with theenterprises existing auth systems.
If you have sensitive data being stored, ENCRYPT it, or use a 1-wayhash instead of storing it.
Ideally with a user-supplied key which is not stored anywhere on disk.
sadly DBA's are people too, and it is pathetically easy for them toget all the data from a DB-dump.
On Nov 12, 2009, at 11:53 AM, Jonathan Mischo wrote:
Let me put this in a slightly different (and probably fairlytactless) context here, for those of us without an infosecbackground.
1) Per-keyspace authentication allows for accountability and accesscontrol, at least at a rudimentary level. Without this, there areseveral types of applications that it would be impossible to useCassandra for, due to data security restrictions. Not justimpractical, but probably illegal in some cases.
2) If you have an intrusion into your network or a malicious userinternally (yes, this happens), you're completely unprotectedwithout per-keyspace authentication.
3) Per-keyspace authentication is done once-per-session; if you useconnection pooling or persistent connections of any sort in yourapplication, this has virtually zero overhead.
4) It's REALLY bad infosec practice to have your data accessible byanyone, anytime, with no restrictions, which is how Cassandracurrently operates. While good network and application securitycan minimize the risks, at present, a typo or missed change in acut and paste can potentially damage your data, especially if youhave CFs/SCFs that have the same name in two keyspaces.
5) This is common practice, pretty much everywhere. Every SQLdatabase, and almost every NOSQL database employs some form of AAA(Authentication, Authorization, and Accountability) functionality.It's pretty much an expected feature, though the granularity variesdepending on the implementation. AAA isn't expensive, but it surecan save your butt later (and/or help you prosecute that Evil NastyPerson). It can also provide you with nifty "Unauthorized"exceptions that let you know you forgot to change the keyspace inyour configuration and almost did a Really Bad Thing.
6) Your bosses will like it (and you) a lot better if they don'thave to ask you where your organization's data went.
Hope this helps drive the point home that authentication andauthorization (as proposed) has near-zero overhead and is actuallypretty bad (read: slightly horrifying to auditors and securitypeople, but really exciting for evil people and typo gremlins) tonot have.
-Jon

On Nov 11, 2009, at 4:18 PM, Coe, Robin wrote:
Do you mean that users interacting with Cassandra through the CLIshould be restricted based on a security service? I agree.However, I believe the more common case is to front the Cassandraservice with an application layer, as you would expose arelational backend. For that kind of service, the applicationshould control the security.
Basically, a user request to Cassandra is not stateful; anyrequest should be able to perform a transaction against any nodein the cluster, using an appropriate consistency model for therequest. Requiring something like real time token synchronizationacross all nodes in a cluster seems outside of Cassandra’seventual consistency model.
Securing the data is intrinsically application-specific. While Icould see adding a plugin that makes the CLI access point secured,I would still want that to be made in a pluggable fashion, so itcould be swapped out with a custom implementation.
Of course, this is just my point of view, but I make it afterhaving written several security layers on J2EE apps over the yearsand none of them have been the same. Besides that, I want thedata layer to be highly efficient and in my opinion, it isn’t thework of the data service to impose security.
Robin.

From: Brandon Williams [mailto:[email protected]]
Sent: November 11, 2009 4:44 PM
To: [email protected]
Subject: Re: Re: bandwidth limiting Cassandra's replication andaccess control
On Wed, Nov 11, 2009 at 9:40 AM, Coe, Robin<[email protected]> wrote:IMO, auth services should be left to the application layer thatinterfaces to Cassandra and not built into Cassandra. In thetutorial snippet included below, the access being granted is atthe codebase level, not the transaction level. Since users ofCassandra will generally be fronted by a service layer, the javasecurity manager isn’t going to suffice. What this snippet coulddo, though, and may be the rationale for the request, is to ensurethat unauthorized users cannot instantiate a new Cassandraserver. However, if a user has physical access to the machine onwhich Cassandra is installed, they could easily bypass that layerof security.
What if Cassandra IS the application you're exposing? Imagine alarge company that creates one large internal Cassandradeployment, and has multiple departments it wants to createseparate keyspaces for. You can do that now, but there's nothingexcept a gentlemen's agreement to prevent one department fromtrashing another department's keyspace, and accidents do happen.You can front the service with some kind of application layer, butthen you have another API to maintain, and you'll lose someperformance this way.
-Brandon
--
Ian Holsman
[email protected]

Re: bandwidth limiting Cassandra's replication and access control

Reply via email to