[cryptography] [ramble] [tldr] Layered security where encryption is used?

Ben Lincoln Sun, 21 Jul 2013 13:41:38 -0700

Hi everyone.

This is very possibly a newb question (or series of questions), and ifso I apologize in advance. I scoured everywhere I could think of for thelast couple of days trying to find information on this and came upempty, but maybe I just didn't know the right terms to search for.


--- Background ---

I was reverse-engineering a system recently and came across an issuethat I know from experience and training is pretty widespread whendevelopers without a strong cryptography background use cryptographywithout thinking things through: although strong cryptography (AES) wasused, and the key was stored securely, the system itself unintentionallyprovided a means for an attacker to decrypt arbitrary data without everknowing the key. I have a set of recommendations in mind for how toavoid this type of vulnerability, but I'd like to sanity-check them withpeople who actually do have a cryptography background.

What I'm hoping to avoid is being the security guy who makes arecommendation for improving something, but unintentionally introduces adifferent vulnerability as a result. I have gotten pretty good atexploiting commonly-made mistakes in software that uses cryptography,but I am not a cryptography expert, or even cryptography adept.

The system in question uses a fairly common mechanism where the state ofcertain non-sensitive variables is maintained on the client by means ofencrypted data which the client doesn't have the key to decrypt. Theonly reason for the encryption is to prevent the client from tamperingwith the data. This allows multiple different load-balanced nodes on theback-end to respond to requests from the same client without having tosync their state. Think of ASP.NET's ViewState, except that here thevariables are broken out into individual components instead of therebeing one giant encrypted blob that contains all of the data.

Like many systems that use this model, there is a flaw that would(probably?) be trivial in the absence of other factors: some of thosenon-sensitive values are displayed back to the user after beingdecrypted. In other words, as I mentioned above, the system includes theunintentional ability for users to decrypt arbitrary data, as long asthat data was encrypted using the same key as the data it actually expects.

Unfortunately, there is other - sensitive - data in the system which isalso encrypted using the same key. It's data that must be stored in areversibly-encrypted format, but which end users should not be able toretrieve. For the sake of argument, let's say it's the password for aservice account that the system uses to execute batch jobs, or a storedcredit card number used to make purchases by a customer. In both cases,the system needs the ability to obtain the original value, but end usersdo not - they just refer to the value abstractly, such as "use thisservice account to execute this task", or "I want to make a purchaseusing the card whose number ends in 1234". I am using examples fromother systems that I've looked at in the past here as opposed to thecurrent one, so please don't get stuck on those two specific cases.Assume that there is a requirement that the system be able to decryptthe data, but that it should not be accessible to end users after it'soriginally entered.

The combination of those two aspects of the system means that if a usercan obtain the encrypted version of the second type of data, they canfeed it into their cookie, and the system will happily display to themthe decrypted value, because it doesn't know any better and because thesame key is used for both types of data.

Now, normally users can't actually obtain this sensitive data, even inencrypted format - there are OS- and database-level permissions that aresupposed to prevent that - but over time, people have a tendency toforget why certain things were configured the way they were, someonemakes a configuration change, and people who shouldn't be able to get tothe encrypted data are suddenly able to.


--- Proposal/Question ---

Of course, one of my main recommendations is going to be "don't use thesame key for multiple types of data!!", but because my background is insystems engineering, one of my interests is building redundant safetyfeatures into a system design so that any one failure or human errorwon't completely compromise the system.

Part 1 of my proposal is that encrypted values should be wrapped in somekind of metadata to identify their type, as well as delimit where theplaintext value starts and ends (to help prevent someone from usingblock-shuffling in a way that involves changing the length of thedesired plaintext, if someone makes a mistake and uses ECB mode insteadof CBC). Some really basic examples of the plaintext might be:


<password>12345? That's the same combination as on my luggage!</password>
versus
<customThemeName>Autumn</customThemeName>

...or...

[value&&type::password&&length::52]12345? That's the same combination ason my luggage![/value]

versus
[value&&type::customThemeName&&length::6]Autumn[/value]

This is obviously going to involve an increase in storage size. Forexample, using the "Autumn" example and XML-style wrapper, with a blocksize of 128 bits, the ciphertext balloons from (size of IV + 16 bytes)to (size of IV + 48 bytes). The benefit I see is that it allows theapplication to do a check to make sure that the type of data it has justdecrypted is actually of the type it expects, prevent other types ofdata from being returned to the user, and possibly generate an alert ifit was expecting e.g. the name of a custom webpage theme but found aservice account password instead. There is a whole side-topic hererelated to making sure that mechanism isn't itself exploitable, but Iwill set that aside because then the email would be even longer.

As I said, the application I'm asking about uses strong encryption forwhich there are no known known-plaintext attacks. However, as soon as Ithought of the above concept, I realized that if a practicalknown-plaintext attack were ever discovered for AES, that scheme wouldbe setting up the system for compromise, because all values of a certaintype would have at least their first block of plaintext behighly-predictable.

So part 2 of my proposal is that the plaintext include a throwawaysection *before* the actual data of concern, which has a length of oneblock, and is filled with random (or at least pseudo-random) data thatis uniquely-generated for each encrypted value. As long as CBC mode wasused, it seems to me that it would be sort of like a second IV (a"reinitialization vector", I guess? :)), except that it would never bestored outside of the ciphertext, would be immediately discarded upondecryption, and never intentionally reused. In other words, while I seeit as serving a purpose somewhat related to an IV, I also see them asbeing complementary to each other instead of redundant - the IV helpsensure that identical plaintext encrypts to different ciphertext, andthe "RIV" helps guard against future known-plaintext attacks when usedwith CBC encryption mode.

This is probably stating the obvious, but in the case of one of theexamples above, if the encryption used were AES or another algorithmwith a block size of 128 bits, the plaintext modified according to bothparts 1 and 2 of my proposal would look like this:

XXXXXXXXXXXXXXXX[value&&type::password&&length::52]12345? That's thesame combination as on my luggage![/value]

...where XXXXXXXXXXXXXXXX represents 16 bytes of random/pseudorandomvalues from 0-255. This whole long set of plaintext would then beencrypted, appended to the IV, and finally stored.

To hammer home the storage downside of this, the result is that what wasoriginally going to be potentially an 80-byte value (16-byte IV +64-byte ciphertext) has swollen to 148 bytes (16-byte IV + 128-byteciphertext). Because the password in question is unusually long, let'ssay that it generally doubles or triples the size of the stored data,and of course increases CPU time for encryption and decryption.

However, at least superficially, I think it greatly reduces thelikelihood of sensitive data being obtained by people who shouldn't haveit, because it provides a means of allowing the application to perform"output validation" before displaying values to the user, and (unlessI'm mistaken) it guards against future known-plaintext attacks on theencryption algorithm. In combination with using different encryptionkeys for different types of data (and of course using unique IVs foreach encrypted value), it seems to me that it makes it much less likelyfor any one mistake to compromise the system.


--- Wrapping up ---

I can definitely see an argument that this is a bunch ofover-engineering, but the type of flaw I'm describing is ridiculouslywidespread in commercial software. I'd already run into it myself, andthen when I went to a SANS advanced web pen-testing course there was anentire day dedicated to it and related defects.

I feel like I need to be able to make some recommendations to developerswho aren't cryptography experts that will let them design and buildsystems that have a degree of built-in redundancy so that the failure ofany one design element related to the encrypted data won't result in acomplete compromise of that system. I need to be able to come up with asimple recipe for that, and it can't be any one silver bullet (like "usedifferent keys for different types of data"), because single mechanismswill always fail at some point. It also can't be something unrealisticlike "become an awesome cryptographer before you design any system thatuses cryptographic algorithms", because I know that's not going tohappen and I have to account for the reality of the situation. I feellike it needs to be 3-5 overlapping design philosophies/patterns thatare easy to remember, in addition to the ones that are well-known like"use existing, well-vetted cryptographic algorithms instead of writingyour own".

From a cryptography perspective, is this a stupid idea? Are therebetter ways to achieve my goal? Am I introducing any new weaknesses intothe system? Has any element of this topic been done to death and I justdidn't know what to search for?


In any case, if anyone got to the end of this rambling email, thank you.

- Ben Lincoln
_______________________________________________
cryptography mailing list
cryptography@randombit.net
http://lists.randombit.net/mailman/listinfo/cryptography

[cryptography] [ramble] [tldr] Layered security where encryption is used?

Reply via email to