Re: KDF API review, round 2

Jamil Nimeh Wed, 29 Nov 2017 05:39:38 -0800


On 11/28/2017 9:34 AM, Michael StJohns wrote:

On 11/28/2017 1:04 AM, Jamil Nimeh wrote:
Hi Mike, I know I said you made arguments in favor of specifying thekeys up front in init, but I'm still really uncomfortable with this. It's been bothering me all day. Comments below:
Before I get to those:
1) Do you know of any protocol using a KDF where the key productioninformation is not known before you'd need to call the .init()?

I honestly don't. I think it's safe to say you probably don't need aKDF instance until you know at least the first object you want out ofit. But for the protocols I know of all the objects are known once acipher suite or proposal is agreed upon.

2) If you do, couldn't you simply provide an empty or null list of keyderivation spec's to .init()?

You could, but that would end up necessitating two models of operation. One where we give a list up front of all objects and call deriveactions with no parameters, and a second model where you specify nothingand then provide object specs one-by-one. Each one has pros and cons,but trying to support both models I think would make the API even moreconfusing.

3) If you're doing a multiobject production from a single call to.init() do you expect in all cases to NOT include the production dataas mixins?

In all cases? I can't honestly say that. For the protocols I know of,the individual object attributes (like length) are not mixins. But youlater go on to say that you know of a couple protocols where they do. If we have real-world scenarios where individual object lengths or otherattributes really affect the keystream then I guess we need to take thatinto account.

My problem is that I have use cases where ALL of my key productioninformation is used as mixins to the key stream. Now I could providea List<DerivationParameterSpec> as part of the KDF init algorithmparameter spec (kdfParams), but that means that I have to provide adifferent APS for each different key schedule (consider TLS1.3svarious calls). If you take out the List<DerivationParameterSpec> outof the .init() I'll end up having to do that and probably having toaccept null values for the deriveKey calls.
More in line.
On 11/27/2017 10:09 AM, Michael StJohns wrote:
On 11/27/2017 1:03 AM, Jamil Nimeh wrote:
HKDF and SP800-108 only deal with the creation of the key stream andignore the issues with assigning the key stream to cryptographicobjects. In the TLS version of HDKF, the L value is mandatory andonly a single object is assigned per init/call to the KDF. An HSMcan look at the HKDF label information and set the appropriatepolicies for the assigned cryptographic object (because if any ofthe label data changes, the entire key stream changes). That's notthe case for the raw HKDF nor for any KDF that allows for multipleobjects to be extracted out of a single key stream. Hence theper-component length values.
So enforce a no-zero-length key policy in your provider code. Youprobably can't affect the internals of the HSM, but you should beable to prevent it in the provider code. I can't get away from thefeeling that this could be dealt with in other ways besidesspecifying all this up-front.
The best way to understand this is to look at the PKCS11 TLS1.2 andbefore KDF stuff. The key production schedule was for an encryptionkey, an integrity key and two IVs, all from the same key stream. Itturns out that NOTHING the HSM could do could prevent the extractionof key material because changing the boundaries between each objectdid not change the key stream. In the TLS case (and IPSec for thatmatter), it's a simple matter to move confidential key material intonon-confidential IVs. However, even if you limit the production toonly confidential items, you still have a problem in that using thesame key material for different algorithms (e.g. using part of an AESkey as a single DES key) can lead to vulnerabilities.
TLS 1.3 fixed this problem by only doing single key productions foreach call to the KDF (and by adding the length of the production tothe mixins). Because of this, an HSM can look at the mixin data and"do the right thing" with respect to policy. If TLS1.3 had kept themultiple object production model, they would have included theper-object lengths in the KDF mixin data.
The HSM can do the right thing because the bits it can depend upon (inthe TLS 1.3 case the label and the length) are included in the mixinand not simply as part of the added on key creation stuff. Withoutthis, there is nothing an HSM can do for enforcement because changingthese inputs wouldn't change the key stream.
Ideally, there should be a complete object spec for each object tobe generated that is part of the mixins (label and context) for anyKDF. That allows an HSM to rely upon the object spec when settingpolicy controls for each generated object - and incidentally allowsfor a KDF to generate both public and non-public data in a secure way.
Between different generations of keystreams do you expect to havedifferent sets of policy controls? The KDF API has no way for you toset those things so I would assume those would be pretty static, orat least controlled outside the KDF API. If so, why is the KDF APIconcerning itself with how some HSM sets its policy on objects it makes?
If I call a KDF with the same key but with different key productions,I *want* the key stream to be different. If I call it with the samekey but with same key productions, I *want* it to be the same. Say Icall the KDF to produce two objects - an AES key of length 16 bytesand a HMAC-SHA256 key of also length 16 bytes. If I then call thesame kdf with the same key to produce two AES keys of length 16 bytes(same overall length of the key stream, but different objects), Iwould *really* like it if the second object did not have the same keybytes as the HMAC-SHA256 key of the first call. The only way I canensure this is to provide mixins that cause the entire key stream tochange if anything changes in the key production data.

With the KDFs I know of I don't see how you're going to pull that off. If you call HKDF with the same key, same salt, same info, you're goingto create the same keystream, no matter how you choose to segment it orwhat kinds of objects you wish to assign them to. I guess in yourimplementation of a KDF you can choose to go through the DPS objects andmix their attributes in.

I had been working on the model that kdfParams provides the mixins(salt, context info, iteration count, whatever the KDF needs to make akeystream). That was based on how the KDFs I know of function. Even TLS1.3 keys can be done via HDKF in this manner by just adding those labeland length properties to the context info field. But if you want yourimplementation to draw it from the DPS, I guess you could do that. Itjust seems like two providers providing the same algorithm would come todifferent answers.

If the mixins include policy hints (key type, key length, label, etc)then the HSM can rely upon those and set policy accordingly for theobjects.

I think I alluded to that up above with TLS 1.3 key derivation usingHKDF. The kdfParams APS for an HKDF-Expand operation would providecontext specific info in the form of an HkdfLabel. You'd have thekey-specific info you're talking about already as part of the mixin. You don't need to get it from the DPS directly.

So as long as you allow for the specification of all of theproduction objects as part of the .init() I'm good. A given KDFmight not require this - but I can't see any way of fixing thecurrent KDFs to work in HSMs without something like this.
As far as your (5) scenario goes, I can see how you can twiddle thelengths to get the keystream output with zero-length keys and largeIV buffers. But that scenario really glosses over what should be abig hurdle and a major access control issue that stands outside theKDF API: That the attacker shouldn't have access to the inputkeying material in the first place. Protect the input keyingmaterial properly and their attack cannot be done.
Let me give you an example. I'm running an embedded HSM - toprotect TLS keys and to do all of the crypto. An attackercompromises the TLS server and now has access to the HSM. Noproblem - I'm going to notice if the attacker starts extraditinglarge amounts of data from the server (e.g. copies of the TLS in theclear but possibly reencrypted data stream) so this isn't a threator is it? Smart attacker does an extraction attack on the TLS 1.2and before KDF and turns all of the key stream material into IVmaterial and exports it from the HSM. The attacker now has the muchsmaller key material so he can send a few messages with those keysand allow for the passive external interception of the traffic anddecryption thereof without the risk of detection of all that trafficbeing sent. Alternately, I can place the key material in a picturevia steganography and publish it as part of the server data.
"If the attacker compromises a TLS server" is the part that getsme...we're using external software bugs/security holes as ajustification to make the KDF API in ways that I think are less clearto the consumer, to cover one class of providers (HSMs).
This isn't a bug in the HSM - its a bug in thinking about how KDFswork/should work. There are three parts to a KDF - extraction ofentropy from the master secret, expansion of that entropy into a keystream and finally, assignment of that key stream to cryptographicobjects. HKDF and SP800-108 talk about the first two, but don'tconsider the implications of the third. Because of this, neitherTLS1.2 nor IPSec provide a KDF with secure key production.

When I referred to "bug" I wasn't talking about the HSM, I was referringto the server that could be compromised, but no matter. I'm not surethere's any KDF API out there that talks about the third class. Seemslike they're all concerned with providing the first two. I hadenvisioned our KDF API providing equivalent functionality.

The idea is to protect extraction of the key material from an HSM_*even from authorized users of that key material*_.
That may well be a goal for the HSM, to be solved by the HSM or theprovider that front-ends it. I do not see that as something to besolved by the KDF API.
It has to be solved by the KDF API because the only way this works isif the mixin data for all the productions is included prior toproducing the first object.
KDFs don't currently do this well. Adding the overall length andper component length stuff as well as a per component spec to thedata used to derive the key stream means that 1) changes to any ofthose change the entire key stream, 2) the per component spec datamay be used by the security module policy engine to enforcerestrictions and 3) because of (1) and (2) calling the KDF a secondtime gets me exactly the same objects rather than just the same keystream. The last isn't very important in a software based securitydomain, but turns out to have real implications for policy enforcingsecurity modules.
But there aren't KDFs that take individual component lengths asinputs, so alterations to individual key component lengths don'tchange the keystream (unless someone decides to write a KDF thatdoes, but none that I've seen do). With the way the KDF API istaking shape, there's no enforcement that you get the same objects -none of that is locked to the instance. It can change betweeninits. If you reinitialize with the same key and KDF parameters,whether you specify all objects up front or one at a time in derivecalls you can still ask for a different set of output objects. Andchanging lengths on various objects won't matter because HKDF,Counter-mode KDF, Feedback-mode KDF...none of those care a whit aboutindividual component lengths. All they care about is the totallength of the keystream (and HKDF only cares about that to make sureit's not more than 255 * Hmac length).
Yes but.
TLS1.3 will be NOT be an HKDF KDF instantiation, it will be a TLS1.3KDF instantiation (which uses the HDKF function internally) that willlimit production to a single object per init and with a known set oflabels and using L as a mixin. Because that's how TLS13 dealt withthe problem.
AND - there are KDFs that take individual components lengths as inputs- in at least two proprietary protocols that I know of. Mostly though,with the trend to AEAD algorithms most of the protocols are tending tomove to a single production per init. (since they don't need both anintegrity and confidentiality key nor an IV per se)
This gets worse when you realize that the KDF key is under it alleither a HASH HMAC or CMAC key and all of those algorithms producepublic data. Ideally you need a way of preventing a KDF key fromcalling the raw HASH/HMAC/CMAC functions directly (and vice versa).
I don't see how we'd prevent this in software. If I've got a key asinput to a KDF (a SecretKey) there's no way to prevent it being usedby anything else that takes a SecretKey. If you need to prevent thatin hardware then that seems like a concern for your provider or theHSM itself.
If I tag a key as MasterSecret (where MasterSecret is not asubinterface of SecretKey, but is of Key) and use MasterSecret insteadof Key in .init().....
The HSM (and the JVM) would both identify functions that can be usedwith that key and keep others away.
This is what I was talking about with cryptographic type safety in mylast email - the idea that the Key objects be as strongly typed aspossible to prevent them from being used inappropriately or in waysthat mathematically bypass security. Take a KDF with a PRF ofCMAC-AES-128. The KDF is meant to produce secret data (a key streamfor the production of keys), but a CMAC-AES-128 is meant to producepublic data (an integrity tag over a set of data). Given that KDFalgorithm is simply a wrapper to the PRF to allow for the productionof multiple blocks of data, then its trivial - if you have access to*use* the KDF key - to use it with the CMAC function to extract thekey stream.
In the HSM I can *somewhat* combat this by (in PKCS11) attributing thekey, but how do get those attributes on the key in the first place ifI'm using a Java front end?
In software this isn't a big thing as the confidential key materialand the public CMAC integrity tag are both in the same softwaredomain. But over the years we've tried to do the right thing (seejavax.security.auth.Destroyable for example) by thinking aboutsecurity past the limitations of what we can get in software.
For KDFs I'd add a jaxa.crypto.MasterSecret interface extendingKey,Destroyable (and pretty much a clone of SecretKey) ajavax.crypto.spec.MasterSecretSpec implementing KeySpec andMasterSecret (and a clone of SecretKeySpec) to tag these secret keysas for use only with a KDF.
Mike

Re: KDF API review, round 2

Reply via email to