Re: KDF API review, round 2

Michael StJohns Wed, 29 Nov 2017 09:44:10 -0800

On 11/29/2017 8:38 AM, Jamil Nimeh wrote:

On 11/28/2017 9:34 AM, Michael StJohns wrote:
On 11/28/2017 1:04 AM, Jamil Nimeh wrote:
Hi Mike, I know I said you made arguments in favor of specifying thekeys up front in init, but I'm still really uncomfortable withthis. It's been bothering me all day. Comments below:
Before I get to those:
1) Do you know of any protocol using a KDF where the key productioninformation is not known before you'd need to call the .init()?
I honestly don't. I think it's safe to say you probably don't need aKDF instance until you know at least the first object you want out ofit. But for the protocols I know of all the objects are known once acipher suite or proposal is agreed upon.
2) If you do, couldn't you simply provide an empty or null list ofkey derivation spec's to .init()?
You could, but that would end up necessitating two models ofoperation. One where we give a list up front of all objects and callderive actions with no parameters, and a second model where youspecify nothing and then provide object specs one-by-one. Each one haspros and cons, but trying to support both models I think would makethe API even more confusing.

I'm tempted to say that this case (where you don't know what theproductions will be before the init) doesn't exist. This degeneratecase is simply a keyed PRNG and could probably be handled byderiveBytes(int length).

If you really want to produce keys from a keyed PRNG where the keyobjects are not a mixin then support both deriveKey() andderiveKey(params), but have the second throw a RuntimeException if youcall it with a KDF initialized with a set of production params.

3) If you're doing a multiobject production from a single call to.init() do you expect in all cases to NOT include the production dataas mixins?
In all cases? I can't honestly say that. For the protocols I knowof, the individual object attributes (like length) are not mixins. But you later go on to say that you know of a couple protocols wherethey do. If we have real-world scenarios where individual objectlengths or other attributes really affect the keystream then I guesswe need to take that into account.
My problem is that I have use cases where ALL of my key productioninformation is used as mixins to the key stream. Now I could providea List<DerivationParameterSpec> as part of the KDF init algorithmparameter spec (kdfParams), but that means that I have to provide adifferent APS for each different key schedule (consider TLS1.3svarious calls). If you take out the List<DerivationParameterSpec> outof the .init() I'll end up having to do that and probably having toaccept null values for the deriveKey calls.
More in line.
On 11/27/2017 10:09 AM, Michael StJohns wrote:
On 11/27/2017 1:03 AM, Jamil Nimeh wrote:
HKDF and SP800-108 only deal with the creation of the key streamand ignore the issues with assigning the key stream tocryptographic objects. In the TLS version of HDKF, the L value ismandatory and only a single object is assigned per init/call to theKDF. An HSM can look at the HKDF label information and set theappropriate policies for the assigned cryptographic object (becauseif any of the label data changes, the entire key stream changes). That's not the case for the raw HKDF nor for any KDF that allowsfor multiple objects to be extracted out of a single key stream. Hence the per-component length values.
So enforce a no-zero-length key policy in your provider code. Youprobably can't affect the internals of the HSM, but you should beable to prevent it in the provider code. I can't get away from thefeeling that this could be dealt with in other ways besidesspecifying all this up-front.
The best way to understand this is to look at the PKCS11 TLS1.2 andbefore KDF stuff. The key production schedule was for an encryptionkey, an integrity key and two IVs, all from the same key stream. Itturns out that NOTHING the HSM could do could prevent the extractionof key material because changing the boundaries between each objectdid not change the key stream. In the TLS case (and IPSec for thatmatter), it's a simple matter to move confidential key material intonon-confidential IVs. However, even if you limit the production toonly confidential items, you still have a problem in that using thesame key material for different algorithms (e.g. using part of an AESkey as a single DES key) can lead to vulnerabilities.
TLS 1.3 fixed this problem by only doing single key productions foreach call to the KDF (and by adding the length of the production tothe mixins). Because of this, an HSM can look at the mixin data and"do the right thing" with respect to policy. If TLS1.3 had kept themultiple object production model, they would have included theper-object lengths in the KDF mixin data.
The HSM can do the right thing because the bits it can depend upon(in the TLS 1.3 case the label and the length) are included in themixin and not simply as part of the added on key creation stuff. Without this, there is nothing an HSM can do for enforcement becausechanging these inputs wouldn't change the key stream.
Ideally, there should be a complete object spec for each object tobe generated that is part of the mixins (label and context) for anyKDF. That allows an HSM to rely upon the object spec when settingpolicy controls for each generated object - and incidentally allowsfor a KDF to generate both public and non-public data in a secure way.
Between different generations of keystreams do you expect to havedifferent sets of policy controls? The KDF API has no way for youto set those things so I would assume those would be pretty static,or at least controlled outside the KDF API. If so, why is the KDFAPI concerning itself with how some HSM sets its policy on objectsit makes?
If I call a KDF with the same key but with different key productions,I *want* the key stream to be different. If I call it with the samekey but with same key productions, I *want* it to be the same. SayI call the KDF to produce two objects - an AES key of length 16 bytesand a HMAC-SHA256 key of also length 16 bytes. If I then call thesame kdf with the same key to produce two AES keys of length 16 bytes(same overall length of the key stream, but different objects), Iwould *really* like it if the second object did not have the same keybytes as the HMAC-SHA256 key of the first call. The only way I canensure this is to provide mixins that cause the entire key stream tochange if anything changes in the key production data.
With the KDFs I know of I don't see how you're going to pull thatoff. If you call HKDF with the same key, same salt, same info, you'regoing to create the same keystream, no matter how you choose tosegment it or what kinds of objects you wish to assign them to. Iguess in your implementation of a KDF you can choose to go through theDPS objects and mix their attributes in.
I had been working on the model that kdfParams provides the mixins(salt, context info, iteration count, whatever the KDF needs to make akeystream). That was based on how the KDFs I know of function. EvenTLS 1.3 keys can be done via HDKF in this manner by just adding thoselabel and length properties to the context info field. But if youwant your implementation to draw it from the DPS, I guess you could dothat. It just seems like two providers providing the same algorithmwould come to different answers.

See my other email. This only works if the context field is formattedin a way that the KDF can parse it. If the context field is opaque -none of this works.


Let me give you an example case:

SP800-108 counter based KDF with L of 32 bits (equal to 32 or 0x20) andcounter of 32 bits. Label is [BE EF CA FE]. Context is [01 03 10 00 0010].

I'm producing a AES Key and an IV. Does the calling sequence givethe provider any information about how many keys and IVs are beingproduced or how much key stream to assign to each?

How about - Same underlying KDF, but the context is {{key (01), aes(03), 16 bytes}, {iv (0), generic (0), 16 bytes} which the providerreads and translates into [01 03 10 00 00 10] and calculates L from thesums of the lengths. The same key stream is produced in both of thesecases, but in the second case the key production has enforceable policybecause the calling sequence provides non-opaque information.

If the mixins include policy hints (key type, key length, label, etc)then the HSM can rely upon those and set policy accordingly for theobjects.
I think I alluded to that up above with TLS 1.3 key derivation usingHKDF. The kdfParams APS for an HKDF-Expand operation would providecontext specific info in the form of an HkdfLabel. You'd have thekey-specific info you're talking about already as part of the mixin. You don't need to get it from the DPS directly.
So as long as you allow for the specification of all of theproduction objects as part of the .init() I'm good. A given KDFmight not require this - but I can't see any way of fixing thecurrent KDFs to work in HSMs without something like this.
As far as your (5) scenario goes, I can see how you can twiddlethe lengths to get the keystream output with zero-length keys andlarge IV buffers. But that scenario really glosses over whatshould be a big hurdle and a major access control issue thatstands outside the KDF API: That the attacker shouldn't haveaccess to the input keying material in the first place. Protectthe input keying material properly and their attack cannot be done.
Let me give you an example. I'm running an embedded HSM - toprotect TLS keys and to do all of the crypto. An attackercompromises the TLS server and now has access to the HSM. Noproblem - I'm going to notice if the attacker starts extraditinglarge amounts of data from the server (e.g. copies of the TLS inthe clear but possibly reencrypted data stream) so this isn't athreat or is it? Smart attacker does an extraction attack on theTLS 1.2 and before KDF and turns all of the key stream materialinto IV material and exports it from the HSM. The attacker now hasthe much smaller key material so he can send a few messages withthose keys and allow for the passive external interception of thetraffic and decryption thereof without the risk of detection of allthat traffic being sent. Alternately, I can place the key materialin a picture via steganography and publish it as part of the serverdata.
"If the attacker compromises a TLS server" is the part that getsme...we're using external software bugs/security holes as ajustification to make the KDF API in ways that I think are lessclear to the consumer, to cover one class of providers (HSMs).
This isn't a bug in the HSM - its a bug in thinking about how KDFswork/should work. There are three parts to a KDF - extraction ofentropy from the master secret, expansion of that entropy into a keystream and finally, assignment of that key stream to cryptographicobjects. HKDF and SP800-108 talk about the first two, but don'tconsider the implications of the third. Because of this, neitherTLS1.2 nor IPSec provide a KDF with secure key production.
When I referred to "bug" I wasn't talking about the HSM, I wasreferring to the server that could be compromised, but no matter. I'mnot sure there's any KDF API out there that talks about the thirdclass. Seems like they're all concerned with providing the firsttwo. I had envisioned our KDF API providing equivalent functionality.

HKDF-Expand-Label (the TLS1.3 KDF) actually does mostly the rightthing. The length and type (e.g. "iv" "key" "traffic upd" etc) are partof the mixins for each key assignment. A policy aware provider canenforce appropriateness on the assignment of that key stream to anappropriate object. It's not as clean (or as general) as I would like,but it's a vast improvement on TLS1.2 and before.


I hope to get IPSec KDFs fixed at some point as well.

Later, Mike

Re: KDF API review, round 2

Reply via email to