On 3/9/2013 7:16 AM, Jason Gunthorpe wrote:
On Mon, Sep 02, 2013 at 12:26:06PM +0300, Matan Barak wrote:
The correct comp_mask should always match the fields that are
initialized by the caller. That is simple to explain and easy to audit
for correctness.
I agree, but the request could be rejected if an unsupported
comp_mask field was given. The user should query the device
No, that is the exact opposite of what I said.
You are trying to make comp_mask indicate if the command should use
the data, I only want the comp_mask to indicate that the data has been
initialized (all new members must have a no-action value), they are
completely different things.
If the kernel treats unprovided fields as "no action" by default (for
new kernel with old user space), why does it matter if the user
initialized them with "no action" or the kernel just assumes so ? Why do
we need a comp_mask for this ?
I think that comp_mask should be used only for fields with an "action"
value that the user requests the kernel to handle.
We already have commands that don't touch every member of an input
structure depending on context. eg commands acting on a UD QP don't
touch fields that are used only by RC QPs.
We use attr_mask and the QP type in order to know which fields should be
treated.
capabilities, see if the feature he wants is supported and set the
fields accordingly. It's simple, it's like every other stack and
it's robust.
It is certainly not robust. You are relying on userspace to set the
correct value for the kernel version being used, and there is no way
for developers to test this because the latest kernel will accept all
values in comp_mask.
If treating comp_mask as "action" fields (instead of "initialized"
fields), comp_mask = 0 is always valid. It means that all extended
fields have their default/"older" values. In other words, the kernel
always knows how to support "older" user space.
That is horrible. Our carefully designed ABI compatability is worth
nothing in practice if commands return EINVAL on old kerenls!
Old and new kernels must behave consistently.
The way to handle comp_mask is to have new kernels *VALIDATE* and old
kernels ignore. New kernels will require a *no action* value in all
situations where the feature is not supported (eg an older HCA,
feature not enabled, feature not present in device caps, etc).
This way the new kernel runtime tests the input, in a manner that is
consistent with the handling of old kernels (old kernel treat
unsupported members as no-action). Testing your app on a new
kernel, with a HCA that does not support the feature is sufficent to
show that it will work on past kernels that do not support the
feature.
Basically, I think I understand the ABI you suggest. I think it could
lead to a situation that an old kernel will ignore an "action" field and
process the command differently than expected.
Saying that, I'm opened to other opinions.
Jason
Matan
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to [email protected]
More majordomo info at http://vger.kernel.org/majordomo-info.html