On Thu, Oct 22, 2009 at 10:52:13AM -0700, Sean Hefty wrote:
> >> +static int ucma_set_ib_path(struct ucma_context *ctx,
> >> + struct ib_path_rec_data *path_data, size_t optlen)
> >> +{
> >> + struct ib_sa_path_rec sa_path;
> >> + struct rdma_cm_event event;
> >> + int ret;
> >> +
> >> + if (optlen != sizeof(*path_data))
> >> + return -EINVAL;
> >> +
> >> + if (path_data->flags != IB_PATH_GMP | IB_PATH_PRIMARY |
> >> + IB_PATH_OUTBOUND | IB_PATH_INBOUND)
> >> + return -EINVAL;
> >
> >This should accept an array here, to aid easing in APM support:
>
> I agree - I thought about this after sending the patch. This is more what I
> think is needed at this point:
>
> if ((optlen % sizeof(*path_data)) != 0) return -EINVAL;
>
> if (number of paths > 1)
> return -ENOSYS;
This is really no different than what you had at first.
My idea was to have the kernel search the array for the entries it
needs/supports. First one found wins. This provides future
API compatability.
Some future userspace, to support some future kernel APM, would pass in 2
entries:
IB_PATH_GMP | IB_PATH_PRIMARY | IB_PATH_OUTBOUND | IB_PATH_INBOUND
IB_PATH_GMP | IB_PATH_ALTERNATE | IB_PATH_OUTBOUND | IB_PATH_INBOUND
This shouldn't break old kernels.
Again, some even future kernel would support 6 entries, future user
space would send this:
IB_PATH_PRIMARY | IB_PATH_INBOUND
IB_PATH_PRIMARY | IB_PATH_OUTBOUND
IB_PATH_GMP | IB_PATH_PRIMARY | IB_PATH_OUTBOUND | IB_PATH_INBOUND
IB_PATH_ALTERNATE | IB_PATH_INBOUND
IB_PATH_ALTERNATE | IB_PATH_OUTBOUND
IB_PATH_GMP | IB_PATH_ALTERNATE | IB_PATH_OUTBOUND | IB_PATH_INBOUND
This shouldn't break old kernels either.
Maybe it is worth returning a postive integer from the syscall
indicating the number of paths the kernel accepted. Although I suspect
most use models of this will ignore that.
> >I like the PATH_PRIMARY/PATH_ALTERNATE idea,
> >
> >But I think IB_PATH_OUTBOUND_REV is still required.
>
> I'm not fond of the reverse idea. Why do you think it's needed?
Alignment with the SM. (Actually, it would be IB_PATH_INBOUND_REV)
You have a PR query where SGID=local, DGID=remote that gives you
IB_PATH_OUTBOUND. Then you do a query with SGID=remote, DGID=local,
that gives you IB_PATH_INBOUND. ie the SGID is different in each
result. This is necessary, non-reversible paths are uni-directional.
If you use a reversible path then the IB_PATH_OUTBOUND == IB_PATH_GMP
== IB_PATH_INBOUND_REV (ie the inbound side is always the one that has
SGID/DGID direction swapped)
Since this is going with the raw PR from the SA, I think it is wise to
not require any modification of the PR as it flows from the SA to the
kernel. Thus there are two PR representations for IB_PATH_INBOUND -
one that has SGID=local (_REV) and one that has SGID=remote.
So either IB_PATH_INBOUND_REV is explicitly part of the flags
structure, or IB_PATH_INBOUND | IB_PATH_OUTBOUND implicity means
IB_PATH_INBOUND_REV | IB_PATH_OUTBOUND
The implict behavior seems way to subtle to me, making it explicit
clears up which is which.
Jason
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to [email protected]
More majordomo info at http://vger.kernel.org/majordomo-info.html