On Thu, Jan 29, 2015 at 11:14:30AM +0200, Or Gerlitz wrote:
> From: Majd Dibbiny <[email protected]>
> 
> The rq/sq->psn is 24 bits as defined in the IB spec, therefore ULPs and User
> space applications shouldn't use the 8 most significant bits in the 32 bits
> variables to avoid overflow in modify_qp.
> 
> Fixed the PSN usage for kernel ULPs and masked out the 8 most significant bits
> for User-space applications.
> 
> From now on, we check in ib_modify_qp_is_ok that the PSN doesn't overflow, and
> we fail if so.

So, I guess the actual problem here is this:

        get_random_bytes(&id_priv->seq_num, sizeof id_priv->seq_num);

?

So what we need is

        get_random_bytes(&id_priv->seq_num, sizeof id_priv->seq_num);
+       id_priv->seq_num &= 0xffffff;

>               if (qp_attr->qp_state == IB_QPS_RTR)
> -                     qp_attr->rq_psn = id_priv->seq_num;
> +                     qp_attr->rq_psn = id_priv->seq_num & 0xffffff;

All this duplicitive masking should be centralized at the seq_num
assignment, not sprinkled

>  int ib_modify_qp_is_ok(enum ib_qp_state cur_state, enum ib_qp_state 
> next_state,
> -                    enum ib_qp_type type, enum ib_qp_attr_mask mask,
> -                    enum rdma_link_layer ll)
> +                    enum ib_qp_type type, struct ib_qp_attr *attr,
> +                    enum ib_qp_attr_mask mask, enum rdma_link_layer ll)
>  {
>       enum ib_qp_attr_mask req_param, opt_param;
>  
> @@ -860,6 +860,12 @@ int ib_modify_qp_is_ok(enum ib_qp_state cur_state, enum 
> ib_qp_state next_state,
>       if (mask & ~(req_param | opt_param | IB_QP_STATE))
>               return 0;
>  
> +     if ((mask & IB_QP_SQ_PSN) && (attr->sq_psn & 0xff000000))
> +             return 0;
> +
> +     if ((mask & IB_QP_RQ_PSN) && (attr->rq_psn & 0xff000000))
> +             return 0;
> +

And since rdmacm has had this longstanding bug of generating > 24 bit
PSNs, this change seems really scary - very likely to break working
systems.

I think we'd be better off keeping the current behavior of masking
the PSN, not enforcing the limit.

> diff --git a/drivers/infiniband/ulp/ipoib/ipoib_cm.c 
> b/drivers/infiniband/ulp/ipoib/ipoib_cm.c
> index 56959ad..bdf35b0 100644
> +++ b/drivers/infiniband/ulp/ipoib/ipoib_cm.c
> @@ -294,7 +294,7 @@ static int ipoib_cm_modify_rx_qp(struct net_device *dev,
>               ipoib_warn(priv, "failed to init QP attr for RTR: %d\n", ret);
>               return ret;
>       }
> -     qp_attr.rq_psn = psn;
> +     qp_attr.rq_psn = psn & 0xffffff;
>       ret = ib_modify_qp(qp, &qp_attr, qp_attr_mask);

Please don't just sed/grep stuff without review. It was trivial for me
to find that this is unneeded:

        psn = prandom_u32() & 0xffffff;
        ret = ipoib_cm_modify_rx_qp(dev, cm_id, p->qp, psn);

> @@ -433,7 +433,7 @@ static int ipoib_cm_send_rep(struct net_device *dev, 
> struct ib_cm_id *cm_id,
>       rep.rnr_retry_count = req->rnr_retry_count;
>       rep.srq = ipoib_cm_has_srq(dev);
>       rep.qp_num = qp->qp_num;
> -     rep.starting_psn = psn;
> +     rep.starting_psn = psn & 0xffffff;

ditto:

        psn = prandom_u32() & 0xffffff;
[..]
        ret = ipoib_cm_send_rep(dev, cm_id, p->qp, &event->param.req_rcvd, psn);

Jason
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to