> >>> I'm not sure that this results in a single error case.
> 
> >> Sorry... I'm not sure to follow, can you elaborate a bit more?
> 
> > We don't know what type of device responds to the ARP query.  It could
> come from an ethernet device.
> 
> Yep, this issue is really nasty..., but wait, you mentioned Ethernet,
> well, if the fabric is IB we do know that the GID in the REQ belongs
> to an HCA of that server node, b/c the client did route (== path
> query) resolution based on this DGID and their CM REQ landed in our
> hands, right?
> 
> We could come and say that we adhere to the IP --> GID mapping as
> present in the CM REQ (GID in the path, IP in the CMA header) and
> associate the newly created CMA ID with the device/port where this GID
> resides, no matter what the local IP stack has to say. This would
> work, but for people that seek HA for their apps, multiple sessions
> can be created over the same server hca/port where they wanted them to
> be spreaded...  what we can when such inconsistency is observed by the
> rdma-cm is the following
> 
> 1. print warning to the system log
> 2. reject the connection request
> 3. send Gratuitous ARP to update client nodes IPoIB neighbour IP --> GID
> mapping
> 
> I suggest that we 1st debate/agree on something that makes sense with
> IB and later see how it would work for RoCE

I wasn't referring to RoCE.  I was simply saying that this problem may result 
in multiple errors.  It's possible that we may never reach the point of sending 
a CM REQ.  It seems best to try to detect this problem as early as possible.

If a CM REQ actually gets to the remote side, it could be rejected as an 
invalid GID, and the client could retry the request.
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to