Re: [ofa-general] Re: [PATCH RFC] RDMA/CMA: Allocate PS_TCP ports from the host TCP port space.

Steve Wise Thu, 02 Aug 2007 07:22:27 -0700

Sean Hefty wrote:

Consider NFS and NFS-RDMA. The NFS gurus struggled with this veryissue and concluded that the RDMA service needs to be on a separateport. Thus they are proposing a new netid/port number for doing RDMAmounts vs TCP/UDP mounts. IMO that is the correct way to go: RDMAservices are different that tcp services. They use a differentprotocol on top of TCP and thus shouldn't be handled on the same TCPport. So, applications that want to service Sockets and RDMA servicesconcurrently would do so by listening on different ports...
This is a good point, and a different view from what I've been taking. Iwas looking at it more like trying to provide the same service over UDPand TCP, where you use the same port number. I just can't come up withany solution that works for iWarp, and sharing the port space seems likethe only way to fix things.
The iWARP protocols don't include a UDP based service, so it is notneeded. But if you're calling it a UDP port space, maybe it should bethe host's port space?
I think it should match what's done for TCP. IMO, there should be aconnectionless RDMA service, along with multicast, overUDP/IP/Ethernet. :)

I think the winner would really be a reliable connectionless RDMAservice with mcast.

Yes. The only exports interfaces into the host port allocation stuffrequires a socket struct. I didn't want to try and tackle exportingthe port allocation services at a lower level. Even at the bottomlevel, I think it still assumes a socket struct...
I looked at this too at one point, and gave up as well. I don't knowwhat other assumptions are made in the stack as a result of this. Forexample, if an app binds to an IP and port, and the IP address isremoved and re-added, is the port still valid/reserved?

I just tried this and I believe the application is still listening/boundeven though the address is no longer valid for the host:


[EMAIL PROTECTED] ~]# ifconfig eth1
eth1      Link encap:Ethernet  HWaddr 00:E0:81:33:67:D1
          BROADCAST MULTICAST  MTU:1500  Metric:1
          RX packets:0 errors:0 dropped:0 overruns:0 frame:0
          TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:0 (0.0 b)  TX bytes:0 (0.0 b)
          Interrupt:29

[EMAIL PROTECTED] ~]# netserver -L 192.168.69.135 -p 2222 -4
Starting netserver at port 2222

set_up_server could not establish a listen endpoint for port 2222 withfamily AF_INET

[EMAIL PROTECTED] ~]# ifconfig eth1 192.168.69.135 up
[EMAIL PROTECTED] ~]# netserver -L 192.168.69.135 -p 2222 -4
Starting netserver at port 2222
Starting netserver at hostname 192.168.69.135 port 2222 and family AF_INET
[EMAIL PROTECTED] ~]# netstat -an|grep 2222

tcp 0 0 192.168.69.135:2222 0.0.0.0:*LISTEN

[EMAIL PROTECTED] ~]# ifconfig eth1 0.0.0.0
[EMAIL PROTECTED] ~]# netstat -an|grep 2222

tcp 0 0 192.168.69.135:2222 0.0.0.0:*LISTEN

[EMAIL PROTECTED] ~]# ifconfig eth1
eth1      Link encap:Ethernet  HWaddr 00:E0:81:33:67:D1
          inet6 addr: fe80::2e0:81ff:fe33:67d1/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:0 errors:0 dropped:0 overruns:0 frame:0
          TX packets:2 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:0 (0.0 b)  TX bytes:176 (176.0 b)
          Interrupt:29

[EMAIL PROTECTED] ~]#

For iWarp, is using a struct socket essentially any different thantransitioning an existing socket to RDMA mode?

In the RFC patch I posted, the socket is _just_ to allow binding to aport/addr. Its not used for anything else. From the native stack'sperspective, its a TCP socket in the CLOSED state (but bound) I guess.

You're just requiring itto be in a specific state. Are there problems around doing this? Howmuch harder (technically, as opposed to politically) would it be to takethis change a step farther and offload an active connection?


By active, do you mean in the ESTABLISHED state?

I left it all in to show the minimal changes needed to implement thefunctionality. To keep the patch simple for initial consumption. Butyes, the rdma-cm really doesn't need to track the port stuff for TCPsince the host stack does.
Okay - for final patches, I think we want to remove the rdma_cm specificport spaces, along with changing the API to clarify that it uses thesame port space as TCP/UDP.


What do you mean by changing the API? Adding a new port space enum?

I haven't looked in detail at the SDP code, but I would think itshould want the TCP port space and not its own anwyay, but I'm notsure. What is the point of the SDP port space anyway?
The rdma_cm needs to adjust its protocol for SDP over IB. I'm not tooconcerned with SDP, since it's not upstream yet, but I don't want tobreak it beyond repair either. The rdma_cm may not need to manage theSDP port space at all, and instead rely on SDP to ensure that itprovides unique port numbers by itself.
- Sean


_______________________________________________
general mailing list
general@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general

To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Re: [ofa-general] Re: [PATCH RFC] RDMA/CMA: Allocate PS_TCP ports from the host TCP port space.

Reply via email to