Re: [openib-general] ib0 shows MAC address as 00-00-00.... is it normal??

2007-02-27 Thread Hal Rosenstock
On Tue, 2007-02-27 at 06:30, Bala wrote: Hi All, We have build and installed OFED-1.1 on RHEL-4 machine, using ipoib we set the IPs for the interface and able to ping each other, but my ifconfig shows ib0 MAC address as shown below 00-00-00-00-00-00-00-00-00-00-00-00-00-00-00-00

Re: [openib-general] [RFC] [PATCH] ib_cache: do not mask upper bit when searching for a pkey

2007-02-27 Thread Hal Rosenstock
On Tue, 2007-02-27 at 12:06, Sean Hefty wrote: Sorry for jumping into that thread, but although this patch will make things more spec compliant, it will break functionality we depend one. I suggest that we first find an alternate way to enable usage of partial partition membership before

Re: [openib-general] [PATCH] osm: trivial data type change to remove compilation warning

2007-02-27 Thread Hal Rosenstock
On Mon, 2007-02-26 at 06:20, Yevgeny Kliteynik wrote: Hi Hal Trivial data type change to remove compilation warning. Please apply to the trunk and to the 1.2 branch. Thanks. Signed-off-by: Yevgeny Kliteynik [EMAIL PROTECTED] Thanks. Applied (to both master and ofed_1_2). -- Hal

Re: [openib-general] [PATCH] osm: Flushing log file after OSM_SYS_LOG message

2007-02-26 Thread Hal Rosenstock
On Sun, 2007-02-25 at 09:23, Yevgeny Kliteynik wrote: Hi Hal, OSM log should be flushed when OSM_SYS_LOG message is printed. We had this once, but somehow it has disappeared. This fix has to go both to trunk and to 1.2. Thanks, --Yevgeny Signed-off-by: Yevgeny Kliteynik [EMAIL

Re: [openib-general] [PATCH] opensm: updn performance improvements

2007-02-26 Thread Hal Rosenstock
On Sat, 2007-02-24 at 15:13, Sasha Khapyorsky wrote: There are various performance improvements for up/down routing engine: - updn_node object which is referenced by switch's priv pointer - ranking for switches only - replace time consuming cl_list by cl_qlist - reuse already collected

Re: [openib-general] ipoib the partial pkey

2007-02-26 Thread Hal Rosenstock
On Sun, 2007-02-25 at 05:48, Or Gerlitz wrote: Sean Hefty wrote: I looked into this more... RFC 4391 states (middle of page 5): For a node to join a partition, one of its ports must be assigned the relevant P_Key by the SM [RFC4392]. Jumping to RFC 4392 (top of page 4): Just to

Re: [openib-general] ipoib the partial pkey

2007-02-26 Thread Hal Rosenstock
On Mon, 2007-02-26 at 10:37, Or Gerlitz wrote: Hal Rosenstock wrote: On Sun, 2007-02-25 at 05:48, Or Gerlitz wrote: Just to have us agree on the quote, it is from section 4 of rfc 4392 (page 14) eg in http://www.ietf.org/rfc/rfc4392.txt at the time of creating an IB multicast group

Re: [openib-general] opensm issue

2007-02-26 Thread Hal Rosenstock
Hi Ashish, On Mon, 2007-02-26 at 16:04, Batwara, Ashish wrote: Hi, I am trying to bring up opensm, but it not letting me. When I look at the /var/log/messages, I see that it becomes UP for a moment and then again it goes down. Look for SUBNET UP in below logs. Can anyone know what the

Re: [openib-general] [PATCH] opensm: faster min hops

2007-02-26 Thread Hal Rosenstock
On Sun, 2007-02-25 at 16:48, Sasha Khapyorsky wrote: After gprof output analyzing, I noticed that current lmx (switch's lid matrix) implementation is extremely slow. This simple hops matrix reimplementation makes lid matrices build process two times faster. Excellent! Signed-off-by: Sasha

Re: [openib-general] ib0 interface up but can't ping

2007-02-24 Thread Hal Rosenstock
On Sat, 2007-02-24 at 09:33, Bala wrote: Hi All, we are trying to setup IB on HP c-class blade system(x86_64) having 4X DDR IB Mezzanine HCA with CentOS-4. 1. Downloaded ofed-1.1 2. build the RPMS and installed two machines 3. set the IP in icfg-ib0 and rebooted 4. can

Re: [openib-general] ipoib the partial pkey, was: librdmacm: fix bug causing failure to work with partial membership pkey

2007-02-23 Thread Hal Rosenstock
On Thu, 2007-02-22 at 18:35, Sean Hefty wrote: Doesn't this allow ipoib to join a multicast group for which it may not be able to communicate with all members? For the broadcast group, this seems like an error to me. Can ipoib work in such a configuration? If all nodes were assigned a

Re: [openib-general] [PATCH] librdmacm: fix bug causing failure to work with partial membership pkey

2007-02-23 Thread Hal Rosenstock
On Thu, 2007-02-22 at 17:18, Sean Hefty wrote: Can someone help my understanding here? Is ipoib joining a multicast group using the full membership PKey, even if the node that it joins from only has the limited membership PKey configured? And the code in ib_find_cached_pkey helps enable

Re: [openib-general] [PATCH] librdmacm: fix bug causing failure to work with partial membership pkey

2007-02-23 Thread Hal Rosenstock
On Thu, 2007-02-22 at 17:18, Sean Hefty wrote: Can someone help my understanding here? Is ipoib joining a multicast group using the full membership PKey, even if the node that it joins from only has the limited membership PKey configured? And the code in ib_find_cached_pkey helps enable

Re: [openib-general] [ewg] anyone have OFED 1.2 alpha1 compiling on ppc64

2007-02-22 Thread Hal Rosenstock
On Thu, 2007-02-22 at 02:11, Scott Weitzenkamp (sweitzen) wrote: I tried both RHEL4 and SLES10 usinstall.sh, and get this. I filed bug 379, anyone else tried ppc64? gcc -DHAVE_CONFIG_H -I. -I. -I. -I./include/infiniband -I./../libibcommon/incl\ ude/infiniband -Wall -m64 -g -O2 -MT

Re: [openib-general] [PATCH] librdmacm: fix bug causing failure to work with partial membership pkey

2007-02-22 Thread Hal Rosenstock
On Thu, 2007-02-22 at 02:28, Or Gerlitz wrote: Hal Rosenstock wrote: On Wed, 2007-02-21 at 15:45, Or Gerlitz wrote: On 21 Feb 2007 08:20:23 -0500, Hal Rosenstock [EMAIL PROTECTED] wrote: If the IPoIB spec does not allow both partial and full members of a partition to share a broadcast

Re: [openib-general] [PATCH] librdmacm: fix bug causing failure to work with partial membership pkey

2007-02-22 Thread Hal Rosenstock
On Thu, 2007-02-22 at 03:04, Or Gerlitz wrote: Sean Hefty wrote: Note that since the HCA validates the pkey in the in coming packet, no matter what the IB SW would do, partial members of a partition can't talk to each other. So the approach taken by the core/ipoib code was to just ignore

Re: [openib-general] [PATCH] librdmacm: fix bug causing failure to work with partial membership pkey

2007-02-21 Thread Hal Rosenstock
On Wed, 2007-02-21 at 01:43, Or Gerlitz wrote: Hal Rosenstock wrote: On Tue, 2007-02-20 at 10:38, Or Gerlitz wrote: Yes. Its a little bit confusing: partial and full members of an IPoIB IB partition use the same MGID. When an IPoIB MGID is constructed, the pkey placed by the driver

Re: [openib-general] [PATCH] librdmacm: fix bug causing failure to work with partial membership pkey

2007-02-21 Thread Hal Rosenstock
On Wed, 2007-02-21 at 07:35, Or Gerlitz wrote: However, no matter what the SM configures, the core ipoib code act as the full pkey is there. This is nice simplification and it works well. I believe it is a spec (compliance) violation for the port to be a partial member and join as a

Re: [openib-general] [ofw] [Fwd: Re: [Fwd: Re: winrelated[was:Re:[PATCH 1/2] opensm: sigusr1: syslog() fixes]]]

2007-02-21 Thread Hal Rosenstock
Rosenstock Cc: ofw@lists.openfabrics.org; Gilad Shainer; OPENIB Subject: RE: [ofw] [Fwd: Re: [openib-general] [Fwd: Re: winrelated[was:Re:[PATCH 1/2] opensm: sigusr1: syslog() fixes]]] -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Hal Rosenstock

Re: [openib-general] [ofw] [Fwd: Re: [Fwd: Re: winrelated[was:Re:[PATCH 1/2] opensm: sigusr1: syslog() fixes]]]

2007-02-21 Thread Hal Rosenstock
On Wed, 2007-02-21 at 09:31, Hal Rosenstock wrote: Tzachi, On Wed, 2007-02-21 at 03:47, Tzachi Dar wrote: OK, Hal let's try to close this. Thanks. The windows openib project was agreed by everyone to be BSD only. The fact that it is BSD means that any partner (or non partner

Re: [openib-general] [PATCH] librdmacm: fix bug causing failure to work with partial membership pkey

2007-02-21 Thread Hal Rosenstock
On Wed, 2007-02-21 at 15:45, Or Gerlitz wrote: On 21 Feb 2007 08:20:23 -0500, Hal Rosenstock [EMAIL PROTECTED] wrote: On Wed, 2007-02-21 at 07:35, Or Gerlitz wrote: I believe it is a spec (compliance) violation for the port to be a partial member and join as a full member. Since

Re: [openib-general] [PATCH] librdmacm: fix bug causing failure to work with partial membership pkey

2007-02-21 Thread Hal Rosenstock
On Wed, 2007-02-21 at 17:36, Sean Hefty wrote: It does this since its makes life simple and robust. Is an SM prevented from loading two PKeys into an HCA's PKey table that differ by only the membership bit? Nope. I can't think of any reason to do such a thing, Me neither. It would be

Re: [openib-general] GetTable path record query not returningDGID=SGID paths

2007-02-21 Thread Hal Rosenstock
On Wed, 2007-02-21 at 18:05, Sean Hefty wrote: We haven't looked into this in more detail yet. This was our observation while testing on a larger (64 node) cluster this morning that we don't have access to at the moment. With the local SA cache running, we were surprised to see any

Re: [openib-general] [PATCH] librdmacm: fix bug causing failure to work with partial membership pkey

2007-02-21 Thread Hal Rosenstock
On Wed, 2007-02-21 at 17:53, Hal Rosenstock wrote: On Wed, 2007-02-21 at 17:36, Sean Hefty wrote: It does this since its makes life simple and robust. Is an SM prevented from loading two PKeys into an HCA's PKey table that differ by only the membership bit? Nope. I can't

Re: [openib-general] Port error rate detection

2007-02-20 Thread Hal Rosenstock
On Mon, 2007-02-19 at 15:53, Steven Carter wrote: I have a Nagios module that alerts on connectivity, port errors, speed/width problems. I would like to give it the ability to change the severity of the alert depending on whether errors are just present or if they are increasing faster

Re: [openib-general] [PATCH] osm_vendor_ibumad: termination crash fix

2007-02-20 Thread Hal Rosenstock
On Mon, 2007-02-19 at 16:46, Sasha Khapyorsky wrote: When OpenSM is terminated umad_receiver thread still running even after the structures are destroyed and freed, this causes to random (but easily reproducible) crashes. The reason is that osm_vendor_delete() does not care about thread

Re: [openib-general] Port error rate detection

2007-02-20 Thread Hal Rosenstock
On Tue, 2007-02-20 at 09:44, Steven Carter wrote: Hal Rosenstock wrote: On Mon, 2007-02-19 at 15:53, Steven Carter wrote: I have a Nagios module that alerts on connectivity, port errors, speed/width problems. I would like to give it the ability to change the severity of the alert

Re: [openib-general] [PATCH] librdmacm: fix bug causing failure to work with partial membership pkey

2007-02-20 Thread Hal Rosenstock
On Mon, 2007-02-19 at 01:40, Or Gerlitz wrote: Hi Sean, this fixes a bug which did not allow to run librdmacm apps over a node which is partial member of a partition. The patch takes the approach of the kernel ib_find_cached_pkey implementation. If you approve this, i suggest pushing it

Re: [openib-general] [PATCH] librdmacm: fix bug causing failure to work with partial membership pkey

2007-02-20 Thread Hal Rosenstock
On Tue, 2007-02-20 at 10:38, Or Gerlitz wrote: Hal Rosenstock wrote: The pkey extracted by the RDMA CM from the IPoIB device hardware address always has the full membership bit set. However, when looking in the pkey table the search must mask out the full membership bit

Re: [openib-general] [Fwd: Re: win related [was: Re: [PATCH 1/2] opensm: sigusr1: syslog() fixes]]

2007-02-20 Thread Hal Rosenstock
Cc: Yossi Leybovich; Gilad Shainer; Yevgeny Kliteynik; OPENIB; Michael S. Tsirkin; Hal Rosenstock Subject: Re: [Fwd: Re: win related [was: Re: [PATCH 1/2] opensm: sigusr1: syslog() fixes]] On 20:31 Thu 08 Feb , Tzachi Dar wrote: The windows open IB has decided on using a BSD

Re: [openib-general] Port error rate detection

2007-02-20 Thread Hal Rosenstock
On Tue, 2007-02-20 at 10:25, Steven Carter wrote: Hal Rosenstock wrote: On Tue, 2007-02-20 at 09:44, Steven Carter wrote: Hal Rosenstock wrote: On Mon, 2007-02-19 at 15:53, Steven Carter wrote: I have a Nagios module that alerts on connectivity, port errors

[openib-general] [Fwd: Re: [Fwd: Re: win related [was: Re: [PATCH 1/2] opensm: sigusr1: syslog() fixes]]]

2007-02-20 Thread Hal Rosenstock
Also, looping in the OpenFabrics Windows email list on this. -- Hal -Forwarded Message- From: Hal Rosenstock [EMAIL PROTECTED] To: Tzachi Dar [EMAIL PROTECTED] Cc: OPENIB openib-general@openib.org, Gilad Shainer [EMAIL PROTECTED] Subject: Re: [openib-general] [Fwd: Re: win related

Re: [openib-general] Unknown SMP Recv

2007-02-17 Thread Hal Rosenstock
On Wed, 2007-02-14 at 21:12, Michael Arndt wrote: Hi, what I forgot was that the write function in umad_send returns with -1 if the error occurs. That's looks like EPERM. Not sure why write would return this. The only thing I see that might return this is handle_outgoing_dr_smp on some

Re: [openib-general] IPv6oIB neighbour discover broken when MCGs overflow

2007-02-16 Thread Hal Rosenstock
On Fri, 2007-02-16 at 12:00, Roland Dreier wrote: We have a customer issue regarding IPv6oIB. In the subnet, there are limited number of MCGs supported. So when there are multiple IPv6 addresses are assigned to one interface, each IPv6 address will have one unique solicited-node

Re: [openib-general] IPv6oIB neighbour discover broken when MCGs overflow

2007-02-16 Thread Hal Rosenstock
On Fri, 2007-02-16 at 12:47, Roland Dreier wrote: I thought that mapping multiple MCGs to the same MLID requires that a set of the (group) parameters are the same. Is that the case for these IPv6 groups ? Is the only variable in those parameters the PKey ? I don't see why any group

Re: [openib-general] IPv6oIB neighbour discover broken when MCGs overflow

2007-02-16 Thread Hal Rosenstock
On Fri, 2007-02-16 at 13:07, Roland Dreier wrote: For starters, I think that rate, MTU, and SL (and maybe PKey too) need to be the same. There may be others too if I stare at the spec for a while... Can you expand on why? For example I definitely can send to the same MLID with

Re: [openib-general] IPv6oIB neighbour discover broken when MCGs overflow

2007-02-16 Thread Hal Rosenstock
On Fri, 2007-02-16 at 16:31, Roland Dreier wrote: Sure but I think this complicates the SL2VL tables in the subnet to accomodate this. I think a similar thing is true for PKeys. So to me this is an SM complexity issue when mapping multiple MGRPs to same MLID. I'm still confused.

Re: [openib-general] bad port physstate

2007-02-15 Thread Hal Rosenstock
On Thu, 2007-02-15 at 10:53, yipeeyipeeyipeeyipee wrote: Hi, It seems like I've stumbled into some sort of bug in the port info mad query. I have several pc's connected to an IB switch. On one of the machines I have an OpenIB installation, and on one pc I continuously run a management

Re: [openib-general] IB diagnostic tool : ibping

2007-02-15 Thread Hal Rosenstock
. the first port that is ACTIVE. .PP 2. if not found, the first port that is UP (physical link up). If a port and/or CA name is specified, the user request is attempted to be fulfilled, and will fail if it is not possible. .SH AUTHOR .TP Hal Rosenstock .RI [EMAIL PROTECTED

Re: [openib-general] sl2vl tables

2007-02-15 Thread Hal Rosenstock
On Thu, 2007-02-15 at 17:08, Dale Purdy wrote: We are experimenting with OFED 1.2 (alpha1) and have dumped the SL2VL tables for both a switch port and an HCA port using the smpqueury command: switch: # SL2VL table: Lid 103 # SL: | 0| 1| 2| 3| 4| 5| 6| 7| 8|

Re: [openib-general] sl2vl tables

2007-02-15 Thread Hal Rosenstock
. You can run with -Q or set no_qos to FALSE in the opensm.opts file. I wonder whether we should tie LASH to this so this isn't needed. -- Hal Dale On Thu, 15 Feb 2007, Hal Rosenstock wrote: On Thu, 2007-02-15 at 17:08, Dale Purdy wrote: We are experimenting with OFED 1.2 (alpha1

Re: [openib-general] SM assigned GID addresses

2007-02-14 Thread Hal Rosenstock
Hi, On Wed, 2007-02-14 at 11:12, Batwara, Ashish wrote: Hi, I am referring to Section 4.1.1 of IB Spec which talks about GID Usage AND Properties. Does anyone know whether or not SM uses item # 3 below for the address assignment and who are all the vendor supports # 3? Can anybody points me

Re: [openib-general] SM assigned GID addresses

2007-02-14 Thread Hal Rosenstock
Message- From: Hal Rosenstock [mailto:[EMAIL PROTECTED] Sent: Wednesday, February 14, 2007 10:41 AM To: Batwara, Ashish Cc: openib-general@openib.org Subject: Re: [openib-general] SM assigned GID addresses Hi, On Wed, 2007-02-14 at 11:12, Batwara, Ashish wrote: Hi, I am referring

Re: [openib-general] GetTable path record query not returning DGID=SGID paths

2007-02-14 Thread Hal Rosenstock
On Wed, 2007-02-14 at 16:45, Sean Hefty wrote: We're seeing a situation where it appears that the response to a GetTable path record query is not returning paths where the DGID is the same as the SGID. Is this OpenSM or a vendor SM ? The query is setting the SGID and number of paths. Yes,

Re: [openib-general] GetTable path record query not returning DGID=SGID paths

2007-02-14 Thread Hal Rosenstock
On Wed, 2007-02-14 at 18:01, Sean Hefty wrote: We're seeing a situation where it appears that the response to a GetTable path record query is not returning paths where the DGID is the same as the SGID. Is this OpenSM or a vendor SM ? This is with opensm. When we're running with the

Re: [openib-general] GetTable path record query not returningDGID=SGID paths

2007-02-14 Thread Hal Rosenstock
On Wed, 2007-02-14 at 18:42, Sean Hefty wrote: What is the value of NumbPath and how large a subnet is this ? I'm pretty sure this works; at least it did the last I checked. By default, NumbPath should be 127, but I would have expected a path record even with it set to 1. Yes, you should

Re: [openib-general] OSM QoS policy file

2007-02-13 Thread Hal Rosenstock
Hi Yevgeny, Sorry for the slow response; I've been consumed getting ready for OFED 1.2 alpha. On Mon, 2007-02-05 at 07:37, Yevgeny Kliteynik wrote: Hi Hal. I added osm/doc/qos-policy.txt file with the description of the QoS policy file, and an example of such file (with more comments

Re: [openib-general] Problem with install.sh openib-diags OFED-1.2-20070208-1508.tgz

2007-02-12 Thread Hal Rosenstock
On Sun, 2007-02-11 at 13:44, Scott Weitzenkamp (sweitzen) wrote: I'm using install.sh on RHEL4 U3 x86_64 Preparing... ## kernel-ib-devel ## kernel-ib

Re: [openib-general] OFED 1.2 components list - for the meeting today

2007-02-12 Thread Hal Rosenstock
On Mon, 2007-02-12 at 11:42, Tziporet Koren wrote: This is the full OFED 1.2 components list that we will review in the meeting Tziporet # Kernel ib_verbs (core) ib_mthca ib_ipoib ib_ipath - currently works on 2.6.20 only. Backport patches cannot applied ib_iser ib_sdp ib_srp

Re: [openib-general] OFED 1.2 components list - for the meeting today

2007-02-12 Thread Hal Rosenstock
On Mon, 2007-02-12 at 12:58, Woodruff, Robert J wrote: BTW. Is the ibdiagui code going to be part of this release. I did not see it in the list below or is it just part of the openib-diags ? It's part of ibutils. -- Hal I thought that we discussed this as an OFED 1.2 feature. I have

Re: [openib-general] patches to 2.6.19.1 kernel for switch Operation

2007-02-12 Thread Hal Rosenstock
Suri, On Mon, 2007-02-12 at 09:27, Suresh Shelvapille wrote: Hal: Ref: comment on mad.c (ib_mad_recv_done_handler(). Even if I make the relevant changes to smi.c functions how do I get the packet to get forwarded, without making additional changes in this function? Meaning,

Re: [openib-general] Unknown SMP Recv

2007-02-10 Thread Hal Rosenstock
On Fri, 2007-02-09 at 15:19, Michael Arndt wrote: Hi, It is strange, I did similar thing (you can see in management/diags/src/mcm_rereg_test.c) and it worked fine for me. What location is that? Which libibumad version you are using? Also I understand you did some changes in the

Re: [openib-general] Problem is routing CM REQ

2007-02-09 Thread Hal Rosenstock
On Thu, 2007-02-08 at 18:43, Sean Hefty wrote: Looking at the problem more, I think that the issue extends to the remote port LID as well. My expectation with a local path record query is that the SLID is the local port, and the DLID is the local router. This should be

Re: [openib-general] Problem is routing CM REQ

2007-02-09 Thread Hal Rosenstock
On Thu, 2007-02-08 at 23:37, Jason Gunthorpe wrote: On Thu, Feb 08, 2007 at 03:43:24PM -0800, Sean Hefty wrote: Looking at the problem more, I think that the issue extends to the remote port LID as well. My expectation with a local path record query is that the SLID is the

Re: [openib-general] [PATCH] IPOIB: Use a GRH when appropriate forunicast packets

2007-02-09 Thread Hal Rosenstock
phone: 781-768-5300 -Original Message- From: Hal Rosenstock [mailto:[EMAIL PROTECTED] Sent: Wednesday, February 07, 2007 7:20 PM To: Sean Hefty Cc: Jason Gunthorpe; Roland Dreier; openib-general@openib.org; Kanevsky, Arkady Subject: Re: [openib-general] [PATCH] IPOIB

Re: [openib-general] [PATCH] OpenSM/osm_ucast_lash.c: In osm_get_lash_sl, fix SL when CA ports on same switch

2007-02-09 Thread Hal Rosenstock
, Hal Rosenstock wrote: OpenSM/osm_ucast_lash.c: In osm_get_lash_sl, fix SL when CA ports on same switch This change resolves an issue with strange SL assignment when two HCAs communicate with other and are on the same switch. Since LASH is switch to switch routing, the get_lash_sl

Re: [openib-general] Problem is routing CM REQ

2007-02-09 Thread Hal Rosenstock
On Fri, 2007-02-09 at 12:22, Sean Hefty wrote: SLID corresponding to SGID and a DLID for some IB router on the subnet which can route to the remote DGID. This was my assumption as well. An SM is free to choose SLID and DLID to supply to if there are multiple LIDs for the ports in

Re: [openib-general] Unknown SMP Recv

2007-02-09 Thread Hal Rosenstock
On Fri, 2007-02-09 at 12:14, Michael Arndt wrote: Hi, umad_send takes the timeout in msec. 100 msec is too short. Try something on the order of seconds. Note also that negative 'timeout_ms' value makes the kernel wait for the reply forever. I have tried many values, but sooner or

Re: [openib-general] Unknown SMP Recv

2007-02-09 Thread Hal Rosenstock
On Fri, 2007-02-09 at 13:38, Michael Arndt wrote: Hi, I have no clue; I don't really understand what you have changed so it is hard to know. For example: if I send ten SMPs like: for (i=0;i10;i++){ umad_send(portid, agentid, msg, len, timeout, repeats); }

Re: [openib-general] Problem is routing CM REQ

2007-02-09 Thread Hal Rosenstock
On Fri, 2007-02-09 at 14:20, Jason Gunthorpe wrote: On Fri, Feb 09, 2007 at 12:58:51PM -0500, Hal Rosenstock wrote: For simplicity, assume a single path. My assumption in this case was that the SLID/DLID values would be reversed. That is, the LIDs are relative to the local

Re: [openib-general] Unknown SMP Recv

2007-02-09 Thread Hal Rosenstock
On Fri, 2007-02-09 at 15:19, Michael Arndt wrote: Hi, It is strange, I did similar thing (you can see in management/diags/src/mcm_rereg_test.c) and it worked fine for me. What location is that? Which libibumad version you are using? Also I understand you did some changes in the

Re: [openib-general] Problem is routing CM REQ

2007-02-09 Thread Hal Rosenstock
On Fri, 2007-02-09 at 15:34, Sean Hefty wrote: the /missing part (right now) is locating the SA on that remote subnet if this is a needed function. Maybe we can expose this to SA clients through a ServiceRecord? That might be one way if there were a standardized service name for SA and

[openib-general] [PATCH] OpenSM/osm_ucast_lash.c: In osm_get_lash_sl, fix SL when CA ports on same switch

2007-02-08 Thread Hal Rosenstock
(the value assigned to the variable NONE) to be a uint8_t when asked for an SL assignment in this case. This change resolves this issue. Signed-off-by: Thomas Sødring [EMAIL PROTECTED] Signed-off-by: Hal Rosenstock [EMAIL PROTECTED] diff --git a/osm/opensm/osm_ucast_lash.c b/osm/opensm

Re: [openib-general] Problem is routing CM REQ was: Use a GRH when appropriate for unicast packets

2007-02-08 Thread Hal Rosenstock
On Thu, 2007-02-08 at 14:54, Sean Hefty wrote: Hum, you mean to meet the LID validation rules of 9.6.1.5? That is a huge PITA.. [IMHO, 9.6.1.5 C9-54 is a mistake, if there is a GRH then the LRH.SLID should not be validated against the QP context since it makes it extra hard for multipath

Re: [openib-general] Problem is routing CM REQ was: Use a GRH when appropriate for unicast packets

2007-02-08 Thread Hal Rosenstock
On Thu, 2007-02-08 at 17:02, Sean Hefty wrote: This requires that the passive side be able to issue path record queries, but I think that it could work for static routes. A point was made to me that the remote side could be a TCA without query capabilities. Are you referring to SA query

Re: [openib-general] Unknown SMP Recv

2007-02-08 Thread Hal Rosenstock
On Thu, 2007-02-08 at 19:39, Michael Arndt wrote: Hi, I think I have found the problem. It is the timeout parameter on the umad_send function. How exactly I have to handle this parameter? It seems to be that it shoult be zero if there is no response exspected. But what value should it be

Re: [openib-general] [PATCH] IPOIB: Use a GRH when appropriate for unicast packets

2007-02-07 Thread Hal Rosenstock
On Wed, 2007-02-07 at 15:24, Sean Hefty wrote: I didn't get too far on getting CMA to work. Beyond the bad HopLimit feild I was seeing Hal pointed out a number of problems in IBA that would prevent it from working as is : I've started thinking about what it would take to get the rdma cm

Re: [openib-general] [PATCH] IPOIB: Use a GRH when appropriate for unicast packets

2007-02-07 Thread Hal Rosenstock
On Wed, 2007-02-07 at 16:31, Jason Gunthorpe wrote: On Wed, Feb 07, 2007 at 12:24:08PM -0800, Sean Hefty wrote: I didn't get too far on getting CMA to work. Beyond the bad HopLimit feild I was seeing Hal pointed out a number of problems in IBA that would prevent it from working as is :

Re: [openib-general] [PATCH] IPOIB: Use a GRH when appropriate for unicast packets

2007-02-07 Thread Hal Rosenstock
On Wed, 2007-02-07 at 17:49, Jason Gunthorpe wrote: On Wed, Feb 07, 2007 at 02:40:51PM -0800, Sean Hefty wrote: Are you referring to the SLID in the CM REQ? If so, I've been looking at this issue as well. I simply cannot think of any way to come up with this LID, and my current solution

Re: [openib-general] openib diags installation issue

2007-02-06 Thread Hal Rosenstock
Hi Vlad, On Tue, 2007-02-06 at 03:19, Vladimir Sokolovsky wrote: Hi Hal, Please merge the following commit to the ofed_1_2 branch of the management.git: commit6c819523a6a58e2ac4948327f256e49984dce9fb Diags/Makefile.am: Fix for executing 'make DESTDIR=/var/tmp/OFED install'

Re: [openib-general] QoS in opensm will not be part of OFED 1.2

2007-02-06 Thread Hal Rosenstock
On Tue, 2007-02-06 at 08:58, Michael S. Tsirkin wrote: Quoting Michael S. Tsirkin [EMAIL PROTECTED]: Subject: Re: QoS in opensm will not be part of OFED 1.2 I had an AI to check the QoS status with OSM. Conclusions are that QoS support in OpenSM will not be part of OFED

[openib-general] [PATCH 1/2] OpenSM: Add a printable node description to osm_node_t

2007-02-06 Thread Hal Rosenstock
OpenSM: Add a printable node description to osm_node_t Also, convert memcpy's to use this rather than temporary one Signed-off-by: Ira K. Weiny [EMAIL PROTECTED] Signed-off-by: Hal Rosenstock [EMAIL PROTECTED] diff --git a/osm/include/opensm/osm_node.h b/osm/include/opensm/osm_node.h index

Re: [openib-general] Unknown SMP Recv

2007-02-06 Thread Hal Rosenstock
On Tue, 2007-02-06 at 16:14, Michael Arndt wrote: Sorry, there was a little mistake. The orginal packet has the initial path [0][1][1], return path [0][2][2], hop count and hop pointer are 2 (SubnGetResp), the Dr_DLID and DrSLID are permissive. Is this the response ? If so, what's the

Re: [openib-general] Unknown SMP Recv

2007-02-06 Thread Hal Rosenstock
On Tue, 2007-02-06 at 16:14, Michael Arndt wrote: Sorry, there was a little mistake. I think I understand what you are saying now. The below are the 2 responses you get. The orginal packet has the initial path [0][1][1], return path [0][2][2], hop count and hop pointer are 2 (SubnGetResp),

Re: [openib-general] QoS in opensm will not be part of OFED 1.2

2007-02-05 Thread Hal Rosenstock
Hi Tziporet, On Mon, 2007-02-05 at 07:04, Tziporet Koren wrote: Hi Hal, I had an AI to check the QoS status with OSM. Conclusions are that QoS support in OpenSM will not be part of OFED 1.2 (I updated the plan on the Wiki) The reasons for this are: 1. Code not ready at code freeze. 2.

Re: [openib-general] Unknown SMP Recv

2007-02-05 Thread Hal Rosenstock
On Mon, 2007-02-05 at 10:18, Michael Arndt wrote: Hi, I have change the driver (smi) a little and have written a tool like a router or a bridge. It receives directed route smp's on one port and sends it to another port. I use 3 nodes (sender on node 1, the router on node 2, normal node

Re: [openib-general] QoS in opensm will not be part of OFED 1.2

2007-02-05 Thread Hal Rosenstock
On Mon, 2007-02-05 at 10:38, Michael S. Tsirkin wrote: I had an AI to check the QoS status with OSM. Conclusions are that QoS support in OpenSM will not be part of OFED 1.2 (I updated the plan on the Wiki) The reasons for this are: 1. Code not ready at code freeze. 2. There

Re: [openib-general] Unknown SMP Recv

2007-02-05 Thread Hal Rosenstock
On Mon, 2007-02-05 at 11:56, Michael Arndt wrote: Hi, A router should not allow a SMP to cross a subnet boundary. SMPs are restricted to the local subnet. I work on a discovering mechanism for switchless InfiniBand Architectures like Rings, Tori or maybe Hyper-Cubes. There is just one

[openib-general] OpenIB management libraries release 1.0.2

2007-02-03 Thread Hal Rosenstock
http://www.openfabrics.org/~halr/ md5sum b9b4bdf899f1d0ff15e06915cd846a3a libibcommon-1.0.2.tar.gz 2af3ff7e38a1f49fb7514660a9991c89 libibmad-1.0.2.tar.gz 7d7690abfe9b08c8240fbf0157653b90 libibumad-1.0.2.tar.gz ___ openib-general mailing list

Re: [openib-general] dry-run mode for opensm ?

2007-02-02 Thread Hal Rosenstock
Hi Phillipe, On Fri, 2007-02-02 at 05:10, Philippe Gregoire wrote: Hal Is there any way to run opensm in a dry-run mode just to make it dump the route tables it will generate ? Not that I'm aware of. We alve already an embedded SM and I would like to compare the current route tables with

Re: [openib-general] components that have not opend the ofed_1_2 branch

2007-02-02 Thread Hal Rosenstock
On Thu, 2007-02-01 at 10:40, Tziporet Koren wrote: The following components have not opened ofed_1_2 branch: * libibverbs - Roland * libmthca - Roland * libipathverbs - Bryan * tvflash - Roland * srptools - Ishai * management - Hal Please open the branch today

Re: [openib-general] [PATCH] osm: some trivial chages in the osm_ucast_lash for compilation on windows

2007-02-01 Thread Hal Rosenstock
Hi Yevgeny, On Thu, 2007-02-01 at 06:48, Yevgeny Kliteynik wrote: Hi Hal, This patch has some trivial changes in the osm_ucast_lash.c for compilation on windows. In general, this file needs a major cosmetic (and not only) patch to fit better into the OSM code. There will shortly be

Re: [openib-general] [PATCH] osm: trivial casting for compilation on windows

2007-02-01 Thread Hal Rosenstock
On Thu, 2007-02-01 at 05:35, Yevgeny Kliteynik wrote: Trivial casting for compilation on windows Signed-off-by: Yevgeny Kliteynik [EMAIL PROTECTED] Thanks. Applied. -- Hal ___ openib-general mailing list openib-general@openib.org

Re: [openib-general] [PATCH 10/10] osm: QoS in OpenSM

2007-02-01 Thread Hal Rosenstock
Hi again Yevgeny, On Wed, 2007-01-31 at 17:19, Yevgeny Kliteynik wrote: [snip...] + for (i = 0; i IB_MAX_NUM_VLS; i++) + { + if (valid_sls[i]) + { + vl = ib_slvl_table_get(p_slvl_tbl,i); + if (vl == IB_DROP_VL) Does vl Operational VLs need

Re: [openib-general] Bugzilla Bug 329: HCA_FATAL_EVENT cause to OpenSM to stop functioning

2007-01-31 Thread Hal Rosenstock
Hi Yevgeny, On Wed, 2007-01-31 at 05:16, Yevgeny Kliteynik wrote: Hi Hal. I noticed the following bug in Bugzilla: Bugzilla Bug 329: HCA_FATAL_EVENT cause to opensm to stop functioning https://bugs.openfabrics.org/show_bug.cgi?id=329 When there is a HCA fatal event

Re: [openib-general] [PATCH 10/10] osm: QoS in OpenSM

2007-01-31 Thread Hal Rosenstock
Hi Yevgeny, On Tue, 2007-01-30 at 10:33, Yevgeny Kliteynik wrote: Checking PathRecord query for QoS constraints The QoS-aware path selection logic is implemented in a separate function that is called only when QoS in OpenSM is on. It causes some code duplication, but the idea is to

[openib-general] Diags/ibtracert: Add switch-map option to ibtracert

2007-01-31 Thread Hal Rosenstock
Diags/ibtracert: Add switch-map option to ibtracert Signed-off-by: Ira K. Weiny [EMAIL PROTECTED] Signed-off-by: Hal Rosenstock [EMAIL PROTECTED] diff --git a/diags/man/ibtracert.8 b/diags/man/ibtracert.8 index c1632ac..28f18b6 100644 --- a/diags/man/ibtracert.8 +++ b/diags/man/ibtracert.8

[openib-general] Diags/ibnetdiscover: Add switch-map option to ibnetdiscover

2007-01-31 Thread Hal Rosenstock
Diags/ibnetdiscover: Add switch-map option to ibnetdiscover Signed-off-by: Ira K. Weiny [EMAIL PROTECTED] Signed-off-by: Hal Rosenstock [EMAIL PROTECTED] diff --git a/diags/man/ibnetdiscover.8 b/diags/man/ibnetdiscover.8 index 4960a8f..000edb5 100644 --- a/diags/man/ibnetdiscover.8 +++ b/diags

Re: [openib-general] [PATCH 0/10] osm: QoS in OpenSM

2007-01-31 Thread Hal Rosenstock
Hi Yevgeny, On Tue, 2007-01-30 at 09:51, Yevgeny Kliteynik wrote: Hi Hal. The following is a series of 10 patches: 1. QoS policy file parser Yacc file 2. QoS policy file parser Lex file 3. QoS policy file parser Yacc Lex generated files 4. QoS policy file parser header file 5. QoS

Re: [openib-general] [PATCH RFC 4 of 5] IB/mthca: QoS support

2007-01-31 Thread Hal Rosenstock
On Mon, 2007-01-22 at 09:50, Michael S. Tsirkin wrote: encode SL in sched_queue field to improve hardware QoS guarantees for connected QPs. Is UD already handled properly in terms of mthca ? -- Hal Signed-off-by: Michael S. Tsirkin [EMAIL PROTECTED] --- Index:

Re: [openib-general] [PATCH RFC 4 of 5] IB/mthca: QoS support

2007-01-31 Thread Hal Rosenstock
On Wed, 2007-01-31 at 14:09, Michael S. Tsirkin wrote: Quoting Hal Rosenstock [EMAIL PROTECTED]: Subject: Re: [PATCH RFC 4 of 5] IB/mthca: QoS support On Mon, 2007-01-22 at 09:50, Michael S. Tsirkin wrote: encode SL in sched_queue field to improve hardware QoS guarantees

Re: [openib-general] [PATCH TRIVIAL] opensm: remove unused p_subn-node_lid_tbl field

2007-01-29 Thread Hal Rosenstock
On Sun, 2007-01-28 at 19:50, Sasha Khapyorsky wrote: This removes unused node_lid_tbl field in osm_subn_t structure. Signed-off-by: Sasha Khapyorsky [EMAIL PROTECTED] Thanks. Applied. -- Hal ___ openib-general mailing list

Re: [openib-general] [RFC] Performance Manager

2007-01-29 Thread Hal Rosenstock
On Fri, 2007-01-26 at 18:15, Sean Hefty wrote: There are numerous PerfManager models which can be supported: 1. Integrated as thread(s) with OpenSM (run only when SM is master) 2. Standby SM 3. Standalone PerfManager (not running with master or standby SM) 4. Distributed PerfManager (most

Re: [openib-general] OFED 1.2 release - to be reviewed in the meeting today

2007-01-29 Thread Hal Rosenstock
On Mon, 2007-01-29 at 09:05, Tziporet Koren wrote: Hi, This is the proposal for OFED 1.2 branching and tagging: Sources developed in OFA: 1. Each git owner will open a branch with the name ofed_1_2. This branch should be opened on 31-Jan (based on code readiness we will review today).

Re: [openib-general] ipoib, ipv6 and multicast groups

2007-01-29 Thread Hal Rosenstock
On Mon, 2007-01-29 at 13:17, chas williams - CONTRACTOR wrote: recently our sm started throwing the following errors: Jan 29 18:10:49 706710 [42003940] - __get_new_mlid: ERR 1B23: All available:32 mlids are taken Jan 29 18:10:49 706721 [42003940] - osm_mcmr_rcv_create_new_mgrp: ERR 1B19:

Re: [openib-general] QoS in OSM

2007-01-29 Thread Hal Rosenstock
let me know what you think about it. Thanks, -- Yevgeny Hal Rosenstock wrote: Hi again Yevgeny, On Thu, 2007-01-25 at 11:53, Yevgeny Kliteynik wrote: Hi Hal. Hal Rosenstock wrote: Hi Yevgeny, On Wed, 2007-01-24 at 09:10, Yevgeny Kliteynik wrote: Hi Hal, Sasha. Here's

Re: [openib-general] [PATCH MINOR] opensm: minor usage strings simplification

2007-01-26 Thread Hal Rosenstock
On Wed, 2007-01-24 at 16:11, Sasha Khapyorsky wrote: Minor usage string simplification - this helps to avoid warning with some version of vim c code analyzer. Signed-off-by: Sasha Khapyorsky [EMAIL PROTECTED] Thanks. Applied. -- Hal ___

Re: [openib-general] [PATCH] opensm: cleanup unused osm_req_ctrl

2007-01-26 Thread Hal Rosenstock
On Wed, 2007-01-24 at 17:19, Sasha Khapyorsky wrote: This cleanups unused osm_req_ctrl stuff and corresponded objects. Signed-off-by: Sasha Khapyorsky [EMAIL PROTECTED] Thanks. Applied. -- Hal ___ openib-general mailing list

[openib-general] [PATCH] OpenSM: Move osm_mtl_bind.h from include/opensm to include/vendor

2007-01-26 Thread Hal Rosenstock
OpenSM: Move osm_mtl_bind.h from include/opensm to include/vendor where it belongs as it is vendor specific Signed-off-by: Hal Rosenstock [EMAIL PROTECTED] osm/include/Makefile.am|4 ++-- osm/include/{opensm = vendor}/osm_mtl_bind.h |0 osm/libvendor

Re: [openib-general] [PATCH 0/6] osm: QoS policy parser

2007-01-26 Thread Hal Rosenstock
Hi again Yevgeny, On Thu, 2007-01-25 at 11:53, Yevgeny Kliteynik wrote: Hi Hal. Hal Rosenstock wrote: Hi Yevgeny, On Wed, 2007-01-24 at 09:10, Yevgeny Kliteynik wrote: Hi Hal, Sasha. Here's a description of the QoS policy file, and an example of such file (with more comments

  1   2   3   4   5   6   7   8   9   10   >