See inline..

On 02 Sep 2005 17:04:42 -0400, Hal Rosenstock <[EMAIL PROTECTED]> wrote:
On Fri, 2005-09-02 at 16:59, Viswanath Krishnamurthy wrote:
> Here is the setup..

Thanks. A couple more questions:

> #svn info
> Path: .
>
> URL: https://openib.org/svn/gen2/trunk
> Repository UUID: 21a7a0b7-18d7-0310-8e21-e8b31bdbf5cd
> Revision: 3295
> Node Kind: directory
> Schedule: normal
> Last Changed Author: halr
> Last Changed Rev: 3295
> Last Changed Date: 2005-09-01 12:07:54 -0700 (Thu, 01 Sep 2005)
>
>
> Patch applied to core/at.c and kernel 2.6.13 recompiled.
>
>
> Machine  A
> =========
> Running opensm
>
> Run ucmpost
>
> machine B
> =========
> ./ucmpost <ipaddr_of_machineA>

Are these back to back HCAs or is there a switch in between ?


There is a  switch in between.  A simple setup with 2 machines and a switch.  The machines are running
2.6.13. One of them is running opensm.

> The problem is reproducible when you *cannot* ping each other

over IPoIB ?


Yes..

> [EMAIL PROTECTED] ~]# ibv_devinfo
> hca_id: mthca0
>         fw_ver:                         1.0.1
>         node_guid:                      0002:c902:0040:0d00
>         sys_image_guid:                 0002:c902:0040:0d03
>         max_mr_size:                    0xffffffffffffffff
>         page_size_cap:                  0x0
>         vendor_id:                      0x02c9
>         vendor_part_id:                 25204
>         hw_ver:                         0x0
>         phys_port_cnt:                  1
>                 port:   1
>                         state:                  PORT_ACTIVE (4)
>                         max_mtu:                invalid MTU (0)  <
> What is this ??>
>                         active_mtu:             invalid MTU (0)

If the program is right and those are the real values, somehow max_mtu
is trashed which causes active_mtu to be invalid which could break all
sorts of things...

Is there some issue with the HCA ?

>                         sm_lid:                 1
>                         port_lid:               3
>                         port_lmc:               0x00

That's on the remote (from the SM) machine.

-- Hal


_______________________________________________
openib-general mailing list
[email protected]
http://openib.org/mailman/listinfo/openib-general

To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Reply via email to