[openib-general] getting LOC_QP_OP_ERR with IPoIB

2006-09-05 Thread Or Gerlitz
Hi, While doing some work to have linux bonding driver be able to work on top of IPoIB i have run into LOC_QP_OP_ERR with vendor (mellanox PCIX HCA) error 62. ib0: failed send event (status=2, wrid=52 vend_err 62) What does this vendor error means? its the same system over which i saw

Re: [openib-general] getting LOC_QP_OP_ERR with IPoIB

2006-09-05 Thread Michael S. Tsirkin
Quoting r. Or Gerlitz [EMAIL PROTECTED]: Subject: getting LOC_QP_OP_ERR with IPoIB Hi, While doing some work to have linux bonding driver be able to work on top of IPoIB i have run into LOC_QP_OP_ERR with vendor (mellanox PCIX HCA) error 62. ib0: failed send event (status=2,

[openib-general] OpenSM - guid2lid cache file questions

2006-09-05 Thread Leonid Arsh
Hi list, I have a question regarding the guid2lid cache file. The file is read by OpenSM on the start up. OpenSM may reassign LIDs according to the LIDs saved in this file. It isn't always acceptable. Is it a right policy? Am I missing anything here? Is there a way to disable the file

Re: [openib-general] getting LOC_QP_OP_ERR with IPoIB

2006-09-05 Thread Or Gerlitz
Michael S. Tsirkin wrote: Quoting r. Or Gerlitz [EMAIL PROTECTED]: While doing some work to have linux bonding driver be able to work on top of IPoIB i have run into LOC_QP_OP_ERR with vendor (mellanox PCIX HCA) error 62. ib0: failed send event (status=2, wrid=52 vend_err 62) What

Re: [openib-general] getting LOC_QP_OP_ERR with IPoIB

2006-09-05 Thread Michael S. Tsirkin
Quoting r. Or Gerlitz [EMAIL PROTECTED]: Subject: Re: getting LOC_QP_OP_ERR with IPoIB Michael S. Tsirkin wrote: Quoting r. Or Gerlitz [EMAIL PROTECTED]: While doing some work to have linux bonding driver be able to work on top of IPoIB i have run into LOC_QP_OP_ERR with vendor

Re: [openib-general] OpenSM - guid2lid cache file questions

2006-09-05 Thread Hal Rosenstock
Hi Leonid, On Tue, 2006-09-05 at 03:30, Leonid Arsh wrote: Hi list, I have a question regarding the guid2lid cache file. The file is read by OpenSM on the start up. OpenSM may reassign LIDs according to the LIDs saved in this file. It isn't always acceptable. Is it a right

Re: [openib-general] MPI Brodcast doubt

2006-09-05 Thread Hal Rosenstock
John, On Mon, 2006-09-04 at 08:56, john t wrote: Hi, I have 3 nodes connected via IB as shown below: node1 --- switch1 --- node2 |-- node3 If node1 sends a brodcast message to node2 and node3, I want to know if the message is delivered to the switch

Re: [openib-general] OpenSM - guid2lid cache file questions

2006-09-05 Thread Leonid Arsh
Hi Hal, Thank you for your reply. Probably I wasn't clear. I have a problem when OpenSM, being started, reads an out-if-date guid2lid file. OpenSM changes LIDs in this case. I don't want the LIDs to be changed. As I understand it, the '-r' option, on the contrary, causes the SM to reassign

Re: [openib-general] MPI Brodcast doubt

2006-09-05 Thread Dotan Barak
Hal Rosenstock wrote: John, On Mon, 2006-09-04 at 08:56, john t wrote: Hi, I have 3 nodes connected via IB as shown below: node1 --- switch1 --- node2 |-- node3 If node1 sends a brodcast message to node2 and node3, I want to know if the message is

Re: [openib-general] OpenSM - guid2lid cache file questions

2006-09-05 Thread Hal Rosenstock
Hi Leonid, On Tue, 2006-09-05 at 08:11, Leonid Arsh wrote: Hi Hal, Thank you for your reply. Probably I wasn't clear. I have a problem when OpenSM, being started, reads an out-if-date guid2lid file. OpenSM changes LIDs in this case. How do you know the file is out of date ? I

Re: [openib-general] getting LOC_QP_OP_ERR with IPoIB - mstflint question

2006-09-05 Thread Or Gerlitz
Michael S. Tsirkin wrote: Donnu, it looks really weird. Could you try firmware 3.5.0 please? I just noted that you can not work with mstflint if the mthca driver is not loaded, i think it was not the case in the gen1 tools, am i correct. Is this connected to this print ACPI: PCI

Re: [openib-general] problems to regiser memory as a reglar

2006-09-05 Thread Tziporet Koren
Dhabaleswar Panda wrote: Christian - Thanks for sending instructions for running mvapich2-0.9.5 to Tziporet. Tziporet - Thanks for looking into this problem on SLES9 environment. Please note that a detailed user guide for running and tuning MVAPICH2 0.9.5 is available from the following

Re: [openib-general] OpenSM - guid2lid cache file questions

2006-09-05 Thread Leonid Arsh
Thanks, On 05 Sep 2006 08:46:22 -0400, Hal Rosenstock [EMAIL PROTECTED] wrote: I have a problem when OpenSM, being started, reads an out-if-date guid2lid file. OpenSM changes LIDs in this case. How do you know the file is out of date ? Actually, the LIDs were assigned by another SM.

[openib-general] [Bug 131] working with huge pages may crash the kernel on Suse10

2006-09-05 Thread bugzilla-daemon
http://openib.org/bugzilla/show_bug.cgi?id=131 [EMAIL PROTECTED] changed: What|Removed |Added Status|NEW |RESOLVED Resolution|

[openib-general] [Bug 145] IB Core unable to communicate IPoIB on Fedora Core 4

2006-09-05 Thread bugzilla-daemon
http://openib.org/bugzilla/show_bug.cgi?id=145 [EMAIL PROTECTED] changed: What|Removed |Added Status|NEW |RESOLVED Resolution|

Re: [openib-general] OpenSM - guid2lid cache file questions

2006-09-05 Thread Hal Rosenstock
Leonid, On Tue, 2006-09-05 at 09:13, Leonid Arsh wrote: Thanks, On 05 Sep 2006 08:46:22 -0400, Hal Rosenstock [EMAIL PROTECTED] wrote: I have a problem when OpenSM, being started, reads an out-if-date guid2lid file. OpenSM changes LIDs in this case. How do you know the file is

Re: [openib-general] [PATCH] opensm: osm_log_init_v2() - new osm_log initializer

2006-09-05 Thread Hal Rosenstock
On Mon, 2006-09-04 at 13:20, Sasha Khapyorsky wrote: There is new osm_log initializer osm_log_init_v2(), this is wrapped by osm_log_init() in order to preserve existing API. Signed-off-by: Sasha Khapyorsky [EMAIL PROTECTED] Thanks. Applied (to trunk and 1.1). -- Hal

Re: [openib-general] OpenSM - guid2lid cache file questions

2006-09-05 Thread Eitan Zahavi
Hi Leonid, The best approach when switching from another vendor SM to OpenSM is to delete the /var/cache/osm/guid2lid file. -Original Message- From: [EMAIL PROTECTED] [mailto:openib-general- [EMAIL PROTECTED] On Behalf Of Hal Rosenstock Sent: Tuesday, September 05, 2006 4:18 PM To:

Re: [openib-general] getting LOC_QP_OP_ERR with IPoIB - mstflint question

2006-09-05 Thread Michael S. Tsirkin
Quoting r. Or Gerlitz [EMAIL PROTECTED]: Subject: Re: getting LOC_QP_OP_ERR with IPoIB - mstflint question Michael S. Tsirkin wrote: Donnu, it looks really weird. Could you try firmware 3.5.0 please? I just noted that you can not work with mstflint if the mthca driver is not loaded, i

[openib-general] libibcm can't connect/talk to libicm on other machine.

2006-09-05 Thread Bub Thomas
Title: libibcm can't connect/talk to libicm on other machine. Im still in the process of migrating my gen1 application to gen2. Actually I CAN connect a gen2 application to a gen2 listener application on the same machine but NOT to a gen 2 listener on another machine. Any hints where to

Re: [openib-general] libibcm can't connect/talk to libicm on other machine.

2006-09-05 Thread Dotan Barak
Hi bub. Bub Thomas wrote: I’m still in the process of migrating my gen1 application to gen2. Actually I CAN connect a gen2 application to a gen2 listener application on the same machine but NOT to a gen 2 listener on another machine. Any hints where to look at? Is there anything in

Re: [openib-general] libibcm can't connect/talk to libicm on other machine.

2006-09-05 Thread Hal Rosenstock
Hi Bub, On Tue, 2006-09-05 at 10:22, Bub Thomas wrote: I’m still in the process of migrating my gen1 application to gen2. Actually I CAN connect a gen2 application to a gen2 listener application on the same machine but NOT to a gen 2 listener on another machine. Any hints where to look

Re: [openib-general] libibcm can't connect/talk to libicm on other machine.

2006-09-05 Thread Bub Thomas
Dotan, the ibv_rc_pingpong example works for me so I can exclude the architecture. I never got the libibcm example compiled. Which is your example and which architecture x86 vs. x86_64 did you compile it for? Can you share your libibcm the example code? (if it is not the standard that I can't get

[openib-general] New development tool for boot-time drivers (FCode, IEE-1275, IBM/Sun)

2006-09-05 Thread David L Paktor
If anyone is interested in developing boot-time device drivers for plug-in devices, conformant to the IEEE-1275 (Open Firmware) specification, using FCode (tokenized Forth source), which is compatible with both IBM and Sun platforms (and is platform-independent, so that a driver written once is

Re: [openib-general] libibcm can't connect/talk to libicm on other machine.

2006-09-05 Thread Sean Hefty
Bub Thomas wrote: Dotan, the ibv_rc_pingpong example works for me so I can exclude the architecture. I never got the libibcm example compiled. Which is your example and which architecture x86 vs. x86_64 did you compile it for? Can you share your libibcm the example code? (if it is not the

Re: [openib-general] libibcm can't connect/talk to libicm on other machine.

2006-09-05 Thread JWM
Title: libibcm can't connect/talk to libicm on other machine. I know this sounds simple, but have you checked the routing tables? JW - Original Message - From: Bub Thomas To: openib-general@openib.org Sent: Tuesday, September 05, 2006 9:22 AM Subject:

Re: [openib-general] [PATCH] OFED 1.1-rc3 is ready

2006-09-05 Thread Arlin Davis
Robert, Here is a slightly modified patch for your attributes issue. Can you give it a try? Signed-off by: Arlin Davis [EMAIL PROTECTED] Index: dapl/openib/dapl_ib_util.c === --- dapl/openib/dapl_ib_util.c (revision 9106) +++

Re: [openib-general] [PATCH] OFED 1.1-rc3 is ready

2006-09-05 Thread Robert Walsh
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Arlin Davis wrote: Robert, Here is a slightly modified patch for your attributes issue. Can you give it a try? I'll give it a spin this afternoon: it looks quite a bit more comprehensive than the small patch I did. Regards, Robert.

[openib-general] Question about interrupt generation

2006-09-05 Thread harish
Hi All,I tried the following simple experiment and am not able to understand the results:Calcualted the number of interrupts generated by the infiniband [with little or no traffic to the NIC] over a period of 10seconds and saw around 10-20 interrupts/sec. Then ran a netperf test and saw around

Re: [openib-general] [-mm patch] drivers/infiniband/hw/amso1100/: possible cleanups

2006-09-05 Thread Roland Dreier
Thanks, I've rolled this up in the amso1100 patch I have queued up. - #if 0 the following unused global function: - c2_mq.c: c2_mq_count() Tom/Steve, any reason to keep c2_mq_count() at all? - R. ___ openib-general mailing list

Re: [openib-general] [PATCH] for-2.6.19 cma: protect against adding device during destruction

2006-09-05 Thread Roland Dreier
Thanks, queued for 2.6.19. ___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Re: [openib-general] [PATCH] OFED 1.1-rc3 is ready

2006-09-05 Thread Arlin Davis
Robert Walsh wrote: -BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Arlin Davis wrote: Robert, Here is a slightly modified patch for your attributes issue. Can you give it a try? I'll give it a spin this afternoon: it looks quite a bit more comprehensive than the small patch I did.

Re: [openib-general] [PATCH] OFED 1.1-rc3 is ready

2006-09-05 Thread Robert Walsh
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Just added all appropriate RDMA in/out fields and some code to zero out the structure to avoid uninitialized data fields. Yup. By comprehensive, I meant better :-) -BEGIN PGP SIGNATURE- Version: GnuPG v1.4.5 (GNU/Linux) Comment: Using

Re: [openib-general] [-mm patch] drivers/infiniband/hw/amso1100/: possible cleanups

2006-09-05 Thread Steve Wise
Its old debug code that isn't used anywhere. It would be nice to keep it around, but if you really don't want it, nuke it... On Tue, 2006-09-05 at 14:57 -0700, Roland Dreier wrote: Thanks, I've rolled this up in the amso1100 patch I have queued up. - #if 0 the following unused global

Re: [openib-general] [PATCH] OFED 1.1-rc3 is ready

2006-09-05 Thread Robert Walsh
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Arlin Davis wrote: Robert, Here is a slightly modified patch for your attributes issue. Can you give it a try? Oddly enough, I'm back to the same problem with your new patch as I saw with the unpatched version: $ mpiexec -n 2 ./a.out

Re: [openib-general] [-mm patch] drivers/infiniband/hw/amso1100/: possible cleanups

2006-09-05 Thread Roland Dreier
Steve Its old debug code that isn't used anywhere. It would be Steve nice to keep it around, but if you really don't want it, Steve nuke it... No, that's fine, I'll leave it inside the #if 0. - R. ___ openib-general mailing list

Re: [openib-general] [PATCH] OFED 1.1-rc3 is ready

2006-09-05 Thread Arlin Davis
Oddly enough, I'm back to the same problem with your new patch as I saw with the unpatched version: Hmmm. We ran this with OFED 1.1 RC3 and MPI 3.0b on an EM64T server with your adapter and it worked. Did you ever pick up the Intel MPI 3.0 beta? $ mpiexec -n 2 ./a.out I_MPI: [1]

Re: [openib-general] [PATCH] OFED 1.1-rc3 is ready

2006-09-05 Thread Robert Walsh
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Oddly enough, I'm back to the same problem with your new patch as I saw with the unpatched version: Hmmm. We ran this with OFED 1.1 RC3 and MPI 3.0b on an EM64T server with your adapter and it worked. Weird - it's not working for me at all.

[openib-general] [Bug 218] New: Call usage verifier is detecting reinitialization of spinlocks already in use

2006-09-05 Thread bugzilla-daemon
http://openib.org/bugzilla/show_bug.cgi?id=218 Summary: Call usage verifier is detecting reinitialization of spinlocks already in use Product: OpenFabrics Windows Version: unspecified Platform: X86 OS/Version: Other

Re: [openib-general] Question about interrupt generation

2006-09-05 Thread harish
Hi,One more question. What kind of event mask helps mask the interrupts?thanksharishOn 9/5/06, harish [EMAIL PROTECTED] wrote:Hi All,I tried the following simple experiment and am not able to understand the results: Calcualted the number of interrupts generated by the infiniband [with little or

Re: [openib-general] [PATCH] OFED 1.1-rc3 is ready

2006-09-05 Thread Robert Walsh
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Woodruff, Robert J wrote: Robert Walsh wrote, I'll give it a spin this afternoon: it looks quite a bit more comprehensive than the small patch I did. I also just tried running the ib_rdma_bw test and it seems to be flaky if you stress it. If

Re: [openib-general] [PATCH] OFED 1.1-rc3 is ready

2006-09-05 Thread Robert Walsh
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Here is a slightly modified patch for your attributes issue. Can you give it a try? I rebuilt OFED from scratch with the patch, and ran successfully on Intel MPI 2.0.1 with the refresh patch. I could not get it to run on Intel MPI 3.0b. If you

Re: [openib-general] [openfabrics-ewg] OFED 1.1-rc2 is ready (how do I enable madeye)?

2006-09-05 Thread Scott Weitzenkamp (sweitzen)
5. Added Madeye utility How do I build madeye? I don't see any reference to it to install.sh. Is there any documentation for madeye? Scott Weitzenkamp SQA and Release Manager Server Virtualization Business Unit Cisco Systems ___ openib-general