[ofa-general] How to get a thread out of the ib_cm_get_event?

2007-08-02 Thread Bub Thomas
Sean, our application is using OFED 1.1 under SLES10. It is a point to point IB connection from a high speed image scanner to a workstation. The application is out in the market for about a year. Now I get complains from the field that they do have dead pthreads hanging around that are

[ofa-general] ofa_1_2_kernel 20070802-0100 daily build status

2007-08-02 Thread Vladimir Sokolovsky
This email was generated automatically, please do not reply git_url: git://git.openfabrics.org/ofed_1_2/linux-2.6.git git_branch: ofed_1_2 Common build parameters: --with-ipoib-mod --with-sdp-mod --with-srp-mod --with-user_mad-mod --with-user_access-mod --with-mthca-mod --with-core-mod

[ofa-general] Reading for fun

2007-08-02 Thread fonda hunter
At long last amount you can smile about. DiscountPharmacy a Highest Canadian Worldwide Medicine Treatment Assistance Seller. From the time of opening in March of 2000, Discount-Pharmacy has earned many drugstore authorizations and turn into one of the mainly safety pharmaceutical on the World

[ofa-general] [PATCH] ehca: map 4k firmware context of cq, qp to user space

2007-08-02 Thread Hoang-Nam Nguyen
From: Hoang-Nam Nguyen hnguyen at de.ibm.com Date: Thu, 2 Aug 2007 10:08:30 +0200 Subject: [PATCH] ehca: map 4k firmware context of cq, qp to user space This patch utilizes remap_4k_pfn() as introduced by Paul M., for details see http://patchwork.ozlabs.org/linuxppc/patch?id=10281, to map ehca cq,

[ofa-general] ofa_1_2_c_kernel 20070802-0201 daily build status

2007-08-02 Thread Vladimir Sokolovsky
-2.6.9-55.ELsmp Passed on x86_64 with linux-2.6.9-22.ELsmp Passed on x86_64 with linux-2.6.9-34.ELsmp Passed on ia64 with linux-2.6.16.21-0.8-default Passed on ppc64 with linux-2.6.18-8.el5 Failed: Build failed on powerpc with linux-2.6.18 Log: /home/vlad/tmp/ofa_1_2_c_kernel-20070802-0201_linux

Re: [ofa-general] Re: IPoIB path caching

2007-08-02 Thread Or Gerlitz
Sean Hefty wrote: Indeed. The argument I was trying to make is that arp cache invalidation requires IPoIB PR cache invalidation, this handles 100% of the cases, including the 10% not covered by doing cache invalidation based only on IB events such as port up / sm lid change / sm reregister

[ofa-general] [RFH] IPoIB retransmission when sending multiple WR's to device

2007-08-02 Thread Krishna Kumar2
(Request For Help) Hi, On the same topic that I wrote about earlier, I put debugs in my code to store all skbs in bufferA when enqueing multiple skbs, and store all skbs to bufferB just before doing post. During post, I compare the two buffers to make sure that I am not posting in the wrong

[ofa-general] To be or not to be. To be...

2007-08-02 Thread Jack Cleveland
attachment: irauu.gif___ general mailing list general@lists.openfabrics.org http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

[ofa-general] [PATCH] osm: cosmetics - removing trailing blanks

2007-08-02 Thread Yevgeny Kliteynik
Hi Sasha. This patch removes trailing blanks in bunch of files Signed-off-by: Yevgeny Kliteynik [EMAIL PROTECTED] --- infiniband-diags/configure.in |2 +- libibcommon/Makefile.am |2 +- libibmad/Makefile.am |2 +- opensm/Makefile.am|4 ++--

Re: [ofa-general] Re: [PATCH RFC] RDMA/CMA: Allocate PS_TCP ports from the host TCP port space.

2007-08-02 Thread Steve Wise
Sean Hefty wrote: Consider NFS and NFS-RDMA. The NFS gurus struggled with this very issue and concluded that the RDMA service needs to be on a separate port. Thus they are proposing a new netid/port number for doing RDMA mounts vs TCP/UDP mounts. IMO that is the correct way to go: RDMA

[ofa-general] Re: [PATCH] ehca: map 4k firmware context of cq, qp to user space

2007-08-02 Thread Arnd Bergmann
On Thursday 02 August 2007, Hoang-Nam Nguyen wrote: +#ifdef CONFIG_PPC_64K_PAGES +   /* make sure we map only 4k for fw context */ +   ret = remap_4k_pfn(vma, vma-vm_start, physical EHCA_PAGESHIFT, +      vma-vm_page_prot); +#else ret =

Re: [ofa-general] Re: IPoIB path caching

2007-08-02 Thread Sean Hefty
When I said requires I meant that I think that it is required, I agree that the current IPoIB code does not link them together. In the beginning on this thread Roland commented saying he agree with me, but since then he did not provide more input to the discussion... I think Jason said it

Re: [ofa-general] Re: [PATCH RFC] RDMA/CMA: Allocate PS_TCP ports from the host TCP port space.

2007-08-02 Thread Sean Hefty
In the RFC patch I posted, the socket is _just_ to allow binding to a port/addr. Its not used for anything else. From the native stack's perspective, its a TCP socket in the CLOSED state (but bound) I guess. For RDMA, I think we're somewhere in between binding to an address, versus mapping

[ofa-general] Re: ofa_1_2_c_kernel 20070802-0201 daily build status

2007-08-02 Thread Michael S. Tsirkin
. -- Failed: Build failed on powerpc with linux-2.6.18 Log: /home/vlad/tmp/ofa_1_2_c_kernel-20070802-0201_linux-2.6.18_powerpc_check/drivers/infiniband/hw/ehca/ehca_main.c:831: error: invalid type argument of ‘-’ /home/vlad/tmp/ofa_1_2_c_kernel-20070802-0201_linux-2.6.18_powerpc_check/drivers/infiniband/hw

Re: [ofa-general] Re: ofa_1_2_c_kernel 20070802-0201 daily build status

2007-08-02 Thread Steve Wise
=no CHECK_CROSS=no ~swise/git/ofabuild/build_ofa_kernel.sh mkdir -p /home/swise/tmp/ofa_1_2_c_kernel-20070802-0912 ~/tmp/ofa_1_2_c_kernel-20070802-0912 ~/git/ofabuild git clone -s --bare --reference /home/vlad/scm/ofed_1_2 /home/vlad/ofed_1_2/linux-2.6 .git git checkout ofed_1_2 ofed_scripts

Re: [ofa-general] Re: ofa_1_2_c_kernel 20070802-0201 daily build status

2007-08-02 Thread Michael S. Tsirkin
Quoting Steve Wise [EMAIL PROTECTED]: Subject: Re: [ofa-general] Re: ofa_1_2_c_kernel 20070802-0201 daily build status Also, Is something broken in the ofed_1_2 branch? I cannot even build against the local kernel on the ofa server using the ~vlad/ofed_1_2/linux-2.6 repository

Re: [ofa-general] Re: [PATCH RFC] RDMA/CMA: Allocate PS_TCP ports from the host TCP port space.

2007-08-02 Thread Steve Wise
Sean Hefty wrote: In the RFC patch I posted, the socket is _just_ to allow binding to a port/addr. Its not used for anything else. From the native stack's perspective, its a TCP socket in the CLOSED state (but bound) I guess. For RDMA, I think we're somewhere in between binding to an

Re: [ofa-general] Re: ofa_1_2_c_kernel 20070802-0201 daily build status

2007-08-02 Thread Steve Wise
Michael S. Tsirkin wrote: Quoting Steve Wise [EMAIL PROTECTED]: Subject: Re: [ofa-general] Re: ofa_1_2_c_kernel 20070802-0201 daily build status Also, Is something broken in the ofed_1_2 branch? I cannot even build against the local kernel on the ofa server using the ~vlad/ofed_1_2/linux-2.6

Re: [ewg] Re: [ofa-general] Re: ofa_1_2_c_kernel 20070802-0201 daily build status

2007-08-02 Thread Steve Wise
I'm havin' a bad day. Can you all help me? My normal process is to use the build_ofa_kernel.sh script from the ofabuild repository to build against all ofed kernels. But that scripts in the master branch of the ofabuild repository now assumes 1.2.c because it tries to configure in the

[ofa-general] Re: [RFH] IPoIB retransmission when sending multiple WR's to device

2007-08-02 Thread Roland Dreier
On the same topic that I wrote about earlier, I put debugs in my code to store all skbs in bufferA when enqueing multiple skbs, and store all skbs to bufferB just before doing post. During post, I compare the two buffers to make sure that I am not posting in the wrong order, and that

Re: [ewg] Re: [ofa-general] Re: ofa_1_2_c_kernel 20070802-0201 daily build status

2007-08-02 Thread Steve Wise
/vlad/scripts/ofed_1_2 Quoting Steve Wise [EMAIL PROTECTED]: Subject: Re: [ewg] Re: [ofa-general] Re: ofa_1_2_c_kernel 20070802-0201 daily build status I'm havin' a bad day. Can you all help me? My normal process is to use the build_ofa_kernel.sh script from the ofabuild repository to build

[ofa-general] Re: [PATCH V2] IB/mlx4: enable MSI-X by default

2007-08-02 Thread Michael S. Tsirkin
Quoting Roland Dreier [EMAIL PROTECTED]: Subject: Re: [PATCH V2] IB/mlx4: enable MSI-X by default mlx4_err(dev, NOP command failed to generate interrupt (IRQ %d), aborting.\n, priv-eq_table.eq[MLX4_EQ_ASYNC].irq); - if

Re: [ewg] Re: [ofa-general] Re: ofa_1_2_c_kernel 20070802-0201 daily build status

2007-08-02 Thread Michael S. Tsirkin
Looke here: /home/vlad/scripts/ofed_1_2 Quoting Steve Wise [EMAIL PROTECTED]: Subject: Re: [ewg] Re: [ofa-general] Re: ofa_1_2_c_kernel 20070802-0201 daily build status I'm havin' a bad day. Can you all help me? My normal process is to use the build_ofa_kernel.sh script from the ofabuild

Re: [ofa-general] PATCH] IB/core: ignore membership bit when looking for a P_Key in the table

2007-08-02 Thread Roland Dreier
A) what PKEY the ib_sa / ib_mad modules for the path query MAD B) what PKEY is being places in the path record used for the query. Neither really... just that the double call to ib_find_pkey() in ib_sa is not needed after this change. Never mind, I'll fix it up myself.

[ofa-general] Re: [PATCH V2] IB/mlx4: enable MSI-X by default

2007-08-02 Thread Roland Dreier
mlx4_err(dev, NOP command failed to generate interrupt (IRQ %d), aborting.\n, priv-eq_table.eq[MLX4_EQ_ASYNC].irq); -if (dev-flags MLX4_FLAG_MSI_X) -mlx4_err(dev, Try again with MSI-X disabled.\n);

Re: [ofa-general] PATCH] IB/core: ignore membership bit when looking for a P_Key in the table

2007-08-02 Thread Roland Dreier
thanks, I applied this. ___ general mailing list general@lists.openfabrics.org http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Re: [ofa-general] How to get a thread out of the ib_cm_get_event?

2007-08-02 Thread Sean Hefty
For your explanation I’m creating a new pthread for every new attempt to connect from the workstation to the image scanner. This will happen only when one does re-power the image scanner while having a connection established, this might happen often as once a week to once a month. The

Re: [ewg] Re: [ofa-general] Re: ofa_1_2_c_kernel 20070802-0201 daily build status

2007-08-02 Thread Steve Wise
Sean Hefty wrote: If I clone from my local system over the net, I _get_ all the branches! Anybody know why local clones on the ofa build server are not pulling all the branches? Maybe I'm abusing git? It sounds like a different between git versions. Older git versions brought in remote

Re: [ewg] Re: [ofa-general] Re: ofa_1_2_c_kernel 20070802-0201 daily build status

2007-08-02 Thread Sean Hefty
Yea, that's it. But how do I checkout the remote branch? The man page and 'git help' don't even show the -r option... I think you use the full name. Example: git checkout origin/ofed_1_2 - Sean ___ general mailing list

[ofa-general] Re: [PATCH V2] IB/mlx4: enable MSI-X by default

2007-08-02 Thread Roland Dreier
Why doesn't MSI-X work? Isn't this BIOS problem? Most likely not -- I guess there are a few cases on Opteron where the BIOS needs to enable HT mapping or something like that, but usually MSI-X fails because of a chipset problem. - R. ___ general

Re: [ewg] Re: [ofa-general] Re: ofa_1_2_c_kernel 20070802-0201 daily build status

2007-08-02 Thread Sean Hefty
If I clone from my local system over the net, I _get_ all the branches! Anybody know why local clones on the ofa build server are not pulling all the branches? Maybe I'm abusing git? It sounds like a different between git versions. Older git versions brought in remote branches such that

Re: [ofa-general] Re: IPoIB path caching

2007-08-02 Thread Sean Hefty
Well, I still think the simplest thing is to make a new netlink protocol to maintain a cache table in the kernel and then a simple user space program that uses the user space RMPP interface (and trap subscription..) to do GetTable queries and uses netlink to groom the kernel cache. Basically

[ofa-general] You can...

2007-08-02 Thread Dean Quintana
attachment: raroru.gif___ general mailing list general@lists.openfabrics.org http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Re: [ofa-general] [PATCH] mad.c: Fix memory leak in switch handling and improve error handling in ib_mad_recv_done_handler

2007-08-02 Thread Roland Dreier
thanks, I applied this as 3 separate patches. ___ general mailing list general@lists.openfabrics.org http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Re: [ofa-general] [PATCH] mad.c: Fix memory leak in switch handling and improve error handling in ib_mad_recv_done_handler

2007-08-02 Thread Roland Dreier
by the way, I had to apply this by hand, because it seems gmail is destroying the whitespace in the patches. I'm not sure if there's any way to send patches via gmail except as attachments unfortunately. ___ general mailing list

[ofa-general] Re: [PATCH] ib/mad: fix address handle leak in mad_rmpp

2007-08-02 Thread Roland Dreier
thanks, applied ___ general mailing list general@lists.openfabrics.org http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Re: [ofa-general] [PATCH] mad.c: Fix memory leak in switch handling and improve error handling in ib_mad_recv_done_handler

2007-08-02 Thread Hal Rosenstock
Roland, On 8/2/07, Roland Dreier [EMAIL PROTECTED] wrote: by the way, I had to apply this by hand, because it seems gmail is destroying the whitespace in the patches. I'm not sure if there's any way to send patches via gmail except as attachments unfortunately. Sorry; I was afraid that was

Re: [ofa-general] [PATCH] mad.c: Fix memory leak in switch handling and improve error handling in ib_mad_recv_done_handler

2007-08-02 Thread Roland Dreier
Sorry; I was afraid that was happening :-( So would attachments be accepted ? If so, should the change still be sent in text as well ? Attachments are a pain -- the best is to find a way to send email without mangling patches. I think you can use a non-web client with gmail's smtp server to

[ofa-general] OpenSM stuck - user level MAD library seems to be timing out

2007-08-02 Thread lbt
Hi Sasha, I am hitting a problem where the user level MAD library seems to be timing out, causing the ports to be stuck in INIT state because the subnet has no Master SM available. The system is still in this state, so if there are any suggestions on what other type of debug info I could collect

Re: [ofa-general] Re: IPoIB path caching

2007-08-02 Thread Jason Gunthorpe
On Thu, Aug 02, 2007 at 11:37:27AM -0700, Sean Hefty wrote: Well, I still think the simplest thing is to make a new netlink protocol to maintain a cache table in the kernel and then a simple user space program that uses the user space RMPP interface (and trap subscription..) to do GetTable

[ofa-general] Re: [PATCH V2] core: move the macro IB_UMEM_MAX_PAGE_CHUNK to umem.c

2007-08-02 Thread Roland Dreier
thanks, applied all 3 of your header fixups. ___ general mailing list general@lists.openfabrics.org http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

[ofa-general] Re: [PATCH] ehca: map 4k firmware context of cq, qp to user space

2007-08-02 Thread Roland Dreier
remap_4k_pfn is defined in terms of remap_pfn_range if the base page size if 4k, so you don't need this #ifdef afaics. Good point. I'll wait for an updated patch. ___ general mailing list general@lists.openfabrics.org

[ofa-general] Re: [PATCH 2.6.23 2/2] iw_cxgb3: Always call low level send function via cxgb3_ofld_send().

2007-08-02 Thread Roland Dreier
thanks, applied. ___ general mailing list general@lists.openfabrics.org http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

[ofa-general] Re: [PATCH 2.6.23 1/2] Make the iw_cxgb3 module parameters writable.

2007-08-02 Thread Roland Dreier
thanks... I actually applied this for 2.6.24, since it's not really a fix for anything, and the 2.6.23 window is closed. ___ general mailing list general@lists.openfabrics.org http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general To unsubscribe,

Re: [ofa-general] Re: [PATCH RFC] RDMA/CMA: Allocate PS_TCP ports from the host TCP port space.

2007-08-02 Thread Steve Wise
Sean Hefty wrote: Okay - for final patches, I think we want to remove the rdma_cm specific port spaces, along with changing the API to clarify that it uses the same port space as TCP/UDP. If we get rid of the rdma_cm specific port spaces, do we then reduce the valid possible spaces to

Re: [ofa-general] QoS RFC

2007-08-02 Thread Yevgeny Kliteynik
Hi Hal, Hal Rosenstock wrote: Hi Yevgeny, On 7/21/07, Yevgeny Kliteynik [EMAIL PROTECTED] wrote: Hi All Please find the attached RFC describing how QoS policy support could be implemented in the OpenFabrics stack. Your comments are welcome. A couple of quick questions: How does this

Re: [ofa-general] Re: [PATCH RFC] RDMA/CMA: Allocate PS_TCP ports from the host TCP port space.

2007-08-02 Thread Sean Hefty
If we get rid of the rdma_cm specific port spaces, do we then reduce the valid possible spaces to just TCP and UDP? Or what? In the sockets paradigm, the socket is explicitly bound to a protocol space when its created (based on the protocol id). Do you think we need to change the

Re: [ofa-general] QoS RFC

2007-08-02 Thread Yevgeny Kliteynik
Sasha Khapyorsky wrote: Hi Yevgeny, On 15:39 Thu 26 Jul , Yevgeny Kliteynik wrote: * Comments may appear only in a separate line Why? What is wrong with: port-name: vs1/HCA-1/P1 # my best port I can use this too, but then the pound sign, wherever it will appear, would mean

[ofa-general] Re: [RFH] IPoIB retransmission when sending multiple WR's to device

2007-08-02 Thread Krishna Kumar2
Hi Roland, Roland Dreier [EMAIL PROTECTED] wrote on 08/02/2007 09:59:23 PM: On the same topic that I wrote about earlier, I put debugs in my code to store all skbs in bufferA when enqueing multiple skbs, and store all skbs to bufferB just before doing post. During post, I compare the

[ofa-general] Re: [RFH] IPoIB retransmission when sending multiple WR's to device

2007-08-02 Thread Krishna Kumar2
Hi Roland, I did one more test to check the out-of-order theory. I changed my new API to be: /* Original code, unmodified */ ipoib_start_xmit() { original code } /* Added new xmit which is identical to original code but doesn't get the lock */ ipoib_start_xmit_nolock() { original