So I did the rebuild with -g and start the debugger and after a few
hundred "n" I figured something was up. I started the server as normal
(no gdb) and it started just fine. I was sure I had done a "make clean"
before trying the IB stuff, but maybe not. Sorry about the false alarm
there and thanks for the quick answers!

My problem now seems to be on the client side. I do a "modprobe pvfs2",
start the client and then try to mount the filesystem. 


[EMAIL PROTECTED] tmp]# mount -t pvfs2 ib://compute-0-0:3334/pvfs2-fs
/mnt/pvfs2
mount: Connection refused

The client logs show

[EMAIL PROTECTED] tmp]# pvfs2-ping -m /mnt/pvfs2

(1) Parsing tab file...

(2) Initializing system interface...

(3) Initializing each file system found in tab file: /etc/pvfs2tab...

   PVFS2 servers: ib://compute-0-0:3334
   Storage name: pvfs2-fs
   Local mount point: /mnt/pvfs2
[E 15:24:28.563484] Warning: ib_tcp_client_connect: connect to server
compute-0-0:3334: Connection refused.
[E 15:24:28.563628] Receive immediately failed: Connection refused
[E 15:24:28.563725] msgpair failed, will retry: Connection refused
 
I'm throwing in my /etc/pvfs2-fs.conf file here

<Defaults>
        UnexpectedRequests 50
        EventLogging none
        LogStamp datetime
        BMIModules bmi_ib
        FlowModules flowproto_multiqueue
        PerfUpdateInterval 1000
        ServerJobBMITimeoutSecs 30
        ServerJobFlowTimeoutSecs 30
        ClientJobBMITimeoutSecs 300
        ClientJobFlowTimeoutSecs 300
        ClientRetryLimit 5
        ClientRetryDelayMilliSecs 2000
</Defaults>

<Aliases>
        Alias compute-0-0 ib://compute-0-0:3335
        Alias compute-0-1 ib://compute-0-1:3335
        Alias compute-0-2 ib://compute-0-2:3335
        Alias compute-0-3 ib://compute-0-3:3335
</Aliases>

<Filesystem>
        Name pvfs2-fs
        ID 1110091704
        RootHandle 1048576
        <MetaHandleRanges>
                Range compute-0-0 4-858993461
        </MetaHandleRanges>
        <DataHandleRanges>
                Range compute-0-0 858993462-1717986919
                Range compute-0-1 1717986920-2576980377
                Range compute-0-2 2576980378-3435973835
                Range compute-0-3 3435973836-4294967293
        </DataHandleRanges>
        <StorageHints>
                TroveSyncMeta yes
                TroveSyncData no
        </StorageHints>
</Filesystem>

-----Original Message-----
From: Pete Wyckoff [mailto:[EMAIL PROTECTED] 
Sent: Tuesday, March 20, 2007 2:43 PM
To: Carlson, Timothy S
Cc: [email protected]
Subject: Re: [Pvfs2-users] error starting server compiled for
infiniband(topspin)

[EMAIL PROTECTED] wrote on Tue, 20 Mar 2007 13:59 -0700:
> I've built and installed pvfs-2.6.2 on x86_64 (RHEL 4) with 
> Cisco/Topspin cards and software stack.
> 
> topspin-ib-rhel4-3.2.0-118
> topspin-ib-mod-rhel4-2.6.9-42.ELsmp-3.2.0-118
> 
> ./configure --with-kernel=/usr/src/kernels/2.6.9-42.EL-smp-x86_64
> --with-ib=/usr/local/topspin
> --with-ib-includes=/usr/local/topspin/include/vapi
> 
> Everything builds fine and I install with make install make 
> kmod_install
> 
> However after configuring the metadata and IO nodes, I try to start 
> the server and get this in the server log.
> 
> [D 03/20 13:25] PVFS2 Server version 2.6.2 starting.
> [E 03/20 13:25] src/io/bmi/bmi.c line 1841: Error: no method available

> for bmi_ib.
> [E 03/20 13:25]         [bt] /usr/local/sbin/pvfs2-server [0x433a50]
> [E 03/20 13:25]         [bt]
> /usr/local/sbin/pvfs2-server(BMI_initialize+0x13e) [0x433f1e]
> [E 03/20 13:25]         [bt] /usr/local/sbin/pvfs2-server [0x40de0d]
> [E 03/20 13:25]         [bt] /usr/local/sbin/pvfs2-server(main+0xaa1)
> [0x40f6a1]
> [E 03/20 13:25]         [bt]
> /lib64/tls/libc.so.6(__libc_start_main+0xdb) [0x39ce01c3fb]
> [E 03/20 13:25]         [bt]
> /usr/local/sbin/pvfs2-server(aio_cancel64+0x16a) [0x40d4aa] [E 03/20 
> 13:25] Error: BMI_initialize: Protocol not available [E 03/20 13:25] 
> Error: Could not initialize server interfaces; aborting.
> [E 03/20 13:25] Error: Could not initialize server; aborting.

Recompile with debugging:
        make clean
        CFLAGS=-g ./configure ...
        make
        make install
and cd to where your server expects to run.  Then do:
        gdb --args pvfs2-server -d fs.conf server.conf
        b BMI_ib_initialize
        run
then use "n" to walk through the function and try to find out where it
is failing.  I would have expected an error message, but do not see one
above.

Your build and ldd output look okay.  Your fs.conf has lines like:
            BMIModules bmi_ib
            ...
            Alias myhost1 ib://myhost1:3335
or there would have been different messages, but check anyway.

If you get a bit closer, let us know and we'll figure out the problem.

                -- Pete

P.S.  Can you get exchange to turn off html email?

_______________________________________________
Pvfs2-users mailing list
[email protected]
http://www.beowulf-underground.org/mailman/listinfo/pvfs2-users

Reply via email to