Gang,

Wondering if anyone has run across this yet (I'm new to the list).  Am running
pvfs2 server v 2.7.0 on 3 meta/io servers which are RHEL5
2.6.18-53.1.6.el5.  These are dual/dual opteron machines with 8G ram per.
The clients are running the same version, only built on RHEL4 kernel.  The
filesystem is very large (3 x 8TB ext3s) with a fs.conf file:

<Defaults>
        UnexpectedRequests 50
        EventLogging all
        LogStamp datetime
        BMIModules bmi_tcp
        FlowModules flowproto_multiqueue
        PerfUpdateInterval 1000
        ServerJobBMITimeoutSecs 30
        ServerJobFlowTimeoutSecs 30
        ClientJobBMITimeoutSecs 300
        ClientJobFlowTimeoutSecs 300
        ClientRetryLimit 5
        ClientRetryDelayMilliSecs 2000

        StorageSpace /pvfs_storage
        LogFile /var/log/pvfs2-server.log
</Defaults>

<Aliases>
        Alias odinnfs2 tcp://odinnfs2:3334
        Alias odinnfs3 tcp://odinnfs3:3334
        Alias odinnfs4 tcp://odinnfs4:3334
</Aliases>

<Filesystem>
        Name pvfs2-fs
        ID 757469263
        RootHandle 1048576
        <MetaHandleRanges>
                Range odinnfs2 3-1537228672809129302
                Range odinnfs3 1537228672809129303-3074457345618258602
                Range odinnfs4 3074457345618258603-4611686018427387902
        </MetaHandleRanges>
        <DataHandleRanges>
                Range odinnfs2 4611686018427387903-6148914691236517202
                Range odinnfs3 6148914691236517203-7686143364045646502
                Range odinnfs4 7686143364045646503-9223372036854775802
        </DataHandleRanges>
        <StorageHints>
                TroveSyncMeta yes
                TroveSyncData no
        </StorageHints>
</Filesystem>

--

I have a user that consistently crashes one of the servers (seems random).
After enabling segv-backtrace, I get the following message:

[D 02/14 15:48] PVFS2 Server version 2.7.0 starting.
[E 02/14 15:55] PVFS2 server: signal 11, faulty address is 0x18, from 0x3cb366ee
f3
[E 02/14 15:55] [bt] /lib64/libc.so.6 [0x3cb366eef3]
[E 02/14 15:55] [bt] /lib64/libc.so.6 [0x3cb366eef3]
[E 02/14 15:55] [bt] /lib64/libc.so.6(cfree+0x8c) [0x3cb3672b1c]
[E 02/14 15:55] [bt] /usr/sbin/pvfs2-server(job_testcontext+0x13b) [0x432ecb]
[E 02/14 15:55] [bt] /usr/sbin/pvfs2-server(main+0xdc8) [0x4109f8]
[E 02/14 15:55] [bt] /lib64/libc.so.6(__libc_start_main+0xf4) [0x3cb361d8a4]
[E 02/14 15:55] [bt] /usr/sbin/pvfs2-server [0x40e369]

Does anyone have a clue about this one?

Thanks in advance,
jon


  +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
  |  Jon Burgoyne                                      Indiana University |
  +  [EMAIL PROTECTED]                                 Computer Science  +
  |  Voice: US 812-855-6990                          Fax: US 812-855-4829 |
  +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
_______________________________________________
Pvfs2-users mailing list
[email protected]
http://www.beowulf-underground.org/mailman/listinfo/pvfs2-users

Reply via email to