This maybe due to the code recently added to handle Lustre 2.7 changelogs.

To determine de cause of the crash, you can run robinhood in gdb:

 > gdb --args robinhood --read-log

gdb> run

...
SEGFAULT

gdb> where

This will display the stack where the crash occurs.

Thanks for reporting!

Thomas


On 06/15/15 19:50, Frederik Ferner wrote:
> All,
>
> while upgrading our Lustre file system to Lustre 2.7, I also upgraded
> robinhood to the newly released 2.5.5. I did download the tar file and
> compiled it locally as the pre-built rpms on sourceforge have a
> dependency on lustre-modules but on our site the rpm provides
> lustre-client-modules.
>
> The RPM installed fine, the server is running Lustre 2.7 but with the
> same configuration that previously was running fine (on 2.5.4) the new
> version now segfaults on startup (called as robinhood --read-log). I'm
> currently not sure how to debug this further. Any pointers welcome,
> strace wasn't helpful in determining where it crashes, the log isn't
> that clear either, with normal options the following are the only lines
> in the logfile:
>
> <snip>
> 2015/06/15 18:27:36 robinhood@cs04r-sc-serv-92[103117/1] CheckFS |
> '/mnt/lustre03' matches mount point '/mnt/lustre03', type=lustre,
> fs=cs04r-sc-mds03-01-10ge@tcp:cs04r-sc-mds03-02-10ge@tcp:/lustre03
> 2015/06/15 18:27:36 robinhood@cs04r-sc-serv-92[103117/2] SigHdlr |
> Signals SIGTERM and SIGINT (daemon shutdown) are ready to be used
> 2015/06/15 18:27:36 robinhood@cs04r-sc-serv-92[103117/2] SigHdlr |
> Signal SIGHUP (config reloading) is ready to be used
> 2015/06/15 18:27:36 robinhood@cs04r-sc-serv-92[103117/2] SigHdlr |
> Signal SIGUSR1 (stats dump) is ready to be used
> 2015/06/15 18:27:36 robinhood@cs04r-sc-serv-92[103117/1] EntryProc | No
> class defined in policies, disabling file class matching.
> 2015/06/15 18:27:36 robinhood@cs04r-sc-serv-92[103117/1] EntryProc | No
> class defined in policies, disabling dir class matching.
> 2015/06/15 18:27:36 robinhood@cs04r-sc-serv-92[103117/1] Main | Daemon
> started (running modules: log_reader)
> 2015/06/15 18:27:36 robinhood@cs04r-sc-serv-92[103117/3] ChangeLog |
> LU-1331 is fixed in this version of Lustre.
> </snip>
>
> With --log-level=DEBUG there are quite a few lines like this following
> before it just stops:
>
> <snip>
> 2015/06/15 18:46:19 robinhood@cs04r-sc-serv-92[110019/11] ChangeLog |
> MDT0000: 3435143616 14SATTR 1434376871.895757017 0x14
> t=[0x20001026e:0x86f9:0x0]
> 2015/06/15 18:46:19 robinhood@cs04r-sc-serv-92[110019/11] ChangeLog |
> MDT0000: 3435143617 08RENME 1434376871.897757063 0x1
> t=[0x20000fa28:0xb3ef:0x0] p=[0xecedd2c:0x52385992:0x0]
> LineScan$py.class s=[0x20001026e:0x86f9:0x0]
> sp=[0xecedd2c:0x52385992:0x0] .LineScan$py.class.3FMhca
> </snip>
>
>
> Cheers,
> Frederik


------------------------------------------------------------------------------
_______________________________________________
robinhood-support mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/robinhood-support

Reply via email to