Trying again - apparently the message is too long.
====

Joe Mervini
Sandia National Laboratories
High Performance Computing
505.844.6770
jame...@sandia.gov<mailto:jame...@sandia.gov>



On Aug 23, 2017, at 9:22 AM, Mervini, Joseph A 
<jame...@sandia.gov<mailto:jame...@sandia.gov>> wrote:

That is the weird thing about this problem. There is NO crash dump (even though 
it is enabled) and there is nothing in either the console or any system logs 
that point to why it’s rebooting. The machine just goes “Poof” and reboots as 
if power was switch off and then back on again. I would consider it a hardware 
problem but I am having identical problems on 3 different platforms, 2 being 
Dell R720s and one Penguin 1905E.

I have applied all the recommended tuning parameters for mariadb and lustre. 
The only thing that I haven’t messed with is the robinhood config file 
(modified from the basic.conf)  but I would suspect that if there were an issue 
there it would be exposed immediately. In any event this is what I am using:

[root@littlejohn log]# cat /etc/robinhood.d/scratch1.conf
General {
    fs_path = "/scratch1";
    # filesystem type, as displayed by 'mount' (e.g. ext4, xfs, lustre, ...)
    fs_type = lustre;
}

Log {
    log_file = "/var/log/robinhood.log";
    report_file = "/var/log/robinhood_actions.log";
    alert_file = "/var/log/robinhood_alerts.log";
    debug_level = full;
    stats_interval = 5min;
}

ListManager {
    MySQL {
        server = localhost;
        db = scratch1;
        user = robinhood;
        password_file = /etc/robinhood.d/.dbpassword;
    }
}

# Lustre 2.x only
ChangeLog {
    MDT {
        mdt_name = "MDT0000";
        reader_id = "cl1";
    }
}

Is there perhaps something I am missing?

Also I am building robinhood from the robinhood-master.zip archive (md5sum 
fbf96fddad156b69c3db5bbdf5e3840d). After unpacking the archive I change into 
the robinhood-master directory, initalize the git repo and commit everything, 
run autogen.sh, configure and make rpms and everything runs clean with the 
exception of the following warnings:

  CC       rbh_cfg.lo
conf_lex.c:1767:17: warning: 'yyunput' defined but not used [-Wunused-function]
     static void yyunput (int c, register char * yy_bp )
                 ^
conf_lex.c:1808:16: warning: 'input' defined but not used [-Wunused-function]
     static int input  (void)
                ^

make[3]: Entering directory 
`/root/robinhood-master/rpms/BUILD/robinhood-3.0/src/tools'
  CC       lhsmtool_cmd-lhsmtool_cmd.o
lhsmtool_cmd.c: In function 'ct_setup':
lhsmtool_cmd.c:891:2: warning: 'g_thread_init' is deprecated (declared at 
/usr/include/glib-2.0/glib/deprecated/gthread.h:265) [-Wdeprecated-declarations]
  g_thread_init(NULL);
  ^
lhsmtool_cmd.c:906:3: warning: 'g_thread_create' is deprecated (declared at 
/usr/include/glib-2.0/glib/deprecated/gthread.h:104): Use 'g_thread_new' 
instead [-Wdeprecated-declarations]
   g_thread_create(subproc_mgr_main, NULL, false, NULL);
   ^
  CCLD     lhsmtool_cmd


I am baffled. Any help would be greatly appreciated.

====




------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
robinhood-support mailing list
robinhood-support@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/robinhood-support

Reply via email to