Trying again - apparently the message is too long.
====
Joe Mervini
Sandia National Laboratories
High Performance Computing
505.844.6770
jame...@sandia.gov<mailto:jame...@sandia.gov>
On Aug 23, 2017, at 9:22 AM, Mervini, Joseph A
<jame...@sandia.gov<mailto:jame...@sandia.gov>> wrote:
That is the weird thing about this problem. There is NO crash dump (even though
it is enabled) and there is nothing in either the console or any system logs
that point to why it’s rebooting. The machine just goes “Poof” and reboots as
if power was switch off and then back on again. I would consider it a hardware
problem but I am having identical problems on 3 different platforms, 2 being
Dell R720s and one Penguin 1905E.
I have applied all the recommended tuning parameters for mariadb and lustre.
The only thing that I haven’t messed with is the robinhood config file
(modified from the basic.conf) but I would suspect that if there were an issue
there it would be exposed immediately. In any event this is what I am using:
[root@littlejohn log]# cat /etc/robinhood.d/scratch1.conf
General {
fs_path = "/scratch1";
# filesystem type, as displayed by 'mount' (e.g. ext4, xfs, lustre, ...)
fs_type = lustre;
}
Log {
log_file = "/var/log/robinhood.log";
report_file = "/var/log/robinhood_actions.log";
alert_file = "/var/log/robinhood_alerts.log";
debug_level = full;
stats_interval = 5min;
}
ListManager {
MySQL {
server = localhost;
db = scratch1;
user = robinhood;
password_file = /etc/robinhood.d/.dbpassword;
}
}
# Lustre 2.x only
ChangeLog {
MDT {
mdt_name = "MDT0000";
reader_id = "cl1";
}
}
Is there perhaps something I am missing?
Also I am building robinhood from the robinhood-master.zip archive (md5sum
fbf96fddad156b69c3db5bbdf5e3840d). After unpacking the archive I change into
the robinhood-master directory, initalize the git repo and commit everything,
run autogen.sh, configure and make rpms and everything runs clean with the
exception of the following warnings:
CC rbh_cfg.lo
conf_lex.c:1767:17: warning: 'yyunput' defined but not used [-Wunused-function]
static void yyunput (int c, register char * yy_bp )
^
conf_lex.c:1808:16: warning: 'input' defined but not used [-Wunused-function]
static int input (void)
^
make[3]: Entering directory
`/root/robinhood-master/rpms/BUILD/robinhood-3.0/src/tools'
CC lhsmtool_cmd-lhsmtool_cmd.o
lhsmtool_cmd.c: In function 'ct_setup':
lhsmtool_cmd.c:891:2: warning: 'g_thread_init' is deprecated (declared at
/usr/include/glib-2.0/glib/deprecated/gthread.h:265) [-Wdeprecated-declarations]
g_thread_init(NULL);
^
lhsmtool_cmd.c:906:3: warning: 'g_thread_create' is deprecated (declared at
/usr/include/glib-2.0/glib/deprecated/gthread.h:104): Use 'g_thread_new'
instead [-Wdeprecated-declarations]
g_thread_create(subproc_mgr_main, NULL, false, NULL);
^
CCLD lhsmtool_cmd
I am baffled. Any help would be greatly appreciated.
====
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
robinhood-support mailing list
robinhood-support@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/robinhood-support