[robinhood-support] Robinhood 3.0.1 rebooting unexpectedly on initial scan with Lustre 2.8.0.8

2017-07-11 Thread Mervini, Joseph A
Hello,

I have a problem similar to 
https://sourceforge.net/p/robinhood/mailman/message/35883907/ in which the 
robinhood server running mariadb-5.5.52-1.el7.x86_64 and lustre 2.8.0.8 client 
will reboot when the initial scan is run. I am running this in a testbed 
environment prior to deployment on our production system because I want to get 
a complete handle on it before I commit to the deployment. I have 2 separate 
lustre file systems that I am running against: One is a 408TB lustre 2.8 file 
system with ~16M inodes, the other is a 204TB lustre 2.5.5 file system with ~3M 
inodes.

The curious thing is that I had successfully scanned both file systems 
independently on the system with everything working (including web-gui) and 
then basically blew away the databases to get a datapoint on how the system 
performed and the time it took if I ran a scan on both file systems 
simultaneously. It appears that it is only impacting the 2.8 file system 
database. I just ran a fresh scan against the 2.5.5 file system without 
problem. I then stated a new scan against the 2.8 file system an once again it 
rebooted.

Like the other support ticket above, when I ran the scan only on the 2.8 file 
system in debug mode it also reported messages similar to “2017/07/10 15:44:58 
[15191/6] FS_Scan | openat failed on 

Re: [robinhood-support] Updating access times in Robinhood

2017-07-11 Thread LEIBOVICI Thomas

On 07/11/17 16:35, Hoolihan, Joe (US) wrote:


All,

I am seeing an odd behavior with Robinhood and MySQL when tracking 
Lustre file system access times.


Here is my test:   I created a file on my Lustre file system.   It 
created a record in the robinhood database, in the ENTRIES table, and 
a rbh-report --entry-info filename returned the info.


I then added a text line to the file (echo "ADDED STRING" >> 
filename).  an ls -l of the file revealed it had been modified, but 
the rbh-report --entry-info and direct query of the database showed 
that the last_access and last_mod fields had not updated.   This did 
not change over time.




Hello Hoolihan,

You did not give much details about your robinhood configuration.
* Is a changelog reader properly registered to the filesystem? (cat 
/proc/fs/lustre/mdd/*/changelog_users on the MDS)
Does the changelog mask include OPEN, CLOSE, MTIME  (cat 
/proc/fs/lustre/mdd/*/changelog_mask on the MDS)

* Do you have a robinhood daemon running with the --readlog option?

Depending on your configuration and the system load, there may be a 
delay between the action on the filesystem and the update of the 
robinhood DB. Posting your robinhood configuration may help troubleshooting.


When the file permissions were changed (chmod 640 filename) that 
forced an update to the database and both direct query and rbh-report 
--entry-info were correct.   Each time value was correct (last_access 
time was different from last_mod, reflecting the correct values).


It sounds like your changelog mask is incomplete, as robinhood seams to 
react on SETATTR but not on OPEN/MTIME/CLOSE


So, my question is:   Is there something I did wrong that would cause 
the database not to get updated when a lustre file changed (some 
config value or other setting),  or is this a possible bug (or maybe a 
feature) ?


Also take care that access time (POSIX' atime) is different from last 
modification time (mtime) and change time (ctime).
By default, the changelog mask include events that modify mtime and 
ctime, but not atime.


HTH
Thomas.



Joseph P. Hoolihan
BAE Systems, Inc.
443-654-9701 onsite number
301-974-9033 cell


--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot


___
robinhood-devel mailing list
robinhood-de...@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/robinhood-devel



--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot___
robinhood-support mailing list
robinhood-support@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/robinhood-support


[robinhood-support] Robinhood User Group 2017

2017-07-11 Thread LEIBOVICI Thomas

Robinhood User Group 2017
=
This year's Robinhood User Group will take place the afternoon before 
LAD[1],

on Tuesday, October 3rd, 2017, in Paris.

This meeting is the occasion for users, developers and vendors to share 
experiences and

future plans about RobinHood Policy Engine.

For more details about the event, see: 
https://github.com/cea-hpc/robinhood/wiki/RUG2017


Registration

Even if it is co-located with LAD, organization of RUG is independent
so you need to register separately to the 2 meetings.
It makes it possible for us to plan enough room and food for participants.

Registration form: 
https://rug2017.wufoo.com/forms/robinhood-user-group-registration-form/

Registration is free.

Call for presentations
==
We invite attendees to submit a presentation when they register to the 
workshop

(there is a dedicated field in the submission form).
Typical presentation length is 20 to 30 min.

Topics of interest include (but are not limited to):
- Specific usages and implementations, site reports
- Contributions, and annex tools
- Performance and tunings (robinhood, database, kernel, filesystem...)
- Dashboards, integration to site monitoring and accounting systems
- Vendors integration and support

Last year's presentations are available here: 
https://github.com/cea-hpc/robinhood/wiki/RUG2016#agenda--slide-decks



See you soon in Paris!

[1] https://www.eofs.eu/events/lad17


--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
___
robinhood-support mailing list
robinhood-support@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/robinhood-support