> 23/pci-0000:00:1a.0-usb-0:1.6.1:1.2-event-mouse
> 23/platform-pcspkr-event-spkr
...

Can you find these entries in your filesystem and see what are their type: symlink, char/block device, ...?

As you host crashes, maybe some syscall made by robinhood to the lustre client wouldn't properly handle this kind of entry.

Regards,
Thomas

Langton wrote:
I am trying to install Robin Hood to manage a 4PB lustre filesystem.
The environment is as follows
IEEL Lustre 2.5
Robinhood 2.5.5-2
CentOS release 6.7
kernel - 2.6.32-573.8.1.el6.x86_64
4PB Lustre Filesystem
Robin Hood Host has 2TB RAM and 390GB Disk capacity
FDR Infiniband fabric network
A Failover setup on all lustre servers

After installing robinhood , I have faced a challenge when I kickstart a scan. For some reason the RBH host reboots just a few seconds after issuing the scan command. I have traced the robinhood logs but they give the following:

2017/06/08 16:19:38 [15616/21] FS_Scan | openat failed on 23/pci-0000:00:1a.0-usb-0:1.6.1:1.2-event-mouse: Too many levels of symbolic links 2017/06/08 16:19:38 [15616/21] FS_Scan | openat failed on 23/platform-pcspkr-event-spkr: Too many levels of symbolic links 2017/06/08 16:19:38 [15616/21] FS_Scan | openat failed on 23/pci-0000:00:1a.0-usb-0:1.6.1:1.2-mouse: Too many levels of symbolic links 2017/06/08 16:19:38 [15616/21] FS_Scan | openat failed on 23/pci-0000:00:1a.0-usb-0:1.2:1.0-event-mouse: Too many levels of symbolic links 2017/06/08 16:19:38 [15616/21] FS_Scan | openat failed on 23/pci-0000:00:1a.0-usb-0:1.6.1:1.1-mouse: Too many levels of symbolic links 2017/06/08 16:19:38 [15616/21] FS_Scan | openat failed on 23/pci-0000:00:1a.0-usb-0:1.5.1:1.1-event: Too many levels of symbolic links

As a test i started the robinhood-lhsm service and it started fine without the initial scan. The command - rbh-lhsm-report --fs-info gives you some info but not much detailed. The command - rbh-lhsm-report -a says file storage has never been checked which means a scan is needed. Currently the filesystem is in production. Can this the main reason why it crashes.
The filesystem is sitting at 2.6PB of used capacity.

Regards

Langton

------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
robinhood-support mailing list
robinhood-support@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/robinhood-support



---
L'absence de virus dans ce courrier électronique a été vérifiée par le logiciel 
antivirus Avast.
https://www.avast.com/antivirus


------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
robinhood-support mailing list
robinhood-support@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/robinhood-support

Reply via email to