Howdy, I just ran a disk usage report for our Lustre (v1.8.1.1) scratch space (6TB used of 40TB) that ended up taking 25 hours to complete. Yikes.
I'm wondering if anyone can suggest a more efficient way to get disk usage data? The disk usage script in question works as follows 1. builds an array of directories to check (/scratch/user1 /scratch/user2 / scratch/user3 and so on) 2. loops through the array running the Linux 'du' command on each user directory storing the results in a new array 3. Once the loop completes a report is generated and emailed using the results array Steps 1 and 3 run very quick, so step 2 is the long straw. Traditionally, this script has been used on our local home directories and will complete in approximately 15 minutes (for ~ 1TB of usage). I should mention that my 25 hour test was run from a Lustre client. We have one MDS and two OSS (each serving 3 OSTs). Also, we do not have any quota enabled. Thanks for any suggestions, Mike ================================= Mike Hanby [email protected] Information Systems Specialist II IT HPCS / Research Computing _______________________________________________ Lustre-discuss mailing list [email protected] http://lists.lustre.org/mailman/listinfo/lustre-discuss
