Since I'm pretty proud of my awk one-liner, and maybe it's useful for this kind of charging, here's how to sum up how much data each user has in the filesystem (without regards to if the data blocks are offline, online, replicated or compressed):
# cat full-file-list.policy RULE EXTERNAL LIST 'files' EXEC '' RULE LIST 'files' SHOW( VARCHAR(USER_ID) || ' ' || VARCHAR(GROUP_ID) || ' ' || VARCHAR(FILESET_NAME) || ' ' || VARCHAR(FILE_SIZE) || ' ' || VARCHAR(KB_ALLOCATED) ) # mmapplypolicy gpfs0 -P /gpfs/gpfsmgt/etc/full-file-list.policy -I defer -f /tmp/full-file-list # awk '{a[$4] += $7} END{ print "# UID\t Bytes" ; for (i in a) print i, "\t", a[i]}' /tmp/full-file-list.list.files Takes ~15 minutes to run on a 60 million file filesystem. -jf On Thu, May 3, 2018 at 11:43 AM, Sobey, Richard A <r.so...@imperial.ac.uk> wrote: > Hi all, > > > > I’d be interested to talk to anyone that is using HSM to move data to > tape, (and stubbing the file(s)) specifically any strategies you’ve > employed to figure out how to charge your customers (where you do charge > anyway) based on usage. > > > > On-list or off is fine with me. > > > > Thanks > > Richard > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > >
_______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss