Since I'm pretty proud of my awk one-liner, and maybe it's useful for this
kind of charging, here's how to sum up how much data each user has in the
filesystem (without regards to if the data blocks are offline, online,
replicated or compressed):

# cat full-file-list.policy
RULE EXTERNAL LIST 'files' EXEC ''

RULE LIST 'files'
     SHOW( VARCHAR(USER_ID) || ' ' ||
          VARCHAR(GROUP_ID) || ' ' ||
          VARCHAR(FILESET_NAME) || ' ' ||
           VARCHAR(FILE_SIZE) || ' ' ||
           VARCHAR(KB_ALLOCATED) )

# mmapplypolicy gpfs0 -P /gpfs/gpfsmgt/etc/full-file-list.policy -I defer
-f /tmp/full-file-list
# awk '{a[$4] += $7} END{ print "# UID\t Bytes" ; for (i in a) print i,
"\t", a[i]}' /tmp/full-file-list.list.files

Takes ~15 minutes to run on a 60 million file filesystem.


  -jf


On Thu, May 3, 2018 at 11:43 AM, Sobey, Richard A <r.so...@imperial.ac.uk>
wrote:

> Hi all,
>
>
>
> I’d be interested to talk to anyone that is using HSM to move data to
> tape, (and stubbing the file(s)) specifically any strategies you’ve
> employed to figure out how to charge your customers (where you do charge
> anyway) based on usage.
>
>
>
> On-list or off is fine with me.
>
>
>
> Thanks
>
> Richard
>
> _______________________________________________
> gpfsug-discuss mailing list
> gpfsug-discuss at spectrumscale.org
> http://gpfsug.org/mailman/listinfo/gpfsug-discuss
>
>
_______________________________________________
gpfsug-discuss mailing list
gpfsug-discuss at spectrumscale.org
http://gpfsug.org/mailman/listinfo/gpfsug-discuss

Reply via email to