Gautam Gopalakrishnan created HDFS-8986: -------------------------------------------
Summary: Add option to -du to calculate directory space usage excluding snapshots Key: HDFS-8986 URL: https://issues.apache.org/jira/browse/HDFS-8986 Project: Hadoop HDFS Issue Type: Improvement Components: snapshots Reporter: Gautam Gopalakrishnan When running {{hadoop fs -du}} on a snapshotted directory (or one of its children), the report includes space consumed by blocks that are only present in the snapshots. This is confusing for end users. {noformat} $ hadoop fs -du -h -s /tmp/parent /tmp/parent/* 799.7 M 2.3 G /tmp/parent 799.7 M 2.3 G /tmp/parent/sub1 $ hdfs dfs -createSnapshot /tmp/parent snap1 Created snapshot /tmp/parent/.snapshot/snap1 $ hadoop fs -rm -skipTrash /tmp/parent/sub1/* ... $ hadoop fs -du -h -s /tmp/parent /tmp/parent/* 799.7 M 2.3 G /tmp/parent 799.7 M 2.3 G /tmp/parent/sub1 $ hdfs dfs -deleteSnapshot /tmp/parent snap1 $ hadoop fs -du -h -s /tmp/parent /tmp/parent/* 0 0 /tmp/parent 0 0 /tmp/parent/sub1 {noformat} It would be helpful if we had a flag, say -X, to exclude any snapshot related disk usage in the output -- This message was sent by Atlassian JIRA (v6.3.4#6332)