Sean Mackrory created HDFS-10797:
------------------------------------
Summary: Disk usage summary of snapshots causes renamed blocks to
get counted twice
Key: HDFS-10797
URL: https://issues.apache.org/jira/browse/HDFS-10797
Project: Hadoop HDFS
Issue Type: Bug
Reporter: Sean Mackrory
DirectoryWithSnapshotFeature.computeContentSummary4Snapshot calculates how much
disk usage is used by a snapshot by tallying up the files in the snapshot that
have since been deleted (that way it won't overlap with regular files whose
disk usage is computed separately). However that is determined from a diff that
shows moved (to Trash or otherwise) or renamed files as a deletion and a
creation operation that may overlap with the list of blocks. Only the deletion
operation is taken into consideration, and this causes those blocks to get
represented twice in the disk usage tallying.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]