[
https://issues.apache.org/jira/browse/HADOOP-2219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12571638#action_12571638
]
Tsz Wo (Nicholas), SZE commented on HADOOP-2219:
------------------------------------------------
I will add a option to du.
For the implementation, I am thinking about adding a method
getContentSummary(Path) in the FileSystem. It returns a ContentSummary (a new
class) object which contains length, number of files and number of directories.
Similar to FileSystem.getContentLength(Path), an implementation of
getContentSummary(Path), which uses FileSystem API , will be provided in
FileSystem. Then, DistributedFileSystem will override getContentSummary(Path)
to provide a NameNode side implementation.
Since content length can be obtained by getContentSummary(Path), I will
deprecate getContentLength(Path).
> du like command to count number of files under a given directory
> ----------------------------------------------------------------
>
> Key: HADOOP-2219
> URL: https://issues.apache.org/jira/browse/HADOOP-2219
> Project: Hadoop Core
> Issue Type: New Feature
> Components: dfs
> Reporter: Koji Noguchi
> Assignee: Tsz Wo (Nicholas), SZE
>
> To keep the total number of files on dfs low, we like the users to be able to
> easily find out how many files each of their directory contain.
> Currently, we only have fsck or dfs -lsr which takes time.
> Can I ask for an option for du to show the total number of files (as well as
> the total size) of a given directory?
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.