[ 
https://issues.apache.org/jira/browse/HADOOP-2219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12571638#action_12571638
 ] 

Tsz Wo (Nicholas), SZE commented on HADOOP-2219:
------------------------------------------------

I will add a option to du.

For the implementation, I am thinking about adding a method 
getContentSummary(Path) in the FileSystem.  It returns a ContentSummary (a new 
class) object which contains length, number of files and number of directories.

Similar to FileSystem.getContentLength(Path), an implementation of 
getContentSummary(Path), which uses FileSystem API , will be provided in 
FileSystem.  Then, DistributedFileSystem will override getContentSummary(Path) 
to provide a NameNode side implementation.

Since content length can be obtained by getContentSummary(Path), I will 
deprecate getContentLength(Path).

> du like command to count number of files under a given directory
> ----------------------------------------------------------------
>
>                 Key: HADOOP-2219
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2219
>             Project: Hadoop Core
>          Issue Type: New Feature
>          Components: dfs
>            Reporter: Koji Noguchi
>            Assignee: Tsz Wo (Nicholas), SZE
>
> To keep the total number of files on dfs low, we like the users to be able to 
> easily find out how many files each of their directory contain.   
> Currently, we only have fsck or dfs -lsr which takes time.
> Can I ask for an option for du to show the total number of files (as well as 
> the total size) of a given directory?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to