[
https://issues.apache.org/jira/browse/HADOOP-9209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13554170#comment-13554170
]
Kihwal Lee commented on HADOOP-9209:
------------------------------------
Regarding the name of command, it seems we use the same name if there is
something equivalent in shell, otherwise the name is more descriptive. Commands
like sum and md5sum exist, so "checksum" may be okay in that sense. But more
descriptive name will be fine too.
HDFS checksum is a bit different from regular checksums obtained against a file
in conventional file systems. It has been of no concern until now as it's
mostly internal. But if it is exposed to users, we now have to tell users what
it is and what to expect. For example, users must be told that hdfs file
checksum can be different even if the contents of files are identical due to
use of different block sizes and checksum parameters. May be we should mention
it in the help.
> Add shell command to dump file checksums
> ----------------------------------------
>
> Key: HADOOP-9209
> URL: https://issues.apache.org/jira/browse/HADOOP-9209
> Project: Hadoop Common
> Issue Type: New Feature
> Components: fs, tools
> Affects Versions: 3.0.0, 2.0.3-alpha
> Reporter: Todd Lipcon
> Assignee: Todd Lipcon
> Attachments: hadoop-9209.txt, hadoop-9209.txt
>
>
> Occasionally while working with tools like distcp, or debugging certain
> issues, it's useful to be able to quickly see the checksum of a file. We
> currently have the APIs to efficiently calculate a checksum, but we don't
> expose it to users. This JIRA is to add a "fs -checksum" command which dumps
> the checksum information for the specified file(s).
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira