[ 
https://issues.apache.org/jira/browse/HADOOP-877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Nigel Daley resolved HADOOP-877.
--------------------------------

    Resolution: Duplicate
      Assignee:     (was: Nigel Daley)

Duplicate of HADOOP-333

> we should automate checks of the output of the sort example program
> -------------------------------------------------------------------
>
>                 Key: HADOOP-877
>                 URL: https://issues.apache.org/jira/browse/HADOOP-877
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: test
>    Affects Versions: 0.10.0
>            Reporter: Owen O'Malley
>
> Since we are using the sort example program to do smoke tests on new versions 
> of Hadoop, it would be nice to have some checks of the output. The checks 
> that I've considered:
>   1. count the number of records on input & output
>   2. compute the md5 of each key/value and xor across all of the rows
>   3. use a map/reduce job to merge the input and output directories and make 
> sure that each key/value appears on both input and output

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to