[ 
https://issues.apache.org/jira/browse/HADOOP-3402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12607668#action_12607668
 ] 

Enis Soztutar commented on HADOOP-3402:
---------------------------------------

>From the patch : 
{quote}
In May 2008, Owen O'Malley ran this code on a 910 node cluster and sorted the 
10 billion records (1 TB) in 209 seconds (3.48 minutes). 
{quote}

Wow ! 
Congratulations to the community. I believe it is very close to the world 
record : http://www.hpl.hp.com/hosted/sortbenchmark/ (197 seconds)
Owen, we should list this in forrest documentation. 

> Add example code to support run terasort on hadoop
> --------------------------------------------------
>
>                 Key: HADOOP-3402
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3402
>             Project: Hadoop Core
>          Issue Type: New Feature
>          Components: examples
>            Reporter: Owen O'Malley
>            Assignee: Owen O'Malley
>         Attachments: terasort-2.patch, terasort.patch, terasort.patch
>
>
> It would be good to be able to run the terasort 
> (http://www.hpl.hp.com/hosted/sortbenchmark/) on Hadoop. I need to write:
>  * distributed data generator
>  * sorter
>  * result checker

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to