[ 
https://issues.apache.org/jira/browse/HADOOP-6467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tsz Wo (Nicholas), SZE updated HADOOP-6467:
-------------------------------------------

     Component/s: fs
    Hadoop Flags: [Reviewed]

+1 Got some better numbers.  The v3 patch is good to go.
{noformat}
"-bash-3.1$ date; time $H ${WC_CMD} ${HAR_FULL}/${DIR} ${TT_WC}4
Tue Feb 23 02:17:18 UTC 2010
10/02/23 02:17:32 INFO input.FileInputFormat: Total input paths to process : 
100000
10/02/23 02:27:39 INFO mapred.JobClient: Running job: job_201002042035_76681
10/02/23 02:27:40 INFO mapred.JobClient:  map 0% reduce 0%
...
10/02/23 02:32:35 INFO mapred.JobClient:  map 100% reduce 100%
10/02/23 02:32:41 INFO mapred.JobClient: Job complete: job_201002042035_76681
...
10/02/23 02:32:42 INFO mapred.JobClient:     Reduce input records=15660

real    15m23.717s
user    2m22.153s
sys     0m39.487s
{noformat}

> Performance improvement for liststatus on directories in hadoop archives.
> -------------------------------------------------------------------------
>
>                 Key: HADOOP-6467
>                 URL: https://issues.apache.org/jira/browse/HADOOP-6467
>             Project: Hadoop Common
>          Issue Type: Improvement
>          Components: fs
>            Reporter: Mahadev konar
>            Assignee: Mahadev konar
>             Fix For: 0.22.0
>
>         Attachments: Archives_performance.docx, Archives_performance.docx, 
> HADOOP-6467-v2.patch, HADOOP-6467-y.0.20-branch-v2.patch, 
> HADOOP-6467-y.0.20-branch-v2.patch, HADOOP-6467-y0.20-branch.patch, 
> HADOOP-6467.patch, HADOOP-6467.patch, HADOOP-6467.patch, HADOOP-6467_v3.patch
>
>
> A liststatus call on a directory in hadoop archives leads to ( 2* number of 
> files in directory) open calls to the namenode. This is very sub optimal and 
> needs to be fixed to make it performant enough to be used on a daily basis. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to