[ 
https://issues.apache.org/jira/browse/HADOOP-5588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12689759#action_12689759
 ] 

Hairong Kuang edited comment on HADOOP-5588 at 3/27/09 10:02 AM:
-----------------------------------------------------------------

A suspect is HADOOP-3497 which introduced a listing call on the parent 
directory no matter the path contains globs or not in globStatus. One of our 
users calls "dfs -get" on many small files under one large directory. It has 
the same effect of calling dfs -ls many times on the large directory, thus 
causing NN to do lots of gc and making it less responsive.

      was (Author: hairong):
    A suspect is HADOOP-3497 which introduced a listing call on the parent 
directory no matter the path contains globs or not in globStatus. One of our 
users calls "dfs -get" on many small files under one dir. It has the same 
effect of calling dfs -ls many times on a large directory, thus causing NN to 
do lots of gc and making it less responsive.
  
> hadoop commands seem extremely slow in 0.20 branch
> --------------------------------------------------
>
>                 Key: HADOOP-5588
>                 URL: https://issues.apache.org/jira/browse/HADOOP-5588
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: dfs, fs
>    Affects Versions: 0.20.0
>         Environment: 0.20-branch and trunk
>            Reporter: Koji Noguchi
>            Assignee: Hairong Kuang
>            Priority: Blocker
>             Fix For: 0.20.0
>
>
> hadoop dfs get, rm, -mkdir- ,cp, mv, ls, etc   mydir/fileA mydir/fileB 
> mydir/fileC ...
> seem to be very slow in 0.20 branch. 
>  

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to