[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5603?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13809537#comment-13809537
 ] 

Jason Lowe commented on MAPREDUCE-5603:
---------------------------------------

Recently we ran across a jobclient that failed with an OOM error once we 
updated the cluster to 0.23.10.  The OOM was triggered by the FileInputFormat 
listLocatedStatus optimization from MAPREDUCE-1981, as the client now caches 
the BlockLocations of all files along with the FileStatus objects it was 
caching before.  Normally the user can bump the heap size of the client to work 
around this issue.  However if a job has an input with a particularly large 
number of BlockLocations, as this job did, it would be nice if the user had the 
option to disable the optimization to reduce the required memory necessary for 
input split calculations.

> Ability to disable FileInputFormat listLocatedStatus optimization to save 
> client memory
> ---------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-5603
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5603
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>          Components: client, mrv2
>    Affects Versions: 0.23.10, 2.2.0
>            Reporter: Jason Lowe
>            Assignee: Jason Lowe
>            Priority: Minor
>
> It would be nice if users had the option to disable the listLocatedStatus 
> optimization in FileInputFormat to save client memory.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to