[
https://issues.apache.org/jira/browse/YARN-202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vinod Kumar Vavilapalli updated YARN-202:
-----------------------------------------
Issue Type: Sub-task (was: Bug)
Parent: YARN-431
> Log Aggregation generates a storm of fsync() for namenode
> ---------------------------------------------------------
>
> Key: YARN-202
> URL: https://issues.apache.org/jira/browse/YARN-202
> Project: Hadoop YARN
> Issue Type: Sub-task
> Affects Versions: 2.0.2-alpha, 0.23.4
> Reporter: Kihwal Lee
> Assignee: Kihwal Lee
> Priority: Critical
> Fix For: 3.0.0, 2.0.3-alpha, 0.23.5
>
> Attachments: yarn-202.patch
>
>
> When the log aggregation is on, write to each aggregated container log causes
> hflush() to be called. For large clusters, this can creates a lot of fsync()
> calls for namenode.
> We have seen 6-7x increase in the average number of fsync operations compared
> to 1.0.x on a large busy cluster. Over 99% of fsync ops were for log
> aggregation writing to tmp files.
--
This message was sent by Atlassian JIRA
(v6.1#6144)