[
https://issues.apache.org/jira/browse/HDFS-5092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andrew Wang updated HDFS-5092:
------------------------------
Labels: caching (was: )
> Add support for incremental cache reports
> -----------------------------------------
>
> Key: HDFS-5092
> URL: https://issues.apache.org/jira/browse/HDFS-5092
> Project: Hadoop HDFS
> Issue Type: Improvement
> Components: datanode, namenode
> Reporter: Colin Patrick McCabe
> Assignee: Andrew Wang
> Priority: Minor
> Labels: caching
>
> The initial {{cacheReport}} patch at HDFS-5051 does frequent full reports of
> DN cache state. Better would be a scheme similar to how block reports are
> currently done: send incremental cache reports on every heartbeat (seconds),
> and full reports on a longer time scale (minutes to hours). This should
> reduce network traffic and allow us to make incremental reports even faster.
> As per discussion on HDFS-5051, we should also roll-up the following review
> comments:
> - Remove gen stamp and length from {{cacheReport}}, unnecessary until we do
> auto-caching of appended data
> - Only jitter full cache reports, similar to how full block reports are
> jittered
> - On DN startup, skip all cache reports until the cache is populated. The NN
> can just assume the DN cache is empty in the meantime.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)