[ 
https://issues.apache.org/jira/browse/HDFS-5092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Wang updated HDFS-5092:
------------------------------

    Description: 
The initial {{cacheReport}} patch at HDFS-5051 does frequent full reports of DN 
cache state. Better would be a scheme similar to how block reports are 
currently done: send incremental cache reports on every heartbeat (seconds), 
and full reports on a longer time scale (minutes to hours). This should reduce 
network traffic and allow us to make incremental reports even faster.

As per discussion on HDFS-5051, we should also roll-up the following review 
comments:

- Remove gen stamp and length from {{cacheReport}}, unnecessary until we do 
auto-caching of appended data
- Only jitter full cache reports, similar to how full block reports are jittered
- On DN startup, skip all cache reports until the cache is populated. The NN 
can just assume the DN cache is empty in the meantime.

  was:We should send incremental cache reports as part of DN heartbeats, 
similar to how we do incremental block reports.  Then we would only need to 
send full cache reports rarely (again similar to full block reports).

       Assignee: Andrew Wang
        Summary: Add support for incremental cache reports  (was: piggyback 
incremental cache reports on DN heartbeats)
    
> Add support for incremental cache reports
> -----------------------------------------
>
>                 Key: HDFS-5092
>                 URL: https://issues.apache.org/jira/browse/HDFS-5092
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>          Components: datanode, namenode
>            Reporter: Colin Patrick McCabe
>            Assignee: Andrew Wang
>            Priority: Minor
>
> The initial {{cacheReport}} patch at HDFS-5051 does frequent full reports of 
> DN cache state. Better would be a scheme similar to how block reports are 
> currently done: send incremental cache reports on every heartbeat (seconds), 
> and full reports on a longer time scale (minutes to hours). This should 
> reduce network traffic and allow us to make incremental reports even faster.
> As per discussion on HDFS-5051, we should also roll-up the following review 
> comments:
> - Remove gen stamp and length from {{cacheReport}}, unnecessary until we do 
> auto-caching of appended data
> - Only jitter full cache reports, similar to how full block reports are 
> jittered
> - On DN startup, skip all cache reports until the cache is populated. The NN 
> can just assume the DN cache is empty in the meantime.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to