[ 
https://issues.apache.org/jira/browse/HDFS-14591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16869380#comment-16869380
 ] 

Jinglun commented on HDFS-14591:
--------------------------------

Hi [~jojochuang] and [~ayushtkn], thanks for your references, those are very 
good works and really inspired. Actually my external service is very like the 
SPS. The external service automatically downloads HDFS's image file and 
analyzes the image to find out the cold files, then it set xattrs to these 
INodes. Another service find the INodes with the special xattrs and start 
movers to move them. If I use the SPS, the progress would be: find cold files, 
change the policies of them, let external-SPS handle the files. I would wait 
SPS be stable and consider changing to it.
SSM is very ambitious and it handles all the replica placement jobs like 
balance, mover and storage policy mismatches. It would simplify my maintenance 
work. Looking forward to play with it.
I'm considering start a new type of NameNode that only handles the replica 
placement. It loads images and edits and receives block reports, there won't be 
rpcs to the active so it won't be a burden to my heavy NameNode. It's job is to 
scan all blocks and datanodes to find the mismatched blocks and over-used dns, 
then move replicas to resolve the problems. Detecting cold files could also be 
added to it.
Since there already has issues including storage mismatch problem, I'll close 
this issue. Again thanks Wei-Chiu and Ayush for your kindly comments.:)

> NameNode should move the replicas to the correct storages after the storage 
> policy is changed.
> ----------------------------------------------------------------------------------------------
>
>                 Key: HDFS-14591
>                 URL: https://issues.apache.org/jira/browse/HDFS-14591
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>            Reporter: Jinglun
>            Assignee: Jinglun
>            Priority: Major
>
> Our Xiaomi HDFS has a cluster storaging both HOT and COLD data. We have a 
> backgroud process searching all the files to find those that are not accessed 
> for a period of time. Then we set them to COLD and start a mover to move the 
> replicas. After moving, all the replicas are consistent with the storage 
> policy.
> It's a natural idea to let the NameNode handle the move.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to