[jira] [Work logged] (HDFS-16485) [SPS]: allow re-satisfy path after restarting sps process

ASF GitHub Bot (Jira) Fri, 25 Feb 2022 01:19:03 -0800


     [ 
https://issues.apache.org/jira/browse/HDFS-16485?focusedWorklogId=732959&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-732959
 ]


ASF GitHub Bot logged work on HDFS-16485:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 25/Feb/22 09:18
            Start Date: 25/Feb/22 09:18
    Worklog Time Spent: 10m 
      Work Description: liubingxing opened a new pull request #4033:
URL: https://github.com/apache/hadoop/pull/4033


   When SPSPathIdProcessor thread call getNextSPSPath(), it get the pathId from 
namenode and namenode will also remove this pathId from pathsToBeTraveresed 
queue.
   ```java
   public Long getNextPathId() {
     synchronized (pathsToBeTraveresed) {
       return pathsToBeTraveresed.poll();
     }
   } 
   ```
   If SPS process restart, this path will not continue the move operation until 
namenode restart.
   
   So we want to provide a way for the SPS to continue performing the move 
operation after SPS restart.
   
   First solution: 
   
   1) When SPSPathIdProcessor thread call getNextSPSPath(), namenode return 
pathId and then move this pathId to a pathsBeingTraveresed queue;
   
   2) After SPS finish a path movement operation, it call a rpc to namenode to 
remove this pathId from pathsBeingTraveresed queue;
   
   3) If SPS restart, SPSPathIdProcessor thread should call a rpc to namenode 
to get all pathId from pathsBeingTraveresed queue;
   
   Second solution:
   
   We added timeout detection in the application layer, if a path does not 
complete the movement within the specified time, we can re-satisfy this path 
even though it has "hdfs.sps" xattr already.
   
   We choose the second solution because the first solution will add more rpc 
operation and may affect namenode performance.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Issue Time Tracking
-------------------

            Worklog Id:     (was: 732959)
    Remaining Estimate: 0h
            Time Spent: 10m

> [SPS]: allow re-satisfy path after restarting sps process
> ---------------------------------------------------------
>
>                 Key: HDFS-16485
>                 URL: https://issues.apache.org/jira/browse/HDFS-16485
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>            Reporter: qinyuren
>            Priority: Major
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> When SPSPathIdProcessor thread call getNextSPSPath(), it get the pathId from 
> namenode and namenode will also remove this pathId from pathsToBeTraveresed 
> queue.
> {code:java}
> public Long getNextPathId() {
>   synchronized (pathsToBeTraveresed) {
>     return pathsToBeTraveresed.poll();
>   }
> } {code}
> If SPS process restart, this path will not continue the move operation until 
> namenode restart.
> So we want to provide a way for the SPS to continue performing the move 
> operation after SPS restart.
> First solution: 
> 1) When SPSPathIdProcessor thread call getNextSPSPath(), namenode return 
> pathId and then move this pathId to a pathsBeingTraveresed queue;
> 2) After SPS finish a path movement operation, it call a rpc to namenode to 
> remove this pathId from pathsBeingTraveresed queue;
> 3) If SPS restart, SPSPathIdProcessor thread should call a rpc to namenode to 
> get all pathId from pathsBeingTraveresed queue;
> Second solution:
> We added timeout detection in the application layer, if a path does not 
> complete the movement within the specified time, we can re-satisfy this path 
> even though it has "hdfs.sps" xattr already.
> We choose the second solution because the first solution will add more rpc 
> operation and may affect namenode performance.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Work logged] (HDFS-16485) [SPS]: allow re-satisfy path after restarting sps process

Reply via email to