[jira] [Commented] (HDDS-1163) Basic framework for Ozone Data Scrubber

Arpit Agarwal (JIRA) Mon, 04 Mar 2019 07:36:10 -0800


    [ 
https://issues.apache.org/jira/browse/HDDS-1163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16783462#comment-16783462
 ]


Arpit Agarwal commented on HDDS-1163:
-------------------------------------

bq. The main point from me is that the rate for full-check and fast-check 
should be different. We can make fask-check for open state in a more higher 
frequency. Currently, we put all check in same loop, sometimes the fast-check 
will be blocked by a long time full-check. So I suggest we can have two 
executors: One for fast-check and the other one for full-check. By default, we 
can make these two executor both use 1 thread number size but different 
execution frequency. How do you think of this?

This is a good idea [~linyiqun]. The design doc discusses this (see the table 
1) where it mentions separate frequency for fast/slow checks. We should 
certainly revisit this in a later sub-task. For now we are adding the basic 
framework so we can leave things as they are or just remove the fast check for 
now. Either sounds fine to me.

> Basic framework for Ozone Data Scrubber
> ---------------------------------------
>
>                 Key: HDDS-1163
>                 URL: https://issues.apache.org/jira/browse/HDDS-1163
>             Project: Hadoop Distributed Data Store
>          Issue Type: Sub-task
>          Components: Ozone Datanode
>            Reporter: Supratim Deka
>            Assignee: Supratim Deka
>            Priority: Major
>         Attachments: HDDS-1163.000.patch
>
>
> Included in the scope:
> 1. Background scanner thread to iterate over container set and dispatch check 
> tasks for individual containers
> 2. Fixed rate scheduling - dispatch tasks at a pre-determined rate (for 
> example 1 container/s)
> 3. Check disk layout of Container - basic check for integrity of the 
> directory hierarchy inside the container, include chunk directory and 
> metadata directories
> 4. Check container file - basic sanity checks for the container metafile
> 5. Check Block Database - iterate over entries in the container block 
> database and check for the existence and accessibility of the chunks for each 
> block.
> Not in scope (will be done as separate subtasks):
> 1. Dynamic scheduling/pacing of background scan based on system load and 
> available resources.
> 2. Detection and handling of orphan chunks
> 3. Checksum verification for Chunks
> 4. Corruption handling - reporting (to SCM) and subsequent handling of any 
> corruption detected by the scanner. The current subtask will simply log any 
> corruption which is detected.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (HDDS-1163) Basic framework for Ozone Data Scrubber

Reply via email to