Re: [PR] HDDS-14456. [Website v2] [Docs] [Core Concepts] Datanodes [ozone-site]

via GitHub Thu, 22 Jan 2026 17:54:17 -0800


jojochuang commented on code in PR #272:
URL: https://github.com/apache/ozone-site/pull/272#discussion_r2719287303



##########
docs/03-core-concepts/01-architecture/05-datanodes.md:
##########
@@ -0,0 +1,97 @@
+---
+sidebar_label: Datanodes
+---
+
+# Datanodes
+
+Datanodes are the worker bees of Ozone. All data is stored on data nodes. 
Clients write data in terms of blocks.
+Datanode aggregates these blocks into a storage container. A storage container 
is the data streams and metadata about
+the blocks written by the clients.
+
+## Storage Containers
+
+![Container Metadata](ContainerMetadata.png)
+
+A storage container is a self-contained super block. It has a list of Ozone 
blocks that reside inside it, as well as on-disk
+files which contain the actual data streams. This is the default Storage 
container format. From Ozone's perspective, container
+is a protocol spec, actual storage layouts does not matter. In other words, it 
is trivial to extend or bring new container layouts.
+Hence this should be treated as a reference implementation of containers under 
Ozone.
+
+## Understanding Ozone Blocks and Containers
+
+When a client wants to read a key from Ozone, the client sends the name of the 
key to the Ozone Manager. Ozone Manager returns
+the list of Ozone blocks that make up that key.
+
+An Ozone block contains the container ID and a local ID. The figure below 
shows the logical layout of the Ozone block.
+
+![Ozone Block](OzoneBlock.png)
+
+The container ID lets the clients discover the location of the container.
+The authoritative information about where a container is located is with the 
Storage Container Manager (SCM). In most cases,
+the container location will be cached by Ozone Manager and will be returned 
along with the Ozone blocks.
+
+Once the client is able to locate the container, the client will connect to 
the Datanode and read the data stream specified
+by *Container ID:Local ID*. In other words, the local ID serves as index into 
the container which describes what data stream to read from.
+
+## Discovering Container Locations
+
+How does SCM know where the containers are located? This is very similar to 
what HDFS does; the data nodes regularly send
+container reports like block reports. Container reports are far more concise 
than block reports. For example, an Ozone deployment
+with a 196 TB data node will have around 40 thousand containers, compared to 
HDFS block count of million and half blocks—a 40x
+reduction in the block reports.
+
+This extra indirection helps tremendously with scaling Ozone. SCM has far less 
block data to process and the namespace
+service (Ozone Manager) as a different service are critical to scaling Ozone.
+
+## Data Volume Management
+
+### What is a Volume?
+
+In the context of an Ozone Datanode, a "volume" refers to a physical disk or 
storage device managed by the Datanode. Each
+volume can store many containers, which are the fundamental units of storage 
in Ozone. This is different from the "volume"
+concept in Ozone Manager, which refers to a namespace for organizing buckets 
and keys.
+
+The status of volumes, including used space, available space and whether or 
not they are operational (healthy) or failed,
+can be looked up from Datanode Web UI.
+
+### Defining Volumes with `hdds.datanode.dir`
+
+The property `hdds.datanode.dir` defines the set of volumes (disks) managed by 
a Datanode. You can specify one or more
+directories, separated by commas. Each directory represents a volume. For 
example: `/data1/disk1,/data2/disk2`, which configures
+the Datanode to manage two volumes.
+
+### Volume Choosing Policy
+
+When a Datanode needs to select a volume to store new data, it uses a volume 
choosing policy. The policy is controlled by
+the property `hdds.datanode.volume.choosing.policy`. There are two main 
policies:
+
+- **CapacityVolumeChoosingPolicy (default)**: This policy randomly selects two 
volumes with enough available space and chooses the one with
+  lower utilization (i.e., more free space). This approach increases the 
likelihood that less-used disks are chosen, helping to balance disk usage over 
time.
+
+- **RoundRobinVolumeChoosingPolicy**: This policy selects volumes in a 
round-robin order, cycling through all available volumes.
+  It does not consider the current utilization of each disk, but ensures even 
distribution of new containers across all disks.
+

Review Comment:
   https://issues.apache.org/jira/browse/HDDS-14488



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [PR] HDDS-14456. [Website v2] [Docs] [Core Concepts] Datanodes [ozone-site]

Reply via email to