--- Begin Message ---
In hyper-converged deployments, the node performing the backup is sourcing
((nodes-1)/(nodes))*bytes) of backup data (i.e., ingress traffic) and then
sending 1*bytes to PBS (i.e., egress traffic). If PBS were to pull the data
from the nodes directly, the maximum load on any one host would be
(1/nodes)*bytes of egress traffic only... that's a considerable improvement!
Further, nodes that don't host OSDs would be completely quiet. So, in the case
of non-converged CEPH, the hypervisor nodes do not need to participate in the
backup flow at all.
> On Jul 28, 2024, at 2:46 AM, Dietmar Maurer <diet...@proxmox.com> wrote:
>
>> Today, I believe the client is reading the data and pushing it to
>> PBS. In the case of CEPH, wouldn't this involve sourcing data from
>> multiple nodes and then sending it to PBS? Wouldn't it be more
>> efficient for PBS to read it directly from storage? In the case of
>> centralized storage, we'd like to eliminate the client load
>> completely, having PBS ingest increment differences directly from
>> storage without passing through the client.
>
> But Ceph is not a central storage. Instead, data is distributed among the
> nodes, so you always need to send some data over the network.
> There is no way to "read it directly from storage".
>
--- End Message ---
_______________________________________________
pve-devel mailing list
pve-devel@lists.proxmox.com
https://lists.proxmox.com/cgi-bin/mailman/listinfo/pve-devel