> The point is that hash functions are many to one and I think the point
> was about that verify wasn't really needed if the hash function is good
This is a circular argument really, isn't it? Hash algorithms are never
perfect, but we're trying to build a perfect one?
It seems to me the obvious fix is to use hash to identify candidates for dedup,
and then do the actual verify and dedup asynchronously. Perhaps a worker thread
doing this at low priority?
Did anyone consider this?
zfs-discuss mailing list