Did not know about @guzba's fast snappy. Nice!! He should make a library/tool that can do parallel compression/decompression. lz4 would smoke with parallelism. With parallel decompr, in particular, you could probably get aggregate throughput numbers competitive with DIMM bandwidth, meaning "only" CPU cost and no throughput slowdown, but not quite as good a compression ratio as Zstd. Sometimes that compression ratio takes a huge factor off of how fast the backing store needs to be, like for the data [of this thread](https://forum.nim-lang.org/t/5103)
Anyway, as with so many things a small toolchest of these is better than any single one...Sounds like we are near 2 out of the ultimate 3 that would satisfy the (most standard, fastest, most compressing-without tons of speed compromise) triple. :-) I'd vote for all 3 in the stdlib since I kind of think of "compress/decompress" as "like basic IO"..super fundamental.