I don't know how to solve your main problem, it feels like a c compiler bug?
But <https://github.com/guzba/zippy> implement gzip decompression in native nim and is very fast, beating or matching zlib. It also uses different SIMD for x86 and arm. Maybe no need to wrap libdeflate?
