This article (https://vorpus.org/blog/why-does-calloc-exist/) looks like it's very applicable to APR, where we do the exact malloc-memset trick.
Prelim testing on my macOS and Linux machines do show appreciable improvements. At the very least, maybe a compile-time flag??