You may use intel-intrinsics who: 1. guarantees float8 is there 2. have aligned malloc _mm_malloc
I have heard about intel-intrinsics and it's really good idea to use it in my code, but I wanted to try some SIMD operations with core.simd. But I didn't know about aligned malloc, thanks!