On 10/10/2023 02:39, Juzhe-Zhong wrote:
Here is the reference comparing dump IR between ARM SVE and RVV.
https://godbolt.org/z/zqess8Gss
We can see RVV has one more dump IR:
optimized: basic block part vectorized using 128 byte vectors
since RVV has 1024 bit vectors.
The codegen is reasonable good.
However, I saw GCN also has 1024 bit vector.
This patch may cause this case FAIL in GCN port ?
Hi, GCN folk, could you check this patch in GCN port for me ?
This patch *fixes* an existing test fail on GCN. :)
It's probably one of the many I've never had time to analyze (and
optimizing more than expected makes it low priority).
LGTM
Andrew