On 10/10/2023 02:39, Juzhe-Zhong wrote:
Here is the reference comparing dump IR between ARM SVE and RVV.

https://godbolt.org/z/zqess8Gss

We can see RVV has one more dump IR:
optimized: basic block part vectorized using 128 byte vectors
since RVV has 1024 bit vectors.

The codegen is reasonable good.

However, I saw GCN also has 1024 bit vector.
This patch may cause this case FAIL in GCN port ?

Hi, GCN folk, could you check this patch in GCN port for me ?

This patch *fixes* an existing test fail on GCN. :)

It's probably one of the many I've never had time to analyze (and optimizing more than expected makes it low priority).

LGTM

Andrew

Reply via email to