Hi all: Recently I found two relative old papers about non-blocking cache, etc. which are :
1) Reducing memory latency via non-blocking and prefetching caches. BY Tien-Fu Chen and Jean-Loup Baer. 2) Data Prefetching:A Cost/Performance Analysis BY Chris Metcalf It seems the hardware facility does have the potential to improve the performance with compiler's assistance(especially instruction scheduling). while on the other hand, lifting ahead load instructions may resulting in increasing register pressure. So I'm thinking : 1, Has anyone from gcc folks done any investigation on this topic yet, or any statistic data based on gcc available? 2, Does GCC(in any release version) supports it in any targets(such as mips 24ke) with this hardware feature? If not currently, does it possible to support it by using target definition macros and functions? Any tips will be highly appreciated, thanks. -- Best Regards.