I believe SimpleScalar accesses the icache for each instruction, while
M5's FullCPU only accesses the icache once per cycle for all the
instructions it is fetching that are in the same icache line.
(Depending on how aggressive your fetch model is, M5 will access the
icache multiple times in a cycle, but only if it's fetching a set of
instructions that span icache blocks.) I wouldn't be surprised if the
number of icache accesses in SimpleScalar is larger by a factor of about
the issue width of the machine.
Steve
Jos Delbar wrote:
Hey,
I am trying to tune the configurations of M5 and sim-outorder to achieve
similar IPC results for uniprocessor simulations. Using more or less
identical processor configurations (as good as possible considering the
differences between the two simulators), identical cache configurations,
etc., I am getting "reasonable" results. One statistic is bothering me
though, and that is the amount of L1 icache lookups. For an identical
benchmark, sim-outorder reports +/- twice as many icache lookups as M5. L1
data and L2 unified cache lookups only differ a few percent.
Does anyone have an idea where this huge difference is coming from?
Thanks,
-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems? Stop! Download the new AJAX search engine that makes
searching your log files as easy as surfing the web. DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
_______________________________________________
m5sim-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/m5sim-users