Re: [gem5-users] simulation methodology

hanfeng QIN Thu, 13 Dec 2012 17:30:48 -0800

I know the options '-F' and '-W'. Actualy, I use them together with '-I'option to specify the detailed instruction numbers (as denoted with N3in my previous mail). It seems that the default implementation inconfigs/common/Simulation.py will pass the N3 tocpu[i].MAX_INSTS_ANY_THREAD. Thus, when any program finishes N3instructions, the total simulation will exit. Obviously, in this case Imodify this default implementaion by passing N3 tocpu[i].MAX_INSTS_ALL_THREADS, which will force each program to commit atleast N3 instructions. Then the final total instruction simulated willbe N3 * Nr_cores. But this approach has a pitfall compared with themethodology I referred. For multi-programmed workload, once some programfinishes N3 instructions, the corresponding core will have no task toschedule ( I assume the number of workload will be no more thanavailable cores simulated). Thus, it may be not reasonable to evaluateits impact on shared resource contention according to final statisticsreport.

Based on this, I have an idea to report statistics more reasonable. Canwe carry out detailed simulated N3 * 2 instructions for each program(thus total instruction simulated will be (N3 * 2) * Nr_cores) but onlydump the stats after the first N3 instructions? But I am not clear onthe stats dump internals.



Hanfeng

On 12/13/2012 11:45 PM, Nilay Vaish wrote:

On Wed, 12 Dec 2012, hanfeng QIN wrote:
Hi all,
I learn a common multi-programmed simulation methodology adopted bymany architecture researchers. But I am not clear its implementationinternals. I describe its idea in brief as following.
For multi-programmed workload consists of M programs, thismethodology firstly fast-forwards N1 instructions. Before detailedmeasurement, it warms up cache with N2 instruction. Then detailedsimulation is carried out until all programs execute N3 instructions.Statistics reports only for the first N3 instructions in detailedsimulation.
I want to know how to implement it with Gem5 in practice. As far asI know, gem5 provides '-s' option to support mode switch fromTimingSimpleCPU to DetailedCPU (O3). However, I have no idea tocontrol each program to execute fixed N3 instructions. Besides, ifsome programs finish retiring N3 instructions before others, how todump the stats to assure it is correct for all programs that haveexecuted N3 instructions.
If you are programs run long enough, then you would be able to takethe required measurements before any one of them finishes. Instead oftrying to force each program to execute a fixed number ofinstructions, you can force each program to execute a minimum numberof instructions during fast-forward, cache warmup and detailedsimulation modes. So when all the programs have executed at least N1instructions, then only you should switch the CPU. Similarly, when allthe programs have executed at least N2 instructions, reset thestatistics and start the detailed simulation.
If you look in to how the options -F and -W are used in the fileconfigs/common/Simulation.py, you should be able to make it work formultiple CPU system as well.
--
Nilay


_______________________________________________
gem5-users mailing list
[email protected]
http://m5sim.org/cgi-bin/mailman/listinfo/gem5-users

Re: [gem5-users] simulation methodology

Reply via email to