Yes, it is. The only way to see what is going on is to use tracediff and see 
where the execution diverges.

Ali

On Jun 2, 2011, at 6:48 AM, Gustavo Henrique Nihei wrote:

> First, sorry for bringing back an old thread.
> 
> But I'm still confused but this matter. I'm not running FS.
> So, by running an SE platform, isn't it weird that for a single application, 
> and different cache configurations, the number of simulated instructions 
> differ between simulations?
> I mean, if there's no underlying OS, just the application, the expected would 
> be the CPU to only execute the instructions provided by the app binary, or am 
> I missing some point here?
> 
> Thanks.
> 
> On Tue, Jan 25, 2011 at 12:46 PM, Steve Reinhardt <ste...@gmail.com> wrote:
> Yes, it's almost impossible to get completely identical behavior without 
> running a completely identical system.  Even making the cache larger will 
> make the program run faster in some phases, which will change where timer 
> interrupts happen with respect to program execution.
> 
> If you look at larger time windows and/or more samples, the mean behavior 
> should stabilize, but trying to correlate individual small samples like 
> you're doing is going to be extremely challenging.
> 
> This paper focuses on these issues in multiprocessor systems, but most of 
> what it talks about is relevant to uniprocessor systems running a full OS too:
> http://pages.cs.wisc.edu/~alaa/papers/ieeemicro03_variability.pdf
> 
> Steve
> 
> 
> On Mon, Jan 24, 2011 at 10:34 PM, Stevenson Jian <stevensonj...@gmail.com> 
> wrote:
> Yes, I am running in FS mode. Is it normal for the OS to make that much 
> difference? 
> These statistics are taken after the benchmarks have started. 
> Thanks!
> Steve
> 
> On Tue, Jan 25, 2011 at 12:00 AM, Steve Reinhardt <ste...@gmail.com> wrote:
> OK, sorry for the confusion; since you were running a Parsec benchmark I 
> assumed the numbers were processor IDs.  Are you running in FS mode?  Are 
> these statistics taken from the beginning when Linux is booting, or are they 
> after the benchmark has started running?
> 
> Steve
> 
> 
> On Mon, Jan 24, 2011 at 5:51 PM, Stevenson Jian <stevensonj...@gmail.com> 
> wrote:
> Thanks for replying Steve. I only used a single processor in both 
> simulations. What is shown is not the output from individual processors, but 
> that of the same processor at the end of every 100,000 instructions (see 
> sim_insts increment 100,000 each time)
> 
> 
> On Mon, Jan 24, 2011 at 7:14 PM, Steve Reinhardt <ste...@gmail.com> wrote:
> With a multiprocessor, seemingly small changes in configuration can have a 
> significant impact if it changes the order in which threads grab a lock, or 
> something like that. So in particular, for the stats you have below, it seems 
> likely that there's some serialized computation going on that happened on 
> processor 3 in the first case and on processor 5 in the second case.
> 
> Steve
> 
> On Mon, Jan 24, 2011 at 1:30 PM, Stevenson Jian <stevensonj...@gmail.com> 
> wrote:
> Hi,
> How does Timing CPU count number of instructions? If it stalls on a cache 
> miss, do the Nops count as instructions as well? The reason why I ask is that 
> by simply changing the size of the cache, the total number of instructions 
> when the benchmark completes varies by about 0.1 - 0.01%.
> 
> Another anomaly that I am observing is that again, by simply changing the 
> size of the L2, the number of overall L2 accesses per let's say 100,000 
> instructions can vary by over 100%. 
> 
> The following are 2 runs that i did on m5 with the Freqmine benchmark. The 
> first simulation uses a 1Mb 4 way L2 with a latency of 6ns while the second 
> simulation uses a 2MB 8 way L2 with a latency of 4.5ns. The overall access 
> per 100,000 instructions are show.
> ---------------------------------------------------------------------------------------------
> 1MB 4Way L2:
> 2:
> sim_insts                                   100200001                       # 
> Number of instructions simulated
> sim_ticks                                   196940000                       # 
> Number of ticks simulated
> system.l2.overall_accesses                       3231                       # 
> number of overall (read+write) accesses
> system.l2.overall_hits                           2515                       # 
> number of overall hits
> 
> 3:
> sim_insts                                   100300001                       # 
> Number of instructions simulated
> sim_ticks                                   227453000                       # 
> Number of ticks simulated
> system.l2.overall_accesses                       4656                       # 
> number of overall (read+write) accesses
> system.l2.overall_hits                           3434                       # 
> number of overall hits
> 
> 4:
> sim_insts                                   100400001                       # 
> Number of instructions simulated
> sim_ticks                                   154064000                       # 
> Number of ticks simulated
> system.l2.overall_accesses                       1078                       # 
> number of overall (read+write) accesses
> system.l2.overall_hits                            722                       # 
> number of overall hits
> 
> 5:
> sim_insts                                   100500001                       # 
> Number of instructions simulated
> sim_ticks                                   155779000                       # 
> Number of ticks simulated
> system.l2.overall_accesses                       1575                       # 
> number of overall (read+write) accesses
> system.l2.overall_hits                           1154                       # 
> number of overall hits
> 
> ....
> 
> 2MB 8Way L2:
> 2:
> sim_insts                                   100200001                       # 
> Number of instructions simulated
> sim_ticks                                   234810000                       # 
> Number of ticks simulated
> system.l2.overall_accesses                       2936                       # 
> number of overall (read+write) accesses
> system.l2.overall_hits                           1163                       # 
> number of overall hits
> 
> 3:
> sim_insts                                   100300000                       # 
> Number of instructions simulated
> sim_ticks                                   174173000                       # 
> Number of ticks simulated
> system.l2.overall_accesses                       1496                       # 
> number of overall (read+write) accesses
> system.l2.overall_hits                            803                       # 
> number of overall hits
> 
> 4:
> sim_insts                                   100400000                       # 
> Number of instructions simulated
> sim_ticks                                   190135000                       # 
> Number of ticks simulated
> system.l2.overall_accesses                       2290                       # 
> number of overall (read+write) accesses
> system.l2.overall_hits                           1672                       # 
> number of overall hits
> 
> 5:
> sim_insts                                   100500000                       # 
> Number of instructions simulated
> sim_ticks                                   213086000                       # 
> Number of ticks simulated
> system.l2.overall_accesses                       4554                       # 
> number of overall (read+write) accesses
> system.l2.overall_hits                           3871                       # 
> number of overall hits
> .....
> ----------------------------------------------------------------------------
> Even if Nops are counted as instructions, I don't see how that would make 
> overall access/100,000 instructions vary by as much 200%. How does M5 count 
> the number of instructions?
> Thanks,
> Steve
> 
> _______________________________________________
> m5-users mailing list
> m5-us...@m5sim.org
> http://m5sim.org/cgi-bin/mailman/listinfo/m5-users
> 
> 
> _______________________________________________
> m5-users mailing list
> m5-us...@m5sim.org
> http://m5sim.org/cgi-bin/mailman/listinfo/m5-users
> 
> 
> _______________________________________________
> m5-users mailing list
> m5-us...@m5sim.org
> http://m5sim.org/cgi-bin/mailman/listinfo/m5-users
> 
> 
> _______________________________________________
> m5-users mailing list
> m5-us...@m5sim.org
> http://m5sim.org/cgi-bin/mailman/listinfo/m5-users
> 
> 
> _______________________________________________
> m5-users mailing list
> m5-us...@m5sim.org
> http://m5sim.org/cgi-bin/mailman/listinfo/m5-users
> 
> 
> _______________________________________________
> m5-users mailing list
> m5-us...@m5sim.org
> http://m5sim.org/cgi-bin/mailman/listinfo/m5-users
> 
> 
> 
> -- 
> Gustavo Henrique Nihei
> LAPS - Laboratório de Automação do Projeto de Sistemas
> NIME - Núcleo Interdepartamental de Microeletrônica
> Universidade Federal de Santa Catarina
> Florianópolis - Santa Catarina - Brasil
> 
> _______________________________________________
> gem5-users mailing list
> gem5-users@m5sim.org
> http://m5sim.org/cgi-bin/mailman/listinfo/gem5-users

_______________________________________________
gem5-users mailing list
gem5-users@m5sim.org
http://m5sim.org/cgi-bin/mailman/listinfo/gem5-users

Reply via email to