Thanks for the reply. I have a follow up question. So, lets say I am only
using one O3 core and If I want to run 2-threaded SMT that means I will have
to provide as input for instance 2 FFT  benchmarks. And for 4-threaded SMT
similarly I will have to provide as input 4 FFT benchmark. In this case how
can I do the comparison as the benchmarks have changed, in size. So my
question was can I just provide lets say 1 FFT benchmark as input and then
try to run it with 2 threaded-SMT and then with 4 threaded-SMT
configuration. And I think you are saying it is not possible.

Thanks,
--Abhishek

On Fri, Apr 16, 2010 at 9:56 AM, Korey Sewell <[email protected]> wrote:

>
> I have this doubt related to O3 SMT model in ALPHA_SE mode. So I understand
>> that for running multi-programmed benchmarks one can pass a vector of
>> benchmarks as the parameters and set the cpu.numThreads to the size of
>> vector. But is it possible for the O3 CPU to extract the TLP in a (single)
>> given multithreaded benchmark. So what I am trying to say is lets say I just
>> want to run a single multi-threaded benchmark and just by specifying the
>> number of SMT threads (say 4 SMT threads) can the O3 CPU extract the threads
>> from the benchmark and run in a SMT fashion? I would appreciate any
>> directions in this regard.
>>
> As far as I know, O3CPU will only run the threads that you give it, so your
> actual benchmark needs to specify how many threads are going to be run in
> parallel in order for you to see any benefit in SE_SMT.  The problem of
> automatically extracting TLP is in fact it's own subset of the research
> community, so if you wish, I'm sure you can find plenty of literature on
> that topic.
>
>
>>
>> Something tells me that it is not possible. And if that is the case then
>> how do I compare different SMT configurations for a given benchmark ?
>
> That's a bit of a abstract question and on the M5-side, I'm not sure there
> is a definite *right* answer here just user-dependent ones. For some people,
> comparing a 2-threaded SMT core v. a 4-threaded SMT core is fine. For
> others, comparing a 2T-SMT core vs. 2 CMP cores is their objective.
>
> Considering a configuration to be the benchmark running and the hardware
> that it is to be run on, It's up to the user to specify different M5
> configurations that are interesting to them, run M5 for different
> configurations, figure out what configurations are comparable to each other,
> and then decide what to do next with their results.
>
>
>
> --
> - Korey
>
> _______________________________________________
> m5-users mailing list
> [email protected]
> http://m5sim.org/cgi-bin/mailman/listinfo/m5-users
>



-- 
-Abhi
_______________________________________________
m5-users mailing list
[email protected]
http://m5sim.org/cgi-bin/mailman/listinfo/m5-users

Reply via email to