Hi Tom,

To recap, I am able to run the Flexus test application using CMPFlex and 
it completes in 5-6 minutes. When I tried running the same application 
using CMPFlex.OoO (please refer to the mail below), the simulation kept 
running for several hours. I let it run for 4 days and then I got an 
error in the 'console window' (see below). This error (panic) seems to 
be from the OS running on Simics !
Also, note that the simulation was stuck at "thread 2 iteration 0" for 
3-4 days. Basically, I think that the nothing is really happening in the 
simulator, but the time does advance.

There are similar panic messages along with Flexus and Simics error 
messages. I have put these messages in the attached file 
'error-messages.txt' for convenience.

Has anyone tried running the test application using simulators other 
than CMPFlex? Is it likely that the checkpointing created is valid only 
with CMPFlex, and that we would need to create different checkpoints 
with the other simulators.

Can you let me know which is the test application that we are trying to 
run, and how to run it directly from simics?

Thanks
- Mrinal



################ start of error (panic) message ####################

sarek.network.sim console login: root
Jun  2 07:00:47 sarek.network.sim login: ROOT LOGIN /dev/console
Sun Microsystems Inc.   SunOS 5.8       Generic Patch   February 2004
# mount /dev/dsk/c0t1d0s0 /mnt; cd /mnt/src; ./flexus-test-app 8 10
Running 8 threads for 10 iterations each.
Thread 0: Initializing per-thread sturctures
... <removed other initialization messages>
Thread 7: Initializing per-thread sturctures
Thread 2 iteration 0

panic[cpu2]/thread=2a1001f7d20: pcisch2: streaming cache page    flush 
timeout!

000002a1001f7590 pcisch:sc_flush+2c0 (10526ae8, 104ef828, 0, 
3000094d6d8, 300008
1fd48, 67b)
  %l0-3: 000003000082d968 0000000000000000 0000000000000000 0000000000000000
  %l4-7: 0000000000000000 0000000000000001 0000000000000000 000002a100007910
000002a1001f7640 pcisch:pci_dma_unbindhdl+144 (3000092f050, 30000e91a60, 
30000e9
1b50, 30000bbdcc8, 16, 0)
  %l0-3: 0000004400001606 0000000000000016 0000000000000004 000000001000a408
  %l4-7: 00000000fe40fa78 000002a10099daf0 0000000000000000 000002a1000079c0
000002a1001f76f0 glm:glm_scsi_destroy_pkt+20 (30000bb4000, 30000ba0ac0, 
0, 30000
94ce18, 2, 30000e9ca98)
  %l0-3: 00000000104f4678 0000000000000001 0000000000000000 0000000000000000
  %l4-7: 0000000000000000 0000000000000000 0000000000000000 0000000000000000
000002a1001f77a0 scsi:scsi_destroy_pkt+10 (30000ba0bf0, 2a1001f7d20, 20, 
4fb, 58
6e0e, 5e231d)
  %l0-3: 0000000010239520 0000000000000000 000000000000ff00 0000000010037954
  %l4-7: 00000000104b6b78 0000000000000000 0000030001775a20 000002a10099dba0
000002a1001f7850 sd:sddone_and_mutex_exit+1fc (400, 3000080bc28, 
30000ff8e10, 30
00080bb28, 30000ff8e10, 0)
  %l0-3: 0000000010239520 0000000000000000 000000000000ff00 0000000000000000
  %l4-7: 0000030001775760 0000000000000000 0000000000000000 0000000000000000
000002a1001f7900 glm:glm_doneq_empty+13c (30000bb4000, 105255b8, 1000, 
30000ba0a
c0, 30000bb4018, 3)
  %l0-3: 00000000101fff70 0000030000bb4000 0000030000ba0bf0 0000030000e9ce90
  %l4-7: 0000030000e9ca98 0000000000000016 0000030000e9d270 0000000000000000
000002a1001f79b0 glm:glm_intr+f4 (1070, 1068, 1058, 30000bb4018, 
30000bb4000, 10
)
  %l0-3: 0000000000000000 00000000000062f0 0000030000ba0bf0 000003000093b968
  %l4-7: 00000300008175c8 0000030000f23510 0000000000000000 0000030000f23510
000002a1001f7a60 pcisch:pci_intr_wrapper+80 (10526030, 10526068, 
3000092f050, 30
000935ce8, 3000080de08, 0)
  %l0-3: 0000000010239d04 0000000000000000 0000030000833058 0000000000000000
  %l4-7: 0000000000000000 0000000000000000 0000000000000000 0000000000000000

syncing file systems...

##########################  end of panic message ########################





Thomas Wenisch wrote:

>Hi Mrinal,
>
>Sorry for the delay in reply.
>
>On Mon, 15 Aug 2005, Mrinal Nath wrote:
>
>  
>
>>Hi Tom,
>>Now I have a question regarding CMPFlex.OoO. I successfully built
>>CMPFlex.OoO using 'make CMPFlex.OoO' command in FLEXUS_ROOT directory.
>>To test CMPFlex.Ooo, I followed the instructions in the getting started
>>manual. (According to the manual, the checkpointing procedure is the
>>same for a given application, so I can use the checkpoints that were
>>created when I tried to test CMPFlex. Am I correct?)
>>    
>>
>
>You are correct, the same checkpoints can be used for all simulation
>modes.
>
>Note that the procedure in the manual describes how to create Simics
>checkpoints.  When you start a Flexus simulation from Simics, the
>simulation state is cold (all caches empty, etc.)  Flexus also has the
>capability of writing out checkpoints with warm state (i.e., caches,
>branch predictor are saved, along with Simics state).  In general, we use
>the in-order simulator to create warm checkpoints for out-of-order
>simulation.  None of this is documented yet, but if you play around with
>the "flexus.*" commands in Simics or take a look in core/flexus.cpp, you
>can probably figure out whats going on.
>
>  
>
>>Then for simulation, I used the command "./interactive -ma CMPFlex.OoO".
>>(Is this correct?)
>>    
>>
>
>This is correct.  The -ma flag must be passed to Simics to run it in
>MAI mode.  Flexus will complain loudly if you try to run with the wrong
>mode.
>
>  
>
>>The in-order simulation using CMPFlex hardly takes 5-6 minutes on my
>>machine. But the OOO simulation using CMPFlex.OoO has been running for
>>the past 10 hours or so and yet, it has not progressed (In the "system
>>console" window, we can see the thread and iteration that is currently
>>going on.) My "system console" gave initialization messages for all 8
>>threads, and then it is stuck at "Thread 2 iteration 0" for the past 10
>>hours. However in the terminal where I started Simics using
>>"./interactive CMPFlex.OoO", Flexus is printing some messages (from
>>different components) which indicate that time is indeed progressing.
>>
>>Is it that the OoO simulation really runs so slowly? Or have I missed
>>out something due to which the simulation is not really running?
>>
>>    
>>
>
>Hmm, in general, for MP simulation, the in-order flexus simulator is
>already so slow that out-of-order simulation is not far behind (within an
>order of magnitude).  6 minutes to 10 hours strikes me as an abnormally
>large slowdown.  It is possible that output on the "system console" can
>take a long time to appear, because the OS code behind printf system
>calls is slow operation, and may have to wait around to grab locks.
>
>However, the number of instructions to complete the test app should be
>pretty much the same in in-order and out-of-order simulation.  You should
>see the "reached iteration whatever" messages at about the same
>instruction counts, and the whole app should take the same number of
>instructions to complete.  Try taking a look at instruction counts in the
>two modes (available in the stats database, via "flexus.print-measurement
>all" while Simics is running; or directly from Simics using its built in
>instruction counters, e.g. cpu0.print-statistics).
>
>Nikos,
>
>  Is everything I said about the test app correct?  Do you have any
>comments
>to add?
>
>As an aside, we have done work to improve simulation speed (by nearly 2
>orders of magnitude)  for in-order trace-based (no timing feedback into
>functional simulation)  simulation, which enables rapid creation of
>warm-state checkpoints.  We will present our checkpoint-based measurement
>methodology at our MICRO tutorial.
>
>Regards,
>-Tom Wenisch
>Computer Architecture Lab
>Carnegie Mellon University
>
>  
>
>>Thanks
>>- Mrinal
>>
-------------- next part --------------
CMPFlex (in-order) sim cycles: (from all.measurement.out file) sys-cycles  
2404285








sarek.network.sim console login: root
Jun  2 07:00:47 sarek.network.sim login: ROOT LOGIN /dev/console
Sun Microsystems Inc.   SunOS 5.8       Generic Patch   February 2004
# mount /dev/dsk/c0t1d0s0 /mnt; cd /mnt/src; ./flexus-test-app 8 10
Running 8 threads for 10 iterations each.
Thread 0: Initializing per-thread sturctures
Thread 1: Initializing per-thread sturctures
Thread 2: Initializing per-thread sturctures
Thread 3: Initializing per-thread sturctures
Thread 4: Initializing per-thread sturctures
Thread 5: Initializing per-thread sturctures
Thread 6: Initializing per-thread sturctures
Thread 7: Initializing per-thread sturctures
Thread 2 iteration 0

panic[cpu2]/thread=2a1001f7d20: pcisch2: streaming cache page    flush timeout!

000002a1001f7590 pcisch:sc_flush+2c0 (10526ae8, 104ef828, 0, 3000094d6d8, 300008
1fd48, 67b)
  %l0-3: 000003000082d968 0000000000000000 0000000000000000 0000000000000000
  %l4-7: 0000000000000000 0000000000000001 0000000000000000 000002a100007910
000002a1001f7640 pcisch:pci_dma_unbindhdl+144 (3000092f050, 30000e91a60, 30000e9
1b50, 30000bbdcc8, 16, 0)
  %l0-3: 0000004400001606 0000000000000016 0000000000000004 000000001000a408
  %l4-7: 00000000fe40fa78 000002a10099daf0 0000000000000000 000002a1000079c0
000002a1001f76f0 glm:glm_scsi_destroy_pkt+20 (30000bb4000, 30000ba0ac0, 0, 30000
94ce18, 2, 30000e9ca98)
  %l0-3: 00000000104f4678 0000000000000001 0000000000000000 0000000000000000
  %l4-7: 0000000000000000 0000000000000000 0000000000000000 0000000000000000
000002a1001f77a0 scsi:scsi_destroy_pkt+10 (30000ba0bf0, 2a1001f7d20, 20, 4fb, 58
6e0e, 5e231d)
  %l0-3: 0000000010239520 0000000000000000 000000000000ff00 0000000010037954
  %l4-7: 00000000104b6b78 0000000000000000 0000030001775a20 000002a10099dba0
000002a1001f7850 sd:sddone_and_mutex_exit+1fc (400, 3000080bc28, 30000ff8e10, 30
00080bb28, 30000ff8e10, 0)
  %l0-3: 0000000010239520 0000000000000000 000000000000ff00 0000000000000000
  %l4-7: 0000030001775760 0000000000000000 0000000000000000 0000000000000000
000002a1001f7900 glm:glm_doneq_empty+13c (30000bb4000, 105255b8, 1000, 30000ba0a
c0, 30000bb4018, 3)
  %l0-3: 00000000101fff70 0000030000bb4000 0000030000ba0bf0 0000030000e9ce90
  %l4-7: 0000030000e9ca98 0000000000000016 0000030000e9d270 0000000000000000
000002a1001f79b0 glm:glm_intr+f4 (1070, 1068, 1058, 30000bb4018, 30000bb4000, 10
)
  %l0-3: 0000000000000000 00000000000062f0 0000030000ba0bf0 000003000093b968
  %l4-7: 00000300008175c8 0000030000f23510 0000000000000000 0000030000f23510
000002a1001f7a60 pcisch:pci_intr_wrapper+80 (10526030, 10526068, 3000092f050, 30
000935ce8, 3000080de08, 0)
  %l0-3: 0000000010239d04 0000000000000000 0000030000833058 0000000000000000
  %l4-7: 0000000000000000 0000000000000000 0000000000000000 0000000000000000

syncing file systems...

Reply via email to