I would recommend that you test your runs in a more controlled environment. This will make it easier to determine if there is truly a problem with gem5, or if it's your setup - running on a cluster. Ensure that the checkpoint is being taken correctly and that all images/binaries/checkpoints are in the proper location and that your path is setup correctly.
-Tony On Tue, Oct 2, 2012 at 10:36 AM, Fangfei Liu <fangf...@princeton.edu> wrote: > It turned out gem5.debug works find so far (after running about 20 > minutes). I was wondering how much slower is gem5.debug than gem5.opt? > ------------------------------ > *From:* gem5-users-boun...@gem5.org [gem5-users-boun...@gem5.org] on > behalf of Ali Saidi [sa...@umich.edu] > *Sent:* Monday, October 01, 2012 9:54 PM > > *To:* gem5 users mailing list > *Subject:* Re: [gem5-users] questions on running benchmark on gem5 > > That is very odd... Could you run the debug binary and see if you get a > better trace. Also could you run it under valgrind? > > > > Thanks, > > Ali > > > > > > On 01.10.2012 19:33, Fangfei Liu wrote: > > Hi, > > This is the back trace of the stack for the segmentation fault: > > command line: /tigress-hsm/fangfeil/gem5/build/ARM/gem5.opt > --outdir=bbench2 configs/example/fs.py -b bbench-gb > --kernel=vmlinux.smp.mouse.arm --frame-capture --caches --cpu-type=detailed > Global frequency set at 1000000000000 ticks per second > info: kernel located at: > /tigress-hsm/fangfeil/gem5/system/binaries/vmlinux.smp.mouse.arm > Listening for system connection on port 5900 > Listening for system connection on port 3456 > 0: system.remote_gdb.listener: listening for remote gdb #0 on port 7000 > info: Using bootloader at address 0x80000000 > **** REAL SIMULATION **** > info: Entering event queue @ 0. Starting simulation... > warn: The clidr register always reports 0 caches. > warn: clidr LoUIS field of 0b001 to match current ARM implementations. > warn: The csselr register isn't implemented. > > Program received signal SIGSEGV, Segmentation fault. > 0x00002aaaabafcbab in __dynamic_cast () from /usr/lib64/libstdc++.so.6 > (gdb) bt > #0 0x00002aaaabafcbab in __dynamic_cast () from /usr/lib64/libstdc++.so.6 > #1 0x00000000016a8b15 in MSHR::TargetList::clearDownstreamPending > (this=0x3c16f30) at build/ARM/mem/cache/mshr.cc:133 > #2 0x00000000016a8d90 in MSHR::markInService (this=, pkt=) at > build/ARM/mem/cache/mshr.cc:238 > #3 0x00000000016ab88f in MSHRQueue::markInService (this=0x3bfab90, > mshr=0x3bfd2e8, pkt=) at build/ARM/mem/cache/mshr_queue.cc:202 > #4 0x0000000001681871 in Cache::markInService(MSHR*, Packet*) () > #5 0x00000000016964a2 in Cache::MemSidePacketQueue::sendDeferredPacket() > () > #6 0x0000000001863d44 in EventQueue::serviceOne (this=) at > build/ARM/sim/eventq.cc:204 > #7 0x00000000018abf82 in simulate (num_cycles=) at > build/ARM/sim/simulate.cc:71 > #8 0x000000000182738b in _wrap_simulate__SWIG_0 (self=, args=) at > build/ARM/python/swig/event_wrap.cc:4371 > #9 _wrap_simulate (self=, args=) at > build/ARM/python/swig/event_wrap.cc:4421 > #10 0x00002aaaab3226c0 in PyObject_Call () from > /usr/lib64/libpython2.4.so.1.0 > #11 0x00002aaaab380b1c in PyEval_EvalFrame () from > /usr/lib64/libpython2.4.so.1.0 > #12 0x00002aaaab3835d5 in PyEval_EvalCodeEx () from > /usr/lib64/libpython2.4.so.1.0 > #13 0x00002aaaab381e08 in PyEval_EvalFrame () from > /usr/lib64/libpython2.4.so.1.0 > #14 0x00002aaaab381fbb in PyEval_EvalFrame () from > /usr/lib64/libpython2.4.so.1.0 > #15 0x00002aaaab381fbb in PyEval_EvalFrame () from > /usr/lib64/libpython2.4.so.1.0 > #16 0x00002aaaab3835d5 in PyEval_EvalCodeEx () from > /usr/lib64/libpython2.4.so.1.0 > #17 0x00002aaaab383622 in PyEval_EvalCode () from > /usr/lib64/libpython2.4.so.1.0 > #18 0x00002aaaab381f2d in PyEval_EvalFrame () from > /usr/lib64/libpython2.4.so.1.0 > #19 0x00002aaaab3835d5 in PyEval_EvalCodeEx () from > /usr/lib64/libpython2.4.so.1.0 > #20 0x00002aaaab381e08 in PyEval_EvalFrame () from > /usr/lib64/libpython2.4.so.1.0 > #21 0x00002aaaab3835d5 in PyEval_EvalCodeEx () from > /usr/lib64/libpython2.4.so.1.0 > #22 0x00002aaaab383622 in PyEval_EvalCode () from > /usr/lib64/libpython2.4.so.1.0 > #23 0x00002aaaab39fd19 in ?? () from /usr/lib64/libpython2.4.so.1.0 > #24 0x000000000186be5c in m5Main (argc=, argv=) at > build/ARM/sim/init.cc:256 > #25 0x0000000000409c11 in main (argc=9, argv=0x7fffffffe4f8) at > build/ARM/sim/main.cc:57 > > Do you have any idea how to solve this problem? Thanks! > > Best regards > Fangfei > ------------------------------ > *From:* gem5-users-boun...@gem5.org [gem5-users-boun...@gem5.org] on > behalf of Ali Saidi [sa...@umich.edu] > *Sent:* Monday, October 01, 2012 5:46 PM > *To:* gem5 users mailing list > *Subject:* Re: [gem5-users] questions on running benchmark on gem5 > > Without a back trace of where the segmentation fault is happening and > perhaps the output of valgrind we're not going to be much help. > > Ali > > > > > > On 01.10.2012 15:51, Fangfei Liu wrote: > > I find that it may not be related to checkpoint but segmentation occurs > for all the configuration using detailed/arm_detailed cpu type. Is it > because of the version of runtime envorinment? > > > ------------------------------ > *From:* gem5-users-boun...@gem5.org [gem5-users-boun...@gem5.org] on > behalf of Fangfei Liu [fangf...@princeton.edu] > *Sent:* Monday, October 01, 2012 2:56 PM > *To:* gem5 users mailing list > *Subject:* Re: [gem5-users] questions on running benchmark on gem5 > > Actually no. But I think the directory where checkpoint is located is > shared by all nodes via GPFS. There are no compiler but only runtime > environment installed on the nodes. Will this be a potential problem? > Thanks! > > > > > > > ------------------------------ > > > *From:* gem5-users-boun...@gem5.org [gem5-users-boun...@gem5.org] on > behalf of Anthony Gutierrez [atgut...@umich.edu] > *Sent:* Monday, October 01, 2012 12:44 PM > *To:* gem5 users mailing list > *Subject:* Re: [gem5-users] questions on running benchmark on gem5 > Are you copying the checkpoint to the node on which it's running? > > On Mon, Oct 1, 2012 at 12:11 AM, Fangfei Liu <fangf...@princeton.edu>wrote: > >> Sorry I may use the wrong command when I run the executable with gdbto >> get previous information. The problem I met is a little bit strange. I'm >> running gem5on a cluster. It turns out that it works fine at the head node >> of the cluster with gdb(at least no segmentation fault after running for >> more than 10 minutes.) But when I submitted the job into a job queue and >> run it on other nodes, segmentation fault occurredafter it run for only >> several seconds. >> ------------------------------ >> *From:* gem5-users-boun...@gem5.org [gem5-users-boun...@gem5.org] on >> behalf of Ali Saidi [sa...@umich.edu] >> *Sent:*Sunday, September 30, 2012 11:32 PM >> >> *To:* gem5 users mailing list >> *Subject:* Re: [gem5-users] questions on running benchmark on gem5 >> Could you get a back trace when it happens. It's not a segmentation >> fault it's a SIGABRT. You're running into a assertion and we need to know >> who is creating this event that is causing you trouble. >> Ali >> >> >> >> On Sep 30, 2012, at 9:44 PM, Fangfei Liu wrote: >> >> I tried several different configurations. It works in atomic mode even >> with caches. But segmentation fault occurs for both arm_detailed and >> detailed cpu type. The checkpoint is taken in atomic mode. Should the >> checkpoint be taken with the same cpu type as it runs by restoring the >> checkpoint? >> >> >> >> >> ------------------------------ >> *From:* gem5-users-boun...@gem5.org [gem5-users-boun...@gem5.org] on >> behalf of Fangfei Liu [fangf...@princeton.edu] >> *Sent:* Sunday, September 30, 2012 4:16 PM >> *To:* gem5 users mailing list >> *Subject:* Re: [gem5-users] questions on running benchmark on gem5 >> Hi, >> >> This is the information I got from running it in gdb: >> >> Switch at curTick count:10000 >> info: Entering event queue @ 19318968082000. Starting simulation... >> gem5.opt: build/ARM/sim/simulate.cc:66: SimLoopExitEvent* simulate(Tick): >> Assertion `curTick() Program received signal SIGABRT, Aborted. >> 0x0000003127430285 in raise () from /lib64/libc.so.6 >> >> It seems that it failed at the assertion. Do you have any idea what's >> wrong with it? Thanks! >> >> Best regards >> Fangfei >> >> >> >> ------------------------------ >> *From:* gem5-users-boun...@gem5.org [gem5-users-boun...@gem5.org] on >> behalf of Anthony Gutierrez [atgut...@umich.edu] >> *Sent:* Sunday, September 30, 2012 3:32 PM >> *To:* gem5 users mailing list >> *Subject:* Re: [gem5-users] questions on running benchmark on gem5 >> You will need to use gdb or a similar tool to find out where that seg >> fault is coming from. >> -Tony >> >> On Sun, Sep 30, 2012 at 3:29 PM, Fangfei Liu <fangf...@princeton.edu> >> wrote: >> >>> I didn't use caches when taking a checkpoint. This is the command line >>> I used to take the checkpoint. >>> >>> build/ARM/gem5.opt --outdir=bbench configs/example/fs.py -b bbench-gb >>> --kernel=vmlinux.smp.mouse.arm --frame-capture --checkpoint-dir=bbench >>> >>> >>> >>> >>> ------------------------------ >>> *From:* gem5-users-boun...@gem5.org [gem5-users-boun...@gem5.org] on >>> behalf of Anthony Gutierrez [atgut...@umich.edu] >>> *Sent:* Sunday, September 30, 2012 3:04 PM >>> >>> *To:* gem5 users mailing list >>> *Subject:* Re: [gem5-users] questions on running benchmark on gem5 >>> Did you take a checkpoint with caches? If so, you must take a >>> checkpoint without caches. >>> -Tony >>> >>> On Sun, Sep 30, 2012 at 2:58 PM, Fangfei Liu <fangf...@princeton.edu> >>> wrote: >>> >>>> I see, probably I had some interacting with vncviewer with the mouse >>>> when the keyboard is open. I tried to run the benchmark again and now it >>>> can run to completion. Thank you very much for your kind help! >>>> >>>> I have another problem. I took a checkpoint after booting the OS in >>>> atomic mode with the default configuration. I did this by inserting a m5 >>>> instruction in the bbench-gb.rcS file: /sbin/m5 checkpoint (right after >>>> sleep 10). The checkpoint can be taken correctly. Then I want to resume the >>>> checkpoint and simulate BBench in arm_detailed mode, but I got segmentation >>>> fault. The command line and output is as follows: >>>> >>>> command line: build/ARM/gem5.opt --outdir=bbench2 configs/example/fs.py >>>> -b bbench-gb --kernel=vmlinux.smp.mouse.arm --frame-capture >>>> --checkpoint-dir=bbench -r 1 --caches --cpu-type=arm_detailed --l2cache >>>> --cacheline_size=64 --l1d_size=32kB --l1d_assoc=4 --l2_size=512kB >>>> --l2_assoc=8 >>>> Global frequency set at 1000000000000 ticks per second >>>> info: kernel located at: >>>> /tigress-hsm/fangfeil/gem5/system/binaries/vmlinux.smp.mouse.arm >>>> Switch at curTick count:10000 >>>> info: Entering event queue @ 19318968082000. Starting simulation... >>>> Switched CPUS @ tick 19318968092000 >>>> Changing memory mode to timing >>>> switching cpus >>>> **** REAL SIMULATION **** >>>> info: Entering event queue @ 19318968092000. Starting simulation... >>>> Segmentation fault >>>> >>>> I still use the same rcS file except comment the checkpoint command. I >>>> was wondering whether I did this correctly. Thanks in advance! >>>> >>>> Best regards >>>> Fangfei >>>> >>>> >>>> ------------------------------ >>>> *From:* gem5-users-boun...@gem5.org [gem5-users-boun...@gem5.org] on >>>> behalf of Anthony Gutierrez [atgut...@umich.edu] >>>> *Sent:* Sunday, September 30, 2012 12:59 PM >>>> >>>> *To:* gem5 users mailing list >>>> *Subject:* Re: [gem5-users] questions on running benchmark on gem5 >>>> BBench is a web-page rendering benchmark, however, it shouldn't >>>> interact with the web-pages in any way. It's doing a google search for >>>> something called ewbay and the keyboard is open, that shouldn't happen. >>>> Were you interacting with the vncviewer with the mouse? Is it possible that >>>> this query could have been entered in error? There is nothing in the BBench >>>> source called "ewbay", so it's hard to tell how that url is being produced >>>> otherwise. >>>> -Tony >>>> >>>> On Sat, Sep 29, 2012 at 2:15 PM, Fangfei Liu <fangf...@princeton.edu> >>>> wrote: >>>> >>>>> Thanks for your reply. >>>>> I didn’t change anything and the command line I used was exactly >>>>> the same as that in the BBench instruction page. The exact URL is >>>>> http://www.google.com/m?hl=md&gl=us&source=android-browser-type&q=ewbay<https://owa.princeton.edu/owa/redir.aspx?C=7HmSRhGXSk2J12xsesLMwH3TuxeMcc8IjThnd3hkIWuO2qoM84GhCA0ZlhUlBjlhcqAIq2Ta4iw.&URL=http%3a%2f%2fwww.google.com%2fm%3fhl%3dmd%26gl%3dus%26source%3dandroid-browser-type%26q%3dewbay>. >>>>> It is actually doing google search for ebay. Please find attached a >>>>> snapshot for this page. >>>>> >>>>> >>>>> Best regards >>>>> Fangfei >>>>> Date: Fri, 28 Sep 2012 14:49:55 -0400 >>>>> From: Anthony Gutierrez <atgut...@umich.edu> >>>>> >>>>> To: gem5 users mailing list <gem5-users@gem5.org> >>>>> Subject: Re: [gem5-users] questions on running benchmark on gem5 >>>>> Message-ID: >>>>> <caeqebpsxtdzuouczrvt6q1-3ckj5mkwzgicnwdo9qbpwume...@mail.gmail.com> >>>>> Content-Type: text/plain; charset="iso-8859-1" >>>>> >>>>> >>>>> I started a BBench run this morning with a fresh check out of the >>>>> latest >>>>> dev repo, the Android kernel and an unmodified Gingerbread disk image >>>>> with >>>>> BBench, both from gem5.org/bbench-gem5. I do not see this problem, >>>>> eBay >>>>> renders fine and the simulation continues. I will need more info on >>>>> your >>>>> setup/how you're running the simulation if you'd like help with this >>>>> problem. >>>>> >>>>> -Tony >>>>> >>>>> On Fri, Sep 28, 2012 at 11:31 AM, Anthony Gutierrez < >>>>> atgut...@umich.edu>wrote: >>>>> >>>>> > I've never seen this problem before, and the BBench source for eBay >>>>> does >>>>> > not show anything called ewbay in the source, so I don't know how >>>>> it's >>>>> > getting redirected to that URL. Did you modify the BBench source in >>>>> any >>>>> > way? Can you send me your exact command line? And, can you tell me >>>>> the >>>>> > exact URL that the browser shows when it hangs? >>>>> > >>>>> > -Tony >>>>> > >>>>> > On Thu, Sep 27, 2012 at 10:25 AM, Fangfei Liu < >>>>> fangf...@princeton.edu>wrote: >>>>> > >>>>> >> Hi,**** >>>>> >> >>>>> >> ** ** >>>>> >>>>> >> >>>>> >> I tried to run Bbench on gem5 with Gingerbread disk image (atomic >>>>> mode >>>>> >> and default configuration). The simulator took about 140 minutes to >>>>> boot >>>>> >> Android and start running the benchmark. But it seems that the >>>>> benchmark >>>>> >> stalled at certain web page after running for a while. I find that >>>>> it is >>>>> >> supposed to render a web page in ebay but used the wrong url with >>>>> ewbay. I >>>>> >> terminated the simulation manually after 97 hours but find that the >>>>> >> statistics shows only 87 billion instructions were simulated. I was >>>>> >> wondering what caused the problem and what is supposed to be the >>>>> end of >>>>> >> running of Bbench? Will the simulation exit automatically? How many >>>>> >> instructions are there if I want to run Bbench to completion? I also >>>>> >> downloaded an EEMBC benchmark: AndEbench, which is an Android app >>>>> (.apk >>>>> >> file). Can I run this app directly on gem5? If so, does anyone know >>>>> how to >>>>> >> install and run it on gem5? Thanks!**** >>>>> >> >>>>> >> ** ** >>>>> >> >>>>> >> ** ** >>>>> >> >>>>> >> Best regards**** >>>>> >> >>>>> >> Fangfei**** >>>>> >> >>>>> >> ** ** >>>>> >>>>> >> >>>>> >> _______________________________________________ >>>>> >> gem5-users mailing list >>>>> >> gem5-users@gem5.org >>>>> >> http://m5sim.org/cgi-bin/mailman/listinfo/gem5-users >>>>> >> >>>>> > >>>>> > >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> _______________________________________________ >>>>> gem5-users mailing list >>>>> gem5-users@gem5.org >>>>> http://m5sim.org/cgi-bin/mailman/listinfo/gem5-users >>>> >>>> >>>> _______________________________________________ >>>> gem5-users mailing list >>>> gem5-users@gem5.org >>>> http://m5sim.org/cgi-bin/mailman/listinfo/gem5-users >>> >>> >>> _______________________________________________ >>> gem5-users mailing list >>> gem5-users@gem5.org >>> http://m5sim.org/cgi-bin/mailman/listinfo/gem5-users >> >> _______________________________________________ >> gem5-users mailing list >> gem5-users@gem5.org >> http://m5sim.org/cgi-bin/mailman/listinfo/gem5-users >> >> >> _______________________________________________ >> gem5-users mailing list >> gem5-users@gem5.org >> http://m5sim.org/cgi-bin/mailman/listinfo/gem5-users > > > > > > > > _______________________________________________ > gem5-users mailing list > gem5-users@gem5.org > http://m5sim.org/cgi-bin/mailman/listinfo/gem5-users >
_______________________________________________ gem5-users mailing list gem5-users@gem5.org http://m5sim.org/cgi-bin/mailman/listinfo/gem5-users