Rick,

1) Just to follow up, did you figure out a good speed for the checkpoint
producing run?

2) Did you ever find why everything was 404'ed?

Lisa

On Wed, Jan 28, 2009 at 9:44 PM, Lisa Hsu <h...@eecs.umich.edu> wrote:

> I couldn't say exactly, so much in M5 has changed since that paper to give
> an exact number, but I'd imagine whatever instructions/second you're getting
> in the detailed, if you make the checkpoint run the appropriate speed
> considering its 1 IPC, you'd probably be in the right range.
>
> Lisa
>
>
> On Wed, Jan 28, 2009 at 8:26 PM, Rick Strong <rstr...@cs.ucsd.edu> wrote:
>
>> The clock rate on the server is only set to 1GHz on the checkpoint run
>> (as opposed to 3GHz for the detailed simulation). How slow should it be
>> set? Are we talking nearer to 250MHz?
>>
>> Thanks,
>> -Rick
>>
>> Ali Saidi wrote:
>> > That's almost certainly what is happening. Different packets are
>> > trying to be sent, both originating from the kernel. This isn't a
>> > device bug. It's exactly what that paper described. The delay observed
>> > by the server has changed dramatically, that a retransmit is occurring
>> > because since the ack didn't arrive in twice the round trip latency.
>> > You should add some latency to the ethernet link, and drive the server
>> > with a slower CPU during the checkpoint run. That will normally fix
>> > the problem.
>> >
>> > Ali
>> >
>> >
>> >
>> >
>> > On Jan 28, 2009, at 5:59 PM, Rick Strong wrote:
>> >
>> >
>> >> This is an interesting. Thanks for the link.
>> >>
>> >> -Rick
>> >>
>> >> Lisa Hsu wrote:
>> >>
>> >>> Your description that it only occurs when you switch to a timing sim
>> >>> makes me think of this (not to toot my own horn or anything):
>> >>>
>> >>> http://www.eecs.umich.edu/~hsul/pubs/mobs05.pdf<http://www.eecs.umich.edu/%7Ehsul/pubs/mobs05.pdf>
>> >>> <http://www.eecs.umich.edu/%7Ehsul/pubs/mobs05.pdf>
>> >>>
>> >>> Just throwing that out as a possibility.  You might want to "slow
>> >>> down" your checkpoint dropping run so that it's not so disruptive
>> >>> when
>> >>> you switch over to timing.
>> >>>
>> >>> Lisa
>> >>>
>> >>> On Wed, Jan 28, 2009 at 5:32 PM, Rick Strong <rstr...@cs.ucsd.edu
>> >>> <mailto:rstr...@cs.ucsd.edu>> wrote:
>> >>>
>> >>>    I have posted tar.gz files that include EthernetAll output (in
>> >>>    ethernet_all.trace) @ http://rickshin.ucsd.edu. Once you have a
>> >>>    chance,
>> >>>    if you could take a look at the trace and figure what is wrong
>> >>> that is
>> >>>    great.
>> >>>
>> >>>    I ended restoring from the checkpoint for two runs. One run
>> >>> stays in
>> >>>    atomic mode while the other switches to timing and detailed. The
>> >>> run
>> >>>    that stays in atomic mode works fine. This leads me to believe
>> >>>    that the
>> >>>    checkpoint restore mechanism is fine. The fault likely lies in the
>> >>>    switching to timing or detailed mdoe.
>> >>>
>> >>>    The differences between the runs:
>> >>>
>> >>>    (1) m5out-atomic-run-aftercheckpoint.tar.gz is a run that stays in
>> >>>    atomic mode (no switching to timing mode)
>> >>>
>> >>>    (2) m5out-timing-run-aftercheckpoint.tar.gz is a run that
>> >>> switches to
>> >>>    timing and then to detailed mode.
>> >>>
>> >>>
>> >>>    Thanks and good luck,
>> >>>
>> >>>    -Rick
>> >>>
>> >>>    Ali Saidi wrote:
>> >>>
>> >>>> Looking at the trace, it appears as though you just restored from a
>> >>>> checkpoint. Is this the case? If so, what does the checkpoint
>> >>>>
>> >>>    dropping
>> >>>
>> >>>> run do after that checkpoint is created? It's so early in the trace
>> >>>> that  I would guess it's a serialization bug, particularly in
>> >>>>
>> >>>    the TSO
>> >>>
>> >>>> code. However, I looked quickly at the code and and nothing
>> >>>>
>> >>>    seemed to
>> >>>
>> >>>> jump out at me. If you can provide me with an EthernetAll trace from
>> >>>> the checkpoint run and from the restored run I can work on figuring
>> >>>> out what the problem is.
>> >>>>
>> >>>> Ali
>> >>>>
>> >>>>
>> >>>>
>> >>>> On Jan 28, 2009, at 2:53 AM, Rick Strong wrote:
>> >>>>
>> >>>>
>> >>>>
>> >>>>>> There are three possibilities here:
>> >>>>>> a) A kernel bug
>> >>>>>> b) a device model/driver bug
>> >>>>>> c) a checkpointing bug (as it relates to (b))
>> >>>>>>
>> >>>>>> What kernel version are you using? Could you put the ethernet
>> >>>>>>
>> >>>    trace
>> >>>
>> >>>>>> somewhere so I could look at it?
>> >>>>>>
>> >>>>>> Ali
>> >>>>>>
>> >>>>>>
>> >>>>>>
>> >>>>> I am using the  kernel 2.6.18 with M5 patches.
>> >>>>>
>> >>>>> I have put the ethernet traces up at http://rickshin.ucsd.edu.
>> >>>>>
>> >>>    It is
>> >>>
>> >>>>> the
>> >>>>> only link. If you take a look, let me know what you think.
>> >>>>>
>> >>>>> Best,
>> >>>>> -Rick
>> >>>>>
>> >>>>>
>> >>>>> _______________________________________________
>> >>>>> m5-dev mailing list
>> >>>>> m5-dev@m5sim.org <mailto:m5-dev@m5sim.org>
>> >>>>> http://m5sim.org/mailman/listinfo/m5-dev
>> >>>>>
>> >>>>>
>> >>>>>
>> >>>> _______________________________________________
>> >>>> m5-dev mailing list
>> >>>> m5-dev@m5sim.org <mailto:m5-dev@m5sim.org>
>> >>>> http://m5sim.org/mailman/listinfo/m5-dev
>> >>>>
>> >>>>
>> >>>>
>> >>>    _______________________________________________
>> >>>    m5-dev mailing list
>> >>>    m5-dev@m5sim.org <mailto:m5-dev@m5sim.org>
>> >>>    http://m5sim.org/mailman/listinfo/m5-dev
>> >>>
>> >>>
>> >>>
>> ------------------------------------------------------------------------
>> >>>
>> >>> _______________________________________________
>> >>> m5-dev mailing list
>> >>> m5-dev@m5sim.org
>> >>> http://m5sim.org/mailman/listinfo/m5-dev
>> >>>
>> >>>
>> >> _______________________________________________
>> >> m5-dev mailing list
>> >> m5-dev@m5sim.org
>> >> http://m5sim.org/mailman/listinfo/m5-dev
>> >>
>> >>
>> >
>> > _______________________________________________
>> > m5-dev mailing list
>> > m5-dev@m5sim.org
>> > http://m5sim.org/mailman/listinfo/m5-dev
>> >
>> >
>>
>> _______________________________________________
>> m5-dev mailing list
>> m5-dev@m5sim.org
>> http://m5sim.org/mailman/listinfo/m5-dev
>>
>>
>
_______________________________________________
m5-dev mailing list
m5-dev@m5sim.org
http://m5sim.org/mailman/listinfo/m5-dev

Reply via email to