Hi Tsung,

Can you tell us a bit more about what exactly you changed?  For example,

- what exactly was causing the segfault such that changing all of these
fields solves the problem?
- what did you change these fields to?

Depending on the above answers, it may be worth submitting a patch.

Thanks,
Matt

On Mon, Aug 6, 2018 at 8:19 AM, Tsungtai Yeh <[email protected]> wrote:

> I found I have to change the following options to increase the number of
> CUs in GCN_X86.
>
> --num-compute-units
>
> --cu-per-sqc
>
> --cu-per-scalar-cache
>
> --simds-per-cu
>
> --cu-per-sa
>
>
>
> ________________________________
> From: Tsungtai Yeh
> Sent: Saturday, July 28, 2018 8:40:30 PM
> To: [email protected]
> Subject: Re: [gem5-dev] How to increase the default CU counts in GCN_X86
> gem5-apu?
>
> I changed the default value of “—num-compute-units” to 8 in apu_se.py. I
> still got the segmentation fault. The following is the trace from gem5
> simulation. The 8 CUs are initialized successfully, but get the failure
> during the instruction fetching stage. — Tsung Tai
> parser.add_option("-u", "--num-compute-units", type="int", default=8,
>                   help="number of GPU compute units")
>
> 51620963000: system.cpu2.CUs0: trueWgSize[0] =  256
> 51620963000: system.cpu2.CUs0: trueWgSize[1] =  1
> 51620963000: system.cpu2.CUs0: trueWgSize[2] =  1
> 51620963000: system.cpu2.CUs0: trueWgSizeTotal =  256
> 51620963000: system.cpu2.CUs0: Free WF slots =  36, Mapped WFs = 0,
>      VGPR Availability = 0, SGPR Availability = 0
> 51620963000: system.cpu2.CUs1: trueWgSize[0] =  256
> 51620963000: system.cpu2.CUs1: trueWgSize[1] =  1
> 51620963000: system.cpu2.CUs1: trueWgSize[2] =  1
> 51620963000: system.cpu2.CUs1: trueWgSizeTotal =  256
> 51620963000: system.cpu2.CUs1: Free WF slots =  36, Mapped WFs = 0,
>      VGPR Availability = 0, SGPR Availability = 0
> 51620963000: system.cpu2.CUs2: trueWgSize[0] =  256
> 51620963000: system.cpu2.CUs2: trueWgSize[1] =  1
> 51620963000: system.cpu2.CUs2: trueWgSize[2] =  1
> 51620963000: system.cpu2.CUs2: trueWgSizeTotal =  256
> 51620963000: system.cpu2.CUs2: Free WF slots =  36, Mapped WFs = 0,
>      VGPR Availability = 0, SGPR Availability = 0
> 51620963000: system.cpu2.CUs3: trueWgSize[0] =  256
> 51620963000: system.cpu2.CUs3: trueWgSize[1] =  1
> 51620963000: system.cpu2.CUs3: trueWgSize[2] =  1
> 51620963000: system.cpu2.CUs3: trueWgSizeTotal =  256
> 51620963000: system.cpu2.CUs3: Free WF slots =  36, Mapped WFs = 0,
>      VGPR Availability = 0, SGPR Availability = 0
> 51620963000: system.cpu2.CUs4: trueWgSize[0] =  256
> 51620963000: system.cpu2.CUs4: trueWgSize[1] =  1
> 51620963000: system.cpu2.CUs4: trueWgSize[2] =  1
> 51620963000: system.cpu2.CUs4: trueWgSizeTotal =  256
> 51620963000: system.cpu2.CUs4: Free WF slots =  36, Mapped WFs = 0,
>      VGPR Availability = 0, SGPR Availability = 0
> 51620963000: system.cpu2.CUs5: trueWgSize[0] =  256
> 51620963000: system.cpu2.CUs5: trueWgSize[1] =  1
> 51620963000: system.cpu2.CUs5: trueWgSize[2] =  1
> 51620963000: system.cpu2.CUs5: trueWgSizeTotal =  256
> 51620963000: system.cpu2.CUs5: Free WF slots =  36, Mapped WFs = 0,
>      VGPR Availability = 0, SGPR Availability = 0
> 51620963000: system.cpu2.CUs6: trueWgSize[0] =  256
> 51620963000: system.cpu2.CUs6: trueWgSize[1] =  1
> 51620963000: system.cpu2.CUs6: trueWgSize[2] =  1
> 51620963000: system.cpu2.CUs6: trueWgSizeTotal =  256
> 51620963000: system.cpu2.CUs6: Free WF slots =  36, Mapped WFs = 0,
>      VGPR Availability = 0, SGPR Availability = 0
> 51620963000: system.cpu2.CUs7: trueWgSize[0] =  256
> 51620963000: system.cpu2.CUs7: trueWgSize[1] =  1
> 51620963000: system.cpu2.CUs7: trueWgSize[2] =  1
> 51620963000: system.cpu2.CUs7: trueWgSizeTotal =  256
> 51620963000: system.cpu2.CUs7: Free WF slots =  36, Mapped WFs = 0,
>      VGPR Availability = 0, SGPR Availability = 0
> 51620963000: system.cpu0.workload.drivers.device.dispatcher: kernel 0
> failed to launch
> 51620963000: system.cpu0.workload.drivers.device.dispatcher: Returning 0
> Kernels
> 51620964000: global: WF[0][0]: Id28 reserved fetch buffer entry for PC =
> 0x7ffeebde9100
> 51620964000: global: CU7: WF[0][0]: Id28: Initiate fetch from pc:
> 140732855652608 0x7ffeebde9100
> 51620964000: global: CU7: WF[0][0]: Initiating fetch translation:
> 0x7ffeebde9100
> gem5 has encountered a segmentation fault!
> ________________________________
> From: Tsungtai Yeh
> Sent: Friday, July 27, 2018 7:20:55 AM
> To: [email protected]
> Cc: [email protected]
> Subject: [gem5-dev] How to increase the default CU counts in GCN_X86
> gem5-apu?
>
>
> I tried to increase the CU counts to 32 in the gem5-apu. However, I found
> I could not only increase "--num-compute-units" default count (4). Do you
> know any other parameters I also need to change when increasing the CU
> counts? Thank you.
> _______________________________________________
> gem5-dev mailing list
> [email protected]
> http://m5sim.org/mailman/listinfo/gem5-dev
>
_______________________________________________
gem5-dev mailing list
[email protected]
http://m5sim.org/mailman/listinfo/gem5-dev

Reply via email to