[gem5-users] Re: Stat dump after each N instructions - CPU and/or AMD GPU

2020-05-27 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only - Internal Distribution Only] You could probably use the simpoint_start_insts vector param in CPU to have simulation exit to your python script and dump/reset stats there. For example: cpu.simpoint_start_insts = [x*N for x in range(1000)] where N is the instruction

[gem5-users] Re: gem5 GCN3 GPU model running issues

2020-11-06 Thread Poremba, Matthew via gem5-users
-style SLICC_HTML=True CCFLAGS_EXTRA="-DSYS_getdents -DSYS_getdents64" Cheers, Dan On Fri, Nov 6, 2020 at 5:25 PM Poremba, Matthew via gem5-users mailto:gem5-users@gem5.org>> wrote: [AMD Public Use] Looking into that syscall error now. I'm not quite sure yet how to fix the docker

[gem5-users] Re: gem5 GCN3 GPU model running issues

2020-11-06 Thread Poremba, Matthew via gem5-users
[AMD Public Use] Looking into that syscall error now. I'm not quite sure yet how to fix the docker image since python 3.5 is the latest version available for the distro needed. For now I disabled the check for 3.6 since it seems unnecessarily strict and doesn't break anything related to this

[gem5-users] Re: gem5 GCN3 GPU model running issues

2020-11-06 Thread Poremba, Matthew via gem5-users
[AMD Public Use] Hi Yichen, Based on the changes I see you've made, it seems like you are using an older version of gem5. These should all be fixed, including the error you are seeing, on the tip of develop. Keep in mind GCN3 was not officially part of the gem5 20.1 release, so the most up

[gem5-users] Re: gem5 GCN3 GPU model docker build issue

2021-03-09 Thread Poremba, Matthew via gem5-users
[AMD Public Use] Hi, Develop branch has the latest Dockerfile. Note that GCN3 won't be "officially" part of gem5 until 21.0 release (in a few weeks). -Matt -Original Message- From: xpf via gem5-users Sent: Monday, March 8, 2021 11:21 PM To: gem5-users@gem5.org Cc:

[gem5-users] Re: gem5 GCN GPU docker error

2021-09-22 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only] Hi Imad & Matt, I am seeing the same error as of this morning. Not sure quite what the issue is, but I suspect not everything is choosing a specific package version and something was updated in an apt repo or one of the repos rocBLAS's install.sh script pulls from.

[gem5-users] Re: gem5 GCN GPU docker error

2021-09-22 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only] Hi Imad, Yes, the docker seems to have broken in the past few days. Regarding the benchmark not completing, please change your command to use 3 CPUs: docker run --rm -v $PWD/gem5:/gem5 -v $PWD/gem5-resources:/gem5-resources \ -w /gem5

[gem5-users] Re: Full-system discrete GPU simulation

2021-10-01 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only] Hi Imad, It is still not supported on stable nor develop. We are submitting patches for this over time as a way to not overwhelm the (volunteer) reviewers. It will most likely be supported in the next gem5 release. -Matt From: Imad Al Assir via gem5-users Sent:

[gem5-users] Re: Full-system discrete GPU simulation

2021-10-04 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only] Hi Imad, Yes, you should be able to run DGPU in SE mode with gfx803 on the stable branch. On develop, gfx900 is also a dgpu option if you build VEGA_X86. -Matt From: Imad Al Assir Sent: Friday, October 1, 2021 4:47 PM To: Poremba, Matthew ; gem5 users mailing list

[gem5-users] Re: gem5 + DGPU (GCN3) build error

2022-03-04 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only] Hi David, I generally look at the shader_active_ticks stat for very high level performance comparisons. -Matt From: David Fong Sent: Friday, March 4, 2022 10:27 AM To: Poremba, Matthew ; gem5 users mailing list ; Bobby Bruce ; Matt Sinclair ; Kyle Roarty Subject:

[gem5-users] Re: gem5 + DGPU (GCN3) build error

2022-03-04 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only] Hi David, gfx801 is APU (e.g., "Carrizo") and gfx803 is dGPU (e.g., RX 4xx/5xx series). From a gem5 perspective, basically they are setup differently in how memory is laid out. Vega was recently added in 21.2 but from my notes not all of the DNNMark kernels are

[gem5-users] Re: gem5 + DGPU (GCN3) build error

2022-03-04 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only] Hi, I don't know if this is what is causing this specific forking problem, but gfx900 is VEGA not GCN3. There is a separate build for VEGA. If you want GCN3 dGPU you want gfx803. -Matt From: David Fong via gem5-users Sent: Friday, March 4, 2022 9:34 AM To: Bobby

[gem5-users] Re: gem5 : X86 + GCN3 (gfx801) + test_fwd_lrn

2022-03-17 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only] Hi David, I don't think these are the parameters you want to be changing if you are trying to change the VRAM memory latency which it seems like you are based on the GDDR5 comment. Those parameters are for the latency between CUs seeing a memory request and the

[gem5-users] Re: gem5 : X86 + APU (gfx801) with CUs128 error with DNNMark test_fwd_softmax

2022-03-08 Thread Poremba, Matthew via gem5-users
[Public] Hi David, You are hitting the limit on the number of same MachineTypes in a Ruby network. You can change this by modifying the `build_opts/GCN_X86` file and adding a new line with `NUMBER_BITS_PER_SET = '128'`, or higher, and then recompile gem5. As far as I know there is not a

[gem5-users] Re: gem5 : X86 + GCN3 (gfx801) + test_fwd_lrn

2022-03-17 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only] These would be valid for both as they both use the same cache protocol files. I'm not very familiar with how dGPU is hacked up in SE mode to look like a dGPU... -Matt From: David Fong Sent: Thursday, March 17, 2022 9:57 AM To: Poremba, Matthew ; Matt Sinclair ;

[gem5-users] Re: Error in an application running on gem5 GCN3 (with apu_se.py)

2023-09-11 Thread Poremba, Matthew via gem5-users
[Public] Hi Anoop, That instruction was recently added to gem5, but for Vega ISA only: https://gem5-review.googlesource.com/c/public/gem5/+/67072 . It could be ported to GCN3 probably by copying the code exactly into the corresponding GCN3 files. You’ll notice however in that relation

[gem5-users] Re: Error in an application running on gem5 GCN3 (with apu_se.py)

2023-09-08 Thread Poremba, Matthew via gem5-users
[Public] Hi Anoop, Based on that register count, I am going to guess you built the application with -O0 or some other debugging flags? If you do this, the compiler makes some super large number of registers. I assume that is so a real GPU will not run any other applications simultaneously.

[gem5-users] Re: Not being able to execute GPU FS example 'hip_samples.py'

2023-09-08 Thread Poremba, Matthew via gem5-users
[Public] This is not the first time I am hearing about this issue. It seems stable needs to be hotfixed for GPU. For now, you can try the develop branch instead. It is tested quite well so it is relatively stable anyway despite the name. -Matt From: Pau Galindo Figuerola via gem5-users

[gem5-users] Re: GCN3 Full System

2023-09-19 Thread Poremba, Matthew via gem5-users
[Public] Hi Pau, It’s probably not possible without a lot of set up, but I have found that GPUs that are “not supported” sometimes still work anyway with certain compute stacks. You would probably need to: * Find a BIOS for a GCN3 GPU or rip it from a real GPU – This is so the driver

[gem5-users] Re: GPU-FS simulation progress

2022-12-07 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only - General] I see, thanks again. The verification should have passed it that case... The docker image is *only* for building applications, so that the user does not need to install ROCm locally to build applications. You do *not* run gem5 simulations in a docker for

[gem5-users] Re: GPU-FS simulation progress

2022-12-07 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only - General] Thanks Rajesh, That is good to know. I don't think there is an list anywhere of which CPUs work with KVM. Which older ROCm do you mean here? Was square compiled with an older version? Ideally the verification should be passing as well. At least, it does on

[gem5-users] Re: KVM error while trying to build disk-image for GPU-FS

2022-12-06 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only - General] Hi Rajesh, Thanks for the update. I'm glad you were able to get it worked out. Ideally we wouldn't *require* sudo access but for KVM in general I think it is going to highly depend on how the system was setup and there are some things that packer won't be

[gem5-users] Re: GPU-FS simulation progress

2022-12-06 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only - General] At this point I would check if the other KVM scripts are working for you (there are some simple tests somewhere like boot Ubuntu and exit). KVM works on some CPUs better than others, I believe, or at least this was true in the past. I have a few other ideas

[gem5-users] Re: GPU-FS simulation progress

2022-12-06 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only - General] Hi Rajesh, I looks like no progress has been made since a very early tick number (the timestamp print by Linux is equal to the current simulation tick / 1 trillion). For reference it should take no more than 1-3 wall clock minutes to full boot Linux and

[gem5-users] Re: 回复:Re: 回复:Re: Gem5 GCN3 (GPUCoalescer detected deadlock when running pagerank.)

2022-11-07 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only - General] Hi, The rocclr, panic, and unimplemented instructions errors/warnings seem to be caused by this patch: https://gem5-review.googlesource.com/c/public/gem5/+/64831. It is likely the ROCm stack is taking a different code path with the different processor

[gem5-users] Re: Can't understand GPU-FS output

2023-04-11 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only - General] Hi, 1. Full system mode in gem5 has "two" outputs. There is the simulator output (what you are showing in the email) and the terminal output. In SE mode these are combined into one output. What is being shown in the simulator output. This is really

[gem5-users] Re: Unavailability of GPU_RfO and GPU_VIPER_Region protocol in gem5 v21

2023-02-08 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only - General] Hi, GPU_RfO and GPU_VIPER_Region were deprecated, mostly because there is no one to help maintain all of the GPU protocols, so we opted to focus on just one. I don't think there have been any Ruby/SLICC changes that would have broken the ability to build

[gem5-users] Re: Can I change GPU frequency on the fly while simulating gpu-fs?

2023-04-13 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only - General] Hello, I don't know how to change the frequency or if it will work, but in your gem5 python script you can call "m5.simulate(10 * 1e12)" to simulate 10 seconds worth of ticks at a time and change the frequency after each call to that. -Matt From: Mejbaul

[gem5-users] Re: Persistent memory with gem5

2023-05-22 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only - General] Hi, By compiled as a module, the Linux `.config` file should have CONFIG_PMEM_xxx=y (i.e., not …=m). I did a quick test with the ubuntu20 disk image and kernel downloaded from the gem5 website. I just added the `memmap=4G!12G` kernel parameter to

[gem5-users] Re: gem5 VEGA_X86 simulation with GPU support

2023-08-02 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only - General] Hi, Can you show the output you removed? What is being printed right before the crash? Thanks, Matt From: Matt Sinclair Sent: Sunday, July 23, 2023 10:37 AM To: The gem5 Users mailing list Cc: l...@163.com <17861509...@163.com>; Poremba, Matthew

[gem5-users] Re: Replacing CPU model in GPU-FS

2023-06-30 Thread Poremba, Matthew via gem5-users
[Public] Hi, No worries about the questions! I will try to answer them all, so this will be a long email : The disconnected (or disjoint) Ruby network is essentially the same as the APU Ruby network used in SE mode - That is, it combines two Ruby protocols in one protocol (MOESI_AMD_base

[gem5-users] Re: GPU-FS simulation progress

2023-06-23 Thread Poremba, Matthew via gem5-users
[Public] For some reason I cannot see the original email Matt is replying to, but m5_exit is the normal exit status. Just as a reminder, the gem5 output does not give any indication about whether or not your application running *in* gem5 completed successfully. You will need to check the

[gem5-users] Re: bad ioctl error in gpu_comput_driver.cc

2023-06-20 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only - General] Hi, SE mode supports ROCm 4.0 only. What version of ROCm is installed on the host machine? If they are different the IOCTLs might be as well. You can alternately try full system with GPU if you want to avoid docker. -Matt From: Anoop Mysore via

[gem5-users] Re: Persistent memory with gem5

2023-05-20 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only - General] Hi, Are you building pmem as a module as described in the blog? (“ PMEM: Persistent memory block device support”) If so, I would try building it into the kernel directly. It is possibly looking for the module for your compiled kernel and does not find it

[gem5-users] Re: Fail to run gpu-fs

2023-12-19 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only - General] Hi Sandy, Could you share the file “m5out/system.pc.com_1.device” as well? You could also try using vega10_atomic.py instead of vega10_kvm.py. Initially it looks to me like a KVM issue. -Matt From: Matt Sinclair Sent: Tuesday, December 19, 2023 9:28 AM

[gem5-users] Re: 回复:Re: Fail to run gpu-fs

2024-01-02 Thread Poremba, Matthew via gem5-users
ope it works! Regards, Pau El mar, 19 dic 2023 18:57, Poremba, Matthew via gem5-users mailto:gem5-users@gem5.org>> escribió: [AMD Official Use Only - General] Hi Sandy, Could you share the file “m5out/system.pc.com_1.device” as well? You could also try using vega10_atomic.py instead o

[gem5-users] Re: RENAME: HELP Needed for Running Benchmarks in GPU Full System Simulation

2024-01-02 Thread Poremba, Matthew via gem5-users
[Public] Hi Sandy, Depending on the benchmark, OpenCL might do an online compile (i.e., compile the kernels right before running them). If you are using KVM it should just work. Otherwise, the online compilation will take a significant amount of simulation time and offline compiling would

[gem5-users] Re: Issues modifying parameters in GPU FS

2024-01-29 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only - General] Hi Pau, The dgpu_mem_size parameter will only change the memory size for gem5 while the GPU driver uses an MMIO register value to determine the memory size. The issue you are seeing is the driver thinks there is still 16GB of memory and it attempts to write

[gem5-users] Re: GCN3 Full System

2024-04-26 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only - General] Hi Pau, You will likely need root access to rip the BIOS from a real GPU. If you have that, I posted my notes below. Keep in mind the PCI ids might be different: “ You can find devices with ROM using `find /sys/devices -name rom` For example, my MI210 is

[gem5-users] Re: GCN3 Full System

2024-04-29 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only - General] Hi Pau, For an APU, I think you can get the bios from the vga rom region in memory. There is a util file for this in gem5: util/amdgpu/fs_files/dump_gpu_rom.sh . However, if you have another GPU such as integrated graphics on the motherboard, it might get

[gem5-users] Re: Disk Image Management

2024-05-04 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only - General] Hi Pau, Typically, we use docker to compile the binaries. This way you don’t need to have the libraries installed on the host machine. You can find the official docker image to build GPU applications here:

[gem5-users] Re: gem5 cpu-gpu heterogeneous NoC

2024-05-07 Thread Poremba, Matthew via gem5-users
[AMD Official Use Only - General] Hello, If you want to run CPU+GPU right now you will probably need to use SE mode. Timing CPUs are not supported in GPUFS configs so the CPU network is completely ignored as KVM/atomic bypass network. If that doesn’t matter, then you will need to create a

[gem5-users] Re: GPU FS Multiple CPU

2024-03-22 Thread Poremba, Matthew via gem5-users
[Public] Hi Pau, From: Pau Galindo Figuerola via gem5-users Sent: Friday, March 22, 2024 10:47 AM To: The gem5 Users mailing list Cc: Pau Galindo Figuerola Subject: [gem5-users] GPU FS Multiple CPU Caution: This message originated from an External Source. Use proper caution when opening

[gem5-users] Re: GPU FS Multiple CPU

2024-03-22 Thread Poremba, Matthew via gem5-users
[Public] Hi Pau, Does the host system have 4 CPUs available for KVM to use? I have seen similar errors occasionally with even 2 CPUs and simply rerunning the simulation seemed to fix it. Unfortunately, I am not a KVM expert so I am not entirely sure how to make it more robust. -Matt