[AMD Official Use Only - Internal Distribution Only]
You could probably use the simpoint_start_insts vector param in CPU to have
simulation exit to your python script and dump/reset stats there. For example:
cpu.simpoint_start_insts = [x*N for x in range(1000)]
where N is the instruction
-style SLICC_HTML=True
CCFLAGS_EXTRA="-DSYS_getdents -DSYS_getdents64"
Cheers,
Dan
On Fri, Nov 6, 2020 at 5:25 PM Poremba, Matthew via gem5-users
mailto:gem5-users@gem5.org>> wrote:
[AMD Public Use]
Looking into that syscall error now.
I'm not quite sure yet how to fix the docker
[AMD Public Use]
Looking into that syscall error now.
I'm not quite sure yet how to fix the docker image since python 3.5 is the
latest version available for the distro needed. For now I disabled the check
for 3.6 since it seems unnecessarily strict and doesn't break anything related
to this
[AMD Public Use]
Hi Yichen,
Based on the changes I see you've made, it seems like you are using an older
version of gem5. These should all be fixed, including the error you are
seeing, on the tip of develop.
Keep in mind GCN3 was not officially part of the gem5 20.1 release, so the most
up
[AMD Public Use]
Hi,
Develop branch has the latest Dockerfile. Note that GCN3 won't be "officially"
part of gem5 until 21.0 release (in a few weeks).
-Matt
-Original Message-
From: xpf via gem5-users
Sent: Monday, March 8, 2021 11:21 PM
To: gem5-users@gem5.org
Cc:
[AMD Official Use Only]
Hi Imad & Matt,
I am seeing the same error as of this morning. Not sure quite what the issue
is, but I suspect not everything is choosing a specific package version and
something was updated in an apt repo or one of the repos rocBLAS's install.sh
script pulls from.
[AMD Official Use Only]
Hi Imad,
Yes, the docker seems to have broken in the past few days.
Regarding the benchmark not completing, please change your command to use 3
CPUs:
docker run --rm -v $PWD/gem5:/gem5 -v $PWD/gem5-resources:/gem5-resources \
-w /gem5
[AMD Official Use Only]
Hi Imad,
It is still not supported on stable nor develop. We are submitting patches for
this over time as a way to not overwhelm the (volunteer) reviewers. It will
most likely be supported in the next gem5 release.
-Matt
From: Imad Al Assir via gem5-users
Sent:
[AMD Official Use Only]
Hi Imad,
Yes, you should be able to run DGPU in SE mode with gfx803 on the stable
branch. On develop, gfx900 is also a dgpu option if you build VEGA_X86.
-Matt
From: Imad Al Assir
Sent: Friday, October 1, 2021 4:47 PM
To: Poremba, Matthew ; gem5 users mailing list
[AMD Official Use Only]
Hi David,
I generally look at the shader_active_ticks stat for very high level
performance comparisons.
-Matt
From: David Fong
Sent: Friday, March 4, 2022 10:27 AM
To: Poremba, Matthew ; gem5 users mailing list
; Bobby Bruce ; Matt Sinclair
; Kyle Roarty
Subject:
[AMD Official Use Only]
Hi David,
gfx801 is APU (e.g., "Carrizo") and gfx803 is dGPU (e.g., RX 4xx/5xx series).
From a gem5 perspective, basically they are setup differently in how memory is
laid out.
Vega was recently added in 21.2 but from my notes not all of the DNNMark
kernels are
[AMD Official Use Only]
Hi,
I don't know if this is what is causing this specific forking problem, but
gfx900 is VEGA not GCN3. There is a separate build for VEGA. If you want GCN3
dGPU you want gfx803.
-Matt
From: David Fong via gem5-users
Sent: Friday, March 4, 2022 9:34 AM
To: Bobby
[AMD Official Use Only]
Hi David,
I don't think these are the parameters you want to be changing if you are
trying to change the VRAM memory latency which it seems like you are based on
the GDDR5 comment. Those parameters are for the latency between CUs seeing a
memory request and the
[Public]
Hi David,
You are hitting the limit on the number of same MachineTypes in a Ruby network.
You can change this by modifying the `build_opts/GCN_X86` file and adding a
new line with `NUMBER_BITS_PER_SET = '128'`, or higher, and then recompile
gem5. As far as I know there is not a
[AMD Official Use Only]
These would be valid for both as they both use the same cache protocol files.
I'm not very familiar with how dGPU is hacked up in SE mode to look like a
dGPU...
-Matt
From: David Fong
Sent: Thursday, March 17, 2022 9:57 AM
To: Poremba, Matthew ; Matt Sinclair
;
[Public]
Hi Anoop,
That instruction was recently added to gem5, but for Vega ISA only:
https://gem5-review.googlesource.com/c/public/gem5/+/67072 . It could be
ported to GCN3 probably by copying the code exactly into the corresponding GCN3
files. You’ll notice however in that relation
[Public]
Hi Anoop,
Based on that register count, I am going to guess you built the application
with -O0 or some other debugging flags? If you do this, the compiler makes
some super large number of registers. I assume that is so a real GPU will not
run any other applications simultaneously.
[Public]
This is not the first time I am hearing about this issue. It seems stable
needs to be hotfixed for GPU.
For now, you can try the develop branch instead. It is tested quite well so it
is relatively stable anyway despite the name.
-Matt
From: Pau Galindo Figuerola via gem5-users
[Public]
Hi Pau,
It’s probably not possible without a lot of set up, but I have found that GPUs
that are “not supported” sometimes still work anyway with certain compute
stacks. You would probably need to:
* Find a BIOS for a GCN3 GPU or rip it from a real GPU – This is so the
driver
[AMD Official Use Only - General]
I see, thanks again. The verification should have passed it that case...
The docker image is *only* for building applications, so that the user does not
need to install ROCm locally to build applications. You do *not* run gem5
simulations in a docker for
[AMD Official Use Only - General]
Thanks Rajesh,
That is good to know. I don't think there is an list anywhere of which CPUs
work with KVM.
Which older ROCm do you mean here? Was square compiled with an older version?
Ideally the verification should be passing as well. At least, it does on
[AMD Official Use Only - General]
Hi Rajesh,
Thanks for the update. I'm glad you were able to get it worked out. Ideally
we wouldn't *require* sudo access but for KVM in general I think it is going to
highly depend on how the system was setup and there are some things that packer
won't be
[AMD Official Use Only - General]
At this point I would check if the other KVM scripts are working for you (there
are some simple tests somewhere like boot Ubuntu and exit). KVM works on some
CPUs better than others, I believe, or at least this was true in the past. I
have a few other ideas
[AMD Official Use Only - General]
Hi Rajesh,
I looks like no progress has been made since a very early tick number (the
timestamp print by Linux is equal to the current simulation tick / 1 trillion).
For reference it should take no more than 1-3 wall clock minutes to full boot
Linux and
[AMD Official Use Only - General]
Hi,
The rocclr, panic, and unimplemented instructions errors/warnings seem to be
caused by this patch:
https://gem5-review.googlesource.com/c/public/gem5/+/64831. It is likely the
ROCm stack is taking a different code path with the different processor
[AMD Official Use Only - General]
Hi,
1. Full system mode in gem5 has "two" outputs. There is the simulator output
(what you are showing in the email) and the terminal output. In SE mode these
are combined into one output. What is being shown in the simulator output.
This is really
[AMD Official Use Only - General]
Hi,
GPU_RfO and GPU_VIPER_Region were deprecated, mostly because there is no one to
help maintain all of the GPU protocols, so we opted to focus on just one. I
don't think there have been any Ruby/SLICC changes that would have broken the
ability to build
[AMD Official Use Only - General]
Hello,
I don't know how to change the frequency or if it will work, but in your gem5
python script you can call "m5.simulate(10 * 1e12)" to simulate 10 seconds
worth of ticks at a time and change the frequency after each call to that.
-Matt
From: Mejbaul
[AMD Official Use Only - General]
Hi,
By compiled as a module, the Linux `.config` file should have CONFIG_PMEM_xxx=y
(i.e., not …=m).
I did a quick test with the ubuntu20 disk image and kernel downloaded from the
gem5 website. I just added the `memmap=4G!12G` kernel parameter to
[AMD Official Use Only - General]
Hi,
Can you show the output you removed? What is being printed right before the
crash?
Thanks,
Matt
From: Matt Sinclair
Sent: Sunday, July 23, 2023 10:37 AM
To: The gem5 Users mailing list
Cc: l...@163.com <17861509...@163.com>; Poremba, Matthew
[Public]
Hi,
No worries about the questions! I will try to answer them all, so this will be
a long email :
The disconnected (or disjoint) Ruby network is essentially the same as the APU
Ruby network used in SE mode - That is, it combines two Ruby protocols in one
protocol (MOESI_AMD_base
[Public]
For some reason I cannot see the original email Matt is replying to, but
m5_exit is the normal exit status.
Just as a reminder, the gem5 output does not give any indication about whether
or not your application running *in* gem5 completed successfully. You will
need to check the
[AMD Official Use Only - General]
Hi,
SE mode supports ROCm 4.0 only. What version of ROCm is installed on the host
machine? If they are different the IOCTLs might be as well.
You can alternately try full system with GPU if you want to avoid docker.
-Matt
From: Anoop Mysore via
[AMD Official Use Only - General]
Hi,
Are you building pmem as a module as described in the blog? (“ PMEM:
Persistent memory block device support”) If so, I would try building it into
the kernel directly. It is possibly looking for the module for your compiled
kernel and does not find it
[AMD Official Use Only - General]
Hi Sandy,
Could you share the file “m5out/system.pc.com_1.device” as well?
You could also try using vega10_atomic.py instead of vega10_kvm.py. Initially
it looks to me like a KVM issue.
-Matt
From: Matt Sinclair
Sent: Tuesday, December 19, 2023 9:28 AM
ope it works!
Regards,
Pau
El mar, 19 dic 2023 18:57, Poremba, Matthew via gem5-users
mailto:gem5-users@gem5.org>> escribió:
[AMD Official Use Only - General]
Hi Sandy,
Could you share the file “m5out/system.pc.com_1.device” as well?
You could also try using vega10_atomic.py instead o
[Public]
Hi Sandy,
Depending on the benchmark, OpenCL might do an online compile (i.e., compile
the kernels right before running them). If you are using KVM it should just
work. Otherwise, the online compilation will take a significant amount of
simulation time and offline compiling would
[AMD Official Use Only - General]
Hi Pau,
The dgpu_mem_size parameter will only change the memory size for gem5 while the
GPU driver uses an MMIO register value to determine the memory size. The issue
you are seeing is the driver thinks there is still 16GB of memory and it
attempts to write
[AMD Official Use Only - General]
Hi Pau,
You will likely need root access to rip the BIOS from a real GPU. If you have
that, I posted my notes below. Keep in mind the PCI ids might be different:
“
You can find devices with ROM using `find /sys/devices -name rom`
For example, my MI210 is
[AMD Official Use Only - General]
Hi Pau,
For an APU, I think you can get the bios from the vga rom region in memory.
There is a util file for this in gem5: util/amdgpu/fs_files/dump_gpu_rom.sh .
However, if you have another GPU such as integrated graphics on the
motherboard, it might get
[AMD Official Use Only - General]
Hi Pau,
Typically, we use docker to compile the binaries. This way you don’t need to
have the libraries installed on the host machine. You can find the official
docker image to build GPU applications here:
[AMD Official Use Only - General]
Hello,
If you want to run CPU+GPU right now you will probably need to use SE mode.
Timing CPUs are not supported in GPUFS configs so the CPU network is completely
ignored as KVM/atomic bypass network. If that doesn’t matter, then you will
need to create a
[Public]
Hi Pau,
From: Pau Galindo Figuerola via gem5-users
Sent: Friday, March 22, 2024 10:47 AM
To: The gem5 Users mailing list
Cc: Pau Galindo Figuerola
Subject: [gem5-users] GPU FS Multiple CPU
Caution: This message originated from an External Source. Use proper caution
when opening
[Public]
Hi Pau,
Does the host system have 4 CPUs available for KVM to use? I have seen similar
errors occasionally with even 2 CPUs and simply rerunning the simulation seemed
to fix it. Unfortunately, I am not a KVM expert so I am not entirely sure how
to make it more robust.
-Matt
44 matches
Mail list logo