store routines

BALATON Zoltan Mon, 24 Jun 2024 02:30:19 -0700

On Mon, 24 Jun 2024, Mark Cave-Ayland wrote:

On 23/06/2024 16:23, BALATON Zoltan wrote:
On Sun, 23 Jun 2024, Mark Cave-Ayland wrote:
Now that do_unaligned_access has been implemented for 68k CPUs, pass therequiredalignment into the TCG memory load/store routines. This allows the TCGmemory coreto generate an Address Error exception for unaligned memory accesses ifrequired.
Suggested-by: Laurent Vivier <laur...@vivier.eu>
Signed-off-by: Mark Cave-Ayland <mark.cave-ayl...@ilande.co.uk>
Resolves: https://gitlab.com/qemu-project/qemu/-/issues/2165
---
target/m68k/translate.c | 18 +++++++++++++++---
1 file changed, 15 insertions(+), 3 deletions(-)

diff --git a/target/m68k/translate.c b/target/m68k/translate.c
index 445966fb6a..661a7b4def 100644
--- a/target/m68k/translate.c
+++ b/target/m68k/translate.c
@@ -303,13 +303,18 @@ static inline TCGv gen_load(DisasContext *s, intopsize, TCGv addr,
                            int sign, int index)
{
    TCGv tmp = tcg_temp_new_i32();
+    MemOp memop = opsize | (sign ? MO_SIGN : 0) | MO_TE;

    switch (opsize) {
    case OS_BYTE:
+        tcg_gen_qemu_ld_tl(tmp, addr, index, memop);
+        break;
    case OS_WORD:
    case OS_LONG:
-        tcg_gen_qemu_ld_tl(tmp, addr, index,
-                           opsize | (sign ? MO_SIGN : 0) | MO_TE);
+        if (!m68k_feature(s->env, M68K_FEATURE_UNALIGNED_DATA)) {
+            memop |= MO_ALIGN_2;
+        }
+        tcg_gen_qemu_ld_tl(tmp, addr, index, memop);
You could swap the order of these so byte comes last and fall through to itfrom word/long to avoid duplicated line.
Maybe this answers my question about where it's restriced by CPU type. Iwonder if this check for M68K_FEATURE_UNALIGNED_DATA could be avoded hereand done by checking it in init and only set the unaligned method for CPUsthat need it to not add overhead for most CPUs that don't need it.
I don't think that it matters too much if the method isn't implemented as thelogic surrounding when to call do_unaligned_access is contained within theTCG core.

I've seen this after I've sent a patch for PPC where removing aconditional from a helper often called for memory access showed it had aneffect on performance. So I thought adding a conditional here might causesome slow down for CPUs that don't need this. But this is compile time somaybe it's not that big problem as in a hepler. Yet only adding theunaligned method for CPUs then always set MO_ALIGN here avoiding the ifonly calls the method when needed if that works at all. I don't know ifthat's worth it, you could test with some memory intensive benchmark suchas STREAM benchmark to check if this has any effect.


Regards,
BALATON Zotan

I'll have a go at updating the ordering and send a v2 if it looks good.


ATB,

Mark.

Re: [PATCH 2/2] target/m68k: pass alignment into TCG memory load/store routines

Reply via email to