On Wed, Aug 26, 2020 at 10:39 AM Frank Chang <frank.ch...@sifive.com> wrote: > > On Thu, Aug 27, 2020 at 12:56 AM Alistair Francis <alistai...@gmail.com> > wrote: >> >> On Tue, Aug 25, 2020 at 1:29 AM Frank Chang <frank.ch...@sifive.com> wrote: >> > >> > On Mon, Aug 17, 2020 at 4:50 PM <frank.ch...@sifive.com> wrote: >> >> >> >> From: Frank Chang <frank.ch...@sifive.com> >> >> >> >> This patchset implements the vector extension v1.0 for RISC-V on QEMU. >> >> >> >> This patchset is sent as RFC because RVV v1.0 is still in draft state. >> >> v2 patchset was sent for RVV v0.9 and bumped to RVV v1.0 since v3 >> >> patchset. >> >> >> >> The port is available here: >> >> https://github.com/sifive/qemu/tree/rvv-1.0-upstream-v4 >> >> >> >> You can change the cpu argument: vext_spec to v1.0 (i.e. vext_spec=v1.0) >> >> to run with RVV v1.0 instructions. >> >> >> >> Note: This patchset depends on two other patchsets listed in Based-on >> >> section below so it might not able to be built unless those two >> >> patchsets are applied. >> >> >> >> Changelog: >> >> >> >> v4 >> >> * remove explicit float flmul variable in DisasContext. >> >> * replace floating-point calculations with shift operations to >> >> improve performance. >> >> * relax RV_VLEN_MAX to 512-bits. >> >> >> >> v3 >> >> * apply nan-box helpers from Richard Henderson. >> >> * remove fp16 api changes as they are sent independently in another >> >> pathcset by Chih-Min Chao. >> >> * remove all tail elements clear functions as tail elements can >> >> retain unchanged for either VTA set to undisturbed or agnostic. >> >> * add fp16 nan-box check generator function. >> >> * add floating-point rounding mode enum. >> >> * replace flmul arithmetic with shifts to avoid floating-point >> >> conversions. >> >> * add Zvqmac extension. >> >> * replace gdbstub vector register xml files with dynamic generator. >> >> * bumped to RVV v1.0. >> >> * RVV v1.0 related changes: >> >> * add vl<nf>re<eew>.v and vs<nf>r.v vector whole register >> >> load/store instructions >> >> * add vrgatherei16 instruction. >> >> * rearranged bits in vtype to make vlmul bits into a contiguous >> >> field. >> >> >> >> v2 >> >> * drop v0.7.1 support. >> >> * replace invisible return check macros with functions. >> >> * move mark_vs_dirty() to translators. >> >> * add SSTATUS_VS flag for s-mode. >> >> * nan-box scalar fp register for floating-point operations. >> >> * add gdbstub files for vector registers to allow system-mode >> >> debugging with GDB. >> >> >> >> Based-on: <20200724002807.441147-1-richard.hender...@linaro.org/> >> >> Based-on: <1596102747-20226-1-git-send-email-chihmin.c...@sifive.com/> >> >> >> >> Frank Chang (62): >> >> target/riscv: drop vector 0.7.1 and add 1.0 support >> >> target/riscv: Use FIELD_EX32() to extract wd field >> >> target/riscv: rvv-1.0: introduce writable misa.v field >> >> target/riscv: rvv-1.0: remove rvv related codes from fcsr registers >> >> target/riscv: rvv-1.0: check MSTATUS_VS when accessing vector csr >> >> registers >> >> target/riscv: rvv-1.0: remove MLEN calculations >> >> target/riscv: rvv-1.0: add fractional LMUL >> >> target/riscv: rvv-1.0: add VMA and VTA >> >> target/riscv: rvv-1.0: update check functions >> >> target/riscv: introduce more imm value modes in translator functions >> >> target/riscv: rvv:1.0: add translation-time nan-box helper function >> >> target/riscv: rvv-1.0: configure instructions >> >> target/riscv: rvv-1.0: stride load and store instructions >> >> target/riscv: rvv-1.0: index load and store instructions >> >> target/riscv: rvv-1.0: fix address index overflow bug of indexed >> >> load/store insns >> >> target/riscv: rvv-1.0: fault-only-first unit stride load >> >> target/riscv: rvv-1.0: amo operations >> >> target/riscv: rvv-1.0: load/store whole register instructions >> >> target/riscv: rvv-1.0: update vext_max_elems() for load/store insns >> >> target/riscv: rvv-1.0: take fractional LMUL into vector max elements >> >> calculation >> >> target/riscv: rvv-1.0: floating-point square-root instruction >> >> target/riscv: rvv-1.0: floating-point classify instructions >> >> target/riscv: rvv-1.0: mask population count instruction >> >> target/riscv: rvv-1.0: find-first-set mask bit instruction >> >> target/riscv: rvv-1.0: set-X-first mask bit instructions >> >> target/riscv: rvv-1.0: iota instruction >> >> target/riscv: rvv-1.0: element index instruction >> >> target/riscv: rvv-1.0: allow load element with sign-extended >> >> target/riscv: rvv-1.0: register gather instructions >> >> target/riscv: rvv-1.0: integer scalar move instructions >> >> target/riscv: rvv-1.0: floating-point move instruction >> >> target/riscv: rvv-1.0: floating-point scalar move instructions >> >> target/riscv: rvv-1.0: whole register move instructions >> >> target/riscv: rvv-1.0: integer extension instructions >> >> target/riscv: rvv-1.0: single-width averaging add and subtract >> >> instructions >> >> target/riscv: rvv-1.0: single-width bit shift instructions >> >> target/riscv: rvv-1.0: integer add-with-carry/subtract-with-borrow >> >> target/riscv: rvv-1.0: narrowing integer right shift instructions >> >> target/riscv: rvv-1.0: widening integer multiply-add instructions >> >> target/riscv: rvv-1.0: add Zvqmac extension >> >> target/riscv: rvv-1.0: quad-widening integer multiply-add instructions >> >> target/riscv: rvv-1.0: single-width saturating add and subtract >> >> instructions >> >> target/riscv: rvv-1.0: integer comparison instructions >> >> target/riscv: use softfloat lib float16 comparison functions >> >> target/riscv: rvv-1.0: floating-point compare instructions >> >> target/riscv: rvv-1.0: mask-register logical instructions >> >> target/riscv: rvv-1.0: slide instructions >> >> target/riscv: rvv-1.0: floating-point slide instructions >> >> target/riscv: rvv-1.0: narrowing fixed-point clip instructions >> >> target/riscv: rvv-1.0: single-width floating-point reduction >> >> target/riscv: rvv-1.0: widening floating-point reduction instructions >> >> target/riscv: rvv-1.0: single-width scaling shift instructions >> >> target/riscv: rvv-1.0: remove widening saturating scaled multiply-add >> >> target/riscv: rvv-1.0: remove vmford.vv and vmford.vf >> >> target/riscv: rvv-1.0: remove integer extract instruction >> >> target/riscv: rvv-1.0: floating-point min/max instructions >> >> target/riscv: introduce floating-point rounding mode enum >> >> target/riscv: rvv-1.0: floating-point/integer type-convert >> >> instructions >> >> target/riscv: rvv-1.0: widening floating-point/integer type-convert >> >> target/riscv: add "set round to odd" rounding mode helper function >> >> target/riscv: rvv-1.0: narrowing floating-point/integer type-convert >> >> target/riscv: rvv-1.0: relax RV_VLEN_MAX to 512-bits >> >> >> >> Greentime Hu (2): >> >> target/riscv: rvv-1.0: add vlenb register >> >> target/riscv: gdb: support vector registers for rv32 >> >> >> >> Hsiangkai Wang (2): >> >> target/riscv: gdb: modify gdb csr xml file to align with csr register >> >> map >> >> target/riscv: gdb: support vector registers for rv64 >> >> >> >> LIU Zhiwei (4): >> >> target/riscv: rvv-1.0: add mstatus VS field >> >> target/riscv: rvv-1.0: add sstatus VS field >> >> target/riscv: rvv-1.0: add translation-time vector context status >> >> target/riscv: rvv-1.0: add vcsr register >> >> >> >> gdb-xml/riscv-32bit-csr.xml | 18 +- >> >> gdb-xml/riscv-64bit-csr.xml | 18 +- >> >> target/riscv/cpu.c | 12 +- >> >> target/riscv/cpu.h | 97 +- >> >> target/riscv/cpu_bits.h | 10 + >> >> target/riscv/cpu_helper.c | 16 +- >> >> target/riscv/csr.c | 73 +- >> >> target/riscv/fpu_helper.c | 17 +- >> >> target/riscv/gdbstub.c | 126 +- >> >> target/riscv/helper.h | 523 ++-- >> >> target/riscv/insn32-64.decode | 18 +- >> >> target/riscv/insn32.decode | 295 +- >> >> target/riscv/insn_trans/trans_rvv.inc.c | 2366 ++++++++++------ >> >> target/riscv/internals.h | 19 +- >> >> target/riscv/translate.c | 68 +- >> >> target/riscv/vector_helper.c | 3269 +++++++++++------------ >> >> 16 files changed, 4051 insertions(+), 2894 deletions(-) >> >> >> >> -- >> >> 2.17.1 >> >> >> > >> > ping~ >> >> I wasn't really following too closely, but didn't Richard give comments? >> >> Alistair > > > Yeah, they were given in v3 patchset and I've made the changes > based on Richard's comments in this v4 patchset.
Ah ok. I missed that while I was on holidays. Did you want to wait until the v1.0 spec is released or have the draft extensions merged? Alistair > > Frank Chang