unicorn

mirror of https://github.com/yuzu-emu/unicorn.git synced 2024-12-23 11:16:28 +00:00

Author	SHA1	Message	Date
Peter Maydell	b08ea84374	target/arm: Implement fp16 for Neon VRSQRTS Convert the Neon VRSQRTS insn to using a gvec helper, and use this to implement the fp16 case. As with VRECPS, we adjust the phrasing of the new implementation slightly so that the fp32 version parallels the fp16 one. Backports 40fde72dda2da8d55b820fa6c5efd85814be2023	2021-03-01 17:20:22 -05:00
Peter Maydell	f4ebbba9fd	target/arm: Implement fp16 for Neon VRECPS Convert the Neon VRECPS insn to using a gvec helper, and use this to implement the fp16 case. The phrasing of the new float32_recps_nf() is slightly different from the old recps_f32() so that it parallels the f16 version; for f16 we can't assume that flush-to-zero is always enabled. Backports ac8c62c4e5a3f24e6d47f52ec1bfb20994caefa5	2021-03-01 17:09:16 -05:00
Peter Maydell	5776c594e4	target/arm: Implement fp16 for Neon fp compare-vs-0 Convert the neon floating-point vector compare-vs-0 insns VCEQ0, VCGT0, VCLE0, VCGE0 and VCLT0 to use a gvec helper, and use this to implement the fp16 case. Backport 635187aaa92f21ab001e2868e803b3c5460261ca	2021-03-01 17:05:03 -05:00
Peter Maydell	8de258c3cb	target/arm: Implement fp16 for Neon VFMA, VMFS Convert the neon floating-point vector operations VFMA and VFMS to use a gvec helper, and use this to implement the fp16 case. This is the last use of do_3same_fp() so we can now delete that function. Backports commit cf722d75b329ef3f86b869e7e68cbfb1607b3bde	2021-03-01 17:00:49 -05:00
Peter Maydell	587c3549b7	target/arm: Implement fp16 for Neon VMLA, VMLS operations Convert the Neon floating-point VMLA and VMLS insns over to using a gvec helper, and use this to implement the fp16 case. Backports e5adc70665ecaf4009c2fb8d66775ea718a85abd	2021-03-01 16:57:20 -05:00
Peter Maydell	0068d12355	target/arm: Implement fp16 for Neon VMAXNM, VMINNM Convert the Neon floating point VMAXNM and VMINNM insns to using a gvec helper and use this to implement the fp16 case. Backports e22705bb941d82d6c2a09e8b2031084326902be3	2021-03-01 16:53:57 -05:00
Peter Maydell	465cfb54c4	target/arm: Implement fp16 for Neon VMAX, VMIN Convert the Neon float-point VMAX and VMIN insns over to using a gvec helper, and use this to implement the fp16 case. Backport e43268c54b6cbcb197d179409df7126e81f8cd52	2021-03-01 16:50:23 -05:00
Peter Maydell	6dd4a8e93f	target/arm: Implement fp16 for VACGE, VACGT Convert the neon floating-point vector absolute comparison ops VACGE and VACGT over to using a gvec hepler and use this to implement the fp16 case. Backports bb2741da186ebaebc7d5189372be4401e1ff9972	2021-03-01 16:47:44 -05:00
Peter Maydell	4eb39f1b2f	target/arm: Implement fp16 for VCEQ, VCGE, VCGT comparisons Convert the Neon floating-point vector comparison ops VCEQ, VCGE and VCGT over to using a gvec helper and use this to implement the fp16 case. (We put the float16_ceq() etc functions above the DO_2OP() macro definition because later when we convert the compare-against-zero instructions we'll want their definitions to be visible at that point in the source file.) Backports ad505db233b89b7fd4b5a98b6f0e8ac8d05b11db	2021-03-01 16:44:34 -05:00
Peter Maydell	4850377f01	target/arm: Implement FP16 for Neon VADD, VSUB, VABD, VMUL Implement FP16 support for the Neon insns which use the DO_3S_FP_GVEC macro: VADD, VSUB, VABD, VMUL. For VABD this requires us to implement a new gvec_fabd_h helper using the machinery we have already for the other helpers. Backport e4a6d4a69e239becfd83bdcd996476e7b8e1138d	2021-03-01 16:31:54 -05:00
Peter Maydell	90aa9647e0	target/arm: Implement VFP fp16 VRINT* Implement the fp16 version of the VFP VRINT* insns. Backports 0a6f4b4cb338665b81ad824d9a6868932461b7f7	2021-03-01 16:15:21 -05:00
Peter Maydell	9c5b6f06a2	target/arm: Use macros instead of open-coding fp16 conversion helpers Now the VFP_CONV_FIX macros can handle fp16's distinction between the width of the operation and the width of the type used to pass operands, use the macros rather than the open-coded functions. This creates an extra six helper functions, all of which we are going to need for the AArch32 VFP fp16 instructions. Backports commit 414ba270c4fb758d987adf37ae9bfe531715c604	2021-02-28 05:08:44 -05:00
Peter Maydell	5d98e14545	target/arm: Implement VFP fp16 VCMP Implement fp16 version of VCMP. Backports 1b88b054c5b201e8581114d29527c6a5a7e088c9	2021-02-28 04:56:24 -05:00
Peter Maydell	2d9abf7c0b	target/arm: Implement VFP fp16 for VABS, VNEG, VSQRT Implement VFP fp16 for VABS, VNEG and VSQRT. This is all the fp16 insns that use the DO_VFP_2OP macro, because there is no fp16 version of VMOV_reg. Notes: * the gen_helper_vfp_negh already exists as we needed to create it for the fp16 multiply-add insns * as usual we need to use the f16 version of the fp_status; this is only relevant for VSQRT Backports ce2d65a5d191380756cdac7a1fd1ba76bd1621cf	2021-02-28 04:48:28 -05:00
Peter Maydell	6ac2c597ab	target/arm: Implement VFP fp16 for fused-multiply-add Implement VFP fp16 support for fused multiply-add insns VFNMA, VFNMS, VFMA, VFMS. Backports 9886fe2834b064a3cf0675a4659942ed547aed42	2021-02-28 04:39:21 -05:00
Peter Maydell	a42ecfe203	target/arm: Implement VFP fp16 VMLA, VMLS, VNMLS, VNMLA, VNMUL Implement fp16 versions of the VFP VMLA, VMLS, VNMLS, VNMLA, VNMUL instructions. (These are all the remaining ones which we implement via do_vfp_3op_[hsd]p().) Backports commit e7cb0ded52c6d7b86585b09935fe7caeb9e38b69	2021-02-28 04:29:37 -05:00
Peter Maydell	eae621098d	target/arm: Implement VFP fp16 for VFP_BINOP operations Implmeent VFP fp16 support for simple binary-operator VFP insns VADD, VSUB, VMUL, VDIV, VMINNM and VMAXNM: * make the VFP_BINOP() macro generate float16 helpers as well as float32 and float64 * implement a do_vfp_3op_hp() function similar to the existing do_vfp_3op_sp() * add decode for the half-precision insn patterns Note that the VFP_BINOP macro use creates a couple of unused helper functions vfp_maxh and vfp_minh, but they're small so it's not worth splitting the BINOP operations into "needs halfprec" and "no halfprec" groups. Backports commit 120a0eb3ea23a5b06fae2f3daebd46a4035864cf	2021-02-28 04:24:39 -05:00
LIU Zhiwei	d26cd63ad6	softfloat: Define misc operations for bfloat16 Backports 5ebf5f4be66c378fd5f3dee85f54dd4942171d57	2021-02-27 16:41:46 -05:00
LIU Zhiwei	d8168a8142	softfloat: Define convert operations for bfloat16 Backports 34f0c0a98a5f3bb6706088c0384f937f7a294d3e	2021-02-27 16:37:11 -05:00
LIU Zhiwei	b0be0d28cc	softfloat: Define operations for bfloat16 Backports 8282310d8535cc2a8431c516e907da79f92df6eb	2021-02-26 15:20:30 -05:00
Frank Chang	d97454eb63	softfloat: Add fp16 and uint8/int8 conversion functions Backports 0d93d8ec632154dea2627a9e989972ee09721187	2021-02-26 15:11:57 -05:00
Lioncash	f5a21abc0b	target/arm: Convert sq{, r}dmulh to gvec for aa64 advsimd	2021-02-26 15:01:44 -05:00
Richard Henderson	aa97b6b755	target/arm: Convert integer multiply-add (indexed) to gvec for aa64 advsimd Backports 3607440c4df6498585a570cfc1041e4972b41b56	2021-02-26 14:51:17 -05:00
Richard Henderson	732674b868	target/arm: Convert integer multiply (indexed) to gvec for aa64 advsimd Backports 2e5a265e6a9e7169c4a3e87db261b2fa92582590	2021-02-26 14:46:29 -05:00
Lioncash	2da89a626c	target/arm: Merge helper_sve_clr_* and helper_sve_movz_*	2021-02-26 14:23:06 -05:00
Richard Henderson	57c66389c2	target/arm: Fix temp double-free in sve ldr/str The temp that gets assigned to clean_addr has been allocated with new_tmp_a64, which means that it will be freed at the end of the instruction. Freeing it earlier leads to assertion failure. The loop creates a complication, in which we allocate a new local temp, which does need freeing, and the final code path is shared between the loop and non-loop. Fix this complication by adding new_tmp_a64_local so that the new local temp is freed at the end, and can be treated exactly like the non-loop path. Fixes: bba87d0a0f4 Backports commit 4b4dc9750a0aa0b9766bd755bf6512a84744ce8a from qemu	2021-02-25 23:10:37 -05:00
Richard Henderson	732efce958	target/arm: Add mte helpers for sve scatter/gather memory ops Because the elements are non-sequential, we cannot eliminate many tests straight away like we can for sequential operations. But we often have the PTE details handy, so we can test for Tagged. Backports commit d28d12f008ee44dc2cc2ee5d8f673be9febc951e from qemu	2021-02-25 22:34:24 -05:00
Richard Henderson	5698b7badb	target/arm: Handle TBI for sve scalar + int memory ops We still need to handle tbi for user-only when mte is inactive. Backports commit 9473d0ecafcffc8b258892b1f9f18e037bdba958 from qemu	2021-02-25 22:17:46 -05:00
Richard Henderson	586235d02d	target/arm: Add mte helpers for sve scalar + int ff/nf loads Because the elements are sequential, we can eliminate many tests all at once when the tag hits TCMA, or if the page(s) are not Tagged. Backports commit aa13f7c3c378fa41366b9fcd6c29af1c3d81126a from qemu	2021-02-25 22:09:17 -05:00
Richard Henderson	cb31d54b18	target/arm: Add mte helpers for sve scalar + int stores Because the elements are sequential, we can eliminate many tests all at once when the tag hits TCMA, or if the page(s) are not Tagged. Backports commit 71b9f3948c75bb97641a3c8c7de96d1cb47cdc07 from qemu	2021-02-25 21:53:55 -05:00
Richard Henderson	670b25c5fa	target/arm: Add mte helpers for sve scalar + int loads Because the elements are sequential, we can eliminate many tests all at once when the tag hits TCMA, or if the page(s) are not Tagged. Backports commit 206adacfb8d35e671e3619591608c475aa046b63 from qemu	2021-02-25 21:45:32 -05:00
Richard Henderson	94b0876f15	target/arm: Add sve infrastructure for page lookup For contiguous predicated memory operations, we want to minimize the number of tlb lookups performed. We have open-coded this for sve_ld1_r, but for correctness with MTE we will need this for all of the memory operations. Create a structure that holds the bounds of active elements, and metadata for two pages. Add routines to find those active elements, lookup the pages, and run watchpoints for those pages. Temporarily mark the functions unused to avoid Werror. Backports commit b4cd95d2f4c7197b844f51b29871d888063ea3e7 from qemu	2021-02-25 20:28:23 -05:00
Richard Henderson	2e03f74a53	target/arm: Use cpu_*_data_ra for sve_ldst_tlb_fn Use the "normal" memory access functions, rather than the softmmu internal helper functions directly. Since fb901c9, cpu_mem_index is now a simple extract from env->hflags and not a large computation. Which means that it's now more work to pass around this value than it is to recompute it. This only adjusts the primitives, and does not clean up all of the uses within sve_helper.c.	2021-02-25 20:16:38 -05:00
Richard Henderson	49bd9a5c68	target/arm: Use mte_checkN for sve unpredicated stores Backports commit bba87d0a0f480805223a6428a7942a51733c488a from qemu	2021-02-25 17:40:43 -05:00
Richard Henderson	4fdd05e1aa	target/arm: Add helper_mte_check_zva Use a special helper for DC_ZVA, rather than the more general mte_checkN. Backports commit 46dc1bc0601554823a42ad27f236da2ad8f3bdc6 from qemu	2021-02-25 17:17:54 -05:00
Richard Henderson	9a05ca01e7	target/arm: Implement helper_mte_checkN Fill out the stub that was added earlier. Backports commit 5add8248556a3c1006018d7d8e601c9572b280a9 from qemu	2021-02-25 17:10:56 -05:00
Richard Henderson	91e2f55b69	target/arm: Implement helper_mte_check1 Fill out the stub that was added earlier. Backports commit 2e34ff45f32cb032883616a1cc5ea8ac96f546d5 from qemu	2021-02-25 17:02:34 -05:00
Richard Henderson	3e786526cf	target/arm: Add gen_mte_checkN Replace existing uses of check_data_tbi in translate-a64.c that perform multiple logical memory access. Leave the helper blank for now to reduce the patch size. Backports commit 73ceeb0011b23bac8bd2c09ebe3c18d034aa69ce from qemu	2021-02-25 16:40:16 -05:00
Richard Henderson	582e64f348	target/arm: Add gen_mte_check1 Replace existing uses of check_data_tbi in translate-a64.c that perform a single logical memory access. Leave the helper blank for now to reduce the patch size. Backports commit 0a405be2b8fd9506a009b10d7d2d98c394b36db6 from qemu	2021-02-25 16:13:31 -05:00
Richard Henderson	4bb37fc3c1	target/arm: Implement the LDGM, STGM, STZGM instructions Backports commit 5f716a82388eb09754dd900e7dbb8ffa15897a28 from qemu	2021-02-25 16:00:50 -05:00
Richard Henderson	5b3ddcf2e2	target/arm: Simplify DC_ZVA Now that we know that the operation is on a single page, we need not loop over pages while probing. Backports commit e26d0d226892f67435cadcce86df0ddfb9943174 from qemu	2021-02-25 15:55:46 -05:00
Richard Henderson	e8b9cb8b4a	target/arm: Implement LDG, STG, ST2G instructions Backports commit c15294c1e36a7dd9b25bd54d98178e80f4b64bc1 from qemu	2021-02-25 15:08:44 -05:00
Richard Henderson	911a6b57ed	target/arm: Implement the ADDG, SUBG instructions Backports commit efbc78ad978763aedd11cb718eb1ff8db3fc9152 from qemu	2021-02-25 14:42:33 -05:00
Richard Henderson	58f3dd2cc7	target/arm: Implement the IRG instruction Backports commit da54941f45b820cbaca72aa6efd5669b3dc86e2f from qemu	2021-02-25 14:36:11 -05:00
Joseph Myers	b08d204a37	softfloat: merge floatx80_mod and floatx80_rem The m68k-specific softfloat code includes a function floatx80_mod that is extremely similar to floatx80_rem, but computing the remainder based on truncating the quotient toward zero rather than rounding it to nearest integer. This is also useful for emulating the x87 fprem and fprem1 instructions. Change the floatx80_rem implementation into floatx80_modrem that can perform either operation, with both floatx80_rem and floatx80_mod as thin wrappers available for all targets. There does not appear to be any use for the _mod operation for other floating-point formats in QEMU (the only other architectures using _rem at all are linux-user/arm/nwfpe, for FPA emulation, and openrisc, for instructions that have been removed in the latest version of the architecture), so no change is made to the code for other formats. Backports commit 6b8b0136ab3018e4b552b485f808bf66bcf19ead from qemu	2021-02-25 13:34:05 -05:00
Richard Henderson	1d95dd1c89	target/arm: Split helper_crypto_sm3tt Rather than passing an opcode to a helper, fully decode the operation at translate time. Use clear_tail_16 to zap the balance of the SVE register with the AdvSIMD write. Backports commit 43fa36c96c24349145497adc1b451f9caf74e344 from qemu	2020-06-14 23:24:21 -04:00
Richard Henderson	5ca8caf656	target/arm: Split helper_crypto_sha1_3reg Rather than passing an opcode to a helper, fully decode the operation at translate time. Use clear_tail_16 to zap the balance of the SVE register with the AdvSIMD write. Backports commit afc8b7d32668547308bdd654a63cf5228936e0ba from qemu	2020-06-14 23:18:45 -04:00
Richard Henderson	894f2168da	target/arm: Convert rax1 to gvec helpers With this conversion, we will be able to use the same helpers with sve. This also fixes a bug in which we failed to clear the high bits of the SVE register after an AdvSIMD operation. Backports commit 1738860d7e60dec5dbeba17f8b44d31aae3accac from qemu	2020-06-14 22:49:36 -04:00
Richard Henderson	cc3187b1e4	tcg: Implement gvec support for rotate by scalar No host backend support yet, but the interfaces for rotls are in place. Only implement left-rotate for now, as the only known use of vector rotate by scalar is s390x, so any right-rotate would be unused and untestable. Backports commit 23850a74afb641102325b4b7f74071d929fc4594 from qemu	2020-06-14 22:00:50 -04:00
Richard Henderson	be78062fd8	tcg: Implement gvec support for rotate by vector No host backend support yet, but the interfaces for rotlv and rotrv are in place. Reviewed-by: Alex Bennée <alex.bennee@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org> --- v3: Drop the generic expansion from rot to shift; we can do better for each backend, and then this code becomes unused. Backports commit 5d0ceda902915e3f0e21c39d142c92c4e97c3ebb from qemu	2020-06-14 21:43:46 -04:00

1 2 3 4 5 ...

431 commits