unicorn

mirror of https://github.com/yuzu-emu/unicorn.git synced 2024-12-23 14:35:34 +00:00

Author	SHA1	Message	Date
Lioncash	f5a21abc0b	target/arm: Convert sq{, r}dmulh to gvec for aa64 advsimd	2021-02-26 15:01:44 -05:00
Richard Henderson	aa97b6b755	target/arm: Convert integer multiply-add (indexed) to gvec for aa64 advsimd Backports 3607440c4df6498585a570cfc1041e4972b41b56	2021-02-26 14:51:17 -05:00
Richard Henderson	732674b868	target/arm: Convert integer multiply (indexed) to gvec for aa64 advsimd Backports 2e5a265e6a9e7169c4a3e87db261b2fa92582590	2021-02-26 14:46:29 -05:00
Lioncash	2da89a626c	target/arm: Merge helper_sve_clr_* and helper_sve_movz_*	2021-02-26 14:23:06 -05:00
Richard Henderson	57c66389c2	target/arm: Fix temp double-free in sve ldr/str The temp that gets assigned to clean_addr has been allocated with new_tmp_a64, which means that it will be freed at the end of the instruction. Freeing it earlier leads to assertion failure. The loop creates a complication, in which we allocate a new local temp, which does need freeing, and the final code path is shared between the loop and non-loop. Fix this complication by adding new_tmp_a64_local so that the new local temp is freed at the end, and can be treated exactly like the non-loop path. Fixes: bba87d0a0f4 Backports commit 4b4dc9750a0aa0b9766bd755bf6512a84744ce8a from qemu	2021-02-25 23:10:37 -05:00
Richard Henderson	732efce958	target/arm: Add mte helpers for sve scatter/gather memory ops Because the elements are non-sequential, we cannot eliminate many tests straight away like we can for sequential operations. But we often have the PTE details handy, so we can test for Tagged. Backports commit d28d12f008ee44dc2cc2ee5d8f673be9febc951e from qemu	2021-02-25 22:34:24 -05:00
Richard Henderson	5698b7badb	target/arm: Handle TBI for sve scalar + int memory ops We still need to handle tbi for user-only when mte is inactive. Backports commit 9473d0ecafcffc8b258892b1f9f18e037bdba958 from qemu	2021-02-25 22:17:46 -05:00
Richard Henderson	586235d02d	target/arm: Add mte helpers for sve scalar + int ff/nf loads Because the elements are sequential, we can eliminate many tests all at once when the tag hits TCMA, or if the page(s) are not Tagged. Backports commit aa13f7c3c378fa41366b9fcd6c29af1c3d81126a from qemu	2021-02-25 22:09:17 -05:00
Richard Henderson	cb31d54b18	target/arm: Add mte helpers for sve scalar + int stores Because the elements are sequential, we can eliminate many tests all at once when the tag hits TCMA, or if the page(s) are not Tagged. Backports commit 71b9f3948c75bb97641a3c8c7de96d1cb47cdc07 from qemu	2021-02-25 21:53:55 -05:00
Richard Henderson	670b25c5fa	target/arm: Add mte helpers for sve scalar + int loads Because the elements are sequential, we can eliminate many tests all at once when the tag hits TCMA, or if the page(s) are not Tagged. Backports commit 206adacfb8d35e671e3619591608c475aa046b63 from qemu	2021-02-25 21:45:32 -05:00
Richard Henderson	94b0876f15	target/arm: Add sve infrastructure for page lookup For contiguous predicated memory operations, we want to minimize the number of tlb lookups performed. We have open-coded this for sve_ld1_r, but for correctness with MTE we will need this for all of the memory operations. Create a structure that holds the bounds of active elements, and metadata for two pages. Add routines to find those active elements, lookup the pages, and run watchpoints for those pages. Temporarily mark the functions unused to avoid Werror. Backports commit b4cd95d2f4c7197b844f51b29871d888063ea3e7 from qemu	2021-02-25 20:28:23 -05:00
Richard Henderson	2e03f74a53	target/arm: Use cpu_*_data_ra for sve_ldst_tlb_fn Use the "normal" memory access functions, rather than the softmmu internal helper functions directly. Since fb901c9, cpu_mem_index is now a simple extract from env->hflags and not a large computation. Which means that it's now more work to pass around this value than it is to recompute it. This only adjusts the primitives, and does not clean up all of the uses within sve_helper.c.	2021-02-25 20:16:38 -05:00
Richard Henderson	49bd9a5c68	target/arm: Use mte_checkN for sve unpredicated stores Backports commit bba87d0a0f480805223a6428a7942a51733c488a from qemu	2021-02-25 17:40:43 -05:00
Richard Henderson	4fdd05e1aa	target/arm: Add helper_mte_check_zva Use a special helper for DC_ZVA, rather than the more general mte_checkN. Backports commit 46dc1bc0601554823a42ad27f236da2ad8f3bdc6 from qemu	2021-02-25 17:17:54 -05:00
Richard Henderson	9a05ca01e7	target/arm: Implement helper_mte_checkN Fill out the stub that was added earlier. Backports commit 5add8248556a3c1006018d7d8e601c9572b280a9 from qemu	2021-02-25 17:10:56 -05:00
Richard Henderson	91e2f55b69	target/arm: Implement helper_mte_check1 Fill out the stub that was added earlier. Backports commit 2e34ff45f32cb032883616a1cc5ea8ac96f546d5 from qemu	2021-02-25 17:02:34 -05:00
Richard Henderson	3e786526cf	target/arm: Add gen_mte_checkN Replace existing uses of check_data_tbi in translate-a64.c that perform multiple logical memory access. Leave the helper blank for now to reduce the patch size. Backports commit 73ceeb0011b23bac8bd2c09ebe3c18d034aa69ce from qemu	2021-02-25 16:40:16 -05:00
Richard Henderson	582e64f348	target/arm: Add gen_mte_check1 Replace existing uses of check_data_tbi in translate-a64.c that perform a single logical memory access. Leave the helper blank for now to reduce the patch size. Backports commit 0a405be2b8fd9506a009b10d7d2d98c394b36db6 from qemu	2021-02-25 16:13:31 -05:00
Richard Henderson	4bb37fc3c1	target/arm: Implement the LDGM, STGM, STZGM instructions Backports commit 5f716a82388eb09754dd900e7dbb8ffa15897a28 from qemu	2021-02-25 16:00:50 -05:00
Richard Henderson	5b3ddcf2e2	target/arm: Simplify DC_ZVA Now that we know that the operation is on a single page, we need not loop over pages while probing. Backports commit e26d0d226892f67435cadcce86df0ddfb9943174 from qemu	2021-02-25 15:55:46 -05:00
Richard Henderson	e8b9cb8b4a	target/arm: Implement LDG, STG, ST2G instructions Backports commit c15294c1e36a7dd9b25bd54d98178e80f4b64bc1 from qemu	2021-02-25 15:08:44 -05:00
Richard Henderson	911a6b57ed	target/arm: Implement the ADDG, SUBG instructions Backports commit efbc78ad978763aedd11cb718eb1ff8db3fc9152 from qemu	2021-02-25 14:42:33 -05:00
Richard Henderson	58f3dd2cc7	target/arm: Implement the IRG instruction Backports commit da54941f45b820cbaca72aa6efd5669b3dc86e2f from qemu	2021-02-25 14:36:11 -05:00
Joseph Myers	b08d204a37	softfloat: merge floatx80_mod and floatx80_rem The m68k-specific softfloat code includes a function floatx80_mod that is extremely similar to floatx80_rem, but computing the remainder based on truncating the quotient toward zero rather than rounding it to nearest integer. This is also useful for emulating the x87 fprem and fprem1 instructions. Change the floatx80_rem implementation into floatx80_modrem that can perform either operation, with both floatx80_rem and floatx80_mod as thin wrappers available for all targets. There does not appear to be any use for the _mod operation for other floating-point formats in QEMU (the only other architectures using _rem at all are linux-user/arm/nwfpe, for FPA emulation, and openrisc, for instructions that have been removed in the latest version of the architecture), so no change is made to the code for other formats. Backports commit 6b8b0136ab3018e4b552b485f808bf66bcf19ead from qemu	2021-02-25 13:34:05 -05:00
Richard Henderson	1d95dd1c89	target/arm: Split helper_crypto_sm3tt Rather than passing an opcode to a helper, fully decode the operation at translate time. Use clear_tail_16 to zap the balance of the SVE register with the AdvSIMD write. Backports commit 43fa36c96c24349145497adc1b451f9caf74e344 from qemu	2020-06-14 23:24:21 -04:00
Richard Henderson	5ca8caf656	target/arm: Split helper_crypto_sha1_3reg Rather than passing an opcode to a helper, fully decode the operation at translate time. Use clear_tail_16 to zap the balance of the SVE register with the AdvSIMD write. Backports commit afc8b7d32668547308bdd654a63cf5228936e0ba from qemu	2020-06-14 23:18:45 -04:00
Richard Henderson	894f2168da	target/arm: Convert rax1 to gvec helpers With this conversion, we will be able to use the same helpers with sve. This also fixes a bug in which we failed to clear the high bits of the SVE register after an AdvSIMD operation. Backports commit 1738860d7e60dec5dbeba17f8b44d31aae3accac from qemu	2020-06-14 22:49:36 -04:00
Richard Henderson	cc3187b1e4	tcg: Implement gvec support for rotate by scalar No host backend support yet, but the interfaces for rotls are in place. Only implement left-rotate for now, as the only known use of vector rotate by scalar is s390x, so any right-rotate would be unused and untestable. Backports commit 23850a74afb641102325b4b7f74071d929fc4594 from qemu	2020-06-14 22:00:50 -04:00
Richard Henderson	be78062fd8	tcg: Implement gvec support for rotate by vector No host backend support yet, but the interfaces for rotlv and rotrv are in place. Reviewed-by: Alex Bennée <alex.bennee@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org> --- v3: Drop the generic expansion from rot to shift; we can do better for each backend, and then this code becomes unused. Backports commit 5d0ceda902915e3f0e21c39d142c92c4e97c3ebb from qemu	2020-06-14 21:43:46 -04:00
Richard Henderson	5cce52a04b	tcg: Implement gvec support for rotate by immediate No host backend support yet, but the interfaces for rotli are in place. Canonicalize immediate rotate to the left, based on a survey of architectures, but provide both left and right shift interfaces to the translators. Backports commit b0f7e7444c03da17e41bf327c8aea590104a28ab from qemu	2020-06-14 21:26:58 -04:00
Peter Maydell	bb0aa79847	target/arm: Convert Neon VADD, VSUB, VABD 3-reg-same insns to decodetree Convert the Neon VADD, VSUB, VABD 3-reg-same insns to decodetree. We already have gvec helpers for addition and subtraction, but must add one for fabd. Backports commit a26a352bb498662cd0c205cb433a352f86fac7d2 from qemu	2020-05-15 23:26:51 -04:00
Richard Henderson	451683ee79	target/arm: Vectorize SABA/UABA Include 64-bit element size in preparation for SVE2. Backports commit cfdb2c0c95ae9205b0dd7f0f5e970cdec50fef20 from qemu	2020-05-15 22:15:14 -04:00
Richard Henderson	98c79f9afc	target/arm: Vectorize SABD/UABD Include 64-bit element size in preparation for SVE2. Backports commit 50c160d44eb059c7fc7f348ae2c3b0cb41437044 from qemu	2020-05-15 22:01:29 -04:00
Richard Henderson	3c4f226e00	target/arm: Create gen_gvec_{qrdmla,qrdmls} Provide a functional interface for the vector expansion. This fits better with the existing set of helpers that we provide for other operations. Backports commit 146aa66ce58b686b8037d0eb3921c1125942dbde from qemu	2020-05-15 21:43:22 -04:00
Richard Henderson	9dfc0479ff	target/arm: Create gen_gvec_{uqadd, sqadd, uqsub, sqsub} Provide a functional interface for the vector expansion. This fits better with the existing set of helpers that we provide for other operations. Backports commit c7715b6b51a6f7a5412c5fcb40a4c8586105e597 from qemu	2020-05-15 21:25:06 -04:00
Richard Henderson	4abfe5156d	target/arm: Create gen_gvec_{cmtst,ushl,sshl} Provide a functional interface for the vector expansion. This fits better with the existing set of helpers that we provide for other operations. Backports commit 8161b75357095fef54c76b1a6ed1e54d0e8655e0 from qemu	2020-05-15 21:15:49 -04:00
Richard Henderson	546db9089c	target/arm: Create gen_gvec_{mla,mls} Provide a functional interface for the vector expansion. This fits better with the existing set of helpers that we provide for other operations. Backports commit 271063206a46062a45fc6bab8dabe45f0b88159d from qemu	2020-05-15 21:06:06 -04:00
Richard Henderson	340f97bf4c	target/arm: Create gen_gvec_{ceq,clt,cle,cgt,cge}0 Provide a functional interface for the vector expansion. This fits better with the existing set of helpers that we provide for other operations. Macro-ize the 5 nearly identical comparisons. Backports commit 69d5e2bf8c3cefedbfa1c1670137e636dbd7faa5 from qemu	2020-05-15 20:57:33 -04:00
Richard Henderson	6190be3191	target/arm: Create gen_gvec_{sri,sli} The functions eliminate duplication of the special cases for this operation. They match up with the GVecGen2iFn typedef. Add out-of-line helpers. We got away with only having inline expanders because the neon vector size is only 16 bytes, and we know that the inline expansion will always succeed. When we reuse this for SVE, tcg-gvec-op may decide to use an out-of-line helper due to longer vector lengths. Backports commit 893ab0542aa385a287cbe46d5535c8b9e95ce699 from qemu	2020-05-15 20:39:28 -04:00
Richard Henderson	2609e6f319	target/arm: Create gen_gvec_{u,s}{rshr,rsra} Create vectorized versions of handle_shri_with_rndacc for shift+round and shift+round+accumulate. Add out-of-line helpers in preparation for longer vector lengths from SVE. Backports commit 6ccd48d4ea244c1c46a24dfa50bfb547f11422dd from qemu	2020-05-15 20:28:44 -04:00
Richard Henderson	5d7c46204d	target/arm: Create gen_gvec_[us]sra The functions eliminate duplication of the special cases for this operation. They match up with the GVecGen2iFn typedef. Add out-of-line helpers. We got away with only having inline expanders because the neon vector size is only 16 bytes, and we know that the inline expansion will always succeed. When we reuse this for SVE, tcg-gvec-op may decide to use an out-of-line helper due to longer vector lengths. Backports commit 631e565450c483e0622eec3d8b61d7fa41d16bca from qemu	2020-05-15 20:10:32 -04:00
Richard Henderson	07f622e57d	tcg: Add tcg_gen_gvec_dup_imm Add a version of tcg_gen_dup_* that takes both immediate and a vector element size operand. This will replace the set of tcg_gen_gvec_dup{8,16,32,64}i functions that encode the element size within the function name. Backports commit 44c94677febd15488f9190b11eaa4a08e8ac696b from qemu	2020-05-07 09:55:25 -04:00
Thomas Huth	84f2729a29	target/arm: Make cpu_register() available for other files Make cpu_register() (renamed to arm_cpu_register()) available from internals.h so we can register CPUs also from other files in the future. Backports commit 37bcf244454f4efb82e2c0c64bbd7eabcc165a0c from qemu	2020-04-30 21:38:42 -04:00
Richard Henderson	b26b4c06cd	target/arm: Vectorize integer comparison vs zero These instructions are often used in glibc's string routines. They were the final uses of the 32-bit at a time neon helpers. Backports commit 6b375d3546b009d1e63e07397ec9c6af256e15e9 from qemu	2020-04-30 21:29:17 -04:00
Richard Henderson	fcce8d4aa1	target/arm: Convert PMULL.8 to gvec We still need two different helpers, since NEON and SVE2 get the inputs from different locations within the source vector. However, we can convert both to the same internal form for computation. The sve2 helper is not used yet, but adding it with this patch helps illustrate why the neon changes are helpful. Backports commit e7e96fc5ec8c79dc77fef522d5226ac09f684ba5 from qemu	2020-03-21 19:35:46 -04:00
Richard Henderson	c00f72f74f	target/arm: Convert PMULL.64 to gvec The gvec form will be needed for implementing SVE2. Backports commit b9ed510e46f2f9e31e5e8adb4661d5d1cbe9a459 from qemu	2020-03-21 19:27:38 -04:00
Richard Henderson	db8a935b44	target/arm: Convert PMUL.8 to gvec The gvec form will be needed for implementing SVE2. Extend the implementation to operate on uint64_t instead of uint32_t. Use a counted inner loop instead of terminating when op1 goes to zero, looking toward the required implementation for ARMv8.4-DIT. Backports commit a21bb78e5817be3f494922e1dadd6455fe5d6318 from qemu	2020-03-21 19:22:18 -04:00
Richard Henderson	d3139f2f0a	target/arm: Vectorize USHL and SSHL These instructions shift left or right depending on the sign of the input, and 7 bits are significant to the shift. This requires several masks and selects in addition to the actual shifts to form the complete answer. That said, the operation is still a small improvement even for two 64-bit elements -- 13 vector operations instead of 2 * 7 integer operations. Backports commit 87b74e8b6edd287ea2160caa0ebea725fa8f1ca1 from qemu	2020-03-21 19:14:17 -04:00
Richard Henderson	12b4e01d9c	tcg: Add tcg_gen_gvec_5_ptr Extend the vector generator infrastructure to handle 5 vector arguments. Backports commit 2445971604c1cfd3ec484457159f4ac300fb04d2 from qemu	2020-03-21 16:54:01 -04:00
Richard Henderson	d6150127b4	target/arm: Add the hypervisor virtual counter Backports commit 8c94b071a09c2183f032febff3112f2b7662156c from qemu	2020-03-21 15:35:36 -04:00

1 2 3 4 5 ...

410 commits