unicorn

mirror of https://github.com/yuzu-emu/unicorn.git synced 2024-12-26 12:45:39 +00:00

Author	SHA1	Message	Date
Richard Henderson	6bec295bf8	target/arm: Add MTE bits to tb_flags Cache the composite ATA setting. Cache when MTE is fully enabled, i.e. access to tags are enabled and tag checks affect the PE. Do this for both the normal context and the UNPRIV context. Backports commit 81ae05fa2d21ac1a0054935b74342aa38a5ecef7 from qemu	2021-02-25 14:31:41 -05:00
Richard Henderson	f6be2a1a42	target/arm: Add MTE system registers This is TFSRE0_EL1, TFSR_EL1, TFSR_EL2, TFSR_EL3, RGSR_EL1, GCR_EL1, GMID_EL1, and PSTATE.TCO. Backports commit 4b779cebb3e5ab30b945181f1ba3932f5f8a1cb5 from qemu	2021-02-25 14:12:24 -05:00
Richard Henderson	179a3aacdf	target/arm: Add DISAS_UPDATE_NOCHAIN Add an option that writes back the PC, like DISAS_UPDATE_EXIT, but does not exit back to the main loop. Backports commit 329833286d7a1b0ef8c7daafe13c6ae32429694e from qemu	2021-02-25 14:08:08 -05:00
Richard Henderson	eaa6291aa7	target/arm: Rename DISAS_UPDATE to DISAS_UPDATE_EXIT Emphasize that the is_jmp option exits to the main loop. Backports commit 14407ec2007e18536ed34772eef46f6e0a0e3d0e from qemu	2021-02-25 14:02:46 -05:00
Richard Henderson	2540911bdd	target/arm: Add support for MTE to SCTLR_ELx target/arm: Add support for MTE to HCR_EL2 and SCR_EL3 This does not attempt to rectify all of the res0 bits, but does clear the mte bits when not enabled. Since there is no high-part mapping of SCTLR, aa32 mode cannot write to these bits. Backports commits f00faf130d5dcf64b04f71a95f14745845ca1014, and 8ddb300bf60a5f3d358dd6fbf81174f6c03c1d9f from qemu.	2021-02-25 13:59:11 -05:00
Richard Henderson	d81feac642	target/arm: Improve masking of SCR RES0 bits Protect reads of aa64 id registers with ARM_CP_STATE_AA64. Use this as a simpler test than arm_el_is_aa64, since EL3 cannot change mode. Backports commit 252e8c69669599b4bcff802df300726300292f47 from qemu	2021-02-25 13:56:35 -05:00
Richard Henderson	1a35600453	target/arm: Add isar tests for mte Backports commit c7fd0baac0c24defec66263799faa8618327b352 from qemu	2021-02-25 13:55:52 -05:00
Peter Maydell	4a1996502f	target/arm: Remove dead code relating to SABA and UABA In commit cfdb2c0c95ae9205b0 ("target/arm: Vectorize SABA/UABA") we replaced the old handling of SABA/UABA with a vectorized implementation which returns early rather than falling into the loop-ever-elements code. We forgot to delete the part of the old looping code that did the accumulate step, and Coverity correctly warns (CID 1428955) that this code is now dead. Delete it. Fixes: cfdb2c0c95ae9205b0 Backports commit ced7e8edb282765685d2ba0206a11f8692d8ec1c from qemu	2021-02-25 13:18:51 -05:00
Peter Maydell	167ed57625	target/arm: Remove unnecessary gen_io_end() calls Since commit ba3e7926691ed3 it has been unnecessary for target code to call gen_io_end() after an IO instruction in icount mode; it is sufficient to call gen_io_start() before it and to force the end of the TB. Many now-unnecessary calls to gen_io_end() were removed in commit 9e9b10c6491153b, but some were missed or accidentally added later. Remove unneeded calls from the arm target: * the call in the handling of exception-return-via-LDM is unnecessary, and the code is already forcing end-of-TB * the call in the VFP access check code is more complicated: we weren't ending the TB, so we need to add the code to force that by setting DISAS_UPDATE * the doc comment for ARM_CP_IO doesn't need to mention gen_io_end() any more Backports commit 55c812b74289863c348449135812027d188f040a from qemu	2021-02-25 13:17:32 -05:00
Peter Maydell	083d207fb0	target/arm: Move some functions used only in translate-neon.inc.c to that file The functions neon_element_offset(), neon_load_element(), neon_load_element64(), neon_store_element() and neon_store_element64() are used only in the translate-neon.inc.c file, so move their definitions there. Since the .inc.c file is #included in translate.c this doesn't make much difference currently, but it's a more logical place to put the functions and it might be helpful if we ever decide to try to make the .inc.c files genuinely separate compilation units. Backports commit 6fb5787898aab6aa04887fed9cf3220dd4c3f36a from qemu	2021-02-25 13:15:23 -05:00
Peter Maydell	0b06317dc4	target/arm: Convert Neon VTRN to decodetree Convert the Neon VTRN insn to decodetree. This is the last insn in the Neon data-processing group, so we can remove all the now-unused old decoder framework. It's possible that there's a more efficient implementation of VTRN, but for this conversion we just copy the existing approach. Backports commit d4366190f84fe89cc5d46da995dac1e7d541b98e from qemu	2021-02-25 13:12:28 -05:00
Peter Maydell	b7584069dd	target/arm: Convert Neon VSWP to decodetree Convert the Neon VSWP insn to decodetree. Since the new implementation doesn't have to share a pass-loop with the other 2-reg-misc operations we can implement the swap with 64-bit accesses rather than 32-bits (which brings us into line with the pseudocode and is more efficient). Backports commit 8ab3a227a0f13f0ff85846f36f7c466769aef4fc from qemu	2021-02-25 13:07:56 -05:00
Peter Maydell	73abdfea53	target/arm: Convert Neon 2-reg-misc VCVT insns to decodetree Convert the VCVT instructions in the 2-reg-misc grouping to decodetree. Backports commit a183d5fb38b07bab2a840196186c4806f3c67c0d from qemu	2021-02-25 13:07:15 -05:00
Peter Maydell	7e705fdc8c	target/arm: Convert Neon 2-reg-misc VRINT insns to decodetree Convert the Neon 2-reg-misc VRINT insns to decodetree. Giving these insns their own do_vrint() function allows us to change the rounding mode just once at the start and end rather than doing it for every element in the vector. Backports commit 128123ea34e9e6afe4842aefcb9cf84b9642ac22 from qemu	2021-02-25 13:02:24 -05:00
Peter Maydell	3eddb77327	target/arm: Convert Neon 2-reg-misc fp-compare-with-zero insns to decodetree Convert the fp-compare-with-zero insns in the Neon 2-reg-misc group to decodetree. Backports commit baa59323e841f76523f6ad4d746cdeb47ea574cd from qemu	2021-02-25 12:59:22 -05:00
Peter Maydell	6eb852ec1c	target/arm: Convert simple fp Neon 2-reg-misc insns Convert the Neon 2-reg-misc insns which are implemented with simple calls to functions that take the input, output and fpstatus pointer. Backports commit 3e96b205286dfb8bbf363229709e4f8648fce379 from qemu	2021-02-25 12:56:28 -05:00
Peter Maydell	3dcee11013	target/arm: Convert Neon VQABS, VQNEG to decodetree Convert the Neon VQABS and VQNEG insns to decodetree. Since these are the only ones which need cpu_env passing to the helper, we wrap the helper rather than creating a whole new do_2misc_env() function. Backports commit 4936f38abe6db0a9d23fd04e4cb0cf4d51cff174 from qemu	2021-02-25 12:53:18 -05:00
Peter Maydell	4033a3ca5c	target/arm: Convert remaining simple 2-reg-misc Neon ops Convert the remaining ops in the Neon 2-reg-misc group which can be implemented simply with our do_2misc() helper. Backports commit 84eae770af69c37a92496a4c4248875c070d5ee3 from qemu	2021-02-25 12:50:55 -05:00
Peter Maydell	88f8111500	target/arm: Convert Neon 2-reg-misc VREV32 and VREV16 to decodetree Convert the VREV32 and VREV16 insns in the Neon 2-reg-misc group to decodetree. Backports commit 8966808205b59d6c196b380b638475bcd1657ef4 from qemu	2021-02-25 12:49:16 -05:00
Peter Maydell	db1e503708	target/arm: Make gen_swap_half() take separate src and dest Make gen_swap_half() take a source and destination TCGv_i32 rather than modifying the input TCGv_i32; we're going to want to be able to use it with the more flexible function signature, and this also brings it into line with other functions like gen_rev16() and gen_revsh(). Backports commit 8ec3de7018a8198624aae49eef5568256114a829 from qemu	2021-02-25 12:40:23 -05:00
Peter Maydell	3c1289c594	target/arm: Fix capitalization in NeonGenTwo{Single, Double}OPFn typedefs All the other typedefs like these spell "Op" with a lowercase 'p'; remane the NeonGenTwoSingleOPFn and NeonGenTwoDoubleOPFn typedefs to match. Backports commit 5de3fd045be11b74cd0fbf36c6d4fb8387d5463b from qemu	2021-02-25 12:38:30 -05:00
Peter Maydell	fa6727ebba	target/arm: Rename NeonGenOneOpFn to NeonGenOne64OpFn The NeonGenOneOpFn typedef breaks with the pattern of the other NeonGen*Fn typedefs, because it is a TCGv_i64 -> TCGv_i64 operation but it does not have '64' in its name. Rename it to NeonGenOne64OpFn, so that the old name is available for a TCGv_i32 -> TCGv_i32 operation (which we will need in a subsequent commit). Backports commit 039f4e809ad2772fb33de4511ff68a485d875618 from qemu	2021-02-25 12:34:51 -05:00
Peter Maydell	27e74962e5	target/arm: Convert Neon 2-reg-misc crypto operations to decodetree Convert the Neon-2-reg misc crypto ops (AESE, AESMC, SHA1H, SHA1SU1) to decodetree. Backports commit 0b30dd5b85e20aba259768cb7aaa952b3e319468 from qemu	2021-02-25 12:32:39 -05:00
Peter Maydell	4354448f57	target/arm: Convert vectorised 2-reg-misc Neon ops to decodetree Convert to decodetree the insns in the Neon 2-reg-misc grouping which we implement using gvec. Backports commit 75153179e9928775d5333243ea4b278f438d75ae from qemu	2021-02-25 12:28:31 -05:00
Peter Maydell	6301f9acaa	target/arm: Convert Neon VCVT f16/f32 insns to decodetree Convert the Neon insns in the 2-reg-misc group which are VCVT between f32 and f16 to decodetree. Backports commit 654a517355e249435505ae5ff14a7520410cf7a4 from qemu	2021-02-25 12:25:32 -05:00
Peter Maydell	4ca33c54a2	target/arm: Convert Neon 2-reg-misc VSHLL to decodetree Convert the VSHLL insn in the 2-reg-misc Neon group to decodetree. Backports commit 749e2be36d75f11d5fa8f8277e2a0569bd2a1c97 from qemu	2021-02-25 12:20:57 -05:00
Peter Maydell	48d57d0dc7	target/arm: Convert Neon narrowing moves to decodetree Convert the Neon narrowing moves VMQNV, VQMOVN, VQMOVUN in the 2-reg-misc group to decodetree. Backports commit 3882bdacb0ad548864b9f2582a32bb5c785e3165 from qemu	2021-02-25 12:18:01 -05:00
Peter Maydell	35d8a3e83f	target/arm: Convert VZIP, VUZP to decodetree Convert the Neon VZIP and VUZP insns in the 2-reg-misc group to decodetree. Backports commit 567663a2af2457da8aa74f221b1f3f8a6d2eddf6 from qemu	2021-02-25 12:14:29 -05:00
Peter Maydell	d21fae82ba	target/arm: Convert Neon 2-reg-misc pairwise ops to decodetree Convert the pairwise ops VPADDL and VPADAL in the 2-reg-misc grouping to decodetree. At this point we can get rid of the weird CPU_V001 #define that was used to avoid having to explicitly list all the arguments being passed to some TCG gen/helper functions. Backports commit 6106af3aa2304fccee91a3a90138352b0c2af998 from qemu	2021-02-25 12:12:11 -05:00
Peter Maydell	505923e676	target/arm: Convert Neon 2-reg-misc VREV64 to decodetree Convert the Neon VREV64 insn from the 2-reg-misc grouping to decodetree. Backports commit 353d2b85058711a5e44c2dc63eb5b620db50a602 from qemu	2021-02-25 12:07:06 -05:00
MerryMage	92243aefd4	arm/translate: Do not tracecode when in an IT block	2021-02-07 19:14:32 +00:00
MerryMage	9ac17104b8	arm: Add missing file vec_internal.h Missing from commit `1df7314dc3`. Ported from qemu a04b68e1d4c4f0cd5cd7542697b1b230b84532f5.	2020-06-20 00:12:09 +01:00
Peter Maydell	709610e606	target/arm: Convert Neon VDUP (scalar) to decodetree Convert the Neon VDUP (scalar) insn to decodetree. (Note that we can't call this just "VDUP" as we used that already in vfp.decode for the "VDUP (general purpose register" insn.) Backports commit 9aaa23c2ae18e6fb9a291b81baf91341db76dfa0 from qemu	2020-06-17 00:43:19 -04:00
Peter Maydell	8de8a4500a	target/arm: Convert Neon VTBL, VTBX to decodetree Convert the Neon VTBL, VTBX instructions to decodetree. The actual implementation of the insn is copied across to the new trans function unchanged except for renaming 'tmp5' to 'tmp4'. Backports commit 54e96c744b70a5d19f14b212a579dd3be8fcaad9 from qemu	2020-06-17 00:39:27 -04:00
Peter Maydell	4731a69d66	target/arm: Convert Neon VEXT to decodetree Convert the Neon VEXT insn to decodetree. Rather than keeping the old implementation which used fixed temporaries cpu_V0 and cpu_V1 and did the extraction with by-hand shift and logic ops, we use the TCG extract2 insn. We don't need to special case 0 or 8 immediates any more as the optimizer is smart enough to throw away the dead code. Backports commit 0aad761fb0aed40c99039eacac470cbd03d07019 from qemu	2020-06-17 00:29:04 -04:00
Peter Maydell	1aa9046120	target/arm: Convert Neon 2-reg-scalar long multiplies to decodetree Convert the Neon 2-reg-scalar long multiplies to decodetree. These are the last instructions in the group. Backports commit 77e576a9281825fc170f3b3af83f47e110549b5c from qemu	2020-06-17 00:24:12 -04:00
Peter Maydell	088a1e8ba9	target/arm: Convert Neon 2-reg-scalar VQRDMLAH, VQRDMLSH to decodetree Convert the VQRDMLAH and VQRDMLSH insns in the 2-reg-scalar group to decodetree. Backports commit aa318f5b9b4ab3b6744b5305dd8ae9b96676f20e from qemu	2020-06-17 00:15:18 -04:00
Peter Maydell	c0551804d4	target/arm: Convert Neon 2-reg-scalar VQDMULH, VQRDMULH to decodetree Convert the VQDMULH and VQRDMULH insns in the 2-reg-scalar group to decodetree. Backports commit b2fc7be972b94872f6a6dd32d9bda1b88ddbcaad from qemu	2020-06-17 00:11:56 -04:00
Peter Maydell	2e8ae1130e	target/arm: Convert Neon 2-reg-scalar float multiplies to decodetree Convert the float versions of VMLA, VMLS and VMUL in the Neon 2-reg-scalar group to decodetree. Backports commit 85ac9aef9a5418de3168df569e21258e853840a2 from qemu	2020-06-17 00:09:32 -04:00
Peter Maydell	bf1b0374b9	target/arm: Convert Neon 2-reg-scalar integer multiplies to decodetree Convert the VMLA, VMLS and VMUL insns in the Neon "2 registers and a scalar" group to decodetree. These are 32x32->32 operations where one of the inputs is the scalar, followed by a possible accumulate operation of the 32-bit result. The refactoring removes some of the oddities of the old decoder: * operands to the operation and accumulation were often reversed (taking advantage of the fact that most of these ops are commutative); the new code follows the pseudocode order * the Q bit in the insn was in a local variable 'u'; in the new code it is decoded into a->q Backports commit 96fc80f5f186decd1a649f6c04252faceb057ad2 from qemu	2020-06-17 00:04:29 -04:00
Peter Maydell	1817f28afd	target/arm: Add missing TCG temp free in do_2shift_env_64() In commit 37bfce81b10450071 we accidentally introduced a leak of a TCG temporary in do_2shift_env_64(); free it. Backports commit a4f67e180def790ff0bbb33fc93bb6e80382f041 from qemu	2020-06-16 23:57:17 -04:00
Peter Maydell	06dfc2ada6	target/arm: Add 'static' and 'const' annotations to VSHLL function arrays Mark the arrays of function pointers in trans_VSHLL_S_2sh() and trans_VSHLL_U_2sh() as both 'static' and 'const'. Backports commit 448f0e5f3ecfbd089b934e5e3aa0ccd1f51a6174 from qemu	2020-06-16 23:56:30 -04:00
Peter Maydell	6383a2bd15	target/arm: Convert Neon 3-reg-diff polynomial VMULL Convert the Neon 3-reg-diff insn polynomial VMULL. This is the last insn in this group to be converted. Backports commit 18fb58d588898550919392277787979ee7d0d84e from qemu	2020-06-16 23:54:51 -04:00
Peter Maydell	090426b120	target/arm: Convert Neon 3-reg-diff saturating doubling multiplies Convert the Neon 3-reg-diff insns VQDMULL, VQDMLAL and VQDMLSL: these are all saturating doubling long multiplies with a possible accumulate step. These are the last insns in the group which use the pass-over-each elements loop, so we can delete that code. Backports commit 9546ca5998d3cbd98a81b2d46a2e92a11b0f78a4 from qemu	2020-06-16 23:51:56 -04:00
Peter Maydell	5464405d5c	target/arm: Convert Neon 3-reg-diff long multiplies Convert the Neon 3-reg-diff insns VMULL, VMLAL and VMLSL; these perform a 32x32->64 multiply with possible accumulate. Note that for VMLSL we do the accumulate directly with a subtraction rather than doing a negate-then-add as the old code did. Backports commit 3a1d9eb07b767a7592abca642af80906f9eab0ed from qemu	2020-06-16 23:47:28 -04:00
Peter Maydell	21044a1d11	target/arm: Convert Neon 3-reg-diff VABAL, VABDL to decodetree Convert the Neon 3-reg-diff insns VABAL and VABDL to decodetree. Like almost all the remaining insns in this group, these are a combination of a two-input operation which returns a double width result and then a possible accumulation of that double width result into the destination. Backports commit f5b28401200ec95ba89552df3ecdcdc342f6b90b from qemu	2020-06-16 23:41:20 -04:00
Peter Maydell	34418f1998	target/arm: Convert Neon 3-reg-diff narrowing ops to decodetree Convert the narrow-to-high-half insns VADDHN, VSUBHN, VRADDHN, VRSUBHN in the Neon 3-registers-different-lengths group to decodetree. Backports commit 0fa1ab0302badabc3581aefcbb2f189ef52c4985 from qemu	2020-06-16 23:36:18 -04:00
Peter Maydell	d25998ba7d	target/arm: Convert Neon 3-reg-diff prewidening ops to decodetree Convert the "pre-widening" insns VADDL, VSUBL, VADDW and VSUBW in the Neon 3-registers-different-lengths group to decodetree. These insns work by widening one or both inputs to double their size, performing an add or subtract at the doubled size and then storing the double-size result. As usual, rather than copying the loop of the original decoder (which needs awkward code to avoid problems when source and destination registers overlap) we just unroll the two passes. Backports commit b28be09570d0827969b62b8f82b0f720a9915427 from qemu	2020-06-16 23:29:53 -04:00
Peter Maydell	a9d0e36bcf	target/arm: Fix missing temp frees in do_vshll_2sh The widenfn() in do_vshll_2sh() does not free the input 32-bit TCGv, so we need to do this in the calling code. Backports commit 9593a3988c3e788790aa107d778386b09f456a6d from qemu	2020-06-16 23:26:04 -04:00
Richard Henderson	a93d01c61d	target/arm: Use a non-overlapping group for misc control The miscellaneous control instructions are mutually exclusive within the t32 decode sub-group. Backports commit d6084fba47bb9aef79775c1102d4b647eb58c365 from qemu	2020-06-15 12:52:48 -04:00
Peter Maydell	7427cca6cc	target/arm: Convert Neon one-register-and-immediate insns to decodetree Convert the insns in the one-register-and-immediate group to decodetree. In the new decode, our asimd_imm_const() function returns a 64-bit value rather than a 32-bit one, which means we don't need to treat cmode=14 op=1 as a special case in the decoder (it is the only encoding where the two halves of the 64-bit value are different). Backports commit 2c35a39eda0b16c2ed85c94cec204bf5efb97812 from qemu	2020-06-15 12:44:54 -04:00
Peter Maydell	93e6d464c8	target/arm: Convert VCVT fixed-point ops to decodetree Convert the VCVT fixed-point conversion operations in the Neon 2-regs-and-shift group to decodetree. Backports commit 3da26f11711caeaa18318b6afa14dfb81d7650ab from qemu	2020-06-15 12:40:59 -04:00
Peter Maydell	a5f903b2a5	target/arm: Convert Neon VSHLL, VMOVL to decodetree Convert the VSHLL and VMOVL insns from the 2-reg-shift group to decodetree. Since the loop always has two passes, we unroll it to avoid the awkward reassignment of one TCGv to another. Backports commit 968bf842742a5ffbb0041cb31089e61a9f7a833d from qemu	2020-06-15 12:35:32 -04:00
Peter Maydell	6fc8fdaa2b	target/arm: Convert Neon narrowing shifts with op==9 to decodetree Convert the remaining Neon narrowing shifts to decodetree: * VQSHRN * VQRSHRN Backports commit b4a3a77bb7a0dff1cc5673fe3be467d9e3635d44 from qemu	2020-06-15 12:31:35 -04:00
Peter Maydell	ef29b91a43	target/arm: Convert Neon narrowing shifts with op==8 to decodetree Convert the Neon narrowing shifts where op==8 to decodetree: * VSHRN * VRSHRN * VQSHRUN * VQRSHRUN backports commit 712182d340e33c2ce86143f25fb2f04ae23d90de from qemu	2020-06-15 12:29:09 -04:00
Peter Maydell	69a3312e3a	target/arm: Convert VQSHLU, VQSHL 2-reg-shift insns to decodetree Convert the VQSHLU and QVSHL 2-reg-shift insns to decodetree. These are the last of the simple shift-by-immediate insns. Backports commit 37bfce81b10450071193c8495a07f182ec652e2a from qemu	2020-06-15 12:21:10 -04:00
Peter Maydell	055c96f985	target/arm: Convert Neon VSHR 2-reg-shift insns to decodetree Convert the VSHR 2-reg-shift insns to decodetree. Note that unlike the legacy decoder, we present the right shift amount to the trans_ function as a positive integer. Backports commit 66432d6b8294e3508218b360acfdf7c244eea993 from qemu	2020-06-15 12:15:29 -04:00
Peter Maydell	bf18bf983d	target/arm: Convert Neon VSHL and VSLI 2-reg-shift insn to decodetree Convert the VSHL and VSLI insns from the Neon 2-registers-and-a-shift group to decodetree. Backports commit d3c8c736f8b4bdd02831076286b1788232f46ced from qemu	2020-06-15 12:07:02 -04:00
Richard Henderson	1d95dd1c89	target/arm: Split helper_crypto_sm3tt Rather than passing an opcode to a helper, fully decode the operation at translate time. Use clear_tail_16 to zap the balance of the SVE register with the AdvSIMD write. Backports commit 43fa36c96c24349145497adc1b451f9caf74e344 from qemu	2020-06-14 23:24:21 -04:00
Richard Henderson	5ca8caf656	target/arm: Split helper_crypto_sha1_3reg Rather than passing an opcode to a helper, fully decode the operation at translate time. Use clear_tail_16 to zap the balance of the SVE register with the AdvSIMD write. Backports commit afc8b7d32668547308bdd654a63cf5228936e0ba from qemu	2020-06-14 23:18:45 -04:00
Richard Henderson	41c4efdb22	target/arm: Convert sha1 and sha256 to gvec helpers Do not yet convert the helpers to loop over opr_sz, but the descriptor allows the vector tail to be cleared. Which fixes an existing bug vs SVE. Backports commit effa992f153f5e7ab97ab843b565690748c5b402 from qemu	2020-06-14 23:11:28 -04:00
Richard Henderson	2c6c4da80c	target/arm: Convert sha512 and sm3 to gvec helpers Do not yet convert the helpers to loop over opr_sz, but the descriptor allows the vector tail to be cleared. Which fixes an existing bug vs SVE. Backports commit aaffebd6d3135b8aed7e61932af53b004d261579 from qemu	2020-06-14 23:01:49 -04:00
Richard Henderson	894f2168da	target/arm: Convert rax1 to gvec helpers With this conversion, we will be able to use the same helpers with sve. This also fixes a bug in which we failed to clear the high bits of the SVE register after an AdvSIMD operation. Backports commit 1738860d7e60dec5dbeba17f8b44d31aae3accac from qemu	2020-06-14 22:49:36 -04:00
Richard Henderson	1df7314dc3	target/arm: Convert aes and sm4 to gvec helpers With this conversion, we will be able to use the same helpers with sve. In particular, pass 3 vector parameters for the 3-operand operations; for advsimd the destination register is also an input. This also fixes a bug in which we failed to clear the high bits of the SVE register after an AdvSIMD operation. Backports commit a04b68e1d4c4f0cd5cd7542697b1b230b84532f5 from qemu	2020-06-14 22:41:33 -04:00
Peter Maydell	1c6b0339e6	target/arm: Allow user-mode code to write CPSR.E via MSR Using the MSR instruction to write to CPSR.E is deprecated, but it is required to work from any mode including unprivileged code. We were incorrectly forbidding usermode code from writing it because CPSR_USER did not include the CPSR_E bit. We use CPSR_USER in only three places: * as the mask of what to allow userspace MSR to write to CPSR * when deciding what bits a linux-user signal-return should be able to write from the sigcontext structure * in target_user_copy_regs() when we set up the initial registers for the linux-user process In the first two cases not being able to update CPSR.E is a bug, and in the third case it doesn't matter because CPSR.E is always 0 there. So we can fix both bugs by adding CPSR_E to CPSR_USER. Because the cpsr_write() in restore_sigcontext() is now changing a CPSR bit which is cached in hflags, we need to add an arm_rebuild_hflags() call there; the callsite in target_user_copy_regs() was already rebuilding hflags for other reasons. (The recommended way to change CPSR.E is to use the 'SETEND' instruction, which we do correctly allow from usermode code.) Backports commit 268b1b3dfbb92a9348406f728a33f39e3d8dcd8a from qemu	2020-06-14 21:08:03 -04:00
Richard Henderson	acdd5c6065	target/arm: Use clear_vec_high more effectively Do not explicitly store zero to the NEON high part when we can pass !is_q to clear_vec_high. Backports commit e1f778596ebfa8782276f4dd4651f2b285d734ff from qemu	2020-06-14 21:06:40 -04:00
Richard Henderson	3ac9b9b206	target/arm: Use tcg_gen_gvec_mov for clear_vec_high The 8-byte store for the end a !is_q operation can be merged with the other stores. Use a no-op vector move to trigger the expand_clr portion of tcg_gen_gvec_mov. Backports commit 5c27392dd08bd8534893abf25ef501f1bd8680fe from qemu	2020-06-14 21:00:57 -04:00
Richard Henderson	d960523cbd	softfloat: Name compare relation enum Give the previously unnamed enum a typedef name. Use it in the prototypes of compare functions. Use it to hold the results of the compare functions. Backports commit 71bfd65c5fcd72f8af2735905415c7ce4220f6dc from qemu	2020-05-21 18:08:52 -04:00
Richard Henderson	8adc704058	softfloat: Name rounding mode enum Give the previously unnamed enum a typedef name. Use the packed attribute so that we do not affect the layout of the float_status struct. Use it in the prototypes of relevant functions. Adjust switch statements as necessary to avoid compiler warnings. Backports commit 3dede407cc61b64997f0c30f6dbf4df09949abc9 from qemu	2020-05-21 18:02:05 -04:00
Richard Henderson	a417227674	softfloat: Replace flag with bool We have had this on the to-do list for quite some time. Backports commit c120391c0090d9c40425c92cdb00f38ea8588ff6 from qemu	2020-05-21 17:48:12 -04:00
Richard Henderson	6530d6342f	softfloat: Use post test for floatN_mul The existing f{32,64}_addsub_post test, which checks for zero inputs, is identical to f{32,64}_mul_fast_test. Which means we can eliminate the fast_test/fast_op hooks in favor of reusing the same post hook. This means we have one fewer test along the fast path for multiply. Backports commit b240c9c497b9880ac0ba29465907d5ebecd48083 from qemu	2020-05-21 17:24:00 -04:00
Peter Maydell	7b2fb5bc63	target/arm: Convert NEON VFMA, VFMS 3-reg-same insns to decodetree Convert the Neon floating point VFMA and VFMS insn to decodetree. These are the last insns in the 3-reg-same group so we can remove all the support/loop code from the old decoder. Backports commit e95485f85657be21135c17a9226e297c21e73360 from qemu	2020-05-15 23:49:20 -04:00
Peter Maydell	82484db863	target/arm: Convert Neon fp VMAX/VMIN/VMAXNM/VMINNM/VRECPS/VRSQRTS to decodetree Convert the Neon fp VMAX/VMIN/VMAXNM/VMINNM/VRECPS/VRSQRTS 3-reg-same insns to decodetree. (These are all the remaining non-accumulation instructions in this group.) Backports commit d5fdf9e9e1c6f2bbb0a4bcaafd85d344cce9c298 from qemu	2020-05-15 23:44:52 -04:00
Peter Maydell	a593866af6	target/arm: Move 'env' argument of recps_f32 and rsqrts_f32 helpers to usual place The usual location for the env argument in the argument list of a TCG helper is immediately after the return-value argument. recps_f32 and rsqrts_f32 differ in that they put it at the end. Move the env argument to its usual place; this will allow us to more easily use these helper functions with the gvec APIs. Backports commit 26c6f695cfd2a3ccddb4d015a25b56f56aa62928 from qemu	2020-05-15 23:41:37 -04:00
Peter Maydell	05e72483f4	target/arm: Convert Neon 3-reg-same compare insns to decodetree Convert the Neon integer 3-reg-same compare insns VCGE, VCGT, VCEQ, VACGE and VACGT to decodetree. Backports commit 727ff1d63213e6666e511956903b9e97a339ec7e from qemu	2020-05-15 23:37:53 -04:00
Peter Maydell	042df686ca	target/arm: Convert Neon fp VMUL, VMLA, VMLS 3-reg-same insns to decodetree Convert the Neon integer VMUL, VMLA, and VMLS 3-reg-same inssn to decodetree. We don't have a gvec helper for multiply-accumulate, so VMLA and VMLS need a loop function do_3same_fp(). This takes a reads_vd parameter to do_3same_fp() which tells it to load the old value into vd before calling the callback function, in the same way that the do_vfp_3op_sp() and do_vfp_3op_dp() functions in translate-vfp.inc.c work. (The only uses in this patch pass reads_vd == true, but later commits will use reads_vd == false.) This conversion fixes in passing an underdecoding for VMUL Backports commit 8aa71ead912ca0a9c0d29b74e0976f91952f950a from qemu	2020-05-15 23:35:21 -04:00
Peter Maydell	2527e76926	target/arm: Convert Neon VPMIN/VPMAX/VPADD float 3-reg-same insns to decodetree Convert the Neon float VPMIN, VPMAX and VPADD 3-reg-same insns to decodetree. These are the only remaining 'pairwise' operations, so we can delete the pairwise-specific bits of the old decoder's for-each-element loop now. Backports commit ab978335a56e3618212868fdce3a54217c6e71e6 from qemu	2020-05-15 23:31:15 -04:00
Peter Maydell	bb0aa79847	target/arm: Convert Neon VADD, VSUB, VABD 3-reg-same insns to decodetree Convert the Neon VADD, VSUB, VABD 3-reg-same insns to decodetree. We already have gvec helpers for addition and subtraction, but must add one for fabd. Backports commit a26a352bb498662cd0c205cb433a352f86fac7d2 from qemu	2020-05-15 23:26:51 -04:00
Peter Maydell	1df5d57e8a	target/arm: Convert Neon VQDMULH/VQRDMULH 3-reg-same to decodetree Convert the Neon VQDMULH and VQRDMULH 3-reg-same insns to decodetree. These are the last integer operations in the 3-reg-same group. Backports commit 7ecc28bc72b8033cf4e0c6332135ec20d4125dfb from qemu	2020-05-15 23:06:44 -04:00
Peter Maydell	59818edb3c	target/arm: Convert Neon VPADD 3-reg-same insns to decodetree Convert the Neon integer VPADD 3-reg-same insns to decodetree. These are 'pairwise' operations. (Note that VQRDMLAH, which shares the same primary opcode but has U=1, has already been converted.) Backports commit fa22827d4eb078b6c58cd3d19af0b50ed951e832 from qemu	2020-05-15 23:01:25 -04:00
Peter Maydell	1cc6451cb6	target/arm: Convert Neon VPMAX/VPMIN 3-reg-same insns to decodetree Convert the Neon integer VPMAX and VPMIN 3-reg-same insns to decodetree. These are 'pairwise' operations. Backports commit 059c2398a2b1ae86c6722c45e79fb0d0f4d95b1d from qemu	2020-05-15 22:59:10 -04:00
Peter Maydell	f35ae14ab4	target/arm: Convert Neon VQSHL, VRSHL, VQRSHL 3-reg-same insns to decodetree Convert the VQSHL, VRSHL and VQRSHL insns in the 3-reg-same group to decodetree. We have already implemented the size==0b11 case of these insns; this commit handles the remaining sizes Backports commit 6812dfdc6b0286730d6f903ebfbdc4f81b80c29b from qemu	2020-05-15 22:53:27 -04:00
Peter Maydell	5308fb324e	target/arm: Convert Neon VRHADD, VHSUB 3-reg-same insns to decodetree Convert the Neon VRHADD and VHSUB 3-reg-same insns to decodetree. (These are all the other insns in 3-reg-same which were using GEN_NEON_INTEGER_OP() and which are not pairwise or reversed-operands.) Backports commit 8e44d03f4b5590e19a4f7910ca1c327609933dd7 from qemu	2020-05-15 22:50:02 -04:00
Peter Maydell	ec327c7fc8	target/arm: Convert Neon VABA/VABD 3-reg-same to decodetree Convert the Neon VABA and VABD insns in the 3-reg-same group to decodetree. Backports commit 7715098f93ff5205334edf161e5fe156346122b0 from qemu	2020-05-15 22:46:02 -04:00
Peter Maydell	f1028fe4a7	target/arm: Convert Neon VHADD 3-reg-same insns Convert the Neon VHADD insns in the 3-reg-same group to decodetree. Backports commit cb294bca866f1cd776e44e03e5e432942bc676e8 from qemu	2020-05-15 22:43:01 -04:00
Peter Maydell	4098e0b80a	target/arm: Convert Neon 64-bit element 3-reg-same insns Convert the 64-bit element insns in the 3-reg-same group to decodetree. This covers VQSHL, VRSHL and VQRSHL where size==0b11. Backports commit 35d4352fa9e94b35bf17f58181cb16c184b98d56 from qemu	2020-05-15 22:40:48 -04:00
Peter Maydell	e2b703a82c	target/arm: Convert Neon 3-reg-same SHA to decodetree Convert the Neon SHA instructions in the 3-reg-same group to decodetree Backports commit 21290edfc29d8929741c0ed043733c23c69bc3b9 from qemu	2020-05-15 22:34:40 -04:00
Richard Henderson	1740e018f4	target/arm: Convert Neon 3-reg-same VQRDMLAH/VQRDMLSH to decodetree Convert the Neon VQRDMLAH and VQRDMLSH insns in the 3-reg-same group to decodetree. These don't use do_3same() because they want to operate on VFP double registers, whose offsets are different from the neon_reg_offset() calculations do_3same does. Backports commit a063569508af8295cf6271e06700e5b956bb402d from qemu	2020-05-15 22:20:23 -04:00
Richard Henderson	451683ee79	target/arm: Vectorize SABA/UABA Include 64-bit element size in preparation for SVE2. Backports commit cfdb2c0c95ae9205b0dd7f0f5e970cdec50fef20 from qemu	2020-05-15 22:15:14 -04:00
Richard Henderson	98c79f9afc	target/arm: Vectorize SABD/UABD Include 64-bit element size in preparation for SVE2. Backports commit 50c160d44eb059c7fc7f348ae2c3b0cb41437044 from qemu	2020-05-15 22:01:29 -04:00
Richard Henderson	765dbb57f0	target/arm: Clear tail in gvec_fmul_idx_, gvec_fmla_idx_ Must clear the tail for AdvSIMD when SVE is enabled. Fixes: ca40a6e6e39 Backports commit 525d9b6d42844e187211d25b69be8b378785bc24 from qemu	2020-05-15 21:50:30 -04:00
Richard Henderson	73d08253a2	target/arm: Pass pointer to qc to qrdmla/qrdmls Pass a pointer directly to env->vfp.qc[0], rather than env. This will allow SVE2, which does not modify QC, to pass a pointer to dummy storage. Change the return type of inl_qrdml.h_s16 to match the sense of the operation: signed. Backports commit e286bf4a72fe3a60490b8d6e3f28d6335677e08c from qemu	2020-05-15 21:48:35 -04:00
Richard Henderson	3c4f226e00	target/arm: Create gen_gvec_{qrdmla,qrdmls} Provide a functional interface for the vector expansion. This fits better with the existing set of helpers that we provide for other operations. Backports commit 146aa66ce58b686b8037d0eb3921c1125942dbde from qemu	2020-05-15 21:43:22 -04:00
Richard Henderson	efdcad70b1	target/arm: Remove fp_status from helper_{recpe, rsqrte}_u32 These operations do not touch fp_status. Backports commit fe6fb4beb2f9bb0afc813e565504b66a92bbf04b from qemu	2020-05-15 21:32:03 -04:00
Richard Henderson	9dfc0479ff	target/arm: Create gen_gvec_{uqadd, sqadd, uqsub, sqsub} Provide a functional interface for the vector expansion. This fits better with the existing set of helpers that we provide for other operations. Backports commit c7715b6b51a6f7a5412c5fcb40a4c8586105e597 from qemu	2020-05-15 21:25:06 -04:00
Richard Henderson	4abfe5156d	target/arm: Create gen_gvec_{cmtst,ushl,sshl} Provide a functional interface for the vector expansion. This fits better with the existing set of helpers that we provide for other operations. Backports commit 8161b75357095fef54c76b1a6ed1e54d0e8655e0 from qemu	2020-05-15 21:15:49 -04:00
Richard Henderson	15b2850f4d	target/arm: Swap argument order for VSHL during decode Rather than perform the argument swap during code generation, perform it during decode. This means it doesn't have to be special cased later, and we can share code with aarch64 code generation. Hopefully the decode comment addresses any confusion that might arise in between. Backports commit e9eee5316ffec5f37643de806b2e5577c5c189cf from qemu	2020-05-15 21:07:59 -04:00
Richard Henderson	546db9089c	target/arm: Create gen_gvec_{mla,mls} Provide a functional interface for the vector expansion. This fits better with the existing set of helpers that we provide for other operations. Backports commit 271063206a46062a45fc6bab8dabe45f0b88159d from qemu	2020-05-15 21:06:06 -04:00
Richard Henderson	340f97bf4c	target/arm: Create gen_gvec_{ceq,clt,cle,cgt,cge}0 Provide a functional interface for the vector expansion. This fits better with the existing set of helpers that we provide for other operations. Macro-ize the 5 nearly identical comparisons. Backports commit 69d5e2bf8c3cefedbfa1c1670137e636dbd7faa5 from qemu	2020-05-15 20:57:33 -04:00
Richard Henderson	e08c2b8ece	target/arm: Tidy handle_vec_simd_shri Now that we've converted all cases to gvec, there is quite a bit of dead code at the end of the function. Remove it. Sink the call to gen_gvec_fn2i to the end, loading a function pointer within the switch statement. Backports commit 3f08f0bce841e7857ec98ce7909629d0c335005e from qemu	2020-05-15 20:47:47 -04:00

1 2 3 4 5 ...

1418 commits