unicorn

mirror of https://github.com/yuzu-emu/unicorn.git synced 2024-12-25 13:15:39 +00:00

Author	SHA1	Message	Date
Richard Henderson	6190be3191	target/arm: Create gen_gvec_{sri,sli} The functions eliminate duplication of the special cases for this operation. They match up with the GVecGen2iFn typedef. Add out-of-line helpers. We got away with only having inline expanders because the neon vector size is only 16 bytes, and we know that the inline expansion will always succeed. When we reuse this for SVE, tcg-gvec-op may decide to use an out-of-line helper due to longer vector lengths. Backports commit 893ab0542aa385a287cbe46d5535c8b9e95ce699 from qemu	2020-05-15 20:39:28 -04:00
Richard Henderson	2609e6f319	target/arm: Create gen_gvec_{u,s}{rshr,rsra} Create vectorized versions of handle_shri_with_rndacc for shift+round and shift+round+accumulate. Add out-of-line helpers in preparation for longer vector lengths from SVE. Backports commit 6ccd48d4ea244c1c46a24dfa50bfb547f11422dd from qemu	2020-05-15 20:28:44 -04:00
Richard Henderson	5d7c46204d	target/arm: Create gen_gvec_[us]sra The functions eliminate duplication of the special cases for this operation. They match up with the GVecGen2iFn typedef. Add out-of-line helpers. We got away with only having inline expanders because the neon vector size is only 16 bytes, and we know that the inline expansion will always succeed. When we reuse this for SVE, tcg-gvec-op may decide to use an out-of-line helper due to longer vector lengths. Backports commit 631e565450c483e0622eec3d8b61d7fa41d16bca from qemu	2020-05-15 20:10:32 -04:00
Richard Henderson	b0f6374149	target/arm: Use tcg_gen_gvec_dup_imm In a few cases, we're able to remove some manual replication. Backports commit 8711e71f9cbb692d614e6ecf5d51222372f7b77e from qemu	2020-05-07 10:05:49 -04:00
Peter Maydell	652165d671	target/arm: Convert Neon 3-reg-same VMUL, VMLA, VMLS, VSHL to decodetree Convert the Neon VMUL, VMLA, VMLS and VSHL insns in the 3-reg-same grouping to decodetree. Backports commit 0de34fd48ad4e44bf5caa2330657ebefa93cea7d from qemu	2020-05-07 09:50:44 -04:00
Peter Maydell	17bd8930fc	target/arm: Convert Neon 3-reg-same VQADD/VQSUB to decodetree Convert the Neon VQADD/VQSUB insns in the 3-reg-same grouping to decodetree. Backports commit 7a9497f1cf73667a4744d09673b808c20e067915 from qemu	2020-05-07 09:47:18 -04:00
Peter Maydell	d52b830ce3	target/arm: Convert Neon 3-reg-same comparisons to decodetree Convert the Neon comparison ops in the 3-reg-same grouping to decodetree. Backports commit 02bd0cdb64b3e79419ba3a8746cb86430883b3ae from qemu	2020-05-07 09:45:03 -04:00
Peter Maydell	c6f9fb54fd	target/arm: Convert Neon 3-reg-same VMAX/VMIN to decodetree Convert the Neon 3-reg-same VMAX and VMIN insns to decodetree. Backports commit 36b59310c38d45213bf860affa90618aa5eeca93 from qemu	2020-05-07 09:42:04 -04:00
Peter Maydell	d30f99ca79	target/arm: Convert Neon 3-reg-same logic ops to decodetree Convert the Neon logic ops in the 3-reg-same grouping to decodetree. Note that for the logic ops the 'size' field forms part of their decode and the actual operations are always bitwise. Backports commit 35a548edb6f5043386183b9f6b4139d99d1f130a from qemu	2020-05-07 09:40:10 -04:00
Peter Maydell	eae3ce9899	target/arm: Convert Neon 3-reg-same VADD/VSUB to decodetree Convert the Neon 3-reg-same VADD and VSUB insns to decodetree. Note that we don't need the neon_3r_sizes[op] check here because all size values are OK for VADD and VSUB; we'll add this when we convert the first insn that has size restrictions. For this we need one of the GVecGen*Fn typedefs currently in translate-a64.h; move them all to translate.h as a block so they are visible to the 32-bit decoder. Backports commit a4e143ac5b9185f670d2f17ee9cc1a430047cb65 from qemu	2020-05-07 09:36:28 -04:00
Peter Maydell	c7a31355fc	target/arm: Convert Neon 'load/store single structure' to decodetree Convert the Neon "load/store single structure to one lane" insns to decodetree. As this is the last set of insns in the neon load/store group, we can remove the whole disas_neon_ls_insn() function. Backports commit 123ce4e3daba26b760b472687e1fb1ad82cf1993 from qemu	2020-05-07 09:32:17 -04:00
Peter Maydell	302506f2f6	target/arm: Convert Neon 'load single structure to all lanes' to decodetree Convert the Neon "load single structure to all lanes" insns to decodetree. Backports commit 3698747c48db871d876a398592c5a23d7580ed4a from qemu	2020-05-07 09:29:03 -04:00
Peter Maydell	7aad825fa6	target/arm: Convert Neon load/store multiple structures to decodetree Convert the Neon "load/store multiple structures" insns to decodetree. Backports commit a27b46304352a0eced45e560e96515dbe3cc174f from qemu	2020-05-07 09:25:51 -04:00
Peter Maydell	9814c1722f	target/arm: Convert VFM[AS]L (scalar) to decodetree Convert the VFM[AS]L (scalar) insns in the 2reg-scalar-ext group to decodetree. These are the last ones in the group so we can remove all the legacy decode for the group. Note that in disas_thumb2_insn() the parts of this encoding space where the decodetree decoder returns false will correctly be directed to illegal_op by the "(insn & (1 << 28))" check so they won't fall into disas_coproc_insn() by mistake. Backports commit d27e82f7d02f35e5919bd9cbbcb157f3537069a0 from qemu	2020-05-07 09:20:35 -04:00
Peter Maydell	49cdb7e2db	target/arm: Convert V[US]DOT (scalar) to decodetree Convert the V[US]DOT (scalar) insns in the 2reg-scalar-ext group to decodetree. Backports commit 35f5d4d1747558c6af2d914bcd848dcc30c3b531 from qemu	2020-05-07 09:17:32 -04:00
Peter Maydell	73dbfbe4d7	target/arm: Convert VCMLA (scalar) to decodetree Convert VCMLA (scalar) in the 2reg-scalar-ext group to decodetree. Backports commit 7e1b5d615361bb0038cda0e08af41e350e42d081 from qemu	2020-05-07 09:15:30 -04:00
Peter Maydell	1ab06d3eb5	target/arm: Convert VFM[AS]L (vector) to decodetree Convert the VFM[AS]L (vector) insns to decodetree. This is the last insn in the legacy decoder for the 3same_ext group, so we can delete the legacy decoder function for the group entirely. Note that in disas_thumb2_insn() the parts of this encoding space where the decodetree decoder returns false will correctly be directed to illegal_op by the "(insn & (1 << 28))" check so they won't fall into disas_coproc_insn() by mistake. Backports commit 9a107e7b8a3c87ab63ec830d3d60f319fc577ff7 from qemu	2020-05-07 09:13:36 -04:00
Peter Maydell	c06bdf4cc2	target/arm: Convert V[US]DOT (vector) to decodetree Convert the V[US]DOT (vector) insns to decodetree. Backports commit 32da0e330d3e5218b669079826496751fb52c1ca from qemu	2020-05-07 09:09:24 -04:00
Peter Maydell	1d4dba1e5a	target/arm: Convert VCADD (vector) to decodetree Convert the VCADD (vector) insns to decodetree. Backports commit 94d5eb7b3f72fbbdee55d7908e9cb6de95949f4b from qemu	2020-05-07 09:05:55 -04:00
Peter Maydell	d8287755b2	target/arm: Convert VCMLA (vector) to decodetree Convert the VCMLA (vector) insns in the 3same extension group to decodetree. Backports commit afff8de0d4d55b4ce7c36eb9cdfafe477a35dd75 from qemu	2020-05-07 09:02:52 -04:00
Peter Maydell	c2c628eb71	target/arm: Add stubs for AArch32 Neon decodetree Add the infrastructure for building and invoking a decodetree decoder for the AArch32 Neon encodings. At the moment the new decoder covers nothing, so we always fall back to the existing hand-written decode. We follow the same pattern we did for the VFP decodetree conversion (commit 78e138bc1f672c145ef6ace74617d and following): code that deals with Neon will be moving gradually out to translate-neon.vfp.inc, which we #include into translate.c. In order to share the decode files between A32 and T32, we split Neon into 3 parts: * data-processing * load-store * 'shared' encodings The first two groups of instructions have similar but not identical A32 and T32 encodings, so we need to manually transform the T32 encoding into the A32 one before calling the decoder; the third group covers the Neon instructions which are identical in A32 and T32. Backports commit 625e3dd44a15dfbe9532daa6454df3f86cf04d3e from qemu	2020-05-07 08:59:42 -04:00
Peter Maydell	518d18062f	target/arm: Don't allow Thumb Neon insns without FEATURE_NEON We were accidentally permitting decode of Thumb Neon insns even if the CPU didn't have the FEATURE_NEON bit set, because the feature check was being done before the call to disas_neon_data_insn() and disas_neon_ls_insn() in the Arm decoder but was omitted from the Thumb decoder. Push the feature bit check down into the called functions so it is done for both Arm and Thumb encodings. Backports commit d1a6d3b594157425232a1ae5ea7f51b7a1c1aa2e from qemu	2020-05-07 08:55:02 -04:00
Fredrik Strupe	65200d8aad	target/arm: Make VQDMULL undefined when U=1 According to Arm ARM, VQDMULL is only valid when U=0, while having U=1 is unallocated. Backports commit ab553ef74ee52c0889679d0bd0da084aaf938f5c from qemu	2020-05-07 08:34:56 -04:00
Richard Henderson	b26b4c06cd	target/arm: Vectorize integer comparison vs zero These instructions are often used in glibc's string routines. They were the final uses of the 32-bit at a time neon helpers. Backports commit 6b375d3546b009d1e63e07397ec9c6af256e15e9 from qemu	2020-04-30 21:29:17 -04:00
Richard Henderson	4ce91875e4	target/arm: Move the vfp decodetree calls next to the base isa Have the calls adjacent as an intermediate step toward actually merging the decodes. Backports commit f0f6d5c81be47d593e5ece7f06df6fba4c15738b from qemu	2020-03-21 23:54:56 -04:00
Richard Henderson	f1ce64857c	target/arm: Move VLLDM and VLSTM to vfp.decode Now that we no longer have an early check for ARM_FEATURE_VFP, we can use the proper ISA check in trans_VLLDM_VLSTM. Backports commit dc778a6873f534817a13257be2acba3ca87ec015 from qemu	2020-03-21 23:51:59 -04:00
Richard Henderson	7592564248	target/arm: Remove ARM_FEATURE_VFP check from disas_vfp_insn We now have proper ISA checks within each trans_* function. Backports commit 46c98019255b056f5dbc9676a6490951469ca661 from qemu	2020-03-21 23:49:14 -04:00
Richard Henderson	3f0ae7ccee	target/arm: Replace ARM_FEATURE_VFP4 with isar_feature_aa32_simdfmac All remaining tests for VFP4 are for fused multiply-add insns. Since the MVFR1 field is used for both VFP and NEON, move its adjustment from the !has_neon block to the (!has_vfp && !has_neon) block. Test for vfp of the appropraite width alongside the test for simdfmac within translate-vfp.inc.c. Within disas_neon_data_insn, we have already tested for ARM_FEATURE_NEON. Backports commit c52881bbc22b50db99a6c37171ad3eea7d959ae6 from qemu	2020-03-21 23:48:13 -04:00
Richard Henderson	833de589ed	target/arm: Use isar_feature_aa32_simd_r32 more places Many uses of ARM_FEATURE_VFP3 are testing for the number of simd registers implemented. Use the proper test vs MVFR0.SIMDReg. Backports commit a6627f5fc607939f7c8b9c3157fdcb2d368ba0ed from qemu	2020-03-21 19:39:35 -04:00
Richard Henderson	fcce8d4aa1	target/arm: Convert PMULL.8 to gvec We still need two different helpers, since NEON and SVE2 get the inputs from different locations within the source vector. However, we can convert both to the same internal form for computation. The sve2 helper is not used yet, but adding it with this patch helps illustrate why the neon changes are helpful. Backports commit e7e96fc5ec8c79dc77fef522d5226ac09f684ba5 from qemu	2020-03-21 19:35:46 -04:00
Richard Henderson	c00f72f74f	target/arm: Convert PMULL.64 to gvec The gvec form will be needed for implementing SVE2. Backports commit b9ed510e46f2f9e31e5e8adb4661d5d1cbe9a459 from qemu	2020-03-21 19:27:38 -04:00
Richard Henderson	db8a935b44	target/arm: Convert PMUL.8 to gvec The gvec form will be needed for implementing SVE2. Extend the implementation to operate on uint64_t instead of uint32_t. Use a counted inner loop instead of terminating when op1 goes to zero, looking toward the required implementation for ARMv8.4-DIT. Backports commit a21bb78e5817be3f494922e1dadd6455fe5d6318 from qemu	2020-03-21 19:22:18 -04:00
Richard Henderson	d3139f2f0a	target/arm: Vectorize USHL and SSHL These instructions shift left or right depending on the sign of the input, and 7 bits are significant to the shift. This requires several masks and selects in addition to the actual shifts to form the complete answer. That said, the operation is still a small improvement even for two 64-bit elements -- 13 vector operations instead of 2 * 7 integer operations. Backports commit 87b74e8b6edd287ea2160caa0ebea725fa8f1ca1 from qemu	2020-03-21 19:14:17 -04:00
Peter Maydell	e63f70f980	target/arm: Add _aa32_ to isar_feature functions testing 32-bit ID registers Enforce a convention that an isar_feature function that tests a 32-bit ID register always has _aa32_ in its name, and one that tests a 64-bit ID register always has _aa64_ in its name. We already follow this except for three cases: thumb_div, arm_div and jazelle, which all need _aa32_ adding. (As noted in the comment, isar_feature_aa32_fp16_arith() is an exception in that it currently tests ID_AA64PFR0_EL1, but will switch to MVFR1 once we've properly implemented FP16 for AArch32.) Backports commit 873b73c0c891ec20adacc7bd1ae789294334d675 from qemu	2020-03-21 18:08:23 -04:00
Richard Henderson	ca2bb77ab3	target/arm: Split out aarch32_cpsr_valid_mask Split this helper out of msr_mask in translate.c. At the same time, transform the negative reductive logic to positive accumulative logic. It will be usable along the exception paths. While touching msr_mask, fix up formatting. Backports commit 4f9584ed4bba8a57a3cb2fa48a682725005d530a from qemu	2020-03-21 17:16:20 -04:00
Richard Henderson	7aaf0d442b	target/arm: Add mmu_idx for EL1 and EL2 w/ PAN enabled To implement PAN, we will want to swap, for short periods of time, to a different privileged mmu_idx. In addition, we cannot do this with flushing alone, because the AT* instructions have both PAN and PAN-less versions. Add the ARMMMUIdx_PAN constants where necessary next to the corresponding ARMMMUIdx constant. Backports commit 452ef8cb8c7b06f44a30a3c3a54d3be82c4aef59 from qemu	2020-03-21 17:12:16 -04:00
Richard Henderson	0318d7af99	target/arm: Reorganize ARMMMUIdx Prepare for, but do not yet implement, the EL2&0 regime. This involves adding the new MMUIdx enumerators and adjusting some of the MMUIdx related predicates to match. Backports commit b9f6033c1a5fb7da55ed353794db8ec064f78bb2 from qemu.	2020-03-21 15:10:05 -04:00
Richard Henderson	be3c71fb8b	target/arm: Recover 4 bits from TBFLAGs We had completely run out of TBFLAG bits. Split A- and M-profile bits into two overlapping buckets. This results in 4 free bits. We used to initialize all of the a32 and m32 fields in DisasContext by assignment, in arm_tr_init_disas_context. Now we only initialize either the a32 or m32 by assignment, because the bits overlap in tbflags. So zero the entire structure in gen_intermediate_code. Backports commit 79cabf1f473ca6e9fa0727f64ed9c2a84a36f0aa from qemu	2020-03-21 14:51:46 -04:00
Richard Henderson	153d7aadd5	target/arm: Rename ARMMMUIdx_S1E2 to ARMMMUIdx_E2 This is part of a reorganization to the set of mmu_idx. The non-secure EL2 regime only has a single stage translation; there is no point in pointing out that the idx is for stage1. Backports commit e013b7411339342aac8d986c5d5e329e1baee8e1 from qemu	2020-03-21 14:42:23 -04:00
Richard Henderson	f45ab0614e	target/arm: Rename ARMMMUIdx_S1E3 to ARMMMUIdx_SE3 This is part of a reorganization to the set of mmu_idx. The EL3 regime only has a single stage translation, and is always secure. Backports commit 127b2b086303296289099a6fb10bbc51077f1d53 from qemu	2020-03-21 14:38:44 -04:00
Richard Henderson	1a672fc3b1	target/arm: Rename ARMMMUIdx_S1SE[01] to ARMMMUIdx_SE10_[01] This is part of a reorganization to the set of mmu_idx. This emphasizes that they apply to the Secure EL1&0 regime. Backports commit fba37aedecb82506c62a1f9e81d066b4fd04e443 from qemu	2020-03-21 14:35:28 -04:00
Richard Henderson	b62b4c4f35	target/arm: Rename ARMMMUIdx_S2NS to ARMMMUIdx_Stage2 The EL1&0 regime is the only one that uses 2-stage translation. Backports commit 97fa9350017e647151dd1dc212f1bbca0294dba7 from qemu	2020-03-21 14:15:35 -04:00
Richard Henderson	ec05f22e82	target/arm: Rename ARMMMUIdx_S12NSE to ARMMMUIdx_E10_ This is part of a reorganization to the set of mmu_idx. This emphasizes that they apply to the EL1&0 regime. The ultimate goal is -- Non-secure regimes: ARMMMUIdx_E10_0, ARMMMUIdx_E20_0, ARMMMUIdx_E10_1, ARMMMUIdx_E2, ARMMMUIdx_E20_2, -- Secure regimes: ARMMMUIdx_SE10_0, ARMMMUIdx_SE10_1, ARMMMUIdx_SE3, -- Helper mmu_idx for non-secure EL1&0 stage1 and stage2 ARMMMUIdx_Stage2, ARMMMUIdx_Stage1_E0, ARMMMUIdx_Stage1_E1, The 'S' prefix is reserved for "Secure". Unless otherwise specified, each mmu_idx represents all stages of translation. Backports commit 01b98b686460b3a0fb47125882e4f8d4268ac1b6 from qemu	2020-03-21 14:09:15 -04:00
Richard Henderson	dc9733e555	target/arm: Set ISSIs16Bit in make_issinfo During the conversion to decodetree, the setting of ISSIs16Bit got lost. This causes the guest os to incorrectly adjust trapping memory operations. Backports commit 1a1fbc6cbb34c26d43d8360c66c1d21681af14a9 from qemu	2020-03-21 12:09:05 -04:00
Richard Henderson	fb1988190e	target/arm: Fix sign-extension for SMLAL* The 32-bit product should be sign-extended, not zero-extended. Fixes: ea96b37 Backports commit 1ab170865202aab8301131f31bffd87ea0f60d16 from qemu	2020-03-21 11:34:43 -04:00
Alex Bennée	8f275077b0	target/arm: only update pc after semihosting completes Before we introduce blocking semihosting calls we need to ensure we can restart the system on semi hosting exception. To be able to do this the EXCP_SEMIHOST operation should be idempotent until it finally completes. Practically this means ensureing we only update the pc after the semihosting call has completed. Backports commit 4ff5ef9e911c670ca10cdd36dd27c5395ec2c753 from qemu	2020-01-14 08:28:25 -05:00
Marc Zyngier	457934855b	target/arm: Handle AArch32 CP15 trapping via HSTR_EL2 HSTR_EL2 offers a way to trap ranges of CP15 system register accesses to EL2, and it looks like this register is completely ignored by QEMU. To avoid adding extra .accessfn filters all over the place (which would have a direct performance impact), let's add a new TB flag that gets set whenever HSTR_EL2 is non-zero and that QEMU translates a context where this trap has a chance to apply, and only generate the extra access check if the hypervisor is actively using this feature. Tested with a hand-crafted KVM guest accessing CBAR. Backports commit 5bb0a20b74ad17dee5dae38e3b8b70b383ee7c2d from qemu	2020-01-07 18:07:21 -05:00
Lioncash	eadeae183d	target/arm: Amend bad merge	2019-11-28 03:29:56 -05:00
Richard Henderson	df5929cb69	target/arm: Relax r13 restriction for ldrex/strex for v8.0 Armv8-A removes UNPREDICTABLE for R13 for these cases. Backports commit d46ad79efac7aaf9f0eb9f5a96a576e9f39200e0 from qemu	2019-11-28 03:29:31 -05:00
Richard Henderson	fa7a6a5d91	target/arm: Do not reject rt == rt2 for strexd There was too much cut and paste between ldrexd and strexd, as ldrexd does prohibit two output registers the same. Fixes: af288228995 Backports commit 655b02646dc175dc10666459b0a1e4346fc8d46a from qemu	2019-11-28 03:29:18 -05:00

1 2 3 4 5 ...

372 commits