unicorn

mirror of https://github.com/yuzu-emu/unicorn.git synced 2025-10-11 13:37:17 +00:00

Author	SHA1	Message	Date
Peter Maydell	43d8441881	target/arm: Implement VSCCLRM insn Implement the v8.1M VSCCLRM insn, which zeros floating point registers if there is an active floating point context. This requires support in write_neon_element32() for the MO_32 element size, so add it. Because we want to use arm_gen_condlabel(), we need to move the definition of that function up in translate.c so it is before the #include of translate-vfp.c.inc. Backports 83ff3d6add965c9752324de11eac5687121ea826	2021-03-03 17:57:30 -05:00
Peter Maydell	952ebdc207	target/arm: Don't clobber ID_PFR1.Security on M-profile cores In arm_cpu_realizefn() we check whether the board code disabled EL3 via the has_el3 CPU object property, which we create if the CPU starts with the ARM_FEATURE_EL3 feature bit. If it is disabled, then we turn off ARM_FEATURE_EL3 and also zero out the relevant fields in the ID_PFR1 and ID_AA64PFR0 registers. This codepath was incorrectly being taken for M-profile CPUs, which do not have an EL3 and don't set ARM_FEATURE_EL3, but which may have the M-profile Security extension and so should have non-zero values in the ID_PFR1.Security field. Restrict the handling of the feature flag to A/R-profile cores. Backports 4018818840f499d0a478508aedbb6802c8eae928	2021-03-03 17:52:30 -05:00
Peter Maydell	cfefada296	target/arm: Implement v8.1M PXN extension In v8.1M the PXN architecture extension adds a new PXN bit to the MPU_RLAR registers, which forbids execution of code in the region from a privileged mode. This is another feature which is just in the generic "in v8.1M" set and has no ID register field indicating its presence. Backports cad8e2e3160dd10371552fce6cd8c6e171503e13	2021-03-03 17:50:26 -05:00
Peter Maydell	b9c51dc19a	Open 6.0 development tree Backports c923a30481baf87f631659085f94cd6000116192	2021-03-02 13:39:05 -05:00
Peter Maydell	e6ae2e0245	Update version for v5.2.0 release Backports 553032db17440f8de011390e5a1cfddd13751b0b	2021-03-02 13:38:38 -05:00
Peter Maydell	530491aef0	Update version for v5.2.0-rc4 release Backports d73c46e4a84e47ffc61b8bf7c378b1383e7316b5	2021-03-02 13:38:19 -05:00
Peter Maydell	d823f26c5e	Update version for v5.2.0-rc3 release Backports dd3d2340c4076d1735cd0f7cb61f4d8622b9562d	2021-03-02 13:37:49 -05:00
Rémi Denis-Courmont	d9592046ef	target/arm: fix stage 2 page-walks in 32-bit emulation Using a target unsigned long would limit the Input Address to a LPAE page-walk to 32 bits on AArch32 and 64 bits on AArch64. This is okay for stage 1 or on AArch64, but it is insufficient for stage 2 on AArch32. In that later case, the Input Address can have up to 40 bits. Backports commit 98e8779770c40901ed585745aacc9a8e2b934a28	2021-03-02 13:37:02 -05:00
Peter Maydell	5eb86e4d3c	Update version for v5.2.0-rc2 release 66a300a107ec286725bdc943601cbd4247b82158	2021-03-02 13:35:58 -05:00
Philippe Mathieu-Daudé	7bb2c171ac	qemu/bswap: Remove unused qemu_bswap_len() Last use of qemu_bswap_len() has been removed in commit e5fd1eb05ec ("apb: add busA qdev property to PBM PCI bridge"). Backport 949eaaad5341db318fc8bae79489a1f7624f3b9e	2021-03-02 13:35:17 -05:00
Chetan Pant	3e25486110	x86 tcg cpus: Fix Lesser GPL version number There is no "version 2" of the "Lesser" General Public License. It is either "GPL version 2.0" or "Lesser GPL version 2.1". This patch replaces all occurrences of "Lesser GPL version 2" with "Lesser GPL version 2.1" in comment section. Backport d9ff33ada7f32ca59f99b270a2d0eb223b3c9c8f	2021-03-02 13:33:10 -05:00
Chetan Pant	c7f6786089	arm tcg cpus: Fix Lesser GPL version number There is no "version 2" of the "Lesser" General Public License. It is either "GPL version 2.0" or "Lesser GPL version 2.1". This patch replaces all occurrences of "Lesser GPL version 2" with "Lesser GPL version 2.1" in comment section. Backports 50f57e09fda4b7ffbc5ba62aad6cebf660824023	2021-03-02 13:30:35 -05:00
Peter Maydell	e19550db6d	Update version for v5.2.0-rc1 release Backports c6f28ed5075df79fef39c500362a3f4089256c9c	2021-03-02 13:25:21 -05:00
Peter Maydell	f991d945d3	target/arm/translate-neon.c: Handle VTBL UNDEF case before VFP access check Checks for UNDEF cases should go before the "is VFP enabled?" access check, except in special cases. Move a stray UNDEF check in the VTBL trans function up above the access check. Backports b6c56c8a9a4064ea783f352f43c5df6231a110fa	2021-03-02 13:24:51 -05:00
Richard Henderson	9623047097	target/arm: Fix neon VTBL/VTBX for len > 1 The helper function did not get updated when we reorganized the vector register file for SVE. Since then, the neon dregs are non-sequential and cannot be simply indexed. At the same time, make the helper function operate on 64-bit quantities so that we do not have to call it twice. Backports 604cef3e57eaeeef77074d78f6cf2eca1be11c62	2021-03-02 13:23:13 -05:00
Xinhao Zhang	b3f63b72a2	target/arm: add space before the open parenthesis '(' Fix code style. Space required before the open parenthesis '('. Backports 7f350a87e3a85e8a260ce4b133d549a7b2789213	2021-03-02 13:17:48 -05:00
Xinhao Zhang	71d4aced5d	target/arm: Don't use '#' flag of printf format Fix code style. Don't use '#' flag of printf format ('%#') in format strings, use '0x' prefix instead Backports 6eb55edbabb9eed1e4c7dfb233e7d738e8b5fa89	2021-03-02 13:16:09 -05:00
Xinhao Zhang	492fbc4d2c	target/arm: add spaces around operator Fix code style. Operator needs spaces both sides. Backports bdc3b6f570e8bd219aa6a24a149b35a691e6986c	2021-03-02 13:15:12 -05:00
Peter Maydell	348504c386	Update version for v5.2.0-rc0 release Backports 3d6e32347a3b57dac7f469a07c5f520e69bd070a	2021-03-02 13:10:16 -05:00
Peter Maydell	e528c8229e	target/arm: Get correct MMU index for other-security-state In arm_v7m_mmu_idx_for_secstate() we get the 'priv' level to pass to armv7m_mmu_idx_for_secstate_and_priv() by calling arm_current_el(). This is incorrect when the security state being queried is not the current one, because arm_current_el() uses the current security state to determine which of the banked CONTROL.nPRIV bits to look at. The effect was that if (for instance) Secure state was in privileged mode but Non-Secure was not then we would return the wrong MMU index. The only places where we are using this function in a way that could trigger this bug are for the stack loads during a v8M function-return and for the instruction fetch of a v8M SG insn. Fix the bug by expanding out the M-profile version of the arm_current_el() logic inline so it can use the passed in secstate rather than env->v7m.secure. Backports 7142eb9e24b4aa5118cd67038057f15694d782aa	2021-03-02 13:08:44 -05:00
Rémi Denis-Courmont	a4053565d6	target/arm: fix LORID_EL1 access check Secure mode is not exempted from checking SCR_EL3.TLOR, and in the future HCR_EL2.TLOR when S-EL2 is enabled. Backports 9bd268bae5c4760870522292fb1d46e7da7e372a	2021-03-02 13:06:50 -05:00
Rémi Denis-Courmont	df4413edc7	target/arm: fix handling of HCR.FB HCR should be applied when NS is set, not when it is cleared. Backports 373e7ffde9bae90a20fb5db21b053f23091689f4	2021-03-02 13:05:01 -05:00
Peter Maydell	6b8096d9fc	target/arm: Fix VUDOT/VSDOT (scalar) on big-endian hosts The helper functions for performing the udot/sdot operations against a scalar were not using an address-swizzling macro when converting the index of the scalar element into a pointer into the vm array. This had no effect on little-endian hosts but meant we generated incorrect results on big-endian hosts. For these insns, the index is indexing over group of 4 8-bit values, so 32 bits per indexed entity, and H4() is therefore what we want. (For Neon the only possible input indexes are 0 and 1.) Backports d1a9254be5cc93afb15be19f7543da6ff4806256	2021-03-02 13:03:51 -05:00
Peter Maydell	5c6730a432	target/arm: Fix float16 pairwise Neon ops on big-endian hosts In the neon_padd/pmax/pmin helpers for float16, a cut-and-paste error meant we were using the H4() address swizzler macro rather than the H2() which is required for 2-byte data. This had no effect on little-endian hosts but meant we put the result data into the destination Dreg in the wrong order on big-endian hosts. Backports 552714c0812a10e5cff239bd29928e5fcb8d8b3b	2021-03-02 13:02:31 -05:00
Richard Henderson	d473f66177	target/arm: Improve do_prewiden_3d We can use proper widening loads to extend 32-bit inputs, and skip the "widenfn" step. Backports 8aab18a2c5209e4e48998a61fbc2d89f374331ed	2021-03-02 13:00:25 -05:00
Richard Henderson	9263117d47	target/arm: Simplify do_long_3d and do_2scalar_long In both cases, we can sink the write-back and perform the accumulate into the normal destination temps Backports 9f1a5f93c2dd345dc6c8fe86ed14bf1485056f6e	2021-03-02 12:46:53 -05:00
Richard Henderson	07c2b70234	target/arm: Rename neon_load_reg64 to vfp_load_reg64 The only uses of this function are for loading VFP double-precision values, and nothing to do with NEON. Backports b38b96ca90827012ab8eb045c1337cea83a54c4b	2021-03-02 12:43:25 -05:00
Richard Henderson	9d87b62578	target/arm: Add read/write_neon_element64 Replace all uses of neon_load/store_reg64 within translate-neon.c.inc. Backports 0aa8e700a53b0aa7275ed747b8fa3acb61d35f2d	2021-03-02 12:40:33 -05:00
Richard Henderson	89b1f62878	target/arm: Rename neon_load_reg32 to vfp_load_reg32 The only uses of this function are for loading VFP single-precision values, and nothing to do with NEON. Backports 21c1c0e50b73c580c6bfc8f2314d1b6a14793561	2021-03-02 12:30:20 -05:00
Richard Henderson	011d9ab061	target/arm: Expand read/write_neon_element32 to all MemOp We can then use this to improve VMOV (scalar to gp) and VMOV (gp to scalar) so that we simply perform the memory operation that we wanted, rather than inserting or extracting from a 32-bit quantity. These were the last uses of neon_load/store_reg, so remove them. Backports 4d5fa5a80ac28f34b8497be1e85371272413a12e	2021-03-02 12:26:41 -05:00
Richard Henderson	d21316d639	target/arm: Add read/write_neon_element32 Model these off the aa64 read/write_vec_element functions. Use it within translate-neon.c.inc. The new functions do not allocate or free temps, so this rearranges the calling code a bit. Backports a712266f5d5a36d04b22fe69fa15592d62bed019	2021-03-02 12:18:31 -05:00
Richard Henderson	e390c1ec7f	target/arm: Use neon_element_offset in vfp_reg_offset This seems a bit more readable than using offsetof CPU_DoubleU. Backports d8719785fde2f5041986853a314c05c6f567d3cb	2021-03-02 11:55:49 -05:00
Richard Henderson	c1ca9e53da	target/arm: Use neon_element_offset in neon_load/store_reg These are the only users of neon_reg_offset, so remove that. Backports 0f2cdc82276a723ee58562b56b9d537a4bd7bfef	2021-03-02 11:54:56 -05:00
Richard Henderson	1b09d0d96f	target/arm: Move neon_element_offset to translate.c This will shortly have users outside of translate-neon.c.inc. Backports 7ec85c02833f4264840c6ed78b749443a7b4ffe0	2021-03-02 11:52:59 -05:00
Richard Henderson	8a20537e7f	target/arm: Introduce neon_full_reg_offset This function makes it clear that we're talking about the whole register, and not the 32-bit piece at index 0. This fixes a bug when running on a big-endian host. Backports 015ee81a4c06b644969f621fd9965cc6372b879e	2021-03-02 11:50:36 -05:00
Peter Maydell	2f0940677e	target/arm: Implement FPSCR.LTPSIZE for M-profile LOB extension If the M-profile low-overhead-branch extension is implemented, FPSCR bits [18:16] are a new field LTPSIZE. If MVE is not implemented (currently always true for us) then this field always reads as 4 and ignores writes. These bits used to be the vector-length field for the old short-vector extension, so we need to take care that they are not misinterpreted as setting vec_len. We do this with a rearrangement of the vfp_set_fpscr() code that deals with vec_len, vec_stride and also the QC bit; this obviates the need for the M-profile only masking step that we used to have at the start of the function. We provide a new field in CPUState for LTPSIZE, even though this will always be 4, in preparation for MVE, so we don't have to come back later and split it out of the vfp.xregs[FPSCR] value. (This state struct field will be saved and restored as part of the FPSCR value via the vmstate_fpscr in machine.c.) Backports 8128c8e8cc9489a8387c74075974f86dc0222e7f	2021-03-01 20:36:02 -05:00
Peter Maydell	8a6e118a17	target/arm: Allow M-profile CPUs with FP16 to set FPSCR.FP16 M-profile CPUs with half-precision floating point support should be able to write to FPSCR.FZ16, but an M-profile specific masking of the value at the top of vfp_set_fpscr() currently prevents that. This is not yet an active bug because we have no M-profile FP16 CPUs, but needs to be fixed before we can add any. The bits that the masking is effectively preventing from being set are the A-profile only short-vector Len and Stride fields, plus the Neon QC bit. Rearrange the order of the function so that those fields are handled earlier and only under a suitable guard; this allows us to drop the M-profile specific masking, making FZ16 writeable. This change also makes the QC bit correctly RAZ/WI for older no-Neon A-profile cores. This refactoring also paves the way for the low-overhead-branch LTPSIZE field, which uses some of the bits that are used for A-profile Stride and Len. Backports commit d31e2ce68d56f5bcc83831497e5fe4b8a7e18e85	2021-03-01 20:33:22 -05:00
Peter Maydell	3ae5543825	target/arm: Implement v8.1M low-overhead-loop instructions v8.1M's "low-overhead-loop" extension has three instructions for looping: * DLS (start of a do-loop) * WLS (start of a while-loop) * LE (end of a loop) The loop-start instructions are both simple operations to start a loop whose iteration count (if any) is in LR. The loop-end instruction handles "decrement iteration count and jump back to loop start"; it also caches the information about the branch back to the start of the loop to improve performance of the branch on subsequent iterations. As with the branch-future instructions, the architecture permits an implementation to discard the LO_BRANCH_INFO cache at any time, and QEMU takes the IMPDEF option to never set it in the first place (equivalent to discarding it immediately), because for us a "real" implementation would be unnecessary complexity. (This implementation only provides the simple looping constructs; the vector extension MVE (Helium) adds some extra variants to handle looping across vectors. We'll add those later when we implement MVE.) Backports commit b7226369721896ab9ef71544e4fe95b40710e05a	2021-03-01 20:29:04 -05:00
Peter Maydell	be197f9857	target/arm: Implement v8.1M branch-future insns (as NOPs) v8.1M implements a new 'branch future' feature, which is a set of instructions that request the CPU to perform a branch "in the future", when it reaches a particular execution address. In hardware, the expected implementation is that the information about the branch location and destination is cached and then acted upon when execution reaches the specified address. However the architecture permits an implementation to discard this cached information at any point, and so guest code must always include a normal branch insn at the branch point as a fallback. In particular, an implementation is specifically permitted to treat all BF insns as NOPs (which is equivalent to discarding the cached information immediately). For QEMU, implementing this caching of branch information would be complicated and would not improve the speed of execution at all, so we make the IMPDEF choice to implement all BF insns as NOPs. Backports commit 05903f036edba8e3ed940cc215b8e27fb49265b9	2021-03-01 20:25:15 -05:00
Peter Maydell	966246d991	target/arm: Don't allow BLX imm for M-profile The BLX immediate insn in the Thumb encoding always performs a switch from Thumb to Arm state. This would be totally useless in M-profile which has no Arm decoder, and so the instruction does not exist at all there. Make the encoding UNDEF for M-profile. (This part of the encoding space is used for the branch-future and low-overhead-loop insns in v8.1M.) Backports 920f04fa3ea789f8f85a52cee5395b8887b56cf7	2021-03-01 20:23:59 -05:00
Peter Maydell	5680bc701b	target/arm: Make the t32 insn[25:23]=111 group non-overlapping The t32 decode has a group which represents a set of insns which overlap with B_cond_thumb because they have [25:23]=111 (which is an invalid condition code field for the branch insn). This group is currently defined using the {} overlap-OK syntax, but it is almost entirely non-overlapping patterns. Switch it over to use a non-overlapping group. For this to be valid syntactically, CPS must move into the same overlapping-group as the hint insns (CPS vs hints was the only actual use of the overlap facility for the group). The non-overlapping subgroup for CLREX/DSB/DMB/ISB/SB is no longer necessary and so we can remove it (promoting those insns to be members of the parent group). Backports 45f11876ae86128bdee27e0b089045de43cc88e4	2021-03-01 20:22:11 -05:00
Peter Maydell	666fe17025	target/arm: Implement v8.1M conditional-select insns v8.1M brings four new insns to M-profile: * CSEL : Rd = cond ? Rn : Rm * CSINC : Rd = cond ? Rn : Rm+1 * CSINV : Rd = cond ? Rn : ~Rm * CSNEG : Rd = cond ? Rn : -Rm Implement these. Backports cc73bbded0dfb5612b0e416f7eda13a66950542a	2021-03-01 20:19:33 -05:00
Peter Maydell	2dae268fcb	target/arm: Implement v8.1M NOCP handling From v8.1M, disabled-coprocessor handling changes slightly: * coprocessors 8, 9, 14 and 15 are also governed by the cp10 enable bit, like cp11 * an extra range of instruction patterns is considered to be inside the coprocessor space We previously marked these up with TODO comments; implement the correct behaviour. Unfortunately there is no ID register field which indicates this behaviour. We could in theory test an unrelated ID register which indicates guaranteed-to-be-in-v8.1M behaviour like ID_ISAR0.CmpBranch >= 3 (low-overhead-loops), but it seems better to simply define a new ARM_FEATURE_V8_1M feature flag and use it for this and other new-in-v8.1M behaviour that isn't identifiable from the ID registers. Backports commit 5d2555a1fe7370feeb1efbbf276a653040910017	2021-03-01 20:16:09 -05:00
Peter Maydell	51093daf5f	decodetree: Fix codegen for non-overlapping group inside overlapping group For nested groups like: { [ pattern 1 pattern 2 ] pattern 3 } the intended behaviour is that patterns 1 and 2 must not overlap with each other; if the insn matches neither then we fall through to pattern 3 as the next thing in the outer overlapping group. Currently we generate incorrect code for this situation, because in the code path for a failed match inside the inner non-overlapping group we generate a "return" statement, which causes decode to stop entirely rather than continuing to the next thing in the outer group. Generate a "break" instead, so that decode flow behaves as required for this nested group case. Backports 514101c0b931f0a11a40d29d26af1cc40482f951	2021-03-01 20:14:19 -05:00
Richard Henderson	f7e831a7e4	target/arm: Ignore HCR_EL2.ATA when {E2H,TGE} != 11 Unlike many other bits in HCR_EL2, the description for this bit does not contain the phrase "if ... this field behaves as 0 for all purposes other than", so do not squash the bit in arm_hcr_el2_eff. Instead, replicate the E2H+TGE test in the two places that require it. Backports 4301acd7d7d455792ea873ced75c0b5d653618b1	2021-03-01 20:12:36 -05:00
Richard Henderson	4f00eacb11	target/arm: Fix reported EL for mte_check_fail The reporting in AArch64.TagCheckFail only depends on PSTATE.EL, and not the AccType of the operation. There are two guest visible problems that affect LDTR and STTR because of this: (1) Selecting TCF0 vs TCF1 to decide on reporting, (2) Report "data abort same el" not "data abort lower el". Backports 50244cc76abcac3296cff3d84826f5ff71808c80	2021-03-01 20:10:44 -05:00
Richard Henderson	511636a3f4	target/arm: Remove redundant mmu_idx lookup We already have the full ARMMMUIdx as computed from the function parameter. For the purpose of regime_has_2_ranges, we can ignore any difference between AccType_Normal and AccType_Unpriv, which would be the only difference between the passed mmu_idx and arm_mmu_idx_el. Backports 4aedfc0f633fd06dd2a5dc8ffa93f4c56117e37f	2021-03-01 20:09:51 -05:00
Peter Maydell	d350644817	target/arm: AArch32 VCVT fixed-point to float is always round-to-nearest For AArch32, unlike the VCVT of integer to float, which honours the rounding mode specified by the FPSCR, VCVT of fixed-point to float is always round-to-nearest. (AArch64 fixed-point-to-float conversions always honour the FPCR rounding mode.) Implement this by providing _round_to_nearest versions of the relevant helpers which set the rounding mode temporarily when making the call to the underlying softfloat function. We only need to change the VFP VCVT instructions, because the standard- FPSCR value used by the Neon VCVT is always set to round-to-nearest, so we don't need to do the extra work of saving and restoring the rounding mode. Backports commit 61db12d9f9eb36761edba4d9a414cd8dd34c512b	2021-03-01 20:04:31 -05:00
Peter Maydell	31013d5a8f	target/arm: Fix SMLAD incorrect setting of Q bit The SMLAD instruction is supposed to: * signed multiply Rn[15:0] * Rm[15:0] * signed multiply Rn[31:16] * Rm[31:16] * perform a signed addition of the products and Ra * set Rd to the low 32 bits of the theoretical infinite-precision result * set the Q flag if the sign-extension of Rd would differ from the infinite-precision result (ie on overflow) Our current implementation doesn't quite do this, though: it performs an addition of the products setting Q on overflow, and then it adds Ra, again possibly setting Q. This sometimes incorrectly sets Q when the architecturally mandated only-check-for-overflow-once algorithm does not. For instance: r1 = 0x80008000; r2 = 0x80008000; r3 = 0xffffffff smlad r0, r1, r2, r3 This is (-32768 * -32768) + (-32768 * -32768) - 1 The products are both 0x4000_0000, so when added together as 32-bit signed numbers they overflow (and QEMU sets Q), but because the addition of Ra == -1 brings the total back down to 0x7fff_ffff there is no overflow for the complete operation and setting Q is incorrect. Fix this edge case by resorting to 64-bit arithmetic for the case where we need to add three values together. Backports commit 5288145d716338ace0f83e3ff05c4d07715bb4f4	2021-03-01 19:58:39 -05:00
Peter Maydell	6cd06169ee	target/arm: Make '-cpu max' have a 48-bit PA QEMU supports a 48-bit physical address range, but we don't currently expose it in the '-cpu max' ID registers (you get the same range as Cortex-A57, which is 44 bits). Set the ID_AA64MMFR0.PARange field to indicate 48 bits. Backports d1b6b7017572e8d82f26eb827a1dba0e8cf3cae6	2021-03-01 19:50:28 -05:00
Richard Henderson	c648361597	tcg: Remove TCG_TARGET_HAS_cmp_vec The cmp_vec opcode is mandatory; this symbol is unused. Backports cae5d53b9e72d7a1e43cebeb36471d77a16c6e43	2021-03-01 19:49:02 -05:00
Richard Henderson	45af31fcb4	tcg/optimize: Fold dup2_vec When the two arguments are identical, this can be reduced to dup_vec or to mov_vec from a tcg_constant_vec. Backports commit 1dc4fe70128db05237a00eda6eb15e2b44deb31f	2021-03-01 19:46:14 -05:00
Richard Henderson	456fb66617	tcg: Fix generation of dupi_vec for 32-bit host The definition of INDEX_op_dupi_vec is that it operates on units of tcg_target_ulong -- in this case 32 bits. It does not work to use this for a uint64_t value that happens to be small enough to fit in tcg_target_ulong. Backports a5b30d950c42b14bc9da24d1e68add6538d23336	2021-03-01 19:45:30 -05:00
Richard Henderson	578673be68	tcg/i386: Fix dupi for avx2 32-bit hosts The previous change wrongly stated that 32-bit avx2 should have used VPBROADCASTW. But that's a 16-bit broadcast and we want a 32-bit broadcast. Backports f80d09b599a5e0fd7f44653f23b04104cb703f7a	2021-03-01 19:44:09 -05:00
Richard Henderson	50b3632ab4	tcg: Remove TCGOpDef.used The last user of this field disappeared in f69d277ece4.	2021-03-01 19:43:37 -05:00
Richard Henderson	7813c57f9e	tcg: Move some TCG_CT_* bits to TCGArgConstraint bitfields These are easier to set and test when they have their own fields. Reduce the size of alias_index and sort_index to 4 bits, which is sufficient for TCG_MAX_OP_ARGS. This leaves only the bits indicating constants within the ct field. Move all initialization to allocation time, rather than init individual fields in process_op_defs. Backports bc2b17e6ea582ef3ade2bdca750de269c674c915	2021-03-01 19:41:34 -05:00
Richard Henderson	71a34d84e5	tcg: Remove TCG_CT_REG This wasn't actually used for anything, really. All variable operands must accept registers, and which are indicated by the set in TCGArgConstraint.regs. Backports commit 74a117906b87ff9220e4baae5a7431d6f4eadd45	2021-03-01 19:38:00 -05:00
Richard Henderson	ae075d324d	tcg: Move sorted_args into TCGArgConstraint.sort_index This uses an existing hole in the TCGArgConstraint structure and will be convenient for keeping the data in one place. Backports 66792f90f14fef18b25a168922877a367ecdca05	2021-03-01 19:33:45 -05:00
Richard Henderson	e3356f9bad	tcg: Drop union from TCGArgConstraint The union is unused; let "regs" appear in the main structure without the "u.regs" wrapping. Backports 9be0d08019465b38e2f1a605960961a491430c21	2021-03-01 19:29:19 -05:00
Richard Henderson	1551f6be9d	tcg: Adjust simd_desc size encoding With larger vector sizes, it turns out oprsz == maxsz, and we only need to represent mismatch for oprsz <= 32. We do, however, need to represent larger oprsz and do so without reducing SIMD_DATA_BITS. Reduce the size of the oprsz field and increase the maxsz field. Steal the oprsz value of 24 to indicate equality with maxsz. Backports e2e7168a214b0ed98dc357bba96816486a289762	2021-03-01 19:23:37 -05:00
Richard Henderson	567fa21c65	target/arm: Fix SVE splice While converting to gen_gvec_ool_zzzp, we lost passing a->esz as the data argument to the function. Backports commit dd701fafe55a78e655d4823d29226d92250a6b56	2021-03-01 19:20:44 -05:00
Richard Henderson	ccb293911f	target/arm: Fix sve ldr/str The mte update missed a bit when producing clean addresses. Fixes: b2aa8879b88 Backports d8227b098301935ea8e0e032e7d41e5dc3e97590	2021-03-01 19:20:04 -05:00
Peter Maydell	79feec40df	target/arm: Make isar_feature_aa32_fp16_arith() handle M-profile The M-profile definition of the MVFR1 ID register differs slightly from the A-profile one, and in particular the check for "does the CPU support fp16 arithmetic" is not the same. We don't currently implement any M-profile CPUs with fp16 arithmetic, so this is not yet a visible bug, but correcting the logic now disarms this beartrap for when we eventually do. Backports commit dfc523a84b06b6a4b583ed4c29d24fd980dd37a0	2021-03-01 19:17:23 -05:00
Peter Maydell	09a7d6381e	target/arm: Move id_pfr0, id_pfr1 into ARMISARegisters Move the id_pfr0 and id_pfr1 fields into the ARMISARegisters sub-struct. We're going to want id_pfr1 for an isar_features check, and moving both at the same time avoids an odd inconsistency. Changes other than the ones to cpu.h and kvm64.c made automatically with: perl -p -i -e 's/cpu->id_pfr/cpu->isar.id_pfr/' target/arm/*.c hw/intc/armv7m_nvic.c Backports commit 8a130a7be6e222965641e1fd9469fd3ee752c7d4	2021-03-01 19:15:10 -05:00
Peter Maydell	ed92f3c42b	target/arm: Replace ARM_FEATURE_PXN with ID_MMFR0.VMSA check The ARM_FEATURE_PXN bit indicates whether the CPU supports the PXN bit in short-descriptor translation table format descriptors. This is indicated by ID_MMFR0.VMSA being at least 0b0100. Replace the feature bit with an ID register check, in line with our preference for ID register checks over feature bits. Backports commit 0ae0326b984e77a55c224b7863071bd3d8951231	2021-03-01 19:06:15 -05:00
Xiaoyao Li	d9d68cc128	i386/cpu: Clear FEAT_XSAVE_COMP_{LO,HI} when XSAVE is not available Per Intel SDM vol 1, 13.2, if CPUID.1:ECX.XSAVE[bit 26] is 0, the processor provides no further enumeration through CPUID function 0DH. QEMU does not do this for "-cpu host,-xsave". Backports 19ca8285fcd61a8f60f2f44f789a561e0958e8e6	2021-03-01 19:04:03 -05:00
Richard Henderson	5e6196ea6b	target/riscv: Set instance_align on RISCVCPU TypeInfo Fix alignment of CPURISCVState.vreg. Backports 5de5b99b3101a1648ed583193db8d92eea0c4545	2021-03-01 19:00:27 -05:00
Richard Henderson	cdf40f7ff6	target/arm: Set instance_align on CPUARM TypeInfo Fix alignment of CPUARMState.vfp.zregs. Backports d03087bda4ba17076b430fd2af083020d7c5112a	2021-03-01 18:58:44 -05:00
Richard Henderson	86dd30850d	qom: Allow objects to be allocated with increased alignment It turns out that some hosts have a default malloc alignment less than that required for vectors. We assume that, with compiler annotation on CPUArchState, that we can properly align the vector portion of the guest state. Fix the alignment of the allocation by using qemu_memalloc when required.	2021-03-01 18:32:51 -05:00
Eduardo Habkost	6baafeafd4	qom: Correct object_class_dynamic_cast_assert() documentation object_class_dynamic_cast_assert() is not used by INTERFACE_CHECK, remove misleading mention of that function in the documentation.	2021-03-01 18:29:34 -05:00
Aaron Lindsay	97702da7ad	target/arm: Count PMU events when MDCR.SPME is set This check was backwards when introduced in commit 033614c47de78409ad3fb39bb7bd1483b71c6789: target/arm: Filter cycle counter based on PMCCFILTR_EL0 Backports commit db1f3afb17269cf2bd86c222e1bced748487ef71	2021-03-01 18:25:25 -05:00
Peter Maydell	16ad0d93d9	target/arm: Convert VCMLA, VCADD size field to MO_* in decode The VCMLA and VCADD insns have a size field which is 0 for fp16 and 1 for fp32 (note that this is the reverse of the Neon 3-same encoding!). Convert it to MO_* values in decode for consistency. Backports d186a4854c04e9832907b0b4240a47731da20993	2021-03-01 18:23:34 -05:00
Peter Maydell	61abec1908	target/arm: Convert Neon VCVT fp size field to MO_* in decode Convert the insns using the 2reg_vcvt and 2reg_vcvt_f16 formats to pass the size through to the trans function as a MO_* value rather than the '0==f32, 1==f16' used in the fp 3-same encodings. Backports commit 0ae715c658a02af1834b63563c56112a6d8842cb	2021-03-01 18:20:11 -05:00
Peter Maydell	524b54bc7b	target/arm: Convert Neon 3-same-fp size field to MO_* in decode In the Neon instructions, some instruction formats have a 2-bit size field which corresponds exactly to QEMU's MO_8/16/32/64. However the floating-point insns in the 3-same group have a 1-bit size field which is "0 for 32-bit float and 1 for 16-bit float". Currently we pass these values directly through to trans_ functions, which means that when reading a particular trans_ function you need to know if that insn uses a 2-bit size or a 1-bit size. Move the handling of the 1-bit size to the decodetree file, so that all these insns consistently pass a size to the trans_ function which is an MO_8/16/32/64 value. In this commit we switch over the insns using the 3same_fp and 3same_fp_q0 formats. Backports commit 6cf0f240e0b980a877abed12d2995f740eae6515	2021-03-01 18:15:18 -05:00
Richard Henderson	cd79d2a915	tcg: Implement 256-bit dup for tcg_gen_gvec_dup_mem We already support duplication of 128-bit blocks. This extends that support to 256-bit blocks. This will be needed by SVE2. Backports commit fe4b0b5bfa96c38ad1cad0689a86cca9f307e353	2021-03-01 18:10:07 -05:00
Richard Henderson	b478ce5052	tcg: Eliminate one store for in-place 128-bit dup_mem Do not store back to the exact memory from which we just loaded. Backports 6a17646176e011ddc463a2870a64c7aaccfe9c50	2021-03-01 18:06:17 -05:00
Stephen Long	c9dc750058	tcg: Fix tcg gen for vectorized absolute value The fallback inline expansion for vectorized absolute value, when the host doesn't support such an insn was flawed. E.g. when a vector of bytes has all elements negative, mask will be 0xffff_ffff_ffff_ffff. Subtracting mask only adds 1 to the low element instead of all elements becase -mask is 1 and not 0x0101_0101_0101_0101. Backports commit e7e8f33fb603c3bfa0479d7d924f2ad676a84317	2021-03-01 18:04:46 -05:00
Eduardo Habkost	cefb1666c0	arm: Fix typo in AARCH64_CPU_GET_CLASS definition There's a typo in the type name of AARCH64_CPU_GET_CLASS. This was never detected because the macro is not used by any code. Backports 37e3d65043229bb20bd07af74dc0866e12071415	2021-03-01 18:03:29 -05:00
Peter Maydell	ff74ede2fd	target/arm: Enable FP16 in '-cpu max' Set the MVFR1 ID register FPHP and SIMDHP fields to indicate that our "-cpu max" has v8.2-FP16. Backports commit 5f07817eb94542e39a419baafa3026b15e8d33f7	2021-03-01 18:00:13 -05:00
Peter Maydell	b948636c4a	target/arm: Implement fp16 for Neon VMUL, VMLA, VMLS Convert the Neon floating-point VMUL, VMLA and VMLS to use gvec, and use this to implement fp16 support. Backports fc8ae790311882afa3c7816df004daf978c40e9a	2021-03-01 17:57:36 -05:00
Peter Maydell	8c6affbca4	target/arm/vec_helper: Add gvec fp indexed multiply-and-add operations Add gvec helpers for doing Neon-style indexed non-fused fp multiply-and-accumulate operations. Backports commit c50d8d144098a8261233ca31b47e3bc487e112fe	2021-03-01 17:52:31 -05:00
Peter Maydell	3cc3099e36	target/arm/vec_helper: Handle oprsz less than 16 bytes in indexed operations In the gvec helper functions for indexed operations, for AArch32 Neon the oprsz (total size of the vector) can be less than 16 bytes if the operation is on a D reg. Since the inner loop in these helpers always goes from 0 to segment, we must clamp it based on oprsz to avoid processing a full 16 byte segment when asked to handle an 8 byte wide vector. Backports commit d7ce81e553e6789bf27657105b32575668d60b1c	2021-03-01 17:48:42 -05:00
Peter Maydell	681218b4ab	target/arm: Implement fp16 for Neon VRINTX Convert the Neon VRINTX insn to use gvec, and use this to implement fp16 support for it. Backports 23afcdd2511f2a3dc05bed650d27bd25cf9b2a3c	2021-03-01 17:47:25 -05:00
Peter Maydell	53aba9d900	target/arm: Implement fp16 for Neon VRINT-with-specified-rounding-mode Convert the Neon VRINT-with-specified-rounding-mode insns to gvec, and use this to implement the fp16 versions. Backports 18725916b1438b54d6d6533980833d2251a20b7c	2021-03-01 17:44:49 -05:00
Peter Maydell	eb4054d04f	target/arm: Implement fp16 for Neon VCVT with rounding modes Convert the Neon VCVT with-specified-rounding-mode instructions to gvec, and use this to implement fp16 support for them. Backports ca88a6efdf4ce96b646a896059f9bd324c2cebc4	2021-03-01 17:40:36 -05:00
Peter Maydell	56fe927d40	target/arm: Implement fp16 for Neon VCVT fixed-point Implement fp16 for the Neon VCVT insns which convert between float and fixed-point. Backports 24018cf3990b692b51e50183c5fbd98d17b3fa40	2021-03-01 17:36:43 -05:00
Peter Maydell	948b01ad01	target/arm: Convert Neon VCVT fixed-point to gvec Convert the Neon VCVT float<->fixed-point insns to a gvec style, in preparation for adding fp16 support. Backports 7b959c5890deb9a6d71bc6800006a0eae0a84c60	2021-03-01 17:33:20 -05:00
Peter Maydell	c324c6817e	target/arm: Implement fp16 for Neon float-integer VCVT Convert the Neon float-integer VCVT insns to gvec, and use this to implement fp16 support for them. Note that unlike the VFP int<->fp16 VCVT insns we converted earlier and which convert to/from a 32-bit integer, these Neon insns convert to/from 16-bit integers. So we can use the existing vfp conversion helpers for the f32<->u32/i32 case but need to provide our own for f16<->u16/i16. Backports 7782a9afec81d1efe23572135c1ed777691ccde5	2021-03-01 17:29:02 -05:00
Peter Maydell	82f4a7e135	target/arm: Implement fp16 for Neon pairwise fp ops Convert the Neon pairwise fp ops to use a single gvic-style helper to do the full operation instead of one helper call for each 32-bit part. This allows us to use the same framework to implement the fp16. Backports 1dc587ee9bfe804406eb3e0bacf47a80644d8abc	2021-03-01 17:25:19 -05:00
Peter Maydell	b08ea84374	target/arm: Implement fp16 for Neon VRSQRTS Convert the Neon VRSQRTS insn to using a gvec helper, and use this to implement the fp16 case. As with VRECPS, we adjust the phrasing of the new implementation slightly so that the fp32 version parallels the fp16 one. Backports 40fde72dda2da8d55b820fa6c5efd85814be2023	2021-03-01 17:20:22 -05:00
Peter Maydell	f4ebbba9fd	target/arm: Implement fp16 for Neon VRECPS Convert the Neon VRECPS insn to using a gvec helper, and use this to implement the fp16 case. The phrasing of the new float32_recps_nf() is slightly different from the old recps_f32() so that it parallels the f16 version; for f16 we can't assume that flush-to-zero is always enabled. Backports ac8c62c4e5a3f24e6d47f52ec1bfb20994caefa5	2021-03-01 17:09:16 -05:00
Peter Maydell	5776c594e4	target/arm: Implement fp16 for Neon fp compare-vs-0 Convert the neon floating-point vector compare-vs-0 insns VCEQ0, VCGT0, VCLE0, VCGE0 and VCLT0 to use a gvec helper, and use this to implement the fp16 case. Backport 635187aaa92f21ab001e2868e803b3c5460261ca	2021-03-01 17:05:03 -05:00
Peter Maydell	8de258c3cb	target/arm: Implement fp16 for Neon VFMA, VMFS Convert the neon floating-point vector operations VFMA and VFMS to use a gvec helper, and use this to implement the fp16 case. This is the last use of do_3same_fp() so we can now delete that function. Backports commit cf722d75b329ef3f86b869e7e68cbfb1607b3bde	2021-03-01 17:00:49 -05:00
Peter Maydell	587c3549b7	target/arm: Implement fp16 for Neon VMLA, VMLS operations Convert the Neon floating-point VMLA and VMLS insns over to using a gvec helper, and use this to implement the fp16 case. Backports e5adc70665ecaf4009c2fb8d66775ea718a85abd	2021-03-01 16:57:20 -05:00
Peter Maydell	0068d12355	target/arm: Implement fp16 for Neon VMAXNM, VMINNM Convert the Neon floating point VMAXNM and VMINNM insns to using a gvec helper and use this to implement the fp16 case. Backports e22705bb941d82d6c2a09e8b2031084326902be3	2021-03-01 16:53:57 -05:00
Peter Maydell	465cfb54c4	target/arm: Implement fp16 for Neon VMAX, VMIN Convert the Neon float-point VMAX and VMIN insns over to using a gvec helper, and use this to implement the fp16 case. Backport e43268c54b6cbcb197d179409df7126e81f8cd52	2021-03-01 16:50:23 -05:00
Peter Maydell	6dd4a8e93f	target/arm: Implement fp16 for VACGE, VACGT Convert the neon floating-point vector absolute comparison ops VACGE and VACGT over to using a gvec hepler and use this to implement the fp16 case. Backports bb2741da186ebaebc7d5189372be4401e1ff9972	2021-03-01 16:47:44 -05:00
Peter Maydell	4eb39f1b2f	target/arm: Implement fp16 for VCEQ, VCGE, VCGT comparisons Convert the Neon floating-point vector comparison ops VCEQ, VCGE and VCGT over to using a gvec helper and use this to implement the fp16 case. (We put the float16_ceq() etc functions above the DO_2OP() macro definition because later when we convert the compare-against-zero instructions we'll want their definitions to be visible at that point in the source file.) Backports ad505db233b89b7fd4b5a98b6f0e8ac8d05b11db	2021-03-01 16:44:34 -05:00
Peter Maydell	0e8fd4cd0c	target/arm: Implement fp16 for Neon VABS, VNEG of floats Rewrite Neon VABS/VNEG of floats to use gvec logical AND and XOR, so that we can implement the fp16 version of the insns. Backport 2b70d8cd09f5450c15788acd24f6f8bc4116c395	2021-03-01 16:40:33 -05:00
Peter Maydell	6c71951d54	target/arm: Implement fp16 for Neon VRECPE, VRSQRTE using gvec We already have gvec helpers for floating point VRECPE and VRQSRTE, so convert the Neon decoder to use them and add the fp16 support. Backports 4a15d9a3b39d4d161d7e03dfcf52e9f214eef0b8	2021-03-01 16:35:04 -05:00
Peter Maydell	4850377f01	target/arm: Implement FP16 for Neon VADD, VSUB, VABD, VMUL Implement FP16 support for the Neon insns which use the DO_3S_FP_GVEC macro: VADD, VSUB, VABD, VMUL. For VABD this requires us to implement a new gvec_fabd_h helper using the machinery we have already for the other helpers. Backport e4a6d4a69e239becfd83bdcd996476e7b8e1138d	2021-03-01 16:31:54 -05:00
Peter Maydell	08b70267d0	target/arm: Implement VFP fp16 VMOV between gp and halfprec registers Implement the VFP fp16 variant of VMOV that transfers a 16-bit value between a general purpose register and a VFP register. Note that Rt == 15 is UNPREDICTABLE; since this insn is v8 and later only we have no need to replicate the old "updates CPSR.NZCV" behaviour that the singleprec version of this insn does Backports commit 46a4b854525cb9f34a611f6ada6cdff1eab0ac2d	2021-03-01 16:26:34 -05:00
Peter Maydell	58485bca97	target/arm: Implement new VFP fp16 insn VMOVX The fp16 extension includes a new instruction VMOVX, which copies the upper 16 bits of a 32-bit source VFP register into the lower 16 bits of the destination and zeroes the high half of the destination. Implement it. Backports f61e5c43b86907dea17f431b528d806659d62bcb	2021-03-01 16:24:50 -05:00
Peter Maydell	3dd587e3df	target/arm: Implement new VFP fp16 insn VINS The fp16 extension includes a new instruction VINS, which copies the lower 16 bits of a 32-bit source VFP register into the upper 16 bits of the destination. Implement it. Backports commit e4875e3bcc3a9c54d7e074c8f51e04c2e6364e2e	2021-03-01 16:22:27 -05:00
Peter Maydell	90aa9647e0	target/arm: Implement VFP fp16 VRINT* Implement the fp16 version of the VFP VRINT* insns. Backports 0a6f4b4cb338665b81ad824d9a6868932461b7f7	2021-03-01 16:15:21 -05:00
Peter Maydell	1c8088b48a	target/arm: Implement VFP fp16 VSEL Implement the fp16 versions of the VFP VSEL instruction. Backports commit 11e78fecdf2d605cfed33aa09bbcf0cc4fb95886	2021-03-01 16:08:51 -05:00
Peter Maydell	beee4ad7f3	target/arm: Implement VFP vp16 VCVT-with-specified-rounding-mode Implement the fp16 versions of the VFP VCVT instruction forms which convert between floating point and integer with a specified rounding mode. Backports c505bc6a9d50a48f9d89d6cf930e863838a5b367	2021-02-28 05:18:07 -05:00
Peter Maydell	74a6af4e23	target/arm: Implement VFP fp16 VCVT between float and fixed-point Implement the fp16 versions of the VFP VCVT instruction forms which convert between floating point and fixed-point. Backports a149e2de0b63e3906729ed1d3df7d9ecdb6de5e6	2021-02-28 05:15:40 -05:00
Peter Maydell	9c5b6f06a2	target/arm: Use macros instead of open-coding fp16 conversion helpers Now the VFP_CONV_FIX macros can handle fp16's distinction between the width of the operation and the width of the type used to pass operands, use the macros rather than the open-coded functions. This creates an extra six helper functions, all of which we are going to need for the AArch32 VFP fp16 instructions. Backports commit 414ba270c4fb758d987adf37ae9bfe531715c604	2021-02-28 05:08:44 -05:00
Peter Maydell	dd6e11eaa7	target/arm: Make VFP_CONV_FIX macros take separate float type and float size Currently the VFP_CONV_FIX macros take a single fsz argument for the size of the float type, which is used both to select the name of the functions to call (eg float32_is_any_nan()) and also for the type to use for the float inputs and outputs (eg float32). Separate these into fsz and ftype arguments, so that we can use them for fp16, which uses 'float16' in the function names but is still passing inputs and outputs in a 32-bit sized type. Backports 5366f6ad7da4f6def2733ec7ee24495430256839	2021-02-28 05:05:53 -05:00
Peter Maydell	f8241ae22f	target/arm: Implement VFP fp16 VCVT between float and integer Backports 0094e9f475a5a742d10d2f1e1beceea82b69f982	2021-02-28 05:02:25 -05:00
Peter Maydell	ac9ae5cbe7	target/arm: Implement VFP fp16 VLDR and VSTR Implement the fp16 versions of the VFP VLDR/VSTR (immediate). Backports commit 274afbb121107b8aaeaa11b3e7904d5f8ae38a94	2021-02-28 04:58:32 -05:00
Peter Maydell	5d98e14545	target/arm: Implement VFP fp16 VCMP Implement fp16 version of VCMP. Backports 1b88b054c5b201e8581114d29527c6a5a7e088c9	2021-02-28 04:56:24 -05:00
Peter Maydell	25d95570f3	target/arm: Implement VFP fp16 for VMOV immediate Implement VFP fp16 support for the VMOV immediate insn. Backports commit 28c28728e53c9f4c13a5cd50f313788c7ec2f9ad	2021-02-28 04:51:11 -05:00
Peter Maydell	2d9abf7c0b	target/arm: Implement VFP fp16 for VABS, VNEG, VSQRT Implement VFP fp16 for VABS, VNEG and VSQRT. This is all the fp16 insns that use the DO_VFP_2OP macro, because there is no fp16 version of VMOV_reg. Notes: * the gen_helper_vfp_negh already exists as we needed to create it for the fp16 multiply-add insns * as usual we need to use the f16 version of the fp_status; this is only relevant for VSQRT Backports ce2d65a5d191380756cdac7a1fd1ba76bd1621cf	2021-02-28 04:48:28 -05:00
Peter Maydell	f3af6b8c25	target/arm: Macroify uses of do_vfp_2op_sp() and do_vfp_2op_dp() Macroify the uses of do_vfp_2op_sp() and do_vfp_2op_dp(); this will make it easier to add the halfprec support. Backports 009a07335b8ff492d940e1eb229a1b0d302c2512	2021-02-28 04:43:01 -05:00
Peter Maydell	6ac2c597ab	target/arm: Implement VFP fp16 for fused-multiply-add Implement VFP fp16 support for fused multiply-add insns VFNMA, VFNMS, VFMA, VFMS. Backports 9886fe2834b064a3cf0675a4659942ed547aed42	2021-02-28 04:39:21 -05:00
Peter Maydell	f86c84425b	target/arm: Macroify trans functions for VFMA, VFMS, VFNMA, VFNMS Macroify creation of the trans functions for single and double precision VFMA, VFMS, VFNMA, VFNMS. The repetition was OK for two sizes, but we're about to add halfprec and it will get a bit more than seems reasonable. Backports 2aa8dcfa14558fe2a63ed0496d60b02565c9a225	2021-02-28 04:36:07 -05:00
Peter Maydell	a42ecfe203	target/arm: Implement VFP fp16 VMLA, VMLS, VNMLS, VNMLA, VNMUL Implement fp16 versions of the VFP VMLA, VMLS, VNMLS, VNMLA, VNMUL instructions. (These are all the remaining ones which we implement via do_vfp_3op_[hsd]p().) Backports commit e7cb0ded52c6d7b86585b09935fe7caeb9e38b69	2021-02-28 04:29:37 -05:00
Peter Maydell	eae621098d	target/arm: Implement VFP fp16 for VFP_BINOP operations Implmeent VFP fp16 support for simple binary-operator VFP insns VADD, VSUB, VMUL, VDIV, VMINNM and VMAXNM: * make the VFP_BINOP() macro generate float16 helpers as well as float32 and float64 * implement a do_vfp_3op_hp() function similar to the existing do_vfp_3op_sp() * add decode for the half-precision insn patterns Note that the VFP_BINOP macro use creates a couple of unused helper functions vfp_maxh and vfp_minh, but they're small so it's not worth splitting the BINOP operations into "needs halfprec" and "no halfprec" groups. Backports commit 120a0eb3ea23a5b06fae2f3daebd46a4035864cf	2021-02-28 04:24:39 -05:00
Peter Maydell	1afb240134	target/arm: Use correct ID register check for aa32_fp16_arith The aa32_fp16_arith feature check function currently looks at the AArch64 ID_AA64PFR0 register. This is (as the comment notes) not correct. The bogus check was put in mostly to allow testing of the fp16 variants of the VCMLA instructions and it was something of a mistake that we allowed them to exist in master. Switch the feature check function to testing VMFR1.FPHP, which is what it ought to be. This will remove emulation of the VCMLA and VCADD insns from AArch32 code running on an AArch64 '-cpu max' using system emulation. (They were never enabled for aarch32 linux-user and system-emulation.) Since we weren't advertising their existence via the AArch32 ID register, well-behaved guests wouldn't have been using them anyway. Once we have implemented all the AArch32 support for the FP16 extension we will advertise it in the MVFR1 ID register field, which will reenable these insns along with all the others. Backports 02bc236d0131a666d4ac2bb7197bbad2897c336a	2021-02-27 16:47:48 -05:00
Peter Maydell	b93ca1fca6	target/arm: Remove local definitions of float constants In several places the target/arm code defines local float constants for 2, 3 and 1.5, which are also provided by include/fpu/softfloat.h. Remove the unnecessary local duplicate versions. Backports b684e49a17da39539b0ac6e4c4c98b28b38feb76	2021-02-27 16:47:10 -05:00
Chen Qun	46af765bbb	target/arm/translate-a64:Remove redundant statement in disas_simd_two_reg_misc_fp16() Clang static code analyzer show warning: target/arm/translate-a64.c:13007:5: warning: Value stored to 'rd' is never read rd = extract32(insn, 0, 5); ^ ~~~~~~~~~~~~~~~~~~~~~ target/arm/translate-a64.c:13008:5: warning: Value stored to 'rn' is never read rn = extract32(insn, 5, 5); ^ ~~~~~~~~~~~~~~~~~~~~~ Backports fa71dd531c12ad9a05cdd78392e9fc2a30ea921d	2021-02-27 16:45:25 -05:00
Chen Qun	9bac2113cd	target/arm/translate-a64:Remove dead assignment in handle_scalar_simd_shli() Clang static code analyzer show warning: target/arm/translate-a64.c:8635:14: warning: Value stored to 'tcg_rn' during its initialization is never read TCGv_i64 tcg_rn = new_tmp_a64(s); ^~~~~~ ~~~~~~~~~~~~~~ target/arm/translate-a64.c:8636:14: warning: Value stored to 'tcg_rd' during its initialization is never read TCGv_i64 tcg_rd = new_tmp_a64(s); ^~~~~~ ~~~~~~~~~~~~~~ Backports 07174c86b41e91d98ed2ee0ee12e516694853c6b	2021-02-27 16:44:29 -05:00
LIU Zhiwei	ad78fc2df5	softfloat: Define comparison operations for bfloat16 Backports c53b1079334c41b342a8ad3b7ccfd51bf5427f5	2021-02-27 16:43:10 -05:00
LIU Zhiwei	d26cd63ad6	softfloat: Define misc operations for bfloat16 Backports 5ebf5f4be66c378fd5f3dee85f54dd4942171d57	2021-02-27 16:41:46 -05:00
LIU Zhiwei	d8168a8142	softfloat: Define convert operations for bfloat16 Backports 34f0c0a98a5f3bb6706088c0384f937f7a294d3e	2021-02-27 16:37:11 -05:00
LIU Zhiwei	b0be0d28cc	softfloat: Define operations for bfloat16 Backports 8282310d8535cc2a8431c516e907da79f92df6eb	2021-02-26 15:20:30 -05:00
Stephen Long	95a0837f2d	softfloat: Add float16_is_normal This float16 predicate was missing from the normal set. Backports a03e924cf8a22888060fc0de4d91de053cd5cde4	2021-02-26 15:12:37 -05:00
Frank Chang	d97454eb63	softfloat: Add fp16 and uint8/int8 conversion functions Backports 0d93d8ec632154dea2627a9e989972ee09721187	2021-02-26 15:11:57 -05:00
Kito Cheng	76d123efee	softfloat: Implement the full set of comparisons for float16 Backports dd205025a048ef6f53ff51eb86ddc58e7a82a771	2021-02-26 15:04:12 -05:00
Lioncash	f5a21abc0b	target/arm: Convert sq{, r}dmulh to gvec for aa64 advsimd	2021-02-26 15:01:44 -05:00
Richard Henderson	aa97b6b755	target/arm: Convert integer multiply-add (indexed) to gvec for aa64 advsimd Backports 3607440c4df6498585a570cfc1041e4972b41b56	2021-02-26 14:51:17 -05:00
Richard Henderson	732674b868	target/arm: Convert integer multiply (indexed) to gvec for aa64 advsimd Backports 2e5a265e6a9e7169c4a3e87db261b2fa92582590	2021-02-26 14:46:29 -05:00
Richard Henderson	80325ac866	target/arm: Generalize inl_qrdmlah_* helper functions Unify add/sub helpers and add a parameter for rounding. This will allow saturating non-rounding to reuse this code. Backports d21798856b227a20a0a41640236af445f4f4aeb0	2021-02-26 14:41:32 -05:00
Richard Henderson	1bedcfbda3	target/arm: Tidy SVE tszimm shift formats Rather than require the user to fill in the immediate (shl or shr), create full formats that include the immediate.	2021-02-26 14:35:53 -05:00
Richard Henderson	da41a23a1b	target/arm: Split out gen_gvec_ool_zz Backports 40e32e5a8a379baf6e0d49d83cf19950cfbaf96b	2021-02-26 14:32:36 -05:00
Richard Henderson	5bd98feed9	target/arm: Split out gen_gvec_ool_zzz Backports e645d1a17a359156c6047006d760ca176d493edb	2021-02-26 14:29:48 -05:00
Richard Henderson	aa3819c396	target/arm: Split out gen_gvec_ool_zzp Model after gen_gvec_fn_zzz et al. Backports 96a461f7c12587d3a64a71e4d90cda5c09ca3eb4	2021-02-26 14:26:33 -05:00
Lioncash	2da89a626c	target/arm: Merge helper_sve_clr_* and helper_sve_movz_*	2021-02-26 14:23:06 -05:00
Richard Henderson	8eb3642d96	target/arm: Split out gen_gvec_ool_zzzp Model after gen_gvec_fn_zzz et al. Backports 36cbb7a8e7100864c488a1153cecba90b1c33a4c	2021-02-26 14:14:13 -05:00
Richard Henderson	9b3671e9ad	target/arm: Use tcg_gen_gvec_bitsel for trans_SEL_pppp The gvec operation was added after the initial implementation of the SEL instruction and was missed in the conversion. Backports d4bc623254b55e2f9613c9450216fa7e50c03929	2021-02-26 14:12:25 -05:00
Richard Henderson	c8c247410f	target/arm: Clean up 4-operand predicate expansion Move the check for !S into do_pppp_flags, which allows to merge in do_vecop4_p. Split out gen_gvec_fn_ppp without sve_access_check, to mirror gen_gvec_fn_zzz. Backport dd81a8d7cf5c90963603806e58a217bbe759f75e	2021-02-26 14:07:14 -05:00
Richard Henderson	7bef6489a8	target/arm: Merge do_vector2_p into do_mov_p This is the only user of the function Backports d0b2df5a01eeccbac71d4d883158b91e7f9a6a29	2021-02-26 13:59:00 -05:00
Richard Henderson	f329d428f3	target/arm: Rearrange {sve,fp}_check_access assert We want to ensure that access is checked by the time we ask for a specific fp/vector register. We want to ensure that we do not emit two lots of code to raise an exception. But sometimes it's difficult to cleanly organize the code such that we never pass through sve_check_access exactly once. Allow multiple calls so long as the result is true, that is, no exception to be raised. Backports 8a40fe5f1bf3837ae3f9961efe1d51e7214f2664	2021-02-26 13:56:27 -05:00
Richard Henderson	64822511dd	target/arm: Split out gen_gvec_fn_zzz, do_zzz_fn Model gen_gvec_fn_zzz on gen_gvec_fn3 in translate-a64.c, but indicating which kind of register and in which order. Model do_zzz_fn on the other do_foo functions that take an argument set and verify sve enabled. Backports 28c4da31be6a5e501b60b77bac17652dd3211378	2021-02-26 13:53:10 -05:00
Richard Henderson	3146cbb64e	target/arm: Split out gen_gvec_fn_zz Model the new function on gen_gvec_fn2 in translate-a64.c, but indicating which kind of register and in which order. Since there is only one user of do_vector2_z, fold it into do_mov_z Backports f7d79c41fa4bd0f0d27dcd14babab8575fbed39f	2021-02-26 13:50:05 -05:00
Richard Henderson	234a22803d	qemu/int128: Add int128_lshift Add left-shift to match the existing right-shift. Backports 5be4dd043f5beb5e7587d1ef8dd4e3716ec05639	2021-02-26 13:45:44 -05:00
Richard Henderson	6f341e0199	target/arm: Fill in the WnR syndrome bit in mte_check_fail According to AArch64.TagCheckFault, none of the other ISS values are provided, so we do not need to go so far as merge_syn_data_abort. But we were missing the WnR bit. Backports commit 9a4670be7f0734d27bf4058db3becf83cd0cc9d5 from qemu	2021-02-26 12:26:15 -05:00
Richard Henderson	6969435fb8	target/arm: Pass the entire mte descriptor to mte_check_fail We need more information than just the mmu_idx in order to create the proper exception syndrome. Only change the function signature so far. Backports dbf8c32178291169e111a6a9fd7ae17af4a3039d	2021-02-26 12:19:51 -05:00
Philippe Mathieu-Daudé	d4c59cce4e	target/arm: Clarify HCR_EL2 ARMCPRegInfo type In commit ce4afed839 ("target/arm: Implement AArch32 HCR and HCR2") the HCR_EL2 register has been changed from type NO_RAW (no underlying state and does not support raw access for state saving/loading) to type CONST (TCG can assume the value to be constant), removing the read/write accessors. We forgot to remove the previous type ARM_CP_NO_RAW. This is not really a problem since the field is overwritten. However it makes code review confuse, so remove it. Backports 0e5aac18bc31dbdfab51f9784240d0c31a4c5579	2021-02-26 12:18:15 -05:00
Max Filippov	d9e561ab2a	softfloat: add xtensa specialization for pickNaNMulAdd pickNaNMulAdd logic on Xtensa is to apply pickNaN to the inputs of the expression (a * b) + c. However if default NaN is produces as a result of (a * b) calculation it is not considered when c is NaN. So with two pickNaN variants there must be two pickNaNMulAdd variants. In addition the invalid flag is always set when (a * b) produces NaN. Backports commit fbcc38e4cb1b539b8615ec9b0adc285351d77628 from qemu	2021-02-26 12:16:51 -05:00
Max Filippov	fee4c62fe4	softfloat: pass float_status pointer to pickNaN Pass float_status structure pointer to the pickNaN so that machine-specific settings are available to NaN selection code. Add use_first_nan property to float_status and use it in Xtensa-specific pickNaN. Backports commit 913602e3ffe6bf50b869a14028a55cb267645ba3	2021-02-26 12:16:05 -05:00
Max Filippov	db780eff66	softfloat: make NO_SIGNALING_NANS runtime property target/xtensa, the only user of NO_SIGNALING_NANS macro has FPU implementations with and without the corresponding property. With NO_SIGNALING_NANS being a macro they cannot be a part of the same QEMU executable. Replace macro with new property in float_status to allow cores with different FPU implementations coexist. Backports cc43c6925113c5bc8f1a0205375931d2e4807c99	2021-02-26 12:11:40 -05:00
Peter Maydell	3e5aa58139	target/arm: Use correct FPST for VCMLA, VCADD on fp16 When we implemented the VCMLA and VCADD insns we put in the code to handle fp16, but left it using the standard fp status flags. Correct them to use FPST_STD_F16 for fp16 operations. Bacports commit b34aa5129e9c3aff890b4f4bcc84962e94185629	2021-02-26 12:02:23 -05:00
Peter Maydell	61377ce01c	target/arm: Implement FPST_STD_F16 fpstatus Architecturally, Neon FP16 operations use the "standard FPSCR" like all other Neon operations. However, this is defined in the Arm ARM pseudocode as "a fixed value, except that FZ16 (and AHP) follow the FPSCR bits". In QEMU, the softfloat float_status doesn't include separate flush-to-zero for FP16 operations, so we must keep separate fp_status for "Neon non-FP16" and "Neon fp16" operations, in the same way we do already for the non-Neon "fp_status" vs "fp_status_f16". Add the extra float_status field to the CPU state structure, ensure it is correctly initialized and updated on FPSCR writes, and make fpstatus_ptr(FPST_STD_F16) return a pointer to it. Backports commit aaae563bc73de0598bbc09a102e68f27fafe704a	2021-02-26 12:00:25 -05:00
Peter Maydell	b1b0a41507	target/arm: Make A32/T32 use new fpstatus_ptr() API Make A32/T32 code use the new fpstatus_ptr() API: get_fpstatus_ptr(0) -> fpstatus_ptr(FPST_FPCR) get_fpstatus_ptr(1) -> fpstatus_ptr(FPST_STD) Backports a84d1d1316726704edd2617b2c30c921d98a8137	2021-02-26 11:55:55 -05:00
Peter Maydell	79359e3a69	target/arm: Replace A64 get_fpstatus_ptr() with generic fpstatus_ptr() We currently have two versions of get_fpstatus_ptr(), which both take an effectively boolean argument: * the one for A64 takes "bool is_f16" to distinguish fp16 from other ops * the one for A32/T32 takes "int neon" to distinguish Neon from other ops This is confusing, and to implement ARMv8.2-FP16 the A32/T32 one will need to make a four-way distinction between "non-Neon, FP16", "non-Neon, single/double", "Neon, FP16" and "Neon, single/double". The A64 version will then be a strict subset of the A32/T32 version. To clean this all up, we want to go to a single implementation which takes an enum argument with values FPST_FPCR, FPST_STD, FPST_FPCR_F16, and FPST_STD_F16. We rename the function to fpstatus_ptr() so that unconverted code gets a compilation error rather than silently passing the wrong thing to the new function. This commit implements that new API, and converts A64 to use it: get_fpstatus_ptr(false) -> fpstatus_ptr(FPST_FPCR) get_fpstatus_ptr(true) -> fpstatus_ptr(FPST_FPCR_F16) Backports commit cdfb22bb7326fee607d9553358856cca341dbc9a	2021-02-26 11:46:51 -05:00
Peter Maydell	e9240f0f54	target/arm: Delete unused ARM_FEATURE_CRC In commit 962fcbf2efe57231a9f5df we converted the uses of the ARM_FEATURE_CRC bit to use the aa32_crc32 isar_feature test instead. However we forgot to remove the now-unused definition of the feature name in the enum. Delete it now. Backports commit cf6303d262e31f4812dfeb654c6c6803e52000af	2021-02-26 11:24:40 -05:00
Peter Maydell	e0000d1700	target/arm/translate.c: Delete/amend incorrect comments In arm_tr_init_disas_context() we have a FIXME comment that suggests "cpu_M0 can probably be the same as cpu_V0". This isn't in fact possible: cpu_V0 is used as a temporary inside gen_iwmmxt_shift(), and that function is called in various places where cpu_M0 contains a live value (i.e. between gen_op_iwmmxt_movq_M0_wRn() and gen_op_iwmmxt_movq_wRn_M0() calls). Remove the comment. We also have a comment on the declarations of cpu_V0/V1/M0 which claims they're "for efficiency". This isn't true with modern TCG, so replace this comment with one which notes that they're only used with the iwmmxt decode Backports 8b4c9a50dc9531a729ae4b5941d287ad0422db48	2021-02-26 11:23:52 -05:00
Peter Maydell	0759bb8eaf	target/arm: Delete unused VFP_DREG macros As part of the Neon decodetree conversion we removed all the uses of the VFP_DREG macros, but forgot to remove the macro definitions. Do so now. Backports e60527c5d501e5015a119a0388a27abeae4dac09	2021-02-26 11:22:01 -05:00
Peter Maydell	368323b03f	target/arm: Remove ARCH macro The ARCH() macro was used a lot in the legacy decoder, but there are now just two uses of it left. Since a macro which expands out to a goto is liable to be confusing when reading code, replace the last two uses with a simple open-coded qeuivalent. Backports ce51c7f522ca488c795c3510413e338021141c96	2021-02-26 11:21:20 -05:00
Peter Maydell	5d9c0addcf	target/arm: Convert T32 coprocessor insns to decodetree Convert the T32 coprocessor instructions to decodetree. As with the A32 conversion, this corrects an underdecoding where we did not check that MRRC/MCRR [24:21] were 0b0010 and so treated some kinds of LDC/STC and MRRC/MCRR rather than UNDEFing them. Backports commit 4c498dcfd84281f20bd55072630027d1b3c115fd	2021-02-26 11:19:35 -05:00
Peter Maydell	bdaaac68f5	target/arm: Do M-profile NOCP checks early and via decodetree For M-profile CPUs, the architecture specifies that the NOCP exception when a coprocessor is not present or disabled should cover the entire wide range of coprocessor-space encodings, and should take precedence over UNDEF exceptions. (This is the opposite of A-profile, where checking for a disabled FPU has to happen last.) Implement this with decodetree patterns that cover the specified ranges of the encoding space. There are a few instructions (VLLDM, VLSTM, and in v8.1 also VSCCLRM) which are in copro-space but must not be NOCP'd: these must be handled also in the new m-nocp.decode so they take precedence. This is a minor behaviour change: for unallocated insn patterns in the VFP area (cp=10,11) we will now NOCP rather than UNDEF when the FPU is disabled. As well as giving us the correct architectural behaviour for v8.1M and the recommended behaviour for v8.0M, this refactoring also removes the old NOCP handling from the remains of the 'legacy decoder' in disas_thumb2_insn(), paving the way for cleaning that up. Since we don't currently have a v8.1M feature bit or any v8.1M CPUs, the minor changes to this logic that we'll need for v8.1M are marked up with TODO comments. Backports commit a3494d4671797c291c88bd414acb0aead15f7239 from qemu	2021-02-26 11:17:23 -05:00
Peter Maydell	c675b73b1f	target/arm: Tidy up disas_arm_insn() The only thing left in the "legacy decoder" is the handling of disas_xscale_insn(), and we can simplify the code. Backports commit 8198c071bc55bee55ef4f104a5b125f541b51096	2021-02-26 10:59:09 -05:00
Peter Maydell	fc4cc9d95f	target/arm: Convert A32 coprocessor insns to decodetree Convert the A32 coprocessor instructions to decodetree. Note that this corrects an underdecoding: for the 64-bit access case (MRRC/MCRR) we did not check that bits [24:21] were 0b0010, so we would incorrectly treat LDC/STC as MRRC/MCRR rather than UNDEFing them. The decodetree versions of these insns assume the coprocessor is in the range 0..7 or 14..15. This is architecturally sensible (as per the comments) and OK in practice for QEMU because the only uses of the ARMCPRegInfo infrastructure we have that aren't for coprocessors 14 or 15 are the pxa2xx use of coprocessor 6. We add an assertion to the define_one_arm_cp_reg_with_opaque() function to catch any accidental future attempts to use it to define coprocessor registers for invalid coprocessors. Backports commit cd8be50e58f63413c033531d3273c0e44851684f from qemu	2021-02-26 10:57:00 -05:00
Peter Maydell	ef0e23f1f9	target/arm: Separate decode from handling of coproc insns As a prelude to making coproc insns use decodetree, split out the part of disas_coproc_insn() which does instruction decoding from the part which does the actual work, and make do_coproc_insn() handle the UNDEF-on-bad-permissions and similar cases itself rather than returning 1 to eventually percolate up to a callsite that calls unallocated_encoding() for it. Backports 19c23a9baafc91dd3881a7a4e9bf454e42d24e4e	2021-02-26 10:53:52 -05:00
Peter Maydell	2944a75b98	target/arm: Pull handling of XScale insns out of disas_coproc_insn() At the moment we check for XScale/iwMMXt insns inside disas_coproc_insn(): for CPUs with ARM_FEATURE_XSCALE all copro insns with cp 0 or 1 are handled specially. This works, but is an odd place for this check, because disas_coproc_insn() is called from both the Arm and Thumb decoders but the XScale case never applies for Thumb (all the XScale CPUs were ARMv5, which has only Thumb1, not Thumb2 with the 32-bit coprocessor insn encodings). It also makes it awkward to convert the real copro access insns to decodetree. Move the identification of XScale out to its own function which is only called from disas_arm_insn(). Backports commit 7b4f933db865391a90a3b4518bb2050a83f2a873 from qemu	2021-02-26 10:50:32 -05:00
LIU Zhiwei	9b7f4b72fc	target/riscv: vector single-width integer multiply instructions	2021-02-26 10:46:26 -05:00
LIU Zhiwei	ab81642440	target/riscv: vector integer min/max instructions 558fa7797c919c4f21ac10980f3ed28160d6d3cb	2021-02-26 10:43:13 -05:00
LIU Zhiwei	965af9986a	target/riscv: vector integer comparison instructions 1366fc79be04fa56a0e3f078ba4f26c27ac67e89	2021-02-26 10:40:33 -05:00
LIU Zhiwei	244793c4e8	target/riscv: vector single-width bit shift instructions Backports 3277d955d21d8943d80062b4cfd8547f831dbd51	2021-02-26 10:37:09 -05:00
LIU Zhiwei	56c0e253c2	target/riscv: vector bitwise logical instructions Backports d3842924cf93d104f691c5ea9090d6700ccef281	2021-02-26 10:30:33 -05:00
LIU Zhiwei	05153c6d7c	target/riscv: vector integer add-with-carry / subtract-with-borrow instructions 3a6f8f68ad2f4a22d9ae8287f336b5dcc80b6448	2021-02-26 10:19:48 -05:00
LIU Zhiwei	b9814de4c3	target/riscv: vector widening integer add and subtract Backports 8fcdf77630290591a6068c2d82ca2935338c3b0c	2021-02-26 10:05:43 -05:00
LIU Zhiwei	f564388e89	target/riscv: vector single-width integer add and subtract Backports 43740e3a3b3bb66456103684e622ba4e9baae297	2021-02-26 09:58:31 -05:00
LIU Zhiwei	7d0d7338c2	target/riscv: add vector amo operations Vector AMOs operate as if aq and rl bits were zero on each element with regard to ordering relative to other instructions in the same hart. Vector AMOs provide no ordering guarantee between element operations in the same vector AMO instruction Backports 268fcca66bde62257960ec8d859de374315a5e3d	2021-02-26 09:47:32 -05:00
LIU Zhiwei	152934bade	target/riscv: add fault-only-first unit stride load The unit-stride fault-only-fault load instructions are used to vectorize loops with data-dependent exit conditions(while loops). These instructions execute as a regular load except that they will only take a trap on element 0. Backports commit 022b4ecf775ffeff522eaea4f0d94edcfe00a0a9 from qemu	2021-02-26 09:28:19 -05:00
LIU Zhiwei	887c29bc79	target/riscv: add vector index load and store instructions Vector indexed operations add the contents of each element of the vector offset operand specified by vs2 to the base effective address to give the effective address of each element. Backports f732560e3551c0823cee52efba993fbb8f689a36	2021-02-26 03:00:45 -05:00
LIU Zhiwei	c7a17d04a2	target/riscv: add vector stride load and store instructions Vector strided operations access the first memory element at the base address, and then access subsequent elements at address increments given by the byte offset contained in the x register specified by rs2. Vector unit-stride operations access elements stored contiguously in memory starting from the base effective address. It can been seen as a special case of strided operations. Backports 751538d5da557e5c10e5045c2d27639580ea54a7	2021-02-26 02:55:14 -05:00
LIU Zhiwei	e4bc5056cd	target/riscv: add an internals.h header The internals.h keeps things that are not relevant to the actual architecture, only to the implementation, separate. Backports f476f17740ad42288d42dd8fedcdae8ca7007a16	2021-02-26 02:39:29 -05:00
LIU Zhiwei	9db3b70869	target/riscv: add vector configure instruction vsetvl and vsetvli are two configure instructions for vl, vtype. TB flags should update after configure instructions. The (ill, lmul, sew ) of vtype and the bit of (VSTART == 0 && VL == VLMAX) will be placed within tb_flags. Backports 2b7168fc43fb270fb89e1dddc17ef54714712f3a from qemu	2021-02-26 02:37:59 -05:00
LIU Zhiwei	0554e79ad1	target/riscv: support vector extension csr The v0.7.1 specification does not define vector status within mstatus. A future revision will define the privileged portion of the vector status. Backports 8e3a1f18871e0ea251b95561fe1ec5a9bc896c4a from qemu	2021-02-26 02:25:58 -05:00
LIU Zhiwei	bff31d8822	target/riscv: implementation-defined constant parameters vlen is the vector register length in bits. elen is the max element size in bits. vext_spec is the vector specification version, default value is v0.7.1. Backports 32931383270e2ca8209267ca99f23f3c5f780982 from qemu	2021-02-26 02:23:28 -05:00
LIU Zhiwei	0968caa249	target/riscv: add vector extension field in CPURISCVState The 32 vector registers will be viewed as a continuous memory block. It avoids the convension between element index and (regno, offset). Thus elements can be directly accessed by offset from the first vector base address. Backports ad9e5aa2ae8032f19a8293b6b8f4661c06167bf0 from qemu	2021-02-26 02:17:49 -05:00
Peter Maydell	fceb5e309a	Open 5.2 development tree Backports commit 672b2f2695891b6d818bddc3ce0df964c7627969 from qemu	2021-02-25 23:52:17 -05:00
Peter Maydell	1f497fc74a	Update version for v5.1.0 release Backports commit d0ed6a69d399ae193959225cdeaa9382746c91cc from qemu	2021-02-25 23:51:51 -05:00
Peter Maydell	3c229a2b9e	Update version for v5.1.0-rc3 release	2021-02-25 23:51:33 -05:00
Peter Maydell	0718459fb3	target/arm: Fix Rt/Rt2 in ESR_ELx for copro traps from AArch32 to 64 When a coprocessor instruction in an AArch32 guest traps to AArch32 Hyp mode, the syndrome register (HSR) includes Rt and Rt2 fields which are simply copies of the Rt and Rt2 fields from the trapped instruction. However, if the instruction is trapped from AArch32 to an AArch64 higher exception level, the Rt and Rt2 fields in the syndrome register (ESR_ELx) must be the AArch64 view of the register. This makes a difference if the AArch32 guest was in a mode other than User or System and it was using r13 or r14, or if it was in FIQ mode and using r8-r14. We don't know at translate time which AArch32 CPU mode we are in, so we leave the values we generate in our prototype syndrome register value at translate time as the raw Rt/Rt2 from the instruction, and instead correct them to the AArch64 view when we find we need to take an exception from AArch32 to AArch64 with one of these syndrome values. Fixes: https://bugs.launchpad.net/qemu/+bug/1879587 Backports commit a65dabf71a9f9b949d556b1b57fd72595df92398 from qemu	2021-02-25 23:50:18 -05:00
Peter Collingbourne	7de60dfa51	target/arm: Fix decode of LDRA[AB] instructions These instructions use zero as the discriminator, not SP. Backports commit d250bb19ced3b702c7c37731855f6876d0cc7995 from qemu	2021-02-25 23:47:25 -05:00
Kaige Li	3004cc1f97	target/arm: Avoid maybe-uninitialized warning with gcc 4.9 GCC version 4.9.4 isn't clever enough to figure out that all execution paths in disas_ldst() that use 'fn' will have initialized it first, and so it warns: /home/LiKaige/qemu/target/arm/translate-a64.c: In function ‘disas_ldst’: /home/LiKaige/qemu/target/arm/translate-a64.c:3392:5: error: ‘fn’ may be used uninitialized in this function [-Werror=maybe-uninitialized] fn(cpu_reg(s, rt), clean_addr, tcg_rs, get_mem_index(s), ^ /home/LiKaige/qemu/target/arm/translate-a64.c:3318:22: note: ‘fn’ was declared here AtomicThreeOpFn *fn; ^ Make it happy by initializing the variable to NULL. Backports commit 88a90e3de6ae99cbcfcc04c862c51f241fdf685f from qemu	2021-02-25 23:45:13 -05:00
Richard Henderson	ce8282d9cd	target/arm: Fix AddPAC error indication The definition of top_bit used in this function is one higher than that used in the Arm ARM psuedo-code, which put the error indication at top_bit - 1 at the wrong place, which meant that it wasn't visible to Auth. Fixing the definition of top_bit requires more changes, because its most common use is for the count of bits in top_bit:bot_bit, which would then need to be computed as top_bit - bot_bit + 1. For now, prefer the minimal fix to the error indication alone. Fixes: 63ff0ca94cb Backports commit 8796fe40dd30cd9ffd3c958906471715c923b341 from qemu	2021-02-25 23:44:28 -05:00
Peter Maydell	4952920d4d	Update version for v5.1.0-rc2 release Backports commit 5772f2b1fc5d00e7e04e01fa28e9081d6550440a from qemu	2021-02-25 23:43:39 -05:00
Lioncash	a1e8e0adff	target/arm: Fix bad rebase within do_mem_zpz	2021-02-25 23:43:16 -05:00
Richard Henderson	5e1316a92e	target/arm: Always pass cacheattr in S1_ptw_translate When we changed the interface of get_phys_addr_lpae to require the cacheattr parameter, this spot was missed. The compiler is unable to detect the use of NULL vs the nonnull attribute here. Fixes: 7e98e21c098 Backports commit a6d6f37aed4b171d121cd4a9363fbb41e90dcb53 from qemu	2021-02-25 23:40:32 -05:00
Laszlo Ersek	40c04c73b0	target/i386: floatx80: avoid compound literals in static initializers Quoting ISO C99 6.7.8p4, "All the expressions in an initializer for an object that has static storage duration shall be constant expressions or string literals". The compound literal produced by the make_floatx80() macro is not such a constant expression, per 6.6p7-9. (An implementation may accept it, according to 6.6p10, but is not required to.) Therefore using "floatx80_zero" and make_floatx80() for initializing "f2xm1_table" and "fpatan_table" is not portable. And gcc-4.8 in RHEL-7.6 actually chokes on them: > target/i386/fpu_helper.c:871:5: error: initializer element is not constant > { make_floatx80(0xbfff, 0x8000000000000000ULL), > ^ We've had the make_floatx80_init() macro for this purpose since commit 3bf7e40ab914 ("softfloat: fix for C99", 2012-03-17), so let's use that macro again. Fixes: eca30647fc0 ("target/i386: reimplement f2xm1 using floatx80 operations") Fixes: ff57bb7b632 ("target/i386: reimplement fpatan using floatx80 operations") Backports commit 163b3d1af2552845a60967979aca8d78a6b1b088 from qemu	2021-02-25 23:38:54 -05:00
Richard Henderson	6390789a09	target/i386: Save cc_op before loop insns We forgot to update cc_op before these branch insns, which lead to losing track of the current eflags. Buglink: https://bugs.launchpad.net/qemu/+bug/1888165 Backports commit 3cb3a7720b01830abd5fbb81819dbb9271bf7821 from qemu	2021-02-25 23:36:43 -05:00
Zong Li	001d2e6a29	target/riscv: Fix the range of pmpcfg of CSR funcion table Backports commit 8ba26b0b2b00dd5849a6c0981e358dc7a7cc315d from qemu	2021-02-25 23:35:21 -05:00
Peter Maydell	08ce565d7c	Update version for v5.1.0-rc1 release Backports commit c8004fe6bbfc0d9c2e7b942c418a85efb3ac4b00 from qemu	2021-02-25 23:34:20 -05:00
Richard Henderson	55369d710c	tcg: Save/restore vecop_list around minmax fallback Forgetting this asserts when tcg_gen_cmp_vec is called from within tcg_gen_cmpsel_vec. Fixes: 72b4c792c7a Backports commit 69c918d2ef319ac63cd759c527debc2a2bdf3a0c from qemu	2021-02-25 23:33:24 -05:00
Chenyi Qiang	e5d9e0ed53	target/i386: add fast short REP MOV support For CPUs support fast short REP MOV[CPUID.(EAX=7,ECX=0):EDX(bit4)], e.g Icelake and Tigerlake, expose it to the guest VM. Backports commit 5cb287d2bd578dfe4897458793b4fce35bc4f744 from qemu	2021-02-25 23:31:42 -05:00
Peter Maydell	113dc25fbf	Update version for v5.1.0-rc0 release Backports commit 8746309137ba470d1b2e8f5ce86ac228625db940 from qemu	2021-02-25 23:30:37 -05:00
Aaron Lindsay	e532ce610e	target/arm: Don't do raw writes for PMINTENCLR Raw writes to this register when in KVM mode can cause interrupts to be raised (even when the PMU is disabled). Because the underlying state is already aliased to PMINTENSET (which already provides raw write functions), we can safely disable raw accesses to PMINTENCLR entirely. Backports commit 887c0f1544991f567543b7c214aa11ab0cea0a29 from qemu	2021-02-25 23:27:47 -05:00
Richard Henderson	f403c1f54f	target/arm: Fix mtedesc for do_mem_zpz The mtedesc that was constructed was not actually passed in. Found by Coverity (CID 1429996). Backports commit cdecb3fc1eb182d90666348a47afe63c493686e7 from qemu	2021-02-25 23:25:54 -05:00
Paolo Bonzini	5b794349d3	target/i386: implement undocumented 'smsw r32' behavior In 32-bit mode, the higher 16 bits of the destination register are undefined. In practice CR0[31:0] is stored, just like in 64-bit mode, so just remove the "if" that currently differentiates the behavior. Backports commit c0c8445255b2b5b440c355431c8b01b7b7b7c8cf from qemu	2021-02-25 23:23:51 -05:00
Joseph Myers	cf54c51869	target/i386: fix IEEE SSE floating-point exception raising The SSE instruction implementations all fail to raise the expected IEEE floating-point exceptions because they do nothing to convert the exception state from the softfloat machinery into the exception flags in MXCSR. Fix this by adding such conversions. Unlike for x87, emulated SSE floating-point operations might be optimized using hardware floating point on the host, and so a different approach is taken that is compatible with such optimizations. The required invariant is that all exceptions set in env->sse_status (other than "denormal operand", for which the SSE semantics are different from those in the softfloat code) are ones that are set in the MXCSR; the emulated MXCSR is updated lazily when code reads MXCSR, while when code sets MXCSR, the exceptions in env->sse_status are set accordingly. A few instructions do not raise all the exceptions that would be raised by the softfloat code, and those instructions are made to save and restore the softfloat exception state accordingly. Nothing is done about "denormal operand"; setting that (only for the case when input denormals are not flushed to zero, the opposite of the logic in the softfloat code for such an exception) will require custom code for relevant instructions, or else architecture-specific conditionals in the softfloat code for when to set such an exception together with custom code for various SSE conversion and rounding instructions that do not set that exception. Nothing is done about trapping exceptions (for which there is minimal and largely broken support in QEMU's emulation in the x87 case and no support at all in the SSE case). Backports commit 418b0f93d12a1589d5031405de857844f32e9ccc from qemu	2021-02-25 23:21:32 -05:00
Joseph Myers	fd5b0dd456	target/i386: set SSE FTZ in correct floating-point state The code to set floating-point state when MXCSR changes calls set_flush_to_zero on &env->fp_status, so affecting the x87 floating-point state rather than the SSE state. Fix to call it for &env->sse_status instead. Backports commit 3ddc0eca2229846bfecc3485648a6cb85a466dc7 from qemu	2021-02-25 23:15:53 -05:00
Laurent Vivier	c15ddf11dd	softfloat,m68k: disable floatx80_invalid_encoding() for m68k According to the comment, this definition of invalid encoding is given by intel developer's manual, and doesn't comply with 680x0 FPU. With m68k, the explicit integer bit can be zero in the case of: - zeros (exp == 0, mantissa == 0) - denormalized numbers (exp == 0, mantissa != 0) - unnormalized numbers (exp != 0, exp < 0x7FFF) - infinities (exp == 0x7FFF, mantissa == 0) - not-a-numbers (exp == 0x7FFF, mantissa != 0) For infinities and NaNs, the explicit integer bit can be either one or zero. The IEEE 754 standard does not define a zero integer bit. Such a number is an unnormalized number. Hardware does not directly support denormalized and unnormalized numbers, but implicitly supports them by trapping them as unimplemented data types, allowing efficient conversion in software. See "M68000 FAMILY PROGRAMMER’S REFERENCE MANUAL", "1.6 FLOATING-POINT DATA TYPES" We will implement in the m68k TCG emulator the FP_UNIMP exception to trap into the kernel to normalize the number. In case of linux-user, the number will be normalized by QEMU. Backports commit d159dd058c7dc48a9291fde92eaae52a9f26a4d1 from qemu	2021-02-25 23:14:47 -05:00
Mark Cave-Ayland	db742bec00	target/m68k: consolidate physical translation offset into get_physical_address() Since all callers to get_physical_address() now apply the same page offset to the translation result, move the logic into get_physical_address() itself to avoid duplication. Backports commit 852002b5664bf079da05c5201dbf2345b870e5ed from qemu	2021-02-25 23:13:48 -05:00
Mark Cave-Ayland	3b2bc4b0c8	target/m68k: fix physical address translation in m68k_cpu_get_phys_page_debug() The result of the get_physical_address() function should be combined with the offset of the original page access before being returned. Otherwise the m68k_cpu_get_phys_page_debug() function can round to the wrong page causing incorrect lookups in gdbstub and various "Disassembler disagrees with translator over instruction decoding" warnings to appear at translation time. Fixes: 88b2fef6c3 ("target/m68k: add MC68040 MMU")	2021-02-25 23:12:12 -05:00
Richard Henderson	65d5288563	tcg: Fix do_nonatomic_op_* vs signed operations The smin/smax/umin/umax operations require the operands to be properly sign extended. Do not drop the MO_SIGN bit from the load, and additionally extend the val input. Backports commit 852f933e482518797f7785a2e017a215b88df815 from qemu	2021-02-25 23:10:40 -05:00
Richard Henderson	57c66389c2	target/arm: Fix temp double-free in sve ldr/str The temp that gets assigned to clean_addr has been allocated with new_tmp_a64, which means that it will be freed at the end of the instruction. Freeing it earlier leads to assertion failure. The loop creates a complication, in which we allocate a new local temp, which does need freeing, and the final code path is shared between the loop and non-loop. Fix this complication by adding new_tmp_a64_local so that the new local temp is freed at the end, and can be treated exactly like the non-loop path. Fixes: bba87d0a0f4 Backports commit 4b4dc9750a0aa0b9766bd755bf6512a84744ce8a from qemu	2021-02-25 23:10:37 -05:00
Richard Henderson	54e2107bdf	target/arm: Enable MTE We now implement all of the components of MTE, without actually supporting any tagged memory. All MTE instructions will work, trivially, so we can enable support. Backports commit c7459633baa71d1781fde4a245d6ec9ce2f008cf from qemu	2021-02-25 23:00:27 -05:00
Richard Henderson	a34fda25b0	target/arm: Add allocation tag storage for system mode Look up the physical address for the given virtual address, convert that to a tag physical address, and finally return the host address that backs it. Backports commit e4d5bf4fbd5abfc3727e711eda64a583cab4d637 from qemu	2021-02-25 22:58:56 -05:00
Richard Henderson	9b6c64f8f8	target/arm: Create tagged ram when MTE is enabled Backports commit 8bce44a2f6beb388a3f157652b46e99929839a96 from qemu	2021-02-25 22:51:23 -05:00
Richard Henderson	2ea0b53c1a	target/arm: Cache the Tagged bit for a page in MemTxAttrs This "bit" is a particular value of the page's MemAttr. Backports commit 337a03f07ff0f9e6295662f4094e03a045b60bdc from qemu	2021-02-25 22:48:04 -05:00
Richard Henderson	28cd096d67	target/arm: Always pass cacheattr to get_phys_addr We need to check the memattr of a page in order to determine whether it is Tagged for MTE. Between Stage1 and Stage2, this becomes simpler if we always collect this data, instead of occasionally being presented with NULL. Use the nonnull attribute to allow the compiler to check that all pointer arguments are non-null. Backports commit 7e98e21c09871cddc20946c8f3f3595e93154ecb from qemu	2021-02-25 22:46:00 -05:00
Richard Henderson	e2456a83a4	target/arm: Set PSTATE.TCO on exception entry D1.10 specifies that exception handlers begin with tag checks overridden. Backports commit 34669338bd9d66255fceaa84c314251ca49ca8d5 from qemu	2021-02-25 22:41:26 -05:00
Richard Henderson	35d0443056	target/arm: Implement data cache set allocation tags This is DC GVA and DC GZVA, and the tag check for DC ZVA. Backports commit eb821168db798302bd124a3b000cebc23bd0a395 from qemu	2021-02-25 22:40:08 -05:00
Richard Henderson	33f5bdabb1	target/arm: Complete TBI clearing for user-only for SVE There are a number of paths by which the TBI is still intact for user-only in the SVE helpers. Because we currently always set TBI for user-only, we do not need to pass down the actual TBI setting from above, and we can remove the top byte in the inner-most primitives, so that none are forgotten. Moreover, this keeps the "dirty" pointer around at the higher levels, where we need it for any MTE checking. Since the normal case, especially for user-only, goes through RAM, this clearing merely adds two insns per page lookup, which will be completely in the noise. Backports commit c4af8ba19b9d22aac79cab679a20b159af9d6809 from qemu	2021-02-25 22:37:12 -05:00
Richard Henderson	732efce958	target/arm: Add mte helpers for sve scatter/gather memory ops Because the elements are non-sequential, we cannot eliminate many tests straight away like we can for sequential operations. But we often have the PTE details handy, so we can test for Tagged. Backports commit d28d12f008ee44dc2cc2ee5d8f673be9febc951e from qemu	2021-02-25 22:34:24 -05:00
Richard Henderson	5698b7badb	target/arm: Handle TBI for sve scalar + int memory ops We still need to handle tbi for user-only when mte is inactive. Backports commit 9473d0ecafcffc8b258892b1f9f18e037bdba958 from qemu	2021-02-25 22:17:46 -05:00
Richard Henderson	586235d02d	target/arm: Add mte helpers for sve scalar + int ff/nf loads Because the elements are sequential, we can eliminate many tests all at once when the tag hits TCMA, or if the page(s) are not Tagged. Backports commit aa13f7c3c378fa41366b9fcd6c29af1c3d81126a from qemu	2021-02-25 22:09:17 -05:00
Richard Henderson	cb31d54b18	target/arm: Add mte helpers for sve scalar + int stores Because the elements are sequential, we can eliminate many tests all at once when the tag hits TCMA, or if the page(s) are not Tagged. Backports commit 71b9f3948c75bb97641a3c8c7de96d1cb47cdc07 from qemu	2021-02-25 21:53:55 -05:00
Richard Henderson	670b25c5fa	target/arm: Add mte helpers for sve scalar + int loads Because the elements are sequential, we can eliminate many tests all at once when the tag hits TCMA, or if the page(s) are not Tagged. Backports commit 206adacfb8d35e671e3619591608c475aa046b63 from qemu	2021-02-25 21:45:32 -05:00
Richard Henderson	6a78133659	target/arm: Remove sve_memopidx None of the sve helpers use TCGMemOpIdx any longer, so we can stop passing it. Backports commit ba080b8682fc6bde7f2d9dedddb519d63cbe138f from qemu	2021-02-25 21:33:44 -05:00
Richard Henderson	1f306230d4	target/arm: Reuse sve_probe_page for gather loads Backports commit 10a85e2c8ab6e004e7f3f1dcfea8cb0bf58fb9fb from qemu	2021-02-25 21:30:13 -05:00
Richard Henderson	585da952ec	target/arm: Reuse sve_probe_page for scatter stores Backports commit 88a660a48ef513ce9875b595e19b2a820b3f3fca from qemu	2021-02-25 21:27:14 -05:00
Richard Henderson	3eee880c2a	target/arm: Reuse sve_probe_page for gather first-fault loads This avoids the need for a separate set of helpers to implement no-fault semantics, and will enable MTE in the future. Backports commit 50de9b78cec06e6d16e92a114a505779359ca532 from qemu	2021-02-25 21:22:16 -05:00
Richard Henderson	b1e31f3bf3	target/arm: Use SVEContLdSt for contiguous stores Follow the model set up for contiguous loads. This handles watchpoints correctly for contiguous stores, recognizing the exception before any changes to memory. Backports commit 0fa476c1bb37a70df7eeff1e5bfb4791feb37e0e from qemu	2021-02-25 21:15:14 -05:00
Richard Henderson	3591c2f548	target/arm: Update contiguous first-fault and no-fault loads With sve_cont_ldst_pages, the differences between first-fault and no-fault are minimal, so unify the routines. With cpu_probe_watchpoint, we are able to make progress through pages with TLB_WATCHPOINT set when the watchpoint does not actually fire. Backports commit c647673ce4d72a8789703c62a7f3cbc732cb1ea8 from qemu	2021-02-25 21:06:14 -05:00
Richard Henderson	6c9304448e	target/arm: Use SVEContLdSt for multi-register contiguous loads Backports commit 5c9b8458a0b3008d24d84b67e1c9b6d5f39f4d66 from qemu	2021-02-25 20:50:22 -05:00
Richard Henderson	3979c8f73e	target/arm: Handle watchpoints in sve_ld1_r Handle all of the watchpoints for active elements all at once, before we've modified the vector register. This removes the TLB_WATCHPOINT bit from page[].flags, which means that we can use the normal fast path via RAM. Backports commit 4bcc3f0ff8e5ae2b17b5aab9aa613ff1b8025896 from qemu	2021-02-25 20:44:13 -05:00
Richard Henderson	0e5aa37c9a	target/arm: Use SVEContLdSt in sve_ld1_r First use of the new helper functions, so we can remove the unused markup. No longer need a scratch for user-only, as we completely probe the page set before reading; system mode still requires a scratch for MMIO. Backports commit b854fd06a868e0308bcfe05ad0a71210705814c7 from qemu	2021-02-25 20:41:53 -05:00
Richard Henderson	d363c3d0ba	target/arm: Adjust interface of sve_ld1_host_fn The current interface includes a loop; change it to load a single element. We will then be able to use the function for ld{2,3,4} where individual vector elements are not adjacent. Replace each call with the simplest possible loop over active elements. Backports commit cf4a49b71b1712142d7122025a8ca7ea5b59d73f from qemu	2021-02-25 20:34:18 -05:00
Richard Henderson	94b0876f15	target/arm: Add sve infrastructure for page lookup For contiguous predicated memory operations, we want to minimize the number of tlb lookups performed. We have open-coded this for sve_ld1_r, but for correctness with MTE we will need this for all of the memory operations. Create a structure that holds the bounds of active elements, and metadata for two pages. Add routines to find those active elements, lookup the pages, and run watchpoints for those pages. Temporarily mark the functions unused to avoid Werror. Backports commit b4cd95d2f4c7197b844f51b29871d888063ea3e7 from qemu	2021-02-25 20:28:23 -05:00
Richard Henderson	f430a399d4	target/arm: Drop manual handling of set/clear_helper_retaddr Since we converted back to cpu_*_data_ra, we do not need to do this ourselves. Backports commit f32e2ab65f3a0fc03d58936709e5a565c4b0db50 from qemu	2021-02-25 20:20:29 -05:00
Richard Henderson	2e03f74a53	target/arm: Use cpu_*_data_ra for sve_ldst_tlb_fn Use the "normal" memory access functions, rather than the softmmu internal helper functions directly. Since fb901c9, cpu_mem_index is now a simple extract from env->hflags and not a large computation. Which means that it's now more work to pass around this value than it is to recompute it. This only adjusts the primitives, and does not clean up all of the uses within sve_helper.c.	2021-02-25 20:16:38 -05:00
Richard Henderson	84012be55c	target/arm: Add arm_tlb_bti_gp Introduce an lvalue macro to wrap target_tlb_bit0. Backports commit 149d3b31f3f0f7f9e1c3a77043450a95c7a7e93d from qemu	2021-02-25 17:45:50 -05:00
Richard Henderson	a9eb62d211	target/arm: Tidy trans_LD1R_zpri Move the variable declarations to the top of the function, but do not create a new label before sve_access_check. Backports commit c0ed9166b1aea86a2fbaada1195aacd1049f9e85 from qemu	2021-02-25 17:42:40 -05:00
Richard Henderson	49bd9a5c68	target/arm: Use mte_checkN for sve unpredicated stores Backports commit bba87d0a0f480805223a6428a7942a51733c488a from qemu	2021-02-25 17:40:43 -05:00
Richard Henderson	3ce14ebc78	target/arm: Use mte_checkN for sve unpredicated loads Backports commit b2aa8879b884cd66acde4123899dd92a38fe6527 from qemu	2021-02-25 17:26:37 -05:00
Richard Henderson	4fdd05e1aa	target/arm: Add helper_mte_check_zva Use a special helper for DC_ZVA, rather than the more general mte_checkN. Backports commit 46dc1bc0601554823a42ad27f236da2ad8f3bdc6 from qemu	2021-02-25 17:17:54 -05:00
Richard Henderson	9a05ca01e7	target/arm: Implement helper_mte_checkN Fill out the stub that was added earlier. Backports commit 5add8248556a3c1006018d7d8e601c9572b280a9 from qemu	2021-02-25 17:10:56 -05:00
Richard Henderson	91e2f55b69	target/arm: Implement helper_mte_check1 Fill out the stub that was added earlier. Backports commit 2e34ff45f32cb032883616a1cc5ea8ac96f546d5 from qemu	2021-02-25 17:02:34 -05:00
Richard Henderson	3e786526cf	target/arm: Add gen_mte_checkN Replace existing uses of check_data_tbi in translate-a64.c that perform multiple logical memory access. Leave the helper blank for now to reduce the patch size. Backports commit 73ceeb0011b23bac8bd2c09ebe3c18d034aa69ce from qemu	2021-02-25 16:40:16 -05:00
Richard Henderson	582e64f348	target/arm: Add gen_mte_check1 Replace existing uses of check_data_tbi in translate-a64.c that perform a single logical memory access. Leave the helper blank for now to reduce the patch size. Backports commit 0a405be2b8fd9506a009b10d7d2d98c394b36db6 from qemu	2021-02-25 16:13:31 -05:00
Richard Henderson	4488858072	target/arm: Move regime_tcr to internals.h We will shortly need this in mte_helper.c as well. Backports commit 38659d311d05e6c5feff6bddcc1c33b60d3b86a1 from qemu	2021-02-25 16:04:54 -05:00
Richard Henderson	e46cec8543	target/arm: Move regime_el to internals.h We will shortly need this in mte_helper.c as well. Backports commit 9c7ab8fc8cb6d6e2fb7a82c1088691c7c23fa1b9 from qemu	2021-02-25 16:04:12 -05:00
Richard Henderson	7147f5f28a	target/arm: Implement the access tag cache flushes Like the regular data cache flushes, these are nops within qemu. Backports commit 5463df160ecee510e78493993eb1bd38b4838a10 from qemu	2021-02-25 16:02:30 -05:00

... 3 4 5 6 7 ...

5864 commits