unicorn

mirror of https://github.com/yuzu-emu/unicorn.git synced 2025-01-08 18:35:44 +00:00

Author	SHA1	Message	Date
Richard Henderson	3591c2f548	target/arm: Update contiguous first-fault and no-fault loads With sve_cont_ldst_pages, the differences between first-fault and no-fault are minimal, so unify the routines. With cpu_probe_watchpoint, we are able to make progress through pages with TLB_WATCHPOINT set when the watchpoint does not actually fire. Backports commit c647673ce4d72a8789703c62a7f3cbc732cb1ea8 from qemu	2021-02-25 21:06:14 -05:00
Richard Henderson	6c9304448e	target/arm: Use SVEContLdSt for multi-register contiguous loads Backports commit 5c9b8458a0b3008d24d84b67e1c9b6d5f39f4d66 from qemu	2021-02-25 20:50:22 -05:00
Richard Henderson	3979c8f73e	target/arm: Handle watchpoints in sve_ld1_r Handle all of the watchpoints for active elements all at once, before we've modified the vector register. This removes the TLB_WATCHPOINT bit from page[].flags, which means that we can use the normal fast path via RAM. Backports commit 4bcc3f0ff8e5ae2b17b5aab9aa613ff1b8025896 from qemu	2021-02-25 20:44:13 -05:00
Richard Henderson	0e5aa37c9a	target/arm: Use SVEContLdSt in sve_ld1_r First use of the new helper functions, so we can remove the unused markup. No longer need a scratch for user-only, as we completely probe the page set before reading; system mode still requires a scratch for MMIO. Backports commit b854fd06a868e0308bcfe05ad0a71210705814c7 from qemu	2021-02-25 20:41:53 -05:00
Richard Henderson	d363c3d0ba	target/arm: Adjust interface of sve_ld1_host_fn The current interface includes a loop; change it to load a single element. We will then be able to use the function for ld{2,3,4} where individual vector elements are not adjacent. Replace each call with the simplest possible loop over active elements. Backports commit cf4a49b71b1712142d7122025a8ca7ea5b59d73f from qemu	2021-02-25 20:34:18 -05:00
Richard Henderson	94b0876f15	target/arm: Add sve infrastructure for page lookup For contiguous predicated memory operations, we want to minimize the number of tlb lookups performed. We have open-coded this for sve_ld1_r, but for correctness with MTE we will need this for all of the memory operations. Create a structure that holds the bounds of active elements, and metadata for two pages. Add routines to find those active elements, lookup the pages, and run watchpoints for those pages. Temporarily mark the functions unused to avoid Werror. Backports commit b4cd95d2f4c7197b844f51b29871d888063ea3e7 from qemu	2021-02-25 20:28:23 -05:00
Richard Henderson	f430a399d4	target/arm: Drop manual handling of set/clear_helper_retaddr Since we converted back to cpu_*_data_ra, we do not need to do this ourselves. Backports commit f32e2ab65f3a0fc03d58936709e5a565c4b0db50 from qemu	2021-02-25 20:20:29 -05:00
Richard Henderson	2e03f74a53	target/arm: Use cpu_*_data_ra for sve_ldst_tlb_fn Use the "normal" memory access functions, rather than the softmmu internal helper functions directly. Since fb901c9, cpu_mem_index is now a simple extract from env->hflags and not a large computation. Which means that it's now more work to pass around this value than it is to recompute it. This only adjusts the primitives, and does not clean up all of the uses within sve_helper.c.	2021-02-25 20:16:38 -05:00
Richard Henderson	84012be55c	target/arm: Add arm_tlb_bti_gp Introduce an lvalue macro to wrap target_tlb_bit0. Backports commit 149d3b31f3f0f7f9e1c3a77043450a95c7a7e93d from qemu	2021-02-25 17:45:50 -05:00
Richard Henderson	a9eb62d211	target/arm: Tidy trans_LD1R_zpri Move the variable declarations to the top of the function, but do not create a new label before sve_access_check. Backports commit c0ed9166b1aea86a2fbaada1195aacd1049f9e85 from qemu	2021-02-25 17:42:40 -05:00
Richard Henderson	49bd9a5c68	target/arm: Use mte_checkN for sve unpredicated stores Backports commit bba87d0a0f480805223a6428a7942a51733c488a from qemu	2021-02-25 17:40:43 -05:00
Richard Henderson	3ce14ebc78	target/arm: Use mte_checkN for sve unpredicated loads Backports commit b2aa8879b884cd66acde4123899dd92a38fe6527 from qemu	2021-02-25 17:26:37 -05:00
Richard Henderson	4fdd05e1aa	target/arm: Add helper_mte_check_zva Use a special helper for DC_ZVA, rather than the more general mte_checkN. Backports commit 46dc1bc0601554823a42ad27f236da2ad8f3bdc6 from qemu	2021-02-25 17:17:54 -05:00
Richard Henderson	9a05ca01e7	target/arm: Implement helper_mte_checkN Fill out the stub that was added earlier. Backports commit 5add8248556a3c1006018d7d8e601c9572b280a9 from qemu	2021-02-25 17:10:56 -05:00
Richard Henderson	91e2f55b69	target/arm: Implement helper_mte_check1 Fill out the stub that was added earlier. Backports commit 2e34ff45f32cb032883616a1cc5ea8ac96f546d5 from qemu	2021-02-25 17:02:34 -05:00
Richard Henderson	3e786526cf	target/arm: Add gen_mte_checkN Replace existing uses of check_data_tbi in translate-a64.c that perform multiple logical memory access. Leave the helper blank for now to reduce the patch size. Backports commit 73ceeb0011b23bac8bd2c09ebe3c18d034aa69ce from qemu	2021-02-25 16:40:16 -05:00
Richard Henderson	582e64f348	target/arm: Add gen_mte_check1 Replace existing uses of check_data_tbi in translate-a64.c that perform a single logical memory access. Leave the helper blank for now to reduce the patch size. Backports commit 0a405be2b8fd9506a009b10d7d2d98c394b36db6 from qemu	2021-02-25 16:13:31 -05:00
Richard Henderson	4488858072	target/arm: Move regime_tcr to internals.h We will shortly need this in mte_helper.c as well. Backports commit 38659d311d05e6c5feff6bddcc1c33b60d3b86a1 from qemu	2021-02-25 16:04:54 -05:00
Richard Henderson	e46cec8543	target/arm: Move regime_el to internals.h We will shortly need this in mte_helper.c as well. Backports commit 9c7ab8fc8cb6d6e2fb7a82c1088691c7c23fa1b9 from qemu	2021-02-25 16:04:12 -05:00
Richard Henderson	7147f5f28a	target/arm: Implement the access tag cache flushes Like the regular data cache flushes, these are nops within qemu. Backports commit 5463df160ecee510e78493993eb1bd38b4838a10 from qemu	2021-02-25 16:02:30 -05:00
Richard Henderson	4bb37fc3c1	target/arm: Implement the LDGM, STGM, STZGM instructions Backports commit 5f716a82388eb09754dd900e7dbb8ffa15897a28 from qemu	2021-02-25 16:00:50 -05:00
Richard Henderson	5b3ddcf2e2	target/arm: Simplify DC_ZVA Now that we know that the operation is on a single page, we need not loop over pages while probing. Backports commit e26d0d226892f67435cadcce86df0ddfb9943174 from qemu	2021-02-25 15:55:46 -05:00
Richard Henderson	7de60598d5	target/arm: Restrict the values of DCZID.BS under TCG We can simplify our DC_ZVA if we recognize that the largest BS that we actually use in system mode is 64. Let us just assert that it fits within TARGET_PAGE_SIZE. For DC_GVA and STZGM, we want to be able to write whole bytes of tag memory, so assert that BS is >= 2 * TAG_GRANULE, or 32. Backports commit a4157b80242bf1c8aa0ee77aae7458ba79012d5d from qemu	2021-02-25 15:12:20 -05:00
Richard Henderson	e15aa7c5a7	target/arm: Implement the STGP instruction Backports commit 6439d67fc944cf29de94a160e9450a2063c7b515 from qemu	2021-02-25 15:10:40 -05:00
Richard Henderson	e8b9cb8b4a	target/arm: Implement LDG, STG, ST2G instructions Backports commit c15294c1e36a7dd9b25bd54d98178e80f4b64bc1 from qemu	2021-02-25 15:08:44 -05:00
Richard Henderson	448fc3ae4a	target/arm: Define arm_cpu_do_unaligned_access for user-only Use the same code as system mode, so that we generate the same exception + syndrome for the unaligned access. For the moment, if MTE is enabled so that this path is reachable, this would generate a SIGSEGV in the user-only cpu_loop. Decoding the syndrome to produce the proper SIGBUS will be done later. Backports commit 0d1762e931f8a694f261c604daba605bcda70928 from qemu	2021-02-25 14:51:19 -05:00
Richard Henderson	43ea7aa828	target/arm: Implement the SUBP instruction Backports commit dad3015f55f8d48f84f0eae36021a9c6f9587e57 from qemu	2021-02-25 14:46:00 -05:00
Richard Henderson	13fd83fcc9	target/arm: Implement the GMI instruction Backports commit 438efea0bb639c9c2dfb42c8d9459e21aa183c8a from qemu	2021-02-25 14:44:29 -05:00
Richard Henderson	911a6b57ed	target/arm: Implement the ADDG, SUBG instructions Backports commit efbc78ad978763aedd11cb718eb1ff8db3fc9152 from qemu	2021-02-25 14:42:33 -05:00
Richard Henderson	acd7e4cb18	target/arm: Revise decoding for disas_add_sub_imm The current Arm ARM has adjusted the official decode of "Add/subtract (immediate)" so that the shift field is only bit 22, and bit 23 is part of the op1 field of the parent category "Data processing - immediate". Backports commit 21a8b343eaae63f6984f9a200092b0ea167647f1 from qemu	2021-02-25 14:38:46 -05:00
Richard Henderson	58f3dd2cc7	target/arm: Implement the IRG instruction Backports commit da54941f45b820cbaca72aa6efd5669b3dc86e2f from qemu	2021-02-25 14:36:11 -05:00
Richard Henderson	6bec295bf8	target/arm: Add MTE bits to tb_flags Cache the composite ATA setting. Cache when MTE is fully enabled, i.e. access to tags are enabled and tag checks affect the PE. Do this for both the normal context and the UNPRIV context. Backports commit 81ae05fa2d21ac1a0054935b74342aa38a5ecef7 from qemu	2021-02-25 14:31:41 -05:00
Richard Henderson	f6be2a1a42	target/arm: Add MTE system registers This is TFSRE0_EL1, TFSR_EL1, TFSR_EL2, TFSR_EL3, RGSR_EL1, GCR_EL1, GMID_EL1, and PSTATE.TCO. Backports commit 4b779cebb3e5ab30b945181f1ba3932f5f8a1cb5 from qemu	2021-02-25 14:12:24 -05:00
Richard Henderson	179a3aacdf	target/arm: Add DISAS_UPDATE_NOCHAIN Add an option that writes back the PC, like DISAS_UPDATE_EXIT, but does not exit back to the main loop. Backports commit 329833286d7a1b0ef8c7daafe13c6ae32429694e from qemu	2021-02-25 14:08:08 -05:00
Richard Henderson	eaa6291aa7	target/arm: Rename DISAS_UPDATE to DISAS_UPDATE_EXIT Emphasize that the is_jmp option exits to the main loop. Backports commit 14407ec2007e18536ed34772eef46f6e0a0e3d0e from qemu	2021-02-25 14:02:46 -05:00
Richard Henderson	2540911bdd	target/arm: Add support for MTE to SCTLR_ELx target/arm: Add support for MTE to HCR_EL2 and SCR_EL3 This does not attempt to rectify all of the res0 bits, but does clear the mte bits when not enabled. Since there is no high-part mapping of SCTLR, aa32 mode cannot write to these bits. Backports commits f00faf130d5dcf64b04f71a95f14745845ca1014, and 8ddb300bf60a5f3d358dd6fbf81174f6c03c1d9f from qemu.	2021-02-25 13:59:11 -05:00
Richard Henderson	d81feac642	target/arm: Improve masking of SCR RES0 bits Protect reads of aa64 id registers with ARM_CP_STATE_AA64. Use this as a simpler test than arm_el_is_aa64, since EL3 cannot change mode. Backports commit 252e8c69669599b4bcff802df300726300292f47 from qemu	2021-02-25 13:56:35 -05:00
Richard Henderson	1a35600453	target/arm: Add isar tests for mte Backports commit c7fd0baac0c24defec66263799faa8618327b352 from qemu	2021-02-25 13:55:52 -05:00
Joseph Myers	c01b7432a1	target/i386: reimplement fpatan using floatx80 operations The x87 fpatan emulation is currently based around conversion to double. This is inherently unsuitable for a good emulation of any floatx80 operation. Reimplement using the soft-float operations, as for other such instructions. Backports commit ff57bb7b63267dabd60f88354c8c29ea5e1eb3ec from qemu	2021-02-25 13:48:32 -05:00
Joseph Myers	ddb2f1d4dd	target/i386: reimplement fyl2x using floatx80 operations The x87 fyl2x emulation is currently based around conversion to double. This is inherently unsuitable for a good emulation of any floatx80 operation. Reimplement using the soft-float operations, building on top of the reimplementation of fyl2xp1 and factoring out code to be shared between the two instructions. The included test assumes that the result in round-to-nearest mode should always be one of the two closest floating-point numbers to the mathematically exact result (including that it should be exact, in the exact cases which cover more cases than for fyl2xp1). Backports commit 1f18a1e6ab8368a4eab2d22894d3b2ae75250cd3 from qemu	2021-02-25 13:46:29 -05:00
Joseph Myers	ac2f3fa0f2	target/i386: reimplement fyl2xp1 using floatx80 operations The x87 fyl2xp1 emulation is currently based around conversion to double. This is inherently unsuitable for a good emulation of any floatx80 operation, even before considering that it is a particularly naive implementation using double (adding 1 then using log rather than attempting a better emulation using log1p). Reimplement using the soft-float operations, as was done for f2xm1; as in that case, m68k has related operations but not exactly this one and it seemed safest to implement directly rather than reusing the m68k code to avoid accumulation of errors. A test is included with many randomly generated inputs. The assumption of the test is that the result in round-to-nearest mode should always be one of the two closest floating-point numbers to the mathematical value of y * log2(x + 1); the implementation aims to do somewhat better than that (about 70 correct bits before rounding). I haven't investigated how accurate hardware is. Intel manuals describe a narrower range of valid arguments to this instruction than AMD manuals. The implementation accepts the wider range (it's needed anyway for the core code to be reusable in a subsequent patch reimplementing fyl2x), but the test only has inputs in the narrower range so that it's valid on hardware that may reject or produce poor results for inputs outside that range. Code in the previous implementation that sets C2 for some out-of-range arguments is not carried forward to the new implementation; C2 is undefined for this instruction and I suspect that code was just cut-and-pasted from the trigonometric instructions (fcos, fptan, fsin, fsincos) where C2 is defined to be set for out-of-range arguments. Backports commit 5eebc49d2d0aa5fc7e90eeac97533051bb7b72fa from qemu	2021-02-25 13:43:46 -05:00
Joseph Myers	0a790f9937	target/i386: reimplement fprem, fprem1 using floatx80 operations The x87 fprem and fprem1 emulation is currently based around conversion to double, which is inherently unsuitable for a good emulation of any floatx80 operation. Reimplement using the soft-float floatx80 remainder operations. Backports commit 5ef396e2ba865f34a4766dbd60c739fb4bcb4fcc from qemu	2021-02-25 13:41:54 -05:00
Joseph Myers	8d0bf2d6e1	softfloat: return low bits of quotient from floatx80_modrem Both x87 and m68k need the low parts of the quotient for their remainder operations. Arrange for floatx80_modrem to track those bits and return them via a pointer. The architectures using float32_rem and float64_rem do not appear to need this information, so the *_rem interface is left unchanged and the information returned only from floatx80_modrem. The logic used to determine the low 7 bits of the quotient for m68k (target/m68k/fpu_helper.c:make_quotient) appears completely bogus (it looks at the result of converting the remainder to integer, the quotient having been discarded by that point); this patch does not change that, but the m68k maintainers may wish to do so. Backports commit 445810ec915687d37b8ae0ef8d7340ab4a153efa from qemu	2021-02-25 13:39:10 -05:00
Joseph Myers	e4cfbc1f06	softfloat: do not set denominator high bit for floatx80 remainder The floatx80 remainder implementation unnecessarily sets the high bit of bSig explicitly. By that point in the function, arguments that are invalid, zero, infinity or NaN have already been handled and subnormals have been through normalizeFloatx80Subnormal, so the high bit will already be set. Remove the unnecessary code. Backports commit 566601f1f9d972e44214696d3cb320e6c18880aa from qemu	2021-02-25 13:37:13 -05:00
Joseph Myers	2d50384633	softfloat: do not return pseudo-denormal from floatx80 remainder The floatx80 remainder implementation sometimes returns the numerator unchanged when the denominator is sufficiently larger than the numerator. But if the value to be returned unchanged is a pseudo-denormal, that is incorrect. Fix it to normalize the numerator in that case. Backports commit b662495dca0a2a36008cf8def91e2566519ed3f2 from qemu	2021-02-25 13:36:42 -05:00
Joseph Myers	6b63555a00	softfloat: fix floatx80 remainder pseudo-denormal check for zero The floatx80 remainder implementation ignores the high bit of the significand when checking whether an operand (numerator) with zero exponent is zero. This means it mishandles a pseudo-denormal representation of 0x1p-16382L by treating it as zero. Fix this by checking the whole significand instead. Backports commit 499a2f7b554a295cfc10f8cd026d9b20a38fe664 from qemu	2021-02-25 13:35:17 -05:00
Joseph Myers	b08d204a37	softfloat: merge floatx80_mod and floatx80_rem The m68k-specific softfloat code includes a function floatx80_mod that is extremely similar to floatx80_rem, but computing the remainder based on truncating the quotient toward zero rather than rounding it to nearest integer. This is also useful for emulating the x87 fprem and fprem1 instructions. Change the floatx80_rem implementation into floatx80_modrem that can perform either operation, with both floatx80_rem and floatx80_mod as thin wrappers available for all targets. There does not appear to be any use for the _mod operation for other floating-point formats in QEMU (the only other architectures using _rem at all are linux-user/arm/nwfpe, for FPA emulation, and openrisc, for instructions that have been removed in the latest version of the architecture), so no change is made to the code for other formats. Backports commit 6b8b0136ab3018e4b552b485f808bf66bcf19ead from qemu	2021-02-25 13:34:05 -05:00
Joseph Myers	2aee4714ab	target/i386: reimplement f2xm1 using floatx80 operations The x87 f2xm1 emulation is currently based around conversion to double. This is inherently unsuitable for a good emulation of any floatx80 operation, even before considering that it is a particularly naive implementation using double (computing with pow and then subtracting 1 rather than attempting a better emulation using expm1). Reimplement using the soft-float operations, including additions and multiplications with higher precision where appropriate to limit accumulation of errors. I considered reusing some of the m68k code for transcendental operations, but the instructions don't generally correspond exactly to x87 operations (for example, m68k has 2^x and e^x - 1, but not 2^x - 1); to avoid possible accumulation of errors from applying multiple such operations each rounding to floatx80 precision, I wrote a direct implementation of 2^x - 1 instead. It would be possible in principle to make the implementation more efficient by doing the intermediate operations directly with significands, signs and exponents and not packing / unpacking floatx80 format for each operation, but that would make it significantly more complicated and it's not clear that's worthwhile; the m68k emulation doesn't try to do that. A test is included with many randomly generated inputs. The assumption of the test is that the result in round-to-nearest mode should always be one of the two closest floating-point numbers to the mathematical value of 2^x - 1; the implementation aims to do somewhat better than that (about 70 correct bits before rounding). I haven't investigated how accurate hardware is. Backports commit eca30647fc078f4d9ed1b455bd67960f99dbeb7a from qemu	2021-02-25 13:31:13 -05:00
Peter Maydell	4a1996502f	target/arm: Remove dead code relating to SABA and UABA In commit cfdb2c0c95ae9205b0 ("target/arm: Vectorize SABA/UABA") we replaced the old handling of SABA/UABA with a vectorized implementation which returns early rather than falling into the loop-ever-elements code. We forgot to delete the part of the old looping code that did the accumulate step, and Coverity correctly warns (CID 1428955) that this code is now dead. Delete it. Fixes: cfdb2c0c95ae9205b0 Backports commit ced7e8edb282765685d2ba0206a11f8692d8ec1c from qemu	2021-02-25 13:18:51 -05:00
Peter Maydell	167ed57625	target/arm: Remove unnecessary gen_io_end() calls Since commit ba3e7926691ed3 it has been unnecessary for target code to call gen_io_end() after an IO instruction in icount mode; it is sufficient to call gen_io_start() before it and to force the end of the TB. Many now-unnecessary calls to gen_io_end() were removed in commit 9e9b10c6491153b, but some were missed or accidentally added later. Remove unneeded calls from the arm target: * the call in the handling of exception-return-via-LDM is unnecessary, and the code is already forcing end-of-TB * the call in the VFP access check code is more complicated: we weren't ending the TB, so we need to add the code to force that by setting DISAS_UPDATE * the doc comment for ARM_CP_IO doesn't need to mention gen_io_end() any more Backports commit 55c812b74289863c348449135812027d188f040a from qemu	2021-02-25 13:17:32 -05:00
Peter Maydell	083d207fb0	target/arm: Move some functions used only in translate-neon.inc.c to that file The functions neon_element_offset(), neon_load_element(), neon_load_element64(), neon_store_element() and neon_store_element64() are used only in the translate-neon.inc.c file, so move their definitions there. Since the .inc.c file is #included in translate.c this doesn't make much difference currently, but it's a more logical place to put the functions and it might be helpful if we ever decide to try to make the .inc.c files genuinely separate compilation units. Backports commit 6fb5787898aab6aa04887fed9cf3220dd4c3f36a from qemu	2021-02-25 13:15:23 -05:00
Peter Maydell	0b06317dc4	target/arm: Convert Neon VTRN to decodetree Convert the Neon VTRN insn to decodetree. This is the last insn in the Neon data-processing group, so we can remove all the now-unused old decoder framework. It's possible that there's a more efficient implementation of VTRN, but for this conversion we just copy the existing approach. Backports commit d4366190f84fe89cc5d46da995dac1e7d541b98e from qemu	2021-02-25 13:12:28 -05:00
Peter Maydell	b7584069dd	target/arm: Convert Neon VSWP to decodetree Convert the Neon VSWP insn to decodetree. Since the new implementation doesn't have to share a pass-loop with the other 2-reg-misc operations we can implement the swap with 64-bit accesses rather than 32-bits (which brings us into line with the pseudocode and is more efficient). Backports commit 8ab3a227a0f13f0ff85846f36f7c466769aef4fc from qemu	2021-02-25 13:07:56 -05:00
Peter Maydell	73abdfea53	target/arm: Convert Neon 2-reg-misc VCVT insns to decodetree Convert the VCVT instructions in the 2-reg-misc grouping to decodetree. Backports commit a183d5fb38b07bab2a840196186c4806f3c67c0d from qemu	2021-02-25 13:07:15 -05:00
Peter Maydell	7e705fdc8c	target/arm: Convert Neon 2-reg-misc VRINT insns to decodetree Convert the Neon 2-reg-misc VRINT insns to decodetree. Giving these insns their own do_vrint() function allows us to change the rounding mode just once at the start and end rather than doing it for every element in the vector. Backports commit 128123ea34e9e6afe4842aefcb9cf84b9642ac22 from qemu	2021-02-25 13:02:24 -05:00
Peter Maydell	3eddb77327	target/arm: Convert Neon 2-reg-misc fp-compare-with-zero insns to decodetree Convert the fp-compare-with-zero insns in the Neon 2-reg-misc group to decodetree. Backports commit baa59323e841f76523f6ad4d746cdeb47ea574cd from qemu	2021-02-25 12:59:22 -05:00
Peter Maydell	6eb852ec1c	target/arm: Convert simple fp Neon 2-reg-misc insns Convert the Neon 2-reg-misc insns which are implemented with simple calls to functions that take the input, output and fpstatus pointer. Backports commit 3e96b205286dfb8bbf363229709e4f8648fce379 from qemu	2021-02-25 12:56:28 -05:00
Peter Maydell	3dcee11013	target/arm: Convert Neon VQABS, VQNEG to decodetree Convert the Neon VQABS and VQNEG insns to decodetree. Since these are the only ones which need cpu_env passing to the helper, we wrap the helper rather than creating a whole new do_2misc_env() function. Backports commit 4936f38abe6db0a9d23fd04e4cb0cf4d51cff174 from qemu	2021-02-25 12:53:18 -05:00
Peter Maydell	4033a3ca5c	target/arm: Convert remaining simple 2-reg-misc Neon ops Convert the remaining ops in the Neon 2-reg-misc group which can be implemented simply with our do_2misc() helper. Backports commit 84eae770af69c37a92496a4c4248875c070d5ee3 from qemu	2021-02-25 12:50:55 -05:00
Peter Maydell	88f8111500	target/arm: Convert Neon 2-reg-misc VREV32 and VREV16 to decodetree Convert the VREV32 and VREV16 insns in the Neon 2-reg-misc group to decodetree. Backports commit 8966808205b59d6c196b380b638475bcd1657ef4 from qemu	2021-02-25 12:49:16 -05:00
Peter Maydell	db1e503708	target/arm: Make gen_swap_half() take separate src and dest Make gen_swap_half() take a source and destination TCGv_i32 rather than modifying the input TCGv_i32; we're going to want to be able to use it with the more flexible function signature, and this also brings it into line with other functions like gen_rev16() and gen_revsh(). Backports commit 8ec3de7018a8198624aae49eef5568256114a829 from qemu	2021-02-25 12:40:23 -05:00
Peter Maydell	3c1289c594	target/arm: Fix capitalization in NeonGenTwo{Single, Double}OPFn typedefs All the other typedefs like these spell "Op" with a lowercase 'p'; remane the NeonGenTwoSingleOPFn and NeonGenTwoDoubleOPFn typedefs to match. Backports commit 5de3fd045be11b74cd0fbf36c6d4fb8387d5463b from qemu	2021-02-25 12:38:30 -05:00
Peter Maydell	fa6727ebba	target/arm: Rename NeonGenOneOpFn to NeonGenOne64OpFn The NeonGenOneOpFn typedef breaks with the pattern of the other NeonGen*Fn typedefs, because it is a TCGv_i64 -> TCGv_i64 operation but it does not have '64' in its name. Rename it to NeonGenOne64OpFn, so that the old name is available for a TCGv_i32 -> TCGv_i32 operation (which we will need in a subsequent commit). Backports commit 039f4e809ad2772fb33de4511ff68a485d875618 from qemu	2021-02-25 12:34:51 -05:00
Peter Maydell	27e74962e5	target/arm: Convert Neon 2-reg-misc crypto operations to decodetree Convert the Neon-2-reg misc crypto ops (AESE, AESMC, SHA1H, SHA1SU1) to decodetree. Backports commit 0b30dd5b85e20aba259768cb7aaa952b3e319468 from qemu	2021-02-25 12:32:39 -05:00
Peter Maydell	4354448f57	target/arm: Convert vectorised 2-reg-misc Neon ops to decodetree Convert to decodetree the insns in the Neon 2-reg-misc grouping which we implement using gvec. Backports commit 75153179e9928775d5333243ea4b278f438d75ae from qemu	2021-02-25 12:28:31 -05:00
Peter Maydell	6301f9acaa	target/arm: Convert Neon VCVT f16/f32 insns to decodetree Convert the Neon insns in the 2-reg-misc group which are VCVT between f32 and f16 to decodetree. Backports commit 654a517355e249435505ae5ff14a7520410cf7a4 from qemu	2021-02-25 12:25:32 -05:00
Peter Maydell	4ca33c54a2	target/arm: Convert Neon 2-reg-misc VSHLL to decodetree Convert the VSHLL insn in the 2-reg-misc Neon group to decodetree. Backports commit 749e2be36d75f11d5fa8f8277e2a0569bd2a1c97 from qemu	2021-02-25 12:20:57 -05:00
Peter Maydell	48d57d0dc7	target/arm: Convert Neon narrowing moves to decodetree Convert the Neon narrowing moves VMQNV, VQMOVN, VQMOVUN in the 2-reg-misc group to decodetree. Backports commit 3882bdacb0ad548864b9f2582a32bb5c785e3165 from qemu	2021-02-25 12:18:01 -05:00
Peter Maydell	35d8a3e83f	target/arm: Convert VZIP, VUZP to decodetree Convert the Neon VZIP and VUZP insns in the 2-reg-misc group to decodetree. Backports commit 567663a2af2457da8aa74f221b1f3f8a6d2eddf6 from qemu	2021-02-25 12:14:29 -05:00
Peter Maydell	d21fae82ba	target/arm: Convert Neon 2-reg-misc pairwise ops to decodetree Convert the pairwise ops VPADDL and VPADAL in the 2-reg-misc grouping to decodetree. At this point we can get rid of the weird CPU_V001 #define that was used to avoid having to explicitly list all the arguments being passed to some TCG gen/helper functions. Backports commit 6106af3aa2304fccee91a3a90138352b0c2af998 from qemu	2021-02-25 12:12:11 -05:00
Peter Maydell	505923e676	target/arm: Convert Neon 2-reg-misc VREV64 to decodetree Convert the Neon VREV64 insn from the 2-reg-misc grouping to decodetree. Backports commit 353d2b85058711a5e44c2dc63eb5b620db50a602 from qemu	2021-02-25 12:07:06 -05:00
Alistair Francis	e1f49dc888	target/riscv: Implement checks for hfence Call the helper_hyp_tlb_flush() function on hfence instructions which will generate an illegal insruction execption if we don't have permission to flush the Hypervisor level TLBs. Backports commit 2761db5fc20943bbd606b6fd49640ac000398de6 from qemu	2021-02-25 12:03:57 -05:00
Alistair Francis	8eb8bc290f	target/riscv: Move the hfence instructions to the rvh decode Also correct the name of the VVMA instruction. Backports commit b8429ded723ec52568e05f6a24ed78c93224687c from qemu	2021-02-25 11:59:49 -05:00
Alistair Francis	39ff690eff	target/riscv: Report errors validating 2nd-stage PTEs Backports commit 88914473e748db20d8e18b9735f647a683319fa6 from qemu	2021-02-25 11:55:53 -05:00
Alistair Francis	a6c323c912	target/riscv: Set access as data_load when validating stage-2 PTEs Backports commit efe9f9c820d1322729957a60ff785c9527a79ddf from qemu	2021-02-25 11:54:31 -05:00
Ian Jiang	5c3a2f391c	riscv: Add helper to make NaN-boxing for FP register The function that makes NaN-boxing when a 32-bit value is assigned to a 64-bit FP register is split out to a helper gen_nanbox_fpr(). Then it is applied in translating of the FLW instruction. Backports commit 354908cee1f7ff761b5fedbdb6376c378c10f941 from qemu	2021-02-25 11:53:27 -05:00
MerryMage	92243aefd4	arm/translate: Do not tracecode when in an IT block	2021-02-07 19:14:32 +00:00
Sunho Kim	d56c79776e	unicorn: fix uc_emu_start until if end instruction is in another tlb	2020-08-05 03:18:51 +09:00
MerryMage	9ac17104b8	arm: Add missing file vec_internal.h Missing from commit `1df7314dc3`. Ported from qemu a04b68e1d4c4f0cd5cd7542697b1b230b84532f5.	2020-06-20 00:12:09 +01:00
Philippe Mathieu-Daudé	4465ff9c93	fpu/softfloat: Silence 'bitwise negation of boolean expression' warning When building with clang version 10.0.0-4ubuntu1, we get: CC lm32-softmmu/fpu/softfloat.o fpu/softfloat.c:3365:13: error: bitwise negation of a boolean expression; did you mean logical negation? [-Werror,-Wbool-operation] absZ &= ~ ( ( ( roundBits ^ 0x40 ) == 0 ) & roundNearestEven ); ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ fpu/softfloat.c:3423:18: error: bitwise negation of a boolean expression; did you mean logical negation? [-Werror,-Wbool-operation] absZ0 &= ~ ( ( (uint64_t) ( absZ1<<1 ) == 0 ) & roundNearestEven ); ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ... fpu/softfloat.c:4273:18: error: bitwise negation of a boolean expression; did you mean logical negation? [-Werror,-Wbool-operation] zSig1 &= ~ ( ( zSig2 + zSig2 == 0 ) & roundNearestEven ); ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Fix by rewriting the fishy bitwise AND of two bools as an int. Backports commit 4066288694c3bdd175df813cad675a3b5191956b from qemu	2020-06-18 23:56:27 -04:00
Peter Maydell	709610e606	target/arm: Convert Neon VDUP (scalar) to decodetree Convert the Neon VDUP (scalar) insn to decodetree. (Note that we can't call this just "VDUP" as we used that already in vfp.decode for the "VDUP (general purpose register" insn.) Backports commit 9aaa23c2ae18e6fb9a291b81baf91341db76dfa0 from qemu	2020-06-17 00:43:19 -04:00
Peter Maydell	8de8a4500a	target/arm: Convert Neon VTBL, VTBX to decodetree Convert the Neon VTBL, VTBX instructions to decodetree. The actual implementation of the insn is copied across to the new trans function unchanged except for renaming 'tmp5' to 'tmp4'. Backports commit 54e96c744b70a5d19f14b212a579dd3be8fcaad9 from qemu	2020-06-17 00:39:27 -04:00
Peter Maydell	4731a69d66	target/arm: Convert Neon VEXT to decodetree Convert the Neon VEXT insn to decodetree. Rather than keeping the old implementation which used fixed temporaries cpu_V0 and cpu_V1 and did the extraction with by-hand shift and logic ops, we use the TCG extract2 insn. We don't need to special case 0 or 8 immediates any more as the optimizer is smart enough to throw away the dead code. Backports commit 0aad761fb0aed40c99039eacac470cbd03d07019 from qemu	2020-06-17 00:29:04 -04:00
Peter Maydell	1aa9046120	target/arm: Convert Neon 2-reg-scalar long multiplies to decodetree Convert the Neon 2-reg-scalar long multiplies to decodetree. These are the last instructions in the group. Backports commit 77e576a9281825fc170f3b3af83f47e110549b5c from qemu	2020-06-17 00:24:12 -04:00
Peter Maydell	088a1e8ba9	target/arm: Convert Neon 2-reg-scalar VQRDMLAH, VQRDMLSH to decodetree Convert the VQRDMLAH and VQRDMLSH insns in the 2-reg-scalar group to decodetree. Backports commit aa318f5b9b4ab3b6744b5305dd8ae9b96676f20e from qemu	2020-06-17 00:15:18 -04:00
Peter Maydell	c0551804d4	target/arm: Convert Neon 2-reg-scalar VQDMULH, VQRDMULH to decodetree Convert the VQDMULH and VQRDMULH insns in the 2-reg-scalar group to decodetree. Backports commit b2fc7be972b94872f6a6dd32d9bda1b88ddbcaad from qemu	2020-06-17 00:11:56 -04:00
Peter Maydell	2e8ae1130e	target/arm: Convert Neon 2-reg-scalar float multiplies to decodetree Convert the float versions of VMLA, VMLS and VMUL in the Neon 2-reg-scalar group to decodetree. Backports commit 85ac9aef9a5418de3168df569e21258e853840a2 from qemu	2020-06-17 00:09:32 -04:00
Peter Maydell	bf1b0374b9	target/arm: Convert Neon 2-reg-scalar integer multiplies to decodetree Convert the VMLA, VMLS and VMUL insns in the Neon "2 registers and a scalar" group to decodetree. These are 32x32->32 operations where one of the inputs is the scalar, followed by a possible accumulate operation of the 32-bit result. The refactoring removes some of the oddities of the old decoder: * operands to the operation and accumulation were often reversed (taking advantage of the fact that most of these ops are commutative); the new code follows the pseudocode order * the Q bit in the insn was in a local variable 'u'; in the new code it is decoded into a->q Backports commit 96fc80f5f186decd1a649f6c04252faceb057ad2 from qemu	2020-06-17 00:04:29 -04:00
Peter Maydell	1817f28afd	target/arm: Add missing TCG temp free in do_2shift_env_64() In commit 37bfce81b10450071 we accidentally introduced a leak of a TCG temporary in do_2shift_env_64(); free it. Backports commit a4f67e180def790ff0bbb33fc93bb6e80382f041 from qemu	2020-06-16 23:57:17 -04:00
Peter Maydell	06dfc2ada6	target/arm: Add 'static' and 'const' annotations to VSHLL function arrays Mark the arrays of function pointers in trans_VSHLL_S_2sh() and trans_VSHLL_U_2sh() as both 'static' and 'const'. Backports commit 448f0e5f3ecfbd089b934e5e3aa0ccd1f51a6174 from qemu	2020-06-16 23:56:30 -04:00
Peter Maydell	6383a2bd15	target/arm: Convert Neon 3-reg-diff polynomial VMULL Convert the Neon 3-reg-diff insn polynomial VMULL. This is the last insn in this group to be converted. Backports commit 18fb58d588898550919392277787979ee7d0d84e from qemu	2020-06-16 23:54:51 -04:00
Peter Maydell	090426b120	target/arm: Convert Neon 3-reg-diff saturating doubling multiplies Convert the Neon 3-reg-diff insns VQDMULL, VQDMLAL and VQDMLSL: these are all saturating doubling long multiplies with a possible accumulate step. These are the last insns in the group which use the pass-over-each elements loop, so we can delete that code. Backports commit 9546ca5998d3cbd98a81b2d46a2e92a11b0f78a4 from qemu	2020-06-16 23:51:56 -04:00
Peter Maydell	5464405d5c	target/arm: Convert Neon 3-reg-diff long multiplies Convert the Neon 3-reg-diff insns VMULL, VMLAL and VMLSL; these perform a 32x32->64 multiply with possible accumulate. Note that for VMLSL we do the accumulate directly with a subtraction rather than doing a negate-then-add as the old code did. Backports commit 3a1d9eb07b767a7592abca642af80906f9eab0ed from qemu	2020-06-16 23:47:28 -04:00
Peter Maydell	21044a1d11	target/arm: Convert Neon 3-reg-diff VABAL, VABDL to decodetree Convert the Neon 3-reg-diff insns VABAL and VABDL to decodetree. Like almost all the remaining insns in this group, these are a combination of a two-input operation which returns a double width result and then a possible accumulation of that double width result into the destination. Backports commit f5b28401200ec95ba89552df3ecdcdc342f6b90b from qemu	2020-06-16 23:41:20 -04:00
Peter Maydell	34418f1998	target/arm: Convert Neon 3-reg-diff narrowing ops to decodetree Convert the narrow-to-high-half insns VADDHN, VSUBHN, VRADDHN, VRSUBHN in the Neon 3-registers-different-lengths group to decodetree. Backports commit 0fa1ab0302badabc3581aefcbb2f189ef52c4985 from qemu	2020-06-16 23:36:18 -04:00
Peter Maydell	d25998ba7d	target/arm: Convert Neon 3-reg-diff prewidening ops to decodetree Convert the "pre-widening" insns VADDL, VSUBL, VADDW and VSUBW in the Neon 3-registers-different-lengths group to decodetree. These insns work by widening one or both inputs to double their size, performing an add or subtract at the doubled size and then storing the double-size result. As usual, rather than copying the loop of the original decoder (which needs awkward code to avoid problems when source and destination registers overlap) we just unroll the two passes. Backports commit b28be09570d0827969b62b8f82b0f720a9915427 from qemu	2020-06-16 23:29:53 -04:00
Peter Maydell	a9d0e36bcf	target/arm: Fix missing temp frees in do_vshll_2sh The widenfn() in do_vshll_2sh() does not free the input 32-bit TCGv, so we need to do this in the calling code. Backports commit 9593a3988c3e788790aa107d778386b09f456a6d from qemu	2020-06-16 23:26:04 -04:00
Thomas Huth	6053203c1c	target/i386: Remove obsolete TODO file The last real change to this file is from 2012, so it is very likely that this file is completely out-of-date and ignored today. Let's simply remove it to avoid confusion if someone finds it by accident. Backports commit 3575b0aea983ad57804c9af739ed8ff7bc168393 from qemu	2020-06-15 13:22:56 -04:00
Joseph Myers	18b0ae9ebd	target/i386: correct fix for pcmpxstrx substring search This corrects a bug introduced in my previous fix for SSE4.2 pcmpestri / pcmpestrm / pcmpistri / pcmpistrm substring search, commit ae35eea7e4a9f21dd147406dfbcd0c4c6aaf2a60. That commit fixed a bug that showed up in four GCC tests with one libc implementation. The tests in question generate random inputs to the intrinsics and compare results to a C implementation, but they only test 1024 possible random inputs, and when the tests use the cases of those instructions that work with word rather than byte inputs, it's easy to have problematic cases that show up much less frequently than that. Thus, testing with a different libc implementation, and so a different random number generator, showed up a problem with the previous patch. When investigating the previous test failures, I found the description of these instructions in the Intel manuals (starting from computing a 16x16 or 8x8 set of comparison results) confusing and hard to match up with the more optimized implementation in QEMU, and referred to AMD manuals which described the instructions in a different way. Those AMD descriptions are very explicit that the whole of the string being searched for must be found in the other operand, not running off the end of that operand; they say "If the prototype and the SUT are equal in length, the two strings must be identical for the comparison to be TRUE.". However, that statement is incorrect. In my previous commit message, I noted: The operation in this case is a search for a string (argument d to the helper) in another string (argument s to the helper); if a copy of d at a particular position would run off the end of s, the resulting output bit should be 0 whether or not the strings match in the region where they overlap, but the QEMU implementation was wrongly comparing only up to the point where s ends and counting it as a match if an initial segment of d matched a terminal segment of s. Here, "run off the end of s" means that some byte of d would overlap some byte outside of s; thus, if d has zero length, it is considered to match everywhere, including after the end of s. The description "some byte of d would overlap some byte outside of s" is accurate only when understood to refer to overlapping some byte within the 16-byte operand but at or after the zero terminator; it is valid to run over the end of s if the end of s is the end of the 16-byte operand. So the fix in the previous patch for the case of d being empty was correct, but the other part of that patch was not correct (as it never allowed partial matches even at the end of the 16-byte operand). Nor was the code before the previous patch correct for the case of d nonempty, as it would always have allowed partial matches at the end of s. Fix with a partial revert of my previous change, combined with inserting a check for the special case of s having maximum length to determine where it is necessary to check for matches. In the added test, test 1 is for the case of empty strings, which failed before my 2017 patch, test 2 is for the bug introduced by my 2017 patch and test 3 deals with the case where a match of an initial segment at the end of the string is not valid when the string ends before the end of the 16-byte operand (that is, the case that would be broken by a simple revert of the non-empty-string part of my 2017 patch). Backports commit bc921b2711c4e2e8ab99a3045f6c0f134a93b535 from qemu	2020-06-15 13:20:48 -04:00
Joseph Myers	e79024e0cf	target/i386: fix IEEE x87 floating-point exception raising Most x87 instruction implementations fail to raise the expected IEEE floating-point exceptions because they do nothing to convert the exception state from the softfloat machinery into the exception flags in the x87 status word. There is special-case handling of division to raise the divide-by-zero exception, but that handling is itself buggy: it raises the exception in inappropriate cases (inf / 0 and nan / 0, which should not raise any exceptions, and 0 / 0, which should raise "invalid" instead). Fix this by converting the floating-point exceptions raised during an operation by the softfloat machinery into exceptions in the x87 status word (passing through the existing fpu_set_exception function for handling related to trapping exceptions). There are special cases where some functions convert to integer internally but exceptions from that conversion are not always correct exceptions for the instruction to raise. There might be scope for some simplification if the softfloat exception state either could always be assumed to be in sync with the state in the status word, or could always be ignored at the start of each instruction and just set to 0 then; I haven't looked into that in detail, and it might run into interactions with the various ways the emulation does not yet handle trapping exceptions properly. I think the approach taken here, of saving the softfloat state, setting exceptions there to 0 and then merging the old exceptions back in after carrying out the operation, is conservatively safe Backports commit 975af797f1e04e4d1b1a12f1731141d3770fdbce from qemu	2020-06-15 13:19:27 -04:00
Joseph Myers	cb50df6aae	target/i386: fix fisttpl, fisttpll handling of out-of-range values The fist / fistt family of instructions should all store the most negative integer in the destination format when the rounded / truncated integer result is out of range or the input is an invalid encoding, infinity or NaN. The fisttpl and fisttpll implementations (32-bit and 64-bit results, truncate towards zero) failed to do this, producing the most positive integer in some cases instead. Fix this by copying the code used to handle this issue for fistpl and fistpll, adjusted to use the _round_to_zero functions for the actual conversion (but without any other changes to that code). Backports commit c8af85b10c818709755f5dc8061c69920611fd4c from qemu	2020-06-15 13:10:23 -04:00
Joseph Myers	ceaa77e576	target/i386: fix fbstp handling of out-of-range values The fbstp implementation fails to check for out-of-range and invalid values, instead just taking the result of conversion to int64_t and storing its sign and low 18 decimal digits. Fix this by checking for an out-of-range result (invalid conversions always result in INT64_MAX or INT64_MIN from the softfloat code, which are large enough to be considered as out-of-range by this code) and storing the packed BCD indefinite encoding in that case. Backports commit 374ff4d0a3c2cce2bc6e4ba8a77eaba55c165252 from qemu	2020-06-15 13:09:23 -04:00
Joseph Myers	477a0af161	target/i386: fix fbstp handling of negative zero The fbstp implementation stores +0 when the rounded result should be -0 because it compares an integer value with 0 to determine the sign. Fix this by checking the sign bit of the operand instead. Backports commit 18c53e1e73197a24f9f4b66b1276eb9868db5bf0 from qemu	2020-06-15 13:08:38 -04:00
Joseph Myers	c796ee5e13	target/i386: fix fxam handling of invalid encodings The fxam implementation does not check for invalid encodings, instead treating them like NaN or normal numbers depending on the exponent. Fix it to check that the high bit of the significand is set before treating an encoding as NaN or normal, thus resulting in correct handling (all of C0, C2 and C3 cleared) for invalid encodings. Backports commit 34b9cc076ff423023a779a04a9f7cd7c17372cbf from qemu	2020-06-15 13:07:54 -04:00
Joseph Myers	5a01ea31eb	target/i386: fix floating-point load-constant rounding The implementations of the fldl2t, fldl2e, fldpi, fldlg2 and fldln2 instructions load fixed constants independent of the rounding mode. Fix them to load a value correctly rounded for the current rounding mode (but always rounded to 64-bit precision independent of the precision control, and without setting "inexact") as specified. Backports commit 80b4008c805ebcfd4c0d302ac31c1689e34571e0 from qemu	2020-06-15 13:07:06 -04:00
Joseph Myers	95368d250b	target/i386: fix fscale handling of rounding precision The fscale implementation uses floatx80_scalbn for the final scaling operation. floatx80_scalbn ends up rounding the result using the dynamic rounding precision configured for the FPU. But only a limited set of x87 floating-point instructions are supposed to respect the dynamic rounding precision, and fscale is not in that set. Fix the implementation to save and restore the rounding precision around the call to floatx80_scalbn. Backports commit c535d68755576bfa33be7aef7bd294a601f776e0 from qemu	2020-06-15 13:05:31 -04:00
Joseph Myers	ad83656acc	target/i386: fix fscale handling of infinite exponents The fscale implementation passes infinite exponents through to generic code that rounds the exponent to a 32-bit integer before using floatx80_scalbn. In round-to-nearest mode, and ignoring exceptions, this works in many cases. But it fails to handle the special cases of scaling 0 by a +Inf exponent or an infinity by a -Inf exponent, which should produce a NaN, and because it produces an inexact result for finite nonzero numbers being scaled, the result is sometimes incorrect in other rounding modes. Add appropriate handling of infinite exponents to produce a NaN or an appropriately signed exact zero or infinity as a result Backports commit c1c5fb8f9067c830e36830c2b82c0ec146c03d7b from qemu	2020-06-15 13:04:46 -04:00
Joseph Myers	bbbf25fdd9	target/i386: fix fscale handling of invalid exponent encodings The fscale implementation does not check for invalid encodings in the exponent operand, thus treating them like INT_MIN (the value returned for invalid encodings by floatx80_to_int32_round_to_zero). Fix it to treat them similarly to signaling NaN exponents, thus generating a quiet NaN result. Backports commit b40eec96b26028b68c3594fbf34b6d6f029df26a from qemu	2020-06-15 13:03:54 -04:00
Joseph Myers	d96c218664	target/i386: fix fscale handling of signaling NaN The implementation of the fscale instruction returns a NaN exponent unchanged. Fix it to return a quiet NaN when the provided exponent is a signaling NaN. Backports commit 0d48b436327955c69e2eb53f88aba9aa1e0dbaa0 from qemu	2020-06-15 13:03:16 -04:00
Joseph Myers	18fc17ca25	target/i386: implement special cases for fxtract The implementation of the fxtract instruction treats all nonzero operands as normal numbers, so yielding incorrect results for invalid formats, infinities, NaNs and subnormal and pseudo-denormal operands. Implement appropriate handling of all those cases. Backports commit c415f2c58296d86e9abb7e4a133111acf7031da3 from qemu	2020-06-15 13:02:33 -04:00
Liran Alon	7373942623	i386/cpu: Store LAPIC bus frequency in CPU structure No functional change. This information will be used by following patches. Backports commit 73b994f6d74ec00a1d78daf4145096ff9f0e2982 from qemu	2020-06-15 13:00:58 -04:00
Janne Grunau	6f41687234	target/i386: fix phadd* with identical destination and source register Detected by asm test suite failures in dav1d (https://code.videolan.org/videolan/dav1d). Can be reproduced by `qemu-x86_64 -cpu core2duo ./tests/checkasm --test=mc_8bpc 1659890620`. Backports commit 2dfbea1a872727fb747ca6adf2390e09956cdc6e from qemu	2020-06-15 12:59:49 -04:00
Philippe Mathieu-Daudé	34930da196	target/i386: Fix OUTL debug output Fix OUTL instructions incorrectly displayed as OUTW. Backports commit ce8540fde2cb535923a52a012f57b418eea85e1b from qemu	2020-06-15 12:56:33 -04:00
Richard Henderson	a93d01c61d	target/arm: Use a non-overlapping group for misc control The miscellaneous control instructions are mutually exclusive within the t32 decode sub-group. Backports commit d6084fba47bb9aef79775c1102d4b647eb58c365 from qemu	2020-06-15 12:52:48 -04:00
Richard Henderson	b45a02e2f7	decodetree: Multi-cleanup Includes multiple changes by Richard Henderson as follows: - Use proper varargs to print the arguments. (2fd51b19c9) - Rename MultiPattern to IncMultiPattern (040145c4f8) - Split out MultiPattern from IncMultiPattern (df63044d02) - Allow group covering the entire insn space (b44b3449a0) - Move semantic propagation into classes (08561fc128) - Implement non-overlapping groups (067e8b0f45) - Drop check for less than 2 patterns in a group (fe079aa13d)	2020-06-15 12:49:02 -04:00
Peter Maydell	7427cca6cc	target/arm: Convert Neon one-register-and-immediate insns to decodetree Convert the insns in the one-register-and-immediate group to decodetree. In the new decode, our asimd_imm_const() function returns a 64-bit value rather than a 32-bit one, which means we don't need to treat cmode=14 op=1 as a special case in the decoder (it is the only encoding where the two halves of the 64-bit value are different). Backports commit 2c35a39eda0b16c2ed85c94cec204bf5efb97812 from qemu	2020-06-15 12:44:54 -04:00
Peter Maydell	93e6d464c8	target/arm: Convert VCVT fixed-point ops to decodetree Convert the VCVT fixed-point conversion operations in the Neon 2-regs-and-shift group to decodetree. Backports commit 3da26f11711caeaa18318b6afa14dfb81d7650ab from qemu	2020-06-15 12:40:59 -04:00
Peter Maydell	a5f903b2a5	target/arm: Convert Neon VSHLL, VMOVL to decodetree Convert the VSHLL and VMOVL insns from the 2-reg-shift group to decodetree. Since the loop always has two passes, we unroll it to avoid the awkward reassignment of one TCGv to another. Backports commit 968bf842742a5ffbb0041cb31089e61a9f7a833d from qemu	2020-06-15 12:35:32 -04:00
Peter Maydell	6fc8fdaa2b	target/arm: Convert Neon narrowing shifts with op==9 to decodetree Convert the remaining Neon narrowing shifts to decodetree: * VQSHRN * VQRSHRN Backports commit b4a3a77bb7a0dff1cc5673fe3be467d9e3635d44 from qemu	2020-06-15 12:31:35 -04:00
Peter Maydell	ef29b91a43	target/arm: Convert Neon narrowing shifts with op==8 to decodetree Convert the Neon narrowing shifts where op==8 to decodetree: * VSHRN * VRSHRN * VQSHRUN * VQRSHRUN backports commit 712182d340e33c2ce86143f25fb2f04ae23d90de from qemu	2020-06-15 12:29:09 -04:00
Peter Maydell	69a3312e3a	target/arm: Convert VQSHLU, VQSHL 2-reg-shift insns to decodetree Convert the VQSHLU and QVSHL 2-reg-shift insns to decodetree. These are the last of the simple shift-by-immediate insns. Backports commit 37bfce81b10450071193c8495a07f182ec652e2a from qemu	2020-06-15 12:21:10 -04:00
Peter Maydell	055c96f985	target/arm: Convert Neon VSHR 2-reg-shift insns to decodetree Convert the VSHR 2-reg-shift insns to decodetree. Note that unlike the legacy decoder, we present the right shift amount to the trans_ function as a positive integer. Backports commit 66432d6b8294e3508218b360acfdf7c244eea993 from qemu	2020-06-15 12:15:29 -04:00
Peter Maydell	bf18bf983d	target/arm: Convert Neon VSHL and VSLI 2-reg-shift insn to decodetree Convert the VSHL and VSLI insns from the Neon 2-registers-and-a-shift group to decodetree. Backports commit d3c8c736f8b4bdd02831076286b1788232f46ced from qemu	2020-06-15 12:07:02 -04:00
Richard Henderson	1d95dd1c89	target/arm: Split helper_crypto_sm3tt Rather than passing an opcode to a helper, fully decode the operation at translate time. Use clear_tail_16 to zap the balance of the SVE register with the AdvSIMD write. Backports commit 43fa36c96c24349145497adc1b451f9caf74e344 from qemu	2020-06-14 23:24:21 -04:00
Richard Henderson	5ca8caf656	target/arm: Split helper_crypto_sha1_3reg Rather than passing an opcode to a helper, fully decode the operation at translate time. Use clear_tail_16 to zap the balance of the SVE register with the AdvSIMD write. Backports commit afc8b7d32668547308bdd654a63cf5228936e0ba from qemu	2020-06-14 23:18:45 -04:00
Richard Henderson	41c4efdb22	target/arm: Convert sha1 and sha256 to gvec helpers Do not yet convert the helpers to loop over opr_sz, but the descriptor allows the vector tail to be cleared. Which fixes an existing bug vs SVE. Backports commit effa992f153f5e7ab97ab843b565690748c5b402 from qemu	2020-06-14 23:11:28 -04:00
Richard Henderson	2c6c4da80c	target/arm: Convert sha512 and sm3 to gvec helpers Do not yet convert the helpers to loop over opr_sz, but the descriptor allows the vector tail to be cleared. Which fixes an existing bug vs SVE. Backports commit aaffebd6d3135b8aed7e61932af53b004d261579 from qemu	2020-06-14 23:01:49 -04:00
Richard Henderson	894f2168da	target/arm: Convert rax1 to gvec helpers With this conversion, we will be able to use the same helpers with sve. This also fixes a bug in which we failed to clear the high bits of the SVE register after an AdvSIMD operation. Backports commit 1738860d7e60dec5dbeba17f8b44d31aae3accac from qemu	2020-06-14 22:49:36 -04:00
Richard Henderson	1df7314dc3	target/arm: Convert aes and sm4 to gvec helpers With this conversion, we will be able to use the same helpers with sve. In particular, pass 3 vector parameters for the 3-operand operations; for advsimd the destination register is also an input. This also fixes a bug in which we failed to clear the high bits of the SVE register after an AdvSIMD operation. Backports commit a04b68e1d4c4f0cd5cd7542697b1b230b84532f5 from qemu	2020-06-14 22:41:33 -04:00
Alistair Francis	2b2f91f82c	target/riscv: Add the lowRISC Ibex CPU The reset vector is set in the init function don't set it again in realize. Backports commit 36b80ad99f7ea4979a4c5fc6e4072619b405e3b0 from qemu	2020-06-14 22:28:55 -04:00
Alistair Francis	2584ab8ee5	target/riscv: Drop support for ISA spec version 1.09.1 The RISC-V ISA spec version 1.09.1 has been deprecated in QEMU since 4.1. It's not commonly used so let's remove support for it. Backports commit 1a9540d1f1a9c5022d9273d0244e5809679dd33b from qemu	2020-06-14 22:23:26 -04:00
Alistair Francis	e35d56a146	target/riscv: Remove the deprecated CPUs	2020-06-14 22:15:16 -04:00
Richard Henderson	0e68fa345e	tcg: Improve move ops in liveness_pass_2 If the output of the move is dead, then the last use is in the store. If we propagate the input to the store, then we can remove the move opcode entirely. Backports commit 61f15c487fc2aea14f6b0e52c459ae8b7d252a65 from qemu	2020-06-14 22:13:04 -04:00
Richard Henderson	6b91e9bae1	tcg/i386: Implement INDEX_op_rotl{i,s,v}_vec For immediates, we must continue the special casing of 8-bit elements. The other element sizes and shift types are trivially implemented with shifts. Backports commit 885b1706df6f0211a22e120fac910fb3abf3e733 from qemu	2020-06-14 22:09:24 -04:00
Richard Henderson	cc3187b1e4	tcg: Implement gvec support for rotate by scalar No host backend support yet, but the interfaces for rotls are in place. Only implement left-rotate for now, as the only known use of vector rotate by scalar is s390x, so any right-rotate would be unused and untestable. Backports commit 23850a74afb641102325b4b7f74071d929fc4594 from qemu	2020-06-14 22:00:50 -04:00
Richard Henderson	2aa9d13120	tcg: Remove expansion to shift by vector from do_shifts We do not reflect this expansion in tcg_can_emit_vecop_list, so it is unused and unusable. However, we actually perform the same expansion in do_gvec_shifts, so it is also unneeded. Backports commit 3d5bb2ea5cc9ed54f65a6929a6e6baa01cabd98b from qemu	2020-06-14 21:53:36 -04:00
Richard Henderson	be78062fd8	tcg: Implement gvec support for rotate by vector No host backend support yet, but the interfaces for rotlv and rotrv are in place. Reviewed-by: Alex Bennée <alex.bennee@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org> --- v3: Drop the generic expansion from rot to shift; we can do better for each backend, and then this code becomes unused. Backports commit 5d0ceda902915e3f0e21c39d142c92c4e97c3ebb from qemu	2020-06-14 21:43:46 -04:00
Richard Henderson	5cce52a04b	tcg: Implement gvec support for rotate by immediate No host backend support yet, but the interfaces for rotli are in place. Canonicalize immediate rotate to the left, based on a survey of architectures, but provide both left and right shift interfaces to the translators. Backports commit b0f7e7444c03da17e41bf327c8aea590104a28ab from qemu	2020-06-14 21:26:58 -04:00
Laurent Vivier	50aa85e560	target/m68k: implement opcode fetoxm1 Example provided in the launchpad bug fails with: qemu: uncaught target signal 4 (Illegal instruction) - core dumped Illegal instruction (core dumped) It appears fetoxm1 is not implemented: IN: expm1f 0x800005cc: fetoxm1x %fp2,%fp0 Disassembler disagrees with translator over instruction decoding Please report this to qemu-devel@nongnu.org (gdb) x/2hx 0x800005cc 0x800005cc: 0xf200 0x0808 This patch adds the instruction. Backports commit 250b1da35d579f42319af234f36207902ca4baa4 from qemu	2020-06-14 21:13:29 -04:00
Laurent Vivier	aa69ab54ad	target/m68k: implement fmove.l #<data>,FPCR The immediate value mode was ignored and instruction execution ends to an invalid access mode. This was found running 'R' that set FPSR to 0 at startup with a 'fmove.l #0,FPSR' in qemu-system-m68k emulation and triggers a kernel crash: [ 56.640000] * ADDRESS ERROR * FORMAT=2 [ 56.640000] Current process id is 728 [ 56.640000] BAD KERNEL TRAP: 00000000 [ 56.640000] Modules linked in: sg evdev mac_hid ip_tables x_tables sha1_generic hmac ipv6 nf_defrag_ipv6 autofs4 ext4 crc16 mbcache jbd2 crc32c_generic sd_mod t10_pi crc_t10dif crct10dif_generic crct10dif_common sr_mod cdrom mac_esp macsonic esp_scsi [ 56.640000] PC: [<00016a2c>] X_UNSUPP+0x2c/0x3c [ 56.640000] SR: 2004 SP: 3eb5e68c a2: c02e239a [ 56.640000] d0: 00000040 d1: 00000002 d2: 8002adec d3: 8002ad50 [ 56.640000] d4: 8002c768 d5: 0000000d a0: ffffffc2 a1: ffffffc1 [ 56.640000] Process R (pid: 728, task=a3dfda5d) [ 56.640000] Frame format=2 instr addr=00000000 [ 56.650000] Stack from 3a4d9f30: [ 56.650000] 41000000 00000002 00000002 ffffffc2 ffffffc1 1fff0000 80000000 00000000 [ 56.650000] 3fbf0000 80000000 00000000 00000000 20000000 00000000 7fff0000 ffffffff [ 56.650000] ffffffff 00000000 00050008 00000000 8000067c c02c2000 efffee20 000002d8 [ 56.650000] 00002a28 3a4d9f98 00000002 00000014 fffffffe 8002c768 00000002 00000041 [ 56.650000] 00000002 c041fc58 c0743758 ffffffff 00000000 0008c075 00002b24 00000012 [ 56.650000] 000007d0 00000024 00000002 c05bef04 c05bef04 0000005e 00000077 c28aca70 [ 56.650000] Call Trace: [<00050008>] copy_overflow+0x10/0x28 [ 56.650000] [<00002a28>] buserr+0x20/0x28 [ 56.650000] [<0008c075>] bpf_check+0x57f/0x1cfa [ 56.650000] [<00002b24>] syscall+0x8/0xc [ 56.650000] [<0000c019>] dn_sched_init+0x75/0x88 [ 56.650000] Code: 1017 0200 00f0 0c00 0040 66ff 0000 05ac <f23c> 8800 0000 0000 f23c 9000 0000 0000 222e ff84 082e 0005 ff1c 6600 000a 0281 [ 56.650000] Disabling lock debugging due to kernel taint ... Backports commit 6a0e8bb4956c34328f4624e20bd3a6c2b1d90adc from qemu	2020-06-14 21:11:54 -04:00
Huacai Chen	504946fb79	target/mips: Support variable page size Traditionally, MIPS use 4KB page size, but Loongson prefer 16KB page size in system emulator. So, let's define TARGET_PAGE_BITS_VARY and TARGET_PAGE_BITS_MIN to support variable page size. Backports commit ee3863b9d414f0b4a59a88f2a79b496a99d4f6dd from qemu	2020-06-14 21:09:51 -04:00
Peter Maydell	1c6b0339e6	target/arm: Allow user-mode code to write CPSR.E via MSR Using the MSR instruction to write to CPSR.E is deprecated, but it is required to work from any mode including unprivileged code. We were incorrectly forbidding usermode code from writing it because CPSR_USER did not include the CPSR_E bit. We use CPSR_USER in only three places: * as the mask of what to allow userspace MSR to write to CPSR * when deciding what bits a linux-user signal-return should be able to write from the sigcontext structure * in target_user_copy_regs() when we set up the initial registers for the linux-user process In the first two cases not being able to update CPSR.E is a bug, and in the third case it doesn't matter because CPSR.E is always 0 there. So we can fix both bugs by adding CPSR_E to CPSR_USER. Because the cpsr_write() in restore_sigcontext() is now changing a CPSR bit which is cached in hflags, we need to add an arm_rebuild_hflags() call there; the callsite in target_user_copy_regs() was already rebuilding hflags for other reasons. (The recommended way to change CPSR.E is to use the 'SETEND' instruction, which we do correctly allow from usermode code.) Backports commit 268b1b3dfbb92a9348406f728a33f39e3d8dcd8a from qemu	2020-06-14 21:08:03 -04:00
Richard Henderson	acdd5c6065	target/arm: Use clear_vec_high more effectively Do not explicitly store zero to the NEON high part when we can pass !is_q to clear_vec_high. Backports commit e1f778596ebfa8782276f4dd4651f2b285d734ff from qemu	2020-06-14 21:06:40 -04:00
Richard Henderson	3ac9b9b206	target/arm: Use tcg_gen_gvec_mov for clear_vec_high The 8-byte store for the end a !is_q operation can be merged with the other stores. Use a no-op vector move to trigger the expand_clr portion of tcg_gen_gvec_mov. Backports commit 5c27392dd08bd8534893abf25ef501f1bd8680fe from qemu	2020-06-14 21:00:57 -04:00
Richard Henderson	22004b8106	softfloat: Return bool from all classification predicates This includes _is_any_nan, _is_neg, *_is_inf, etc. Backports commit 150c7a91ce7862bcaf7422f6038dcf0ba4a7eee3 from qemu	2020-05-21 18:23:11 -04:00
Richard Henderson	afd8d05aa2	softfloat: Inline floatx80 compare specializations Replace the floatx80 compare specializations with inline functions that call the standard floatx80_compare{,_quiet} functions. Use bool as the return type. Backports commit c6baf65000f826a713e8d9b5b35e617b0ca9ab5d from qemu	2020-05-21 18:17:53 -04:00
Richard Henderson	57d2419cd3	softfloat: Inline float128 compare specializations Replace the float128 compare specializations with inline functions that call the standard float128_compare{,_quiet} functions. Use bool as the return type. Backports commit b7b1ac684fea49c6bfe1ad8b706aed7b09116d15 from qemu	2020-05-21 18:15:55 -04:00
Richard Henderson	18a46c4d79	softfloat: Inline float64 compare specializations Replace the float64 compare specializations with inline functions that call the standard float64_compare{,_quiet} functions. Use bool as the return type. Backports commit 0673ecdf6cb2b1445a85283db8cbacb251c46516 from qemu	2020-05-21 18:13:44 -04:00
Richard Henderson	a35333741a	softfloat: Inline float32 compare specializations Replace the float32 compare specializations with inline functions that call the standard float32_compare{,_quiet} functions. Use bool as the return type. Backports commit 5da2d2d8e53d80e92a61720ea995c86b33cbf25d from qemu	2020-05-21 18:11:25 -04:00
Richard Henderson	d960523cbd	softfloat: Name compare relation enum Give the previously unnamed enum a typedef name. Use it in the prototypes of compare functions. Use it to hold the results of the compare functions. Backports commit 71bfd65c5fcd72f8af2735905415c7ce4220f6dc from qemu	2020-05-21 18:08:52 -04:00
Richard Henderson	8adc704058	softfloat: Name rounding mode enum Give the previously unnamed enum a typedef name. Use the packed attribute so that we do not affect the layout of the float_status struct. Use it in the prototypes of relevant functions. Adjust switch statements as necessary to avoid compiler warnings. Backports commit 3dede407cc61b64997f0c30f6dbf4df09949abc9 from qemu	2020-05-21 18:02:05 -04:00
Richard Henderson	a5c8178e35	softfloat: Change tininess_before_rounding to bool Slightly tidies the usage within softfloat.c and the representation in float_status. Backports commit a828b373bdabc7e53d1e218e3fc76f85b6674688 from qemu	2020-05-21 17:52:50 -04:00
Richard Henderson	a417227674	softfloat: Replace flag with bool We have had this on the to-do list for quite some time. Backports commit c120391c0090d9c40425c92cdb00f38ea8588ff6 from qemu	2020-05-21 17:48:12 -04:00
Richard Henderson	6530d6342f	softfloat: Use post test for floatN_mul The existing f{32,64}_addsub_post test, which checks for zero inputs, is identical to f{32,64}_mul_fast_test. Which means we can eliminate the fast_test/fast_op hooks in favor of reusing the same post hook. This means we have one fewer test along the fast path for multiply. Backports commit b240c9c497b9880ac0ba29465907d5ebecd48083 from qemu	2020-05-21 17:24:00 -04:00
Joseph Myers	c675454b27	softfloat: fix floatx80 pseudo-denormal round to integer The softfloat function floatx80_round_to_int incorrectly handles the case of a pseudo-denormal where only the high bit of the significand is set, ignoring that bit (treating the number as an exact zero) rather than treating the number as an alternative representation of +/- 2^-16382 (which may round to +/- 1 depending on the rounding mode) as hardware does. Fix this check (simplifying the code in the process). Backports commit 9ecaf5ccec13ff2e8fe1e72f6e0f3367d2169c1c from qemu	2020-05-15 23:59:23 -04:00
Joseph Myers	3d4a7e34e1	softfloat: fix floatx80 pseudo-denormal comparisons The softfloat floatx80 comparisons fail to allow for pseudo-denormals, which should compare equal to corresponding values with biased exponent 1 rather than 0. Add an adjustment for that case when comparing numbers with the same sign. Backports commit be53fa785ab766d2722628403edee75b3e6ab599 from qemu	2020-05-15 23:58:49 -04:00
Joseph Myers	85964d48d2	softfloat: fix floatx80 pseudo-denormal addition / subtraction The softfloat function addFloatx80Sigs, used for addition of values with the same sign and subtraction of values with opposite sign, fails to handle the case where the two values both have biased exponent zero and there is a carry resulting from adding the significands, which can occur if one or both values are pseudo-denormals (biased exponent zero, explicit integer bit 1). Add a check for that case, so making the results match those seen on x86 hardware for pseudo-denormals. Backports commit 41602807766e253ccb6fb761f3ff12767f786e2c from qemu	2020-05-15 23:56:24 -04:00
Joseph Myers	2ea23a5bbd	softfloat: silence sNaN for conversions to/from floatx80 Conversions between IEEE floating-point formats should convert signaling NaNs to quiet NaNs. Most of those in QEMU's softfloat code do so, but those for floatx80 fail to. Fix those conversions to silence signaling NaNs as well. Backports commit 7537c2b4a363237534c96d089a02b0712b49d890 from qemu	2020-05-15 23:54:32 -04:00
Peter Maydell	7b2fb5bc63	target/arm: Convert NEON VFMA, VFMS 3-reg-same insns to decodetree Convert the Neon floating point VFMA and VFMS insn to decodetree. These are the last insns in the 3-reg-same group so we can remove all the support/loop code from the old decoder. Backports commit e95485f85657be21135c17a9226e297c21e73360 from qemu	2020-05-15 23:49:20 -04:00
Peter Maydell	82484db863	target/arm: Convert Neon fp VMAX/VMIN/VMAXNM/VMINNM/VRECPS/VRSQRTS to decodetree Convert the Neon fp VMAX/VMIN/VMAXNM/VMINNM/VRECPS/VRSQRTS 3-reg-same insns to decodetree. (These are all the remaining non-accumulation instructions in this group.) Backports commit d5fdf9e9e1c6f2bbb0a4bcaafd85d344cce9c298 from qemu	2020-05-15 23:44:52 -04:00
Peter Maydell	a593866af6	target/arm: Move 'env' argument of recps_f32 and rsqrts_f32 helpers to usual place The usual location for the env argument in the argument list of a TCG helper is immediately after the return-value argument. recps_f32 and rsqrts_f32 differ in that they put it at the end. Move the env argument to its usual place; this will allow us to more easily use these helper functions with the gvec APIs. Backports commit 26c6f695cfd2a3ccddb4d015a25b56f56aa62928 from qemu	2020-05-15 23:41:37 -04:00
Peter Maydell	05e72483f4	target/arm: Convert Neon 3-reg-same compare insns to decodetree Convert the Neon integer 3-reg-same compare insns VCGE, VCGT, VCEQ, VACGE and VACGT to decodetree. Backports commit 727ff1d63213e6666e511956903b9e97a339ec7e from qemu	2020-05-15 23:37:53 -04:00
Peter Maydell	042df686ca	target/arm: Convert Neon fp VMUL, VMLA, VMLS 3-reg-same insns to decodetree Convert the Neon integer VMUL, VMLA, and VMLS 3-reg-same inssn to decodetree. We don't have a gvec helper for multiply-accumulate, so VMLA and VMLS need a loop function do_3same_fp(). This takes a reads_vd parameter to do_3same_fp() which tells it to load the old value into vd before calling the callback function, in the same way that the do_vfp_3op_sp() and do_vfp_3op_dp() functions in translate-vfp.inc.c work. (The only uses in this patch pass reads_vd == true, but later commits will use reads_vd == false.) This conversion fixes in passing an underdecoding for VMUL Backports commit 8aa71ead912ca0a9c0d29b74e0976f91952f950a from qemu	2020-05-15 23:35:21 -04:00
Peter Maydell	2527e76926	target/arm: Convert Neon VPMIN/VPMAX/VPADD float 3-reg-same insns to decodetree Convert the Neon float VPMIN, VPMAX and VPADD 3-reg-same insns to decodetree. These are the only remaining 'pairwise' operations, so we can delete the pairwise-specific bits of the old decoder's for-each-element loop now. Backports commit ab978335a56e3618212868fdce3a54217c6e71e6 from qemu	2020-05-15 23:31:15 -04:00
Peter Maydell	bb0aa79847	target/arm: Convert Neon VADD, VSUB, VABD 3-reg-same insns to decodetree Convert the Neon VADD, VSUB, VABD 3-reg-same insns to decodetree. We already have gvec helpers for addition and subtraction, but must add one for fabd. Backports commit a26a352bb498662cd0c205cb433a352f86fac7d2 from qemu	2020-05-15 23:26:51 -04:00
Peter Maydell	1df5d57e8a	target/arm: Convert Neon VQDMULH/VQRDMULH 3-reg-same to decodetree Convert the Neon VQDMULH and VQRDMULH 3-reg-same insns to decodetree. These are the last integer operations in the 3-reg-same group. Backports commit 7ecc28bc72b8033cf4e0c6332135ec20d4125dfb from qemu	2020-05-15 23:06:44 -04:00
Peter Maydell	59818edb3c	target/arm: Convert Neon VPADD 3-reg-same insns to decodetree Convert the Neon integer VPADD 3-reg-same insns to decodetree. These are 'pairwise' operations. (Note that VQRDMLAH, which shares the same primary opcode but has U=1, has already been converted.) Backports commit fa22827d4eb078b6c58cd3d19af0b50ed951e832 from qemu	2020-05-15 23:01:25 -04:00
Peter Maydell	1cc6451cb6	target/arm: Convert Neon VPMAX/VPMIN 3-reg-same insns to decodetree Convert the Neon integer VPMAX and VPMIN 3-reg-same insns to decodetree. These are 'pairwise' operations. Backports commit 059c2398a2b1ae86c6722c45e79fb0d0f4d95b1d from qemu	2020-05-15 22:59:10 -04:00
Peter Maydell	f35ae14ab4	target/arm: Convert Neon VQSHL, VRSHL, VQRSHL 3-reg-same insns to decodetree Convert the VQSHL, VRSHL and VQRSHL insns in the 3-reg-same group to decodetree. We have already implemented the size==0b11 case of these insns; this commit handles the remaining sizes Backports commit 6812dfdc6b0286730d6f903ebfbdc4f81b80c29b from qemu	2020-05-15 22:53:27 -04:00
Peter Maydell	5308fb324e	target/arm: Convert Neon VRHADD, VHSUB 3-reg-same insns to decodetree Convert the Neon VRHADD and VHSUB 3-reg-same insns to decodetree. (These are all the other insns in 3-reg-same which were using GEN_NEON_INTEGER_OP() and which are not pairwise or reversed-operands.) Backports commit 8e44d03f4b5590e19a4f7910ca1c327609933dd7 from qemu	2020-05-15 22:50:02 -04:00
Peter Maydell	ec327c7fc8	target/arm: Convert Neon VABA/VABD 3-reg-same to decodetree Convert the Neon VABA and VABD insns in the 3-reg-same group to decodetree. Backports commit 7715098f93ff5205334edf161e5fe156346122b0 from qemu	2020-05-15 22:46:02 -04:00
Peter Maydell	f1028fe4a7	target/arm: Convert Neon VHADD 3-reg-same insns Convert the Neon VHADD insns in the 3-reg-same group to decodetree. Backports commit cb294bca866f1cd776e44e03e5e432942bc676e8 from qemu	2020-05-15 22:43:01 -04:00
Peter Maydell	4098e0b80a	target/arm: Convert Neon 64-bit element 3-reg-same insns Convert the 64-bit element insns in the 3-reg-same group to decodetree. This covers VQSHL, VRSHL and VQRSHL where size==0b11. Backports commit 35d4352fa9e94b35bf17f58181cb16c184b98d56 from qemu	2020-05-15 22:40:48 -04:00
Peter Maydell	e2b703a82c	target/arm: Convert Neon 3-reg-same SHA to decodetree Convert the Neon SHA instructions in the 3-reg-same group to decodetree Backports commit 21290edfc29d8929741c0ed043733c23c69bc3b9 from qemu	2020-05-15 22:34:40 -04:00
Richard Henderson	1740e018f4	target/arm: Convert Neon 3-reg-same VQRDMLAH/VQRDMLSH to decodetree Convert the Neon VQRDMLAH and VQRDMLSH insns in the 3-reg-same group to decodetree. These don't use do_3same() because they want to operate on VFP double registers, whose offsets are different from the neon_reg_offset() calculations do_3same does. Backports commit a063569508af8295cf6271e06700e5b956bb402d from qemu	2020-05-15 22:20:23 -04:00
Richard Henderson	451683ee79	target/arm: Vectorize SABA/UABA Include 64-bit element size in preparation for SVE2. Backports commit cfdb2c0c95ae9205b0dd7f0f5e970cdec50fef20 from qemu	2020-05-15 22:15:14 -04:00
Richard Henderson	98c79f9afc	target/arm: Vectorize SABD/UABD Include 64-bit element size in preparation for SVE2. Backports commit 50c160d44eb059c7fc7f348ae2c3b0cb41437044 from qemu	2020-05-15 22:01:29 -04:00
Richard Henderson	765dbb57f0	target/arm: Clear tail in gvec_fmul_idx_, gvec_fmla_idx_ Must clear the tail for AdvSIMD when SVE is enabled. Fixes: ca40a6e6e39 Backports commit 525d9b6d42844e187211d25b69be8b378785bc24 from qemu	2020-05-15 21:50:30 -04:00
Richard Henderson	73d08253a2	target/arm: Pass pointer to qc to qrdmla/qrdmls Pass a pointer directly to env->vfp.qc[0], rather than env. This will allow SVE2, which does not modify QC, to pass a pointer to dummy storage. Change the return type of inl_qrdml.h_s16 to match the sense of the operation: signed. Backports commit e286bf4a72fe3a60490b8d6e3f28d6335677e08c from qemu	2020-05-15 21:48:35 -04:00
Richard Henderson	3c4f226e00	target/arm: Create gen_gvec_{qrdmla,qrdmls} Provide a functional interface for the vector expansion. This fits better with the existing set of helpers that we provide for other operations. Backports commit 146aa66ce58b686b8037d0eb3921c1125942dbde from qemu	2020-05-15 21:43:22 -04:00
Richard Henderson	efdcad70b1	target/arm: Remove fp_status from helper_{recpe, rsqrte}_u32 These operations do not touch fp_status. Backports commit fe6fb4beb2f9bb0afc813e565504b66a92bbf04b from qemu	2020-05-15 21:32:03 -04:00
Richard Henderson	9dfc0479ff	target/arm: Create gen_gvec_{uqadd, sqadd, uqsub, sqsub} Provide a functional interface for the vector expansion. This fits better with the existing set of helpers that we provide for other operations. Backports commit c7715b6b51a6f7a5412c5fcb40a4c8586105e597 from qemu	2020-05-15 21:25:06 -04:00
Richard Henderson	4abfe5156d	target/arm: Create gen_gvec_{cmtst,ushl,sshl} Provide a functional interface for the vector expansion. This fits better with the existing set of helpers that we provide for other operations. Backports commit 8161b75357095fef54c76b1a6ed1e54d0e8655e0 from qemu	2020-05-15 21:15:49 -04:00
Richard Henderson	15b2850f4d	target/arm: Swap argument order for VSHL during decode Rather than perform the argument swap during code generation, perform it during decode. This means it doesn't have to be special cased later, and we can share code with aarch64 code generation. Hopefully the decode comment addresses any confusion that might arise in between. Backports commit e9eee5316ffec5f37643de806b2e5577c5c189cf from qemu	2020-05-15 21:07:59 -04:00
Richard Henderson	546db9089c	target/arm: Create gen_gvec_{mla,mls} Provide a functional interface for the vector expansion. This fits better with the existing set of helpers that we provide for other operations. Backports commit 271063206a46062a45fc6bab8dabe45f0b88159d from qemu	2020-05-15 21:06:06 -04:00
Richard Henderson	340f97bf4c	target/arm: Create gen_gvec_{ceq,clt,cle,cgt,cge}0 Provide a functional interface for the vector expansion. This fits better with the existing set of helpers that we provide for other operations. Macro-ize the 5 nearly identical comparisons. Backports commit 69d5e2bf8c3cefedbfa1c1670137e636dbd7faa5 from qemu	2020-05-15 20:57:33 -04:00
Richard Henderson	e08c2b8ece	target/arm: Tidy handle_vec_simd_shri Now that we've converted all cases to gvec, there is quite a bit of dead code at the end of the function. Remove it. Sink the call to gen_gvec_fn2i to the end, loading a function pointer within the switch statement. Backports commit 3f08f0bce841e7857ec98ce7909629d0c335005e from qemu	2020-05-15 20:47:47 -04:00
Richard Henderson	7a1750d691	target/arm: Remove unnecessary range check for VSHL In 1dc8425e551, while converting to gvec, I added an extra range check against the shift count. This was unnecessary because the encoding of the shift count produces 0 to the element size - 1. Backports commit 2f27c5244db300387f15d9ffa5067a204ffd625d from qemu	2020-05-15 20:42:12 -04:00
Richard Henderson	6190be3191	target/arm: Create gen_gvec_{sri,sli} The functions eliminate duplication of the special cases for this operation. They match up with the GVecGen2iFn typedef. Add out-of-line helpers. We got away with only having inline expanders because the neon vector size is only 16 bytes, and we know that the inline expansion will always succeed. When we reuse this for SVE, tcg-gvec-op may decide to use an out-of-line helper due to longer vector lengths. Backports commit 893ab0542aa385a287cbe46d5535c8b9e95ce699 from qemu	2020-05-15 20:39:28 -04:00
Richard Henderson	2609e6f319	target/arm: Create gen_gvec_{u,s}{rshr,rsra} Create vectorized versions of handle_shri_with_rndacc for shift+round and shift+round+accumulate. Add out-of-line helpers in preparation for longer vector lengths from SVE. Backports commit 6ccd48d4ea244c1c46a24dfa50bfb547f11422dd from qemu	2020-05-15 20:28:44 -04:00
Richard Henderson	5d7c46204d	target/arm: Create gen_gvec_[us]sra The functions eliminate duplication of the special cases for this operation. They match up with the GVecGen2iFn typedef. Add out-of-line helpers. We got away with only having inline expanders because the neon vector size is only 16 bytes, and we know that the inline expansion will always succeed. When we reuse this for SVE, tcg-gvec-op may decide to use an out-of-line helper due to longer vector lengths. Backports commit 631e565450c483e0622eec3d8b61d7fa41d16bca from qemu	2020-05-15 20:10:32 -04:00
Richard Henderson	4be4ca57b1	target/arm: Fix tcg_gen_gvec_dup_imm vs DUP (indexed) DUP (indexed) can duplicate 128-bit elements, so using esz unconditionally can assert in tcg_gen_gvec_dup_imm. Fixes: 8711e71f9cbb Backports commit 7e17d50ebd359ee5fa3d65d7fdc0fe0336d60694 from qemu	2020-05-11 17:22:52 -04:00
Lioncash	5c03efd5d6	arm/helper: Amend sign conversion warning	2020-05-11 17:21:25 -04:00
Lioncash	08cc2c6dcc	arm/cpu64: Remove unused variable	2020-05-11 17:18:13 -04:00
Richard Henderson	f93deb0786	target/arm: Use tcg_gen_gvec_5_ptr for sve FMLA/FCMLA Now that we can pass 7 parameters, do not encode register operands within simd_data. Backports commit 08975da9f0bfcfa654628cae71201a351ba5449a from qemu	2020-05-11 17:17:17 -04:00
Thomas Huth	dfe548117e	target/arm: Make set_feature() available for other files Move the common set_feature() and unset_feature() functions from cpu.c and cpu64.c to cpu.h. Backports commit 5fda95041d7237ab35733ceb66e0cb89f6107169 from qemu	2020-05-11 17:02:21 -04:00
Philippe Mathieu-Daudé	cfe94f63f3	target/arm/cpu: Use ARRAY_SIZE() to iterate over ARMCPUInfo[] Since on the aarch64-linux-user build, arm_cpus[] is empty, add the cpu_count variable and only iterate when it is non-zero. Backports commit 92b6a659388ab3735e5fbb17ac486923b681f57f from qemu	2020-05-11 16:59:54 -04:00
Richard Henderson	4016b667f3	accel/tcg: Add block comment for probe_access Backports commit 857129b34190a4c2e782006dc255352a6cd3934b from qemu	2020-05-11 16:42:10 -04:00
Edgar E. Iglesias	91dbd53f77	target/arm: Drop access_el3_aa32ns_aa64any() Calling access_el3_aa32ns() works for AArch32 only cores but it does not handle 32-bit EL2 on top of 64-bit EL3 for mixed 32/64-bit cores. Merge access_el3_aa32ns_aa64any() into access_el3_aa32ns() and only use the latter. Fixes: 68e9c2fe65 ("target-arm: Add VTCR_EL2") Backports commit 93dd1e6140e2652347cfe7208591d4cd32762d08 from qemu	2020-05-11 16:39:40 -04:00
MerryMage	9255fbce96	target/arm: Introduce add_reg_for_lit (fixup) Backports commit 16e0d8234ef9291747332d2c431e46808a060472 from qemu Missed from original backporting commit `a2e60445de`	2020-05-10 12:30:52 +01:00
Richard Henderson	742301a7c1	tcg: Fix integral argument type to tcg_gen_rot[rl]i_i{32,64} For the benefit of compatibility of function pointer types, we have standardized on int32_t and int64_t as the integral argument to tcg expanders. We converted most of them in 474b2e8f0f7, but missed the rotates. Backports commit 07dada0336a83002dfa8673a9220a88e13d9a45c from qemu	2020-05-07 10:41:01 -04:00
Richard Henderson	0bcd0ca93d	tcg: Add load_dest parameter to GVecGen2 We have this same parameter for GVecGen2i, GVecGen3, and GVecGen3i. This will make some SVE2 insns easier to parameterize. Backports commit ac09ae627e9a2c65c8a452b69c3dac33c29d0719 from qemu	2020-05-07 10:35:47 -04:00
Richard Henderson	f02f71f38f	tcg: Improve vector tail clearing Better handling of non-power-of-2 tails as seen with Arm 8-byte vector operations. Backports commit f47db80cc073c0a7a22136c8296b5eca20c0e199 from qemu	2020-05-07 10:24:00 -04:00
Richard Henderson	549b0ec3c5	tcg: Add tcg_gen_gvec_dup_tl For use when a target needs to pass a configure-specific target_ulong value to duplicate. Backports commit 0f039e3ad9131966d9fe509c231b756868b015e2 from qemu	2020-05-07 10:12:09 -04:00
Richard Henderson	e65806c356	tcg: Remove tcg_gen_gvec_dup{8,16,32,64}i These interfaces are now unused. Backports commit 398f21412aeec158338963e3f71c9313bc126a71 form qemu	2020-05-07 10:11:00 -04:00
Richard Henderson	43a72b0540	tcg: Use tcg_gen_gvec_dup_imm in logical simplifications Replace the outgoing interface. Backports commit 03ddb6f315ca6d02dfdba0aecc43aa97c728c428 from qemu	2020-05-07 10:09:53 -04:00
Richard Henderson	b0f6374149	target/arm: Use tcg_gen_gvec_dup_imm In a few cases, we're able to remove some manual replication. Backports commit 8711e71f9cbb692d614e6ecf5d51222372f7b77e from qemu	2020-05-07 10:05:49 -04:00
Richard Henderson	07f622e57d	tcg: Add tcg_gen_gvec_dup_imm Add a version of tcg_gen_dup_* that takes both immediate and a vector element size operand. This will replace the set of tcg_gen_gvec_dup{8,16,32,64}i functions that encode the element size within the function name. Backports commit 44c94677febd15488f9190b11eaa4a08e8ac696b from qemu	2020-05-07 09:55:25 -04:00
Peter Maydell	d350125eab	target/arm: Move gen_ function typedefs to translate.h We're going to want at least some of the NeonGen* typedefs for the refactored 32-bit Neon decoder, so move them all to translate.h since it makes more sense to keep them in one group. Backports commit 9aefc6cf9b73f66062d2f914a0136756e7a28211 from qemu	2020-05-07 09:51:52 -04:00
Peter Maydell	652165d671	target/arm: Convert Neon 3-reg-same VMUL, VMLA, VMLS, VSHL to decodetree Convert the Neon VMUL, VMLA, VMLS and VSHL insns in the 3-reg-same grouping to decodetree. Backports commit 0de34fd48ad4e44bf5caa2330657ebefa93cea7d from qemu	2020-05-07 09:50:44 -04:00
Peter Maydell	17bd8930fc	target/arm: Convert Neon 3-reg-same VQADD/VQSUB to decodetree Convert the Neon VQADD/VQSUB insns in the 3-reg-same grouping to decodetree. Backports commit 7a9497f1cf73667a4744d09673b808c20e067915 from qemu	2020-05-07 09:47:18 -04:00
Peter Maydell	d52b830ce3	target/arm: Convert Neon 3-reg-same comparisons to decodetree Convert the Neon comparison ops in the 3-reg-same grouping to decodetree. Backports commit 02bd0cdb64b3e79419ba3a8746cb86430883b3ae from qemu	2020-05-07 09:45:03 -04:00
Peter Maydell	c6f9fb54fd	target/arm: Convert Neon 3-reg-same VMAX/VMIN to decodetree Convert the Neon 3-reg-same VMAX and VMIN insns to decodetree. Backports commit 36b59310c38d45213bf860affa90618aa5eeca93 from qemu	2020-05-07 09:42:04 -04:00
Peter Maydell	d30f99ca79	target/arm: Convert Neon 3-reg-same logic ops to decodetree Convert the Neon logic ops in the 3-reg-same grouping to decodetree. Note that for the logic ops the 'size' field forms part of their decode and the actual operations are always bitwise. Backports commit 35a548edb6f5043386183b9f6b4139d99d1f130a from qemu	2020-05-07 09:40:10 -04:00
Peter Maydell	eae3ce9899	target/arm: Convert Neon 3-reg-same VADD/VSUB to decodetree Convert the Neon 3-reg-same VADD and VSUB insns to decodetree. Note that we don't need the neon_3r_sizes[op] check here because all size values are OK for VADD and VSUB; we'll add this when we convert the first insn that has size restrictions. For this we need one of the GVecGen*Fn typedefs currently in translate-a64.h; move them all to translate.h as a block so they are visible to the 32-bit decoder. Backports commit a4e143ac5b9185f670d2f17ee9cc1a430047cb65 from qemu	2020-05-07 09:36:28 -04:00
Peter Maydell	c7a31355fc	target/arm: Convert Neon 'load/store single structure' to decodetree Convert the Neon "load/store single structure to one lane" insns to decodetree. As this is the last set of insns in the neon load/store group, we can remove the whole disas_neon_ls_insn() function. Backports commit 123ce4e3daba26b760b472687e1fb1ad82cf1993 from qemu	2020-05-07 09:32:17 -04:00
Peter Maydell	302506f2f6	target/arm: Convert Neon 'load single structure to all lanes' to decodetree Convert the Neon "load single structure to all lanes" insns to decodetree. Backports commit 3698747c48db871d876a398592c5a23d7580ed4a from qemu	2020-05-07 09:29:03 -04:00
Peter Maydell	7aad825fa6	target/arm: Convert Neon load/store multiple structures to decodetree Convert the Neon "load/store multiple structures" insns to decodetree. Backports commit a27b46304352a0eced45e560e96515dbe3cc174f from qemu	2020-05-07 09:25:51 -04:00
Peter Maydell	9814c1722f	target/arm: Convert VFM[AS]L (scalar) to decodetree Convert the VFM[AS]L (scalar) insns in the 2reg-scalar-ext group to decodetree. These are the last ones in the group so we can remove all the legacy decode for the group. Note that in disas_thumb2_insn() the parts of this encoding space where the decodetree decoder returns false will correctly be directed to illegal_op by the "(insn & (1 << 28))" check so they won't fall into disas_coproc_insn() by mistake. Backports commit d27e82f7d02f35e5919bd9cbbcb157f3537069a0 from qemu	2020-05-07 09:20:35 -04:00
Peter Maydell	49cdb7e2db	target/arm: Convert V[US]DOT (scalar) to decodetree Convert the V[US]DOT (scalar) insns in the 2reg-scalar-ext group to decodetree. Backports commit 35f5d4d1747558c6af2d914bcd848dcc30c3b531 from qemu	2020-05-07 09:17:32 -04:00
Peter Maydell	73dbfbe4d7	target/arm: Convert VCMLA (scalar) to decodetree Convert VCMLA (scalar) in the 2reg-scalar-ext group to decodetree. Backports commit 7e1b5d615361bb0038cda0e08af41e350e42d081 from qemu	2020-05-07 09:15:30 -04:00
Peter Maydell	1ab06d3eb5	target/arm: Convert VFM[AS]L (vector) to decodetree Convert the VFM[AS]L (vector) insns to decodetree. This is the last insn in the legacy decoder for the 3same_ext group, so we can delete the legacy decoder function for the group entirely. Note that in disas_thumb2_insn() the parts of this encoding space where the decodetree decoder returns false will correctly be directed to illegal_op by the "(insn & (1 << 28))" check so they won't fall into disas_coproc_insn() by mistake. Backports commit 9a107e7b8a3c87ab63ec830d3d60f319fc577ff7 from qemu	2020-05-07 09:13:36 -04:00
Peter Maydell	c06bdf4cc2	target/arm: Convert V[US]DOT (vector) to decodetree Convert the V[US]DOT (vector) insns to decodetree. Backports commit 32da0e330d3e5218b669079826496751fb52c1ca from qemu	2020-05-07 09:09:24 -04:00
Peter Maydell	1d4dba1e5a	target/arm: Convert VCADD (vector) to decodetree Convert the VCADD (vector) insns to decodetree. Backports commit 94d5eb7b3f72fbbdee55d7908e9cb6de95949f4b from qemu	2020-05-07 09:05:55 -04:00
Peter Maydell	d8287755b2	target/arm: Convert VCMLA (vector) to decodetree Convert the VCMLA (vector) insns in the 3same extension group to decodetree. Backports commit afff8de0d4d55b4ce7c36eb9cdfafe477a35dd75 from qemu	2020-05-07 09:02:52 -04:00
Peter Maydell	c2c628eb71	target/arm: Add stubs for AArch32 Neon decodetree Add the infrastructure for building and invoking a decodetree decoder for the AArch32 Neon encodings. At the moment the new decoder covers nothing, so we always fall back to the existing hand-written decode. We follow the same pattern we did for the VFP decodetree conversion (commit 78e138bc1f672c145ef6ace74617d and following): code that deals with Neon will be moving gradually out to translate-neon.vfp.inc, which we #include into translate.c. In order to share the decode files between A32 and T32, we split Neon into 3 parts: * data-processing * load-store * 'shared' encodings The first two groups of instructions have similar but not identical A32 and T32 encodings, so we need to manually transform the T32 encoding into the A32 one before calling the decoder; the third group covers the Neon instructions which are identical in A32 and T32. Backports commit 625e3dd44a15dfbe9532daa6454df3f86cf04d3e from qemu	2020-05-07 08:59:42 -04:00
Peter Maydell	518d18062f	target/arm: Don't allow Thumb Neon insns without FEATURE_NEON We were accidentally permitting decode of Thumb Neon insns even if the CPU didn't have the FEATURE_NEON bit set, because the feature check was being done before the call to disas_neon_data_insn() and disas_neon_ls_insn() in the Arm decoder but was omitted from the Thumb decoder. Push the feature bit check down into the called functions so it is done for both Arm and Thumb encodings. Backports commit d1a6d3b594157425232a1ae5ea7f51b7a1c1aa2e from qemu	2020-05-07 08:55:02 -04:00
Peter Maydell	1964e4b9c9	target/arm/translate-vfp.inc.c: Remove duplicate simd_r32 check Somewhere along theline we accidentally added a duplicate "using D16-D31 when they don't exist" check to do_vfm_dp() (probably an artifact of a patchseries rebase). Remove it. Backports commit 0d787cf1f3c88fa29477e054f8523f6d82d91c98 from qemu	2020-05-07 08:52:42 -04:00
Philippe Mathieu-Daudé	8f90b77a6d	target/arm: Use uint64_t for midr field in CPU state struct MIDR_EL1 is a 64-bit system register with the top 32-bit being RES0. Represent it in QEMU's ARMCPU struct with a uint64_t, not a uint32_t. This fixes an error when compiling with -Werror=conversion because we were manipulating the register value using a local uint64_t variable: target/arm/cpu64.c: In function ‘aarch64_max_initfn’: target/arm/cpu64.c:628:21: error: conversion from ‘uint64_t’ {aka ‘long unsigned int’} to ‘uint32_t’ {aka ‘unsigned int’} may change value [-Werror=conversion] 628 \| cpu->midr = t; \| ^ and future-proofs us against a possible future architecture change using some of the top 32 bits. Backports commit e544f80030121040c8932ff1bd4006f390266c0f from qemu	2020-05-07 08:51:28 -04:00
Peter Maydell	d28059f4ea	target/arm: Use correct variable for setting 'max' cpu's ID_AA64DFR0 In aarch64_max_initfn() we update both 32-bit and 64-bit ID registers. The intended pattern is that for 64-bit ID registers we use FIELD_DP64 and the uint64_t 't' register, while 32-bit ID registers use FIELD_DP32 and the uint32_t 'u' register. For ID_AA64DFR0 we accidentally used 'u', meaning that the top 32 bits of this 64-bit ID register would end up always zero. Luckily at the moment that's what they should be anyway, so this bug has no visible effects. Use the right-sized variable. Backports commit 5a89dd2385a193aa954a7c9bf4e381f2ba6ae359 from qemu	2020-05-07 08:50:40 -04:00
Peter Maydell	b427549ce4	target/arm: Implement ARMv8.2-TTS2UXN The ARMv8.2-TTS2UXN feature extends the XN field in stage 2 translation table descriptors from just bit [54] to bits [54:53], allowing stage 2 to control execution permissions separately for EL0 and EL1. Implement the new semantics of the XN field and enable the feature for our 'max' CPU. Backports commit ce3125bed935a12e619a8253c19340ecaa899347 from qemu	2020-05-07 08:49:18 -04:00
Peter Maydell	1e75276a89	target/arm: Add new 's1_is_el0' argument to get_phys_addr_lpae() For ARMv8.2-TTS2UXN, the stage 2 page table walk wants to know whether the stage 1 access is for EL0 or not, because whether exec permission is given can depend on whether this is an EL0 or EL1 access. Add a new argument to get_phys_addr_lpae() so the call sites can pass this information in. Since get_phys_addr_lpae() doesn't already have a doc comment, add one so we have a place to put the documentation of the semantics of the new s1_is_el0 argument. Backports commit ff7de2fc2c994030bfb83af9ddc9a3cd70ce3e88 from qemu	2020-05-07 08:45:23 -04:00
Peter Maydell	bec9ee21b6	target/arm: Use enum constant in get_phys_addr_lpae() call The access_type argument to get_phys_addr_lpae() is an MMUAccessType; use the enum constant MMU_DATA_LOAD rather than a literal 0 when we call it in S1_ptw_translate(). Backports commit 59dff859cd850876df2cfa561c7bcfc4bdda4599 from qemu	2020-05-07 08:42:41 -04:00
Peter Maydell	3df93e463d	target/arm: Don't use a TLB for ARMMMUIdx_Stage2 We define ARMMMUIdx_Stage2 as being an MMU index which uses a QEMU TLB. However we never actually use the TLB -- all stage 2 lookups are done by direct calls to get_phys_addr_lpae() followed by a physical address load via address_space_ld(). Remove Stage2 from the list of ARM MMU indexes which correspond to real core MMU indexes, and instead put it in the set of "NOTLB" ARM MMU indexes. This allows us to drop NB_MMU_MODES to 11. It also means we can safely add support for the ARMv8.3-TTS2UXN extension, which adds permission bits to the stage 2 descriptors which define execute permission separatel for EL0 and EL1; supporting that while keeping Stage2 in a QEMU TLB would require us to use separate TLBs for "Stage2 for an EL0 access" and "Stage2 for an EL1 access", which is a lot of extra complication given we aren't even using the QEMU TLB. In the process of updating the comment on our MMU index use, fix a couple of other minor errors: NS EL2 EL2&0 was missing from the list in the comment * some text hadn't been updated from when we bumped NB_MMU_MODES above 8 Backports commit bf05340cb655637451162c02dadcd6581a05c02c from qemu	2020-05-07 08:40:06 -04:00
Fredrik Strupe	65200d8aad	target/arm: Make VQDMULL undefined when U=1 According to Arm ARM, VQDMULL is only valid when U=0, while having U=1 is unallocated. Backports commit ab553ef74ee52c0889679d0bd0da084aaf938f5c from qemu	2020-05-07 08:34:56 -04:00
Philippe Mathieu-Daudé	12cad29510	target/arm/cpu: Update coding style to make checkpatch.pl happy We will move this code in the next commit. Clean it up first to avoid checkpatch.pl errors. Backports commit 51c510aa5876a681cd0059ed3bacaa17590dc2d5 from qemu	2020-04-30 21:40:07 -04:00
Thomas Huth	84f2729a29	target/arm: Make cpu_register() available for other files Make cpu_register() (renamed to arm_cpu_register()) available from internals.h so we can register CPUs also from other files in the future. Backports commit 37bcf244454f4efb82e2c0c64bbd7eabcc165a0c from qemu	2020-04-30 21:38:42 -04:00
Philippe Mathieu-Daudé	afeb8ff2dc	target/arm: Restrict the Address Translate write operation to TCG accel Under KVM these registers are written by the hardware. Restrict the writefn handlers to TCG to avoid when building without TCG: LINK aarch64-softmmu/qemu-system-aarch64 target/arm/helper.o: In function `do_ats_write': target/arm/helper.c:3524: undefined reference to `raise_exception' Backports commit 9fb005b02dbda7f47b789b7f19bf5f73622a4756 from qemu	2020-04-30 21:31:22 -04:00
Richard Henderson	b26b4c06cd	target/arm: Vectorize integer comparison vs zero These instructions are often used in glibc's string routines. They were the final uses of the 32-bit at a time neon helpers. Backports commit 6b375d3546b009d1e63e07397ec9c6af256e15e9 from qemu	2020-04-30 21:29:17 -04:00
Corey Wharton	bc097bd2ae	target/riscv: Add a sifive-e34 cpu type The sifive-e34 cpu type is the same as the sifive-e31 with the single precision floating-point extension enabled. Backports commit d784733bf1875c1ba355c69739518f24d56f1260 from qemu	2020-04-30 21:08:10 -04:00
Alistair Francis	dc7cbbe2c1	target/riscv: Remove user version information Remove the user version information. This was never used and never publically exposed in a release of QEMU, so let's just remove it. In future to manage versions we can extend the extension properties to specify version. Backports commit c9a73910c34a2147bcf6a3b5194d27abb19c2e54 from qemu	2020-04-30 21:06:51 -04:00
Anup Patel	15e558e9cc	riscv: Fix Stage2 SV32 page table walk As-per RISC-V H-Extension v0.5 draft, the Stage2 SV32 page table has 12bits of VPN[1] and 10bits of VPN[0]. The additional 2bits in VPN[1] is required to handle the 34bit intermediate physical address coming from Stage1 SV32 page table. The 12bits of VPN[1] implies that Stage2 SV32 level-0 page table will be 16KB in size with total 4096 enteries where each entry maps 4MB of memory (same as Stage1 SV32 page table). The get_physical_address() function is broken for Stage2 SV32 level-0 page table because it incorrectly computes output physical address for Stage2 SV32 level-0 page table entry. The root cause of the issue is that get_physical_address() uses the "widened" variable to compute level-0 physical address mapping which changes level-0 mapping size (instead of 4MB). We should use the "widened" variable only for computing index of Stage2 SV32 level-0 page table. Backports commit ee79e7cd47ef47074d7c20c221321c5d31d3683d from qemu	2020-04-30 20:54:08 -04:00
Alistair Francis	434f9b057f	riscv: AND stage-1 and stage-2 protection flags Take the result of stage-1 and stage-2 page table walks and AND the two protection flags together. This way we require both to set permissions instead of just stage-2. Backports commit 8f67cd6db7375f9133d900b13b300931fbc2e1d8 from qemu	2020-04-30 20:53:11 -04:00
Alistair Francis	b6e2cf33df	riscv: Don't use stage-2 PTE lookup protection flags When doing the fist of a two stage lookup (Hypervisor extensions) don't set the current protection flags from the second stage lookup of the base address PTE. Backports commit 384728905441279e54fa3d714b11bf1b1bcbfd27 from qemu	2020-04-30 20:52:06 -04:00
Philippe Mathieu-Daudé	6d9233a542	various: Remove suspicious '\' character outside of #define in C code Fixes the following coccinelle warnings: $ spatch --sp-file --verbose-parsing ... \ scripts/coccinelle/remove_local_err.cocci ... SUSPICIOUS: a \ character appears outside of a #define at ./target/ppc/translate_init.inc.c:5213 SUSPICIOUS: a \ character appears outside of a #define at ./target/ppc/translate_init.inc.c:5261 SUSPICIOUS: a \ character appears outside of a #define at ./target/microblaze/cpu.c:166 SUSPICIOUS: a \ character appears outside of a #define at ./target/microblaze/cpu.c:167 SUSPICIOUS: a \ character appears outside of a #define at ./target/microblaze/cpu.c:169 SUSPICIOUS: a \ character appears outside of a #define at ./target/microblaze/cpu.c:170 SUSPICIOUS: a \ character appears outside of a #define at ./target/microblaze/cpu.c:171 SUSPICIOUS: a \ character appears outside of a #define at ./target/microblaze/cpu.c:172 SUSPICIOUS: a \ character appears outside of a #define at ./target/microblaze/cpu.c:173 SUSPICIOUS: a \ character appears outside of a #define at ./target/i386/cpu.c:5787 SUSPICIOUS: a \ character appears outside of a #define at ./target/i386/cpu.c:5789 SUSPICIOUS: a \ character appears outside of a #define at ./target/i386/cpu.c:5800 SUSPICIOUS: a \ character appears outside of a #define at ./target/i386/cpu.c:5801 SUSPICIOUS: a \ character appears outside of a #define at ./target/i386/cpu.c:5802 SUSPICIOUS: a \ character appears outside of a #define at ./target/i386/cpu.c:5804 SUSPICIOUS: a \ character appears outside of a #define at ./target/i386/cpu.c:5805 SUSPICIOUS: a \ character appears outside of a #define at ./target/i386/cpu.c:5806 SUSPICIOUS: a \ character appears outside of a #define at ./target/i386/cpu.c:6329 SUSPICIOUS: a \ character appears outside of a #define at ./hw/sd/sdhci.c:1133 SUSPICIOUS: a \ character appears outside of a #define at ./hw/scsi/scsi-disk.c:3081 SUSPICIOUS: a \ character appears outside of a #define at ./hw/net/virtio-net.c:1529 SUSPICIOUS: a \ character appears outside of a #define at ./hw/riscv/sifive_u.c:468 SUSPICIOUS: a \ character appears outside of a #define at ./dump/dump.c:1895 SUSPICIOUS: a \ character appears outside of a #define at ./block/vhdx.c:2209 SUSPICIOUS: a \ character appears outside of a #define at ./block/vhdx.c:2215 SUSPICIOUS: a \ character appears outside of a #define at ./block/vhdx.c:2221 SUSPICIOUS: a \ character appears outside of a #define at ./block/vhdx.c:2222 SUSPICIOUS: a \ character appears outside of a #define at ./block/replication.c:172 SUSPICIOUS: a \ character appears outside of a #define at ./block/replication.c:173 Backports commit 78ee6bd04821847036a805cb4bdd46464e1d3098 from qemu	2020-04-30 07:31:45 -04:00
Peter Maydell	2a2bb20399	Open 5.1 development tree Backports commit a7922a3c81f34f45b1ebc9670a7769edc4c42a43 from qemu	2020-04-30 07:30:38 -04:00
Peter Maydell	0bc85f116b	Update version for v5.0.0 release Backports commit fdd76fecdde1ad444ff4deb7f1c4f7e4a1ef97d6 from qemu	2020-04-30 07:30:12 -04:00
Peter Maydell	4dd2736c2b	Update version for v5.0.0-rc4 release Backports commit ee573f5326046223b6eef4ae7fbfec31d274e093 from qemu	2020-04-30 07:29:47 -04:00
Peter Maydell	a4a171a9c9	target/arm: Fix ID_MMFR4 value on AArch64 'max' CPU In commit 41a4bf1feab098da4cd the added code to set the CNP field in ID_MMFR4 for the AArch64 'max' CPU had a typo where it used the wrong variable name, resulting in ID_MMFR4 fields AC2, XNX and LSM being wrong. Fix the typo. Fixes: 41a4bf1feab098da4cd Backports commit e73c4443473107ddf11ad3a7fea5bef2001ee802 from qemu	2020-04-30 07:29:06 -04:00
Peter Maydell	cecdfd5fca	Update version for v5.0.0-rc3 release Backports commit 20038cd7a8412feeb49c01f6ede89e36c8995472 from qemu	2020-04-30 07:28:15 -04:00
Peter Maydell	c6509498da	osdep.h: Drop no-longer-needed Coverity workarounds In commit a1a98357e3fd in 2018 we added some workarounds for Coverity not being able to handle the _Float* types introduced by recent glibc. Newer versions of the Coverity scan tools have support for these types, and will fail with errors about duplicate typedefs if we have our workaround. Remove our copy of the typedefs. Backports commit c160f17cd6f5fc3f8698b408a451149b34b1a647 from qemu	2020-04-30 07:27:24 -04:00
Alexander Duyck	05cd02d6c6	memory: Do not allow direct write access to rom_device regions According to the documentation in memory.h a ROM memory region will be backed by RAM for reads, but is supposed to go through a callback for writes. Currently we were not checking for the existence of the rom_device flag when determining if we could perform a direct write or not. To correct that add a check to memory_region_is_direct so that if the memory region has the rom_device flag set we will return false for all checks where is_write is set. Backports commit d489ae4ac57ebe14bde8384556cbac237ead988d from qemu	2020-04-30 07:26:06 -04:00
lixinyu	7c32c5b0a4	tcg/mips: mips sync* encode error OPC_SYNC_WMB, OPC_SYNC_MB, OPC_SYNC_ACQUIRE, OPC_SYNC_RELEASE and OPC_SYNC_RMB have wrong encode. According to the mips manual, their encode should be 'OPC_SYNC \| 0x?? << 6' rather than 'OPC_SYNC \| 0x?? << 5'. Wrong encode can lead illegal instruction errors. These instructions often appear with multi-threaded simulation. Fixes: 6f0b99104a3 ("tcg/mips: Add support for fence") Backports commit a4e57084c16d5b0eff3651693fba04f26b30b551 from qemu	2020-04-30 07:24:57 -04:00
Peter Maydell	eeeb5cde38	Update version for v5.0.0-rc2 release Backports commit f3bac27cc1e303e1860cc55b9b6889ba39dee587 from qemu	2020-04-30 07:24:07 -04:00
Richard Henderson	3e934b99c8	softfloat: Fix BAD_SHIFT from normalizeFloatx80Subnormal All other calls to normalize*Subnormal detect zero input before the call -- this is the only outlier. This case can happen with +0.0 + +0.0 = +0.0 or -0.0 + -0.0 = -0.0, so return a zero of the correct sign. Reported-by: Coverity (CID 1421991) Backports commit 2f311075b7a74124098effc72290767b02869561 from qemu	2020-04-30 07:22:57 -04:00
Peter Maydell	6a015761ac	target/arm: Remove obsolete TODO note from get_phys_addr_lpae() An old comment in get_phys_addr_lpae() claims that the code does not support the different format TCR for VTCR_EL2. This used to be true but it is not true now (in particular the aa64_va_parameters() and aa32_va_parameters() functions correctly handle the different register format by checking whether the mmu_idx is Stage2). Remove the out of date parts of the comment. Backports commit 07d1be3b3aac20c21ac4a95c7f3f01a3622a31a3 from qemu	2020-04-30 07:21:17 -04:00
Peter Maydell	4228e7f155	target/arm: PSTATE.PAN should not clear exec bits Our implementation of the PSTATE.PAN bit incorrectly cleared all access permission bits for privileged access to memory which is user-accessible. It should only affect the privileged read and write permissions; execute permission is dealt with via XN/PXN instead. Fixes: 81636b70c226dc27d7ebc8d Backports commit f4e1dbc578a051db08a40c05276ebf525b98f949 from qemu	2020-04-30 07:20:20 -04:00
Peter Maydell	f0a864a8d9	Update version for v5.0.0-rc1 release Backports commit 2833ad487cfff7dc33703e4731b75facde1c561e from qemu	2020-04-30 07:17:31 -04:00
Philippe Mathieu-Daudé	c68800aa29	decodetree: Use Python3 floor division operator This script started using Python2, where the 'classic' division operator returns the floor result. In commit 3d004a371 we started to use Python3, where the division operator returns the float result ('true division'). To keep the same behavior, use the 'floor division' operator "//" which returns the floor result. Fixes: 3d004a371 Backports commit b412378785c1bd95e3461c1373dd8938bc54fb4e from qemu	2020-04-30 07:16:30 -04:00
Richard Henderson	299ba4e867	tcg/i386: Fix INDEX_op_dup2_vec We were only constructing the 64-bit element, and not replicating the 64-bit element across the rest of the vector. Backports commit e20cb81d9c5a3d0f9c08f3642728a210a1c162c9 from qemu	2020-04-30 07:15:08 -04:00
Jiaxun Yang	5ca33a4aaa	target/mips: Fix loongson multimedia condition instructions Loongson multimedia condition instructions were previously implemented as write 0 to rd due to lack of documentation. So I just confirmed with Loongson about their encoding and implemented them correctly. Backports commit 84878f4c00a7beca1d1460e2f77a6c833b8d0393 from qemu	2020-04-30 07:14:10 -04:00
Richard Henderson	224f35b48e	configure: Support -static-pie if requested Recent toolchains support static and pie at the same time. As with normal dynamic builds, allow --static to default to PIE if supported by the toolchain. Allow --enable/--disable-pie to override the default. Backports commit 127814629b32d5e0de2873d742e08cb9bd412af7 from qemu	2020-04-30 07:09:34 -04:00
Richard Henderson	f08ba73e78	configure: Override the os default with --disable-pie Some distributions, e.g. Ubuntu 19.10, enable PIE by default. If for some reason one wishes to build a non-pie binary, we must provide additional options to override. At the same time, reorg the code to an elif chain. Backports commit 2c674109c2da3f2e17b69f39e0b93b3f3d3dfa59 from qemu	2020-04-30 07:07:39 -04:00
Richard Henderson	598083421b	configure: Unnest detection of -z,relro and -z,now There is nothing about these options that is related to PIE. Use them unconditionally. Backports commit e6cbd75158ea274ab98c13c9b73d2bc1d90aa50d from qemu	2020-04-30 07:06:34 -04:00
Richard Henderson	e546e24b83	configure: Always detect -no-pie toolchain support The CFLAGS_NOPIE and LDFLAGS_NOPIE variables are used in pc-bios/optionrom/Makefile, which has nothing to do with the PIE setting of the main qemu executables. This overrides any operating system default to build all executables as PIE, which is important for ROMs. Backports commit b26341241bbfe9cc126479a0dbed5d40d547f242 from qemu	2020-04-30 07:05:45 -04:00
Richard Henderson	13e94b11ea	configure: Do not force pie=no for non-x86 PIE is supported on many other hosts besides x86. The default for non-x86 is now the same as x86: pie is used if supported, and may be forced via --enable/--disable-pie. The original commit (40d6444e91c) said: "Non-x86 are not changed, as they require TCG changes" but I think that's wrong -- there's nothing about PIE that affects TCG one way or another. Tested on aarch64 (bionic) and ppc64le (centos 7) hosts. Backports commit d2cd29e30736afd4a1e8cac3cf4da360bbc65978 from qemu	2020-04-30 07:04:13 -04:00
Richard Henderson	c6f644ea0c	tcg: Remove softmmu code_gen_buffer fixed address The commentary talks about "in concert with the addresses assigned in the relevant linker script", except there is no linker script for softmmu, nor has there been for some time. (Do not confuse the user-only linker script editing that was removed in the previous patch, because user-only does not use this code_gen_buffer allocation method.) Backports commit 64547a3bb6c92781372994e4e9b25a89f6c88074 from qemu	2020-04-30 07:03:06 -04:00
Changbin Du	1e274425bd	target/arm: fix incorrect current EL bug in aarch32 exception emulation The arm_current_el() should be invoked after mode switching. Otherwise, we get a wrong current EL value, since current EL is also determined by current mode. Fixes: 4a2696c0d4 ("target/arm: Set PAN bit as required on exception entry") Backports commit 88828bf133b64b7a860c166af3423ef1a47c5d3b from qemu	2020-04-30 06:57:36 -04:00
Richard Henderson	5ac51ff092	target/i386: Renumber EXCP_SYSCALL We are not short of numbers for EXCP_*. There is no need to confuse things by having EXCP_VMEXIT and EXCP_SYSCALL overlap, even though the former is only used for system mode and the latter is only used for user mode. Backports commit 628460891dd46c25e33eec01757ac655679ea198 from qemu	2020-04-30 06:56:25 -04:00
Peter Maydell	f8891928eb	Update version for v5.0.0-rc0 release Backports commit 736cf607e40674776d752acc201f565723e86045 from qemu	2020-04-30 06:55:45 -04:00
Richard Henderson	c9ee9a2729	target/arm: Move computation of index in handle_simd_dupe Coverity reports a BAD_SHIFT with ctz32(imm5), with imm5 == 0. This is an invalid encoding, but we diagnose that just below by rejecting size > 3. Avoid the warning by sinking the computation of index below the check. Backports commit 550a04893c2bd4442211b353680b9a6408d94dba from qemu	2020-04-30 06:54:39 -04:00
Richard Henderson	fd4ce2cba0	target/arm: Assert immh != 0 in disas_simd_shift_imm Coverity raised a shed-load of errors cascading from inferring that clz32(immh) might yield 32, from immh might be 0. While immh cannot be 0 from encoding, it is not obvious even to a human how we've checked that: via the filtering provided by data_proc_simd[]. Backports commit 3944d58db3fc5bf131345a21a44013bc13849a12 from qemu	2020-04-30 06:53:54 -04:00
Richard Henderson	d5234c8b3d	target/arm: Rearrange disabled check for watchpoints Coverity rightly notes that ctz32(bas) on 0 will return 32, which makes the len calculation a BAD_SHIFT. A value of 0 in DBGWCR<n>_EL1.BAS is reserved. Simply move the existing check we have for this case Backports commit ae1111d4def40c6f592c3a307c599272b778eb65 from qemu	2020-04-30 06:52:38 -04:00
Moger, Babu	8c5f623ac3	i386: Add 2nd Generation AMD EPYC processors Adds the support for 2nd Gen AMD EPYC Processors. The model display name will be EPYC-Rome. Adds the following new feature bits on top of the feature bits from the first generation EPYC models. perfctr-core : core performance counter extensions support. Enables the VM to use extended performance counter support. It enables six programmable counters instead of four counters. clzero : instruction zeroes out the 64 byte cache line specified in RAX. xsaveerptr : XSAVE, XSAVE, FXSAVEOPT, XSAVEC, XSAVES always save error pointers and FXRSTOR, XRSTOR, XRSTORS always restore error pointers. wbnoinvd : Write back and do not invalidate cache ibpb : Indirect Branch Prediction Barrier amd-stibp : Single Thread Indirect Branch Predictor clwb : Cache Line Write Back and Retain xsaves : XSAVES, XRSTORS and IA32_XSS support rdpid : Read Processor ID instruction support umip : User-Mode Instruction Prevention support The Reference documents are available at https://developer.amd.com/wp-content/resources/55803_0.54-PUB.pdf https://www.amd.com/system/files/TechDocs/24594.pdf Depends on following kernel commits: 40bc47b08b6e ("kvm: x86: Enumerate support for CLZERO instruction") 504ce1954fba ("KVM: x86: Expose XSAVEERPTR to the guest") 6d61e3c32248 ("kvm: x86: Expose RDPID in KVM_GET_SUPPORTED_CPUID") 52297436199d ("kvm: svm: Update svm_xsaves_supported") Backports commit 143c30d4d346831a09e59e9af45afdca0331e819 from qem	2020-04-30 06:50:02 -04:00
Moger, Babu	ced1be70e2	i386: Add missing cpu feature bits in EPYC model Adds the following missing CPUID bits: perfctr-core : core performance counter extensions support. Enables the VM to use extended performance counter support. It enables six programmable counters instead of 4 counters. clzero : instruction zeroes out the 64 byte cache line specified in RAX. xsaveerptr : XSAVE, XSAVE, FXSAVEOPT, XSAVEC, XSAVES always save error pointers and FXRSTOR, XRSTOR, XRSTORS always restore error pointers. ibpb : Indirect Branch Prediction Barrie. xsaves : XSAVES, XRSTORS and IA32_XSS supported. Depends on following kernel commits: 40bc47b08b6e ("kvm: x86: Enumerate support for CLZERO instruction") 504ce1954fba ("KVM: x86: Expose XSAVEERPTR to the guest") 52297436199d ("kvm: svm: Update svm_xsaves_supported") These new features will be added in EPYC-v3. The -cpu help output after the change. x86 EPYC-v1 AMD EPYC Processor x86 EPYC-v2 AMD EPYC Processor (with IBPB) x86 EPYC-v3 AMD EPYC Processor Backports commit a16e8dbc043720abcb37fc7dca313e720b4e0f0c from qemu	2020-04-30 06:47:26 -04:00
Tao Xu	7b2296a512	target/i386: Add Denverton-v2 (no MPX) CPU model Because MPX is being removed from the linux kernel, remove MPX feature from Denverton. Backports commit ab0c942c868210e78ff88aef83efb4b4018068e1 from qemu	2020-04-30 06:45:13 -04:00
Richard Henderson	12d7126443	tcg: Remove tcg-runtime-gvec.c DO_CMP0 Partial cleanup from the CONFIG_VECTOR16 removal. Replace DO_CMP0 with its scalar expansion, a simple negation. Backports commit 0270bd503e3699b7202200a2d693ad1feb57473f from qemu	2020-04-30 06:43:07 -04:00
Richard Henderson	6ab9bca262	tcg: Tidy tcg-runtime-gvec.c DUP* Partial cleanup from the CONFIG_VECTOR16 removal. Replace the DUP* expansions with the scalar argument. Backports commit 0a83e43a9ee624b44da61514db9b77d86e74e8c2 from qemu	2020-04-30 06:41:48 -04:00
Richard Henderson	5b14fc103a	tcg: Tidy tcg-runtime-gvec.c types Partial cleanup from the CONFIG_VECTOR16 removal. Replace the vec* types with their scalar expansions. Backports commit 6c7ab3015ac498181444deff55dcc8fd43ad468c from qemu	2020-04-30 06:34:37 -04:00
Richard Henderson	72d8af5282	tcg: Remove CONFIG_VECTOR16 The comment in tcg-runtime-gvec.c about CONFIG_VECTOR16 says that tcg-op-gvec.c has eliminated size 8 vectors, and only passes on multiples of 16. This may have been true of the first few operations, but is not true of all operations. In particular, multiply, shift by scalar, and compare of 8- and 16-bit elements are not expanded inline if host vector operations are not supported. For an x86_64 host that does not support AVX, this means that we will fall back to the helper, which will attempt to use SSE instructions, which will SEGV on an invalid 8-byte aligned memory operation. This patch simply removes the CONFIG_VECTOR16 code and configuration without further simplification. Buglink: https://bugs.launchpad.net/bugs/1863508 Backports commit 43d1ccd2a02fadf3c36b46a8c31125a28864d141 from qemu	2020-04-30 06:28:46 -04:00
Richard Henderson	b358f771f6	tcg/i386: Bound shift count expanding sari_vec A given RISU testcase for SVE can produce tcg-op-vec.c:511: do_shifti: Assertion `i >= 0 && i < (8 << vece)' failed. because expand_vec_sari gave a shift count of 32 to a MO_32 vector shift. In 44f1441dbe1, we changed from direct expansion of vector opcodes to re-use of the tcg expanders. So while the comment correctly notes that the hw will handle such a shift count, we now have to take our own sanity checks into account. Which is easy in this particular case. Fixes: 44f1441dbe1 Backports commit 312b426fea4d6dd322d7472c80010a8ba7a166d2 from qemu	2020-04-30 06:26:42 -04:00
Alex Bennée	46e1dab19e	target/arm: don't bother with id_aa64pfr0_read for USER_ONLY For system emulation we need to check the state of the GIC before we report the value. However this isn't relevant to exporting of the value to linux-user and indeed breaks the exported value as set by modify_arm_cp_regs. Backports commit 976b99b6ec2e15cd7c36d72fdb9b60c37c5494f8 from qemu	2020-04-30 06:24:10 -04:00
Rajnesh Kanwal	cf08d74c26	target/riscv: Fix VS mode interrupts forwarding. Currently riscv_cpu_local_irq_pending is used to find out pending interrupt and VS mode interrupts are being shifted to represent S mode interrupts in this function. So when the cause returned by this function is passed to riscv_cpu_do_interrupt to actually forward the interrupt, the VS mode forwarding check does not work as intended and interrupt is actually forwarded to hypervisor. This patch fixes this issue. Backports commit c5969a3a3c2cb9ea02ffb7e86acb059d3cf8c264 from qemu	2020-04-30 06:21:03 -04:00
Alistair Francis	1001f0ba1f	target/riscv: Correctly implement TSR trap As reported in: https://bugs.launchpad.net/qemu/+bug/1851939 we weren't correctly handling illegal instructions based on the value of MSTATUS_TSR and the current privledge level. This patch fixes the issue raised in the bug by raising an illegal instruction if TSR is set and we are in S-Mode. Backports commit ed5abf46b3c414ef58e647145f19b3966700b206 from qemu	2020-04-30 06:19:49 -04:00
Richard Henderson	6c8172fd08	target/arm: Disable clean_data_tbi for system mode We must include the tag in the FAR_ELx register when raising an addressing exception. Which means that we should not clear out the tag during translation. We cannot at present comply with this for user mode, so we retain the clean_data_tbi function for the moment, though it no longer does what it says on the tin for system mode. This function is to be replaced with MTE, so don't worry about the slight misnaming. Buglink: https://bugs.launchpad.net/qemu/+bug/1867072 Backports commit 38d931687fa196a7ef860f8583815abc7fd5521a from qemu	2020-04-30 06:18:31 -04:00
Richard Henderson	e040675fbf	target/arm: Clean address for DC ZVA This data access was forgotten when we added support for cleaning addresses of TBI information. Fixes: 3a471103ac1823ba Backports commit 597d61a3b1f94c53a3aaa77671697c0c5f797dbf from qemu.	2020-04-30 06:16:03 -04:00
Richard Henderson	a37d9b2be5	target/arm: Use DEF_HELPER_FLAGS for helper_dc_zva The function does not write registers, and only reads them by implication via the exception path. Backports commit 1371b02c5a060e423e70560dbca769b54e471ba9 from qemu	2020-04-30 06:14:45 -04:00
Richard Henderson	3cb68bc44e	target/arm: Move helper_dc_zva to helper-a64.c This is an aarch64-only function. Move it out of the shared file. This patch is code movement only. Backports commit 7b182eb2467af6c47c9c77c64bbbeed8ed53c330 from qemu	2020-04-30 06:12:26 -04:00
Richard Henderson	a22a2a8b71	target/arm: Introduce core_to_aa64_mmu_idx If by context we know that we're in AArch64 mode, we need not test for M-profile when reconstructing the full ARMMMUIdx. Backports commit 20dc67c947a691fa9df05e76aec6df50204b4b94 from qemu	2020-04-30 05:58:59 -04:00
Richard Henderson	d3a5843aeb	target/arm: Replicate TBI/TBID bits for single range regimes Replicate the single TBI bit from TCR_EL2 and TCR_EL3 so that we can unconditionally use pointer bit 55 to index into our composite TBI1:TBI0 field. Backports commit 3e270f67f0f05277021763af119a6ce195f8ed51 from qemu	2020-04-30 05:58:59 -04:00
Richard Henderson	cc32a96183	target/arm: Honor the HCR_EL2.TTLB bit This bit traps EL1 access to tlb maintenance insns. Backports commit 30881b7353b5bb41210c32cd8e00421da757808c from qemu	2020-04-30 05:58:59 -04:00
Richard Henderson	74d6aa6012	target/arm: Honor the HCR_EL2.TPU bit This bit traps EL1 access to cache maintenance insns that operate to the point of unification. There are no longer any references to plain aa64_cacheop_access, so remove it. Backports commit 38262d8a732f8bd0e9ca3dc064f6e73d00c08b9a from qemu	2020-04-30 05:58:59 -04:00
MerryMage	f9b909f529	header_gen: Add gen_{u,s}shl_i{32,64} to arm	2020-04-13 19:38:59 +01:00
Richard Henderson	f35a83d5ff	target/arm: Honor the HCR_EL2.TPCP bit This bit traps EL1 access to cache maintenance insns that operate to the point of coherency or persistence. Backports commit 1bed4d2e55459129c19f5952bcfc65bd0c70db5b from qemu	2020-03-22 02:44:41 -04:00
Richard Henderson	8ff8ff0c4a	target/arm: Honor the HCR_EL2.TACR bit This bit traps EL1 access to the auxiliary control registers. Backports commit 9960237769ada2faaaf1898b96da7a55e1691cf4 from qemu	2020-03-22 02:42:05 -04:00
Richard Henderson	d252af2069	target/arm: Honor the HCR_EL2.TSW bit These bits trap EL1 access to set/way cache maintenance insns. Backports commit 1803d2713b29d85031cc964d545036bda9880f26 from qemu	2020-03-22 02:40:10 -04:00
Richard Henderson	7ee27e5d93	target/arm: Honor the HCR_EL2.{TVM,TRVM} bits These bits trap EL1 access to various virtual memory controls. Backports commit 84929218512c19ec9a296fbfd7b39219e0c592ae from qemu	2020-03-22 02:38:03 -04:00
Richard Henderson	7cd75be7c8	target/arm: Improve masking in arm_hcr_el2_eff Update the {TGE,E2H} == '11' masking to ARMv8.6. If EL2 is configured for aarch32, disable all of the bits that are RES0 in aarch32 mode. Backports commit 4990e1d3c128580dd2fa0bbb1a42b6d63ba1ac28 from qemu	2020-03-22 02:32:35 -04:00
Richard Henderson	fa599a9538	target/arm: Add HCR_EL2 bit definitions from ARMv8.6 Backports commit e0a38bb35aa930c2d3b9982914297f0c0e8fd5c8 from qemu	2020-03-22 02:31:19 -04:00
Richard Henderson	c4b2493c2e	target/arm: Improve masking of HCR/HCR2 RES0 bits Don't merely start with v8.0, handle v7VE as well. Ensure that writes from aarch32 mode do not change bits in the other half of the register. Protect reads of aa64 id registers with ARM_FEATURE_AARCH64. Backports commit d1fb4da208411ce7b3dafb9f9e7726ebcec14edb from qemu	2020-03-22 02:28:41 -04:00
Peter Maydell	32b0e506e6	target/arm: Implement (trivially) ARMv8.2-TTCNP The ARMv8.2-TTCNP extension allows an implementation to optimize by sharing TLB entries between multiple cores, provided that software declares that it's ready to deal with this by setting a CnP bit in the TTBRn_ELx. It is mandatory from ARMv8.2 onward. For QEMU's TLB implementation, sharing TLB entries between different cores would not really benefit us and would be a lot of work to implement. So we implement this extension in the "trivial" manner: we allow the guest to set and read back the CnP bit, but don't change our behaviour (this is an architecturally valid implementation choice). The only code path which looks at the TTBRn_ELx values for the long-descriptor format where the CnP bit is defined is already doing enough masking to not get confused when the CnP bit at the bottom of the register is set, so we can simply add a comment noting why we're relying on that mask. Backports commit 41a4bf1feab098da4cd5495cd56a99b0339e2275 from qemu	2020-03-22 02:24:48 -04:00
Anup Patel	d47390ade4	target/riscv: Emulate TIME CSRs for privileged mode Currently, TIME CSRs are emulated only for user-only mode. This patch add TIME CSRs emulation for privileged mode. For privileged mode, the TIME CSRs will return value provided by rdtime callback which is registered by QEMU machine/platform emulation (i.e. CLINT emulation). If rdtime callback is not available then the monitor (i.e. OpenSBI) will trap-n-emulate TIME CSRs in software. We see 25+% performance improvement in hackbench numbers when TIME CSRs are not trap-n-emulated. Backports commit c695724868ce4049fd79c5a509880dbdf171e744 from qemu	2020-03-22 02:22:17 -04:00
Alistair Francis	6c3338430a	target/riscv: Add the MSTATUS_MPV_ISSET helper macro Add a helper macro MSTATUS_MPV_ISSET() which will determine if the MSTATUS_MPV bit is set for both 32-bit and 64-bit RISC-V. Backports commit e44b50b5b2e508fdd24915ab0e44ac49685e1de3 from qemu	2020-03-22 02:18:02 -04:00
Alistair Francis	835b025692	target/riscv: Add support for the 32-bit MSTATUSH CSR Backports commit 551fa7e8a695ea5fd1cca8ffd318556855bbf54f from qemu	2020-03-22 02:16:13 -04:00
Alistair Francis	63d5b37ebc	target/riscv: Set htval and mtval2 on execptions Backports commit 3067553993ae986b76a92df8a978778134ecdc84 from qemu	2020-03-22 02:13:07 -04:00
Alistair Francis	141c47a286	target/riscv: Raise the new execptions when 2nd stage translation fails Backports commit b2ef6ab9fee6948cf016f9e741feecdfb333fcbe from qemu	2020-03-22 02:08:53 -04:00
Alistair Francis	773757a919	target/riscv: Implement second stage MMU Backports commit 36a18664bafcfafa5e997b47458387f6fe53d537 from qemu	2020-03-22 02:07:49 -04:00
Alistair Francis	e350bd6f85	target/riscv: Allow specifying MMU stage Backports commit 1448689c7b23690f49a4cce248c6e4ac973d37b8 from qemu	2020-03-22 01:56:41 -04:00
Alistair Francis	74b8fd51f4	target/riscv: Respect MPRV and SPRV for floating point ops mark_fs_dirty() is the only place in translate.c that uses the virt_enabled bool. Let's respect the contents of MSTATUS.MPRV and HSTATUS.SPRV when setting the bool as this is used for performing floating point operations when V=0. Backports commit ae84dd0ab7eaf7e98cd6ee05b2063cce8ff9bc02 from qemu	2020-03-22 01:54:06 -04:00
Alistair Francis	270808e22e	target/riscv: Mark both sstatus and msstatus_hs as dirty Mark both sstatus and vsstatus as dirty (3). Backports commit 45b4dc8b403aa5473ec015336adf7d14d88e85c5 from qemu	2020-03-22 01:51:41 -04:00
Alistair Francis	347ad87e98	target/riscv: Disable guest FP support based on virtual status When the Hypervisor extension is in use we only enable floating point support when both status and vsstatus have enabled floating point support. Backports commit 29409c1d921d607873268671bf11a088efb5558e from qemu	2020-03-22 01:49:05 -04:00
Alistair Francis	0d488f3a76	target/riscv: Only set TB flags with FP status if enabled Backports commit e28eaed87902c5e852c3ed043b27204e879aa4e2 from qemu	2020-03-22 01:45:32 -04:00
Alistair Francis	1e17cbc52c	target/riscv: Remove the hret instruction The hret instruction does not exist in the new spec versions, so remove it from QEMU. Backports commit 0736febb2d0e1bb503ca07091c16a16e78480366 from qemu	2020-03-22 01:44:55 -04:00
Alistair Francis	13a7b74827	target/riscv: Add hfence instructions Backports commit 895c412cb6e79b7b08bd3c2d2fcb70a3cab6ff8a from qemu	2020-03-22 01:43:53 -04:00
Alistair Francis	fcc30eac4d	target/riscv: Add Hypervisor trap return support Backports commit e3fba4bab668a41f7cec6405c5aeb21497bd7c83 from qemu	2020-03-22 01:41:58 -04:00
Alistair Francis	3e3bdf836b	target/riscv: Add hypvervisor trap support Backports commit 5eb9e782f523d2898e2dacd86c6e41365dae74b3 from qemu	2020-03-22 01:40:06 -04:00
Alistair Francis	f6b14e146e	target/riscv: Generate illegal instruction on WFI when V=1 Backports commit 9d0d11269671646be7475cc01142e9d3ed8ae59c from qemu	2020-03-22 01:38:14 -04:00
Alistair Francis	4af0dc9a7e	target/ricsv: Flush the TLB on virtulisation mode changes To ensure our TLB isn't out-of-date we flush it on all virt mode changes. Unlike priv mode this isn't saved in the mmu_idx as all guests share V=1. The easiest option is just to flush on all changes. Backports commit eccc5a12c2fd1c646c69a1e7de29183b7a559973 from qemu	2020-03-22 01:37:19 -04:00
Alistair Francis	5adba06c29	target/riscv: Add support for virtual interrupt setting Backports commit 3ef10a098b0d3ebb02bf8e1325adc3b77af92f0b from qemu	2020-03-22 01:36:28 -04:00
Alistair Francis	4e7aaee31a	target/riscv: Extend the SIP CSR to support virtulisation Backports commit a2e9f57d06279220b1834eca2494e52adae121b8 from qemu	2020-03-22 01:35:11 -04:00
Alistair Francis	1fd49ac6f0	target/riscv: Extend the MIE CSR to support virtulisation Backports commit d0e53ce33ec8f66ffa597c634d50be73264aeadb from qemu	2020-03-22 01:33:17 -04:00
Alistair Francis	6aabd67ef8	target/riscv: Set VS bits in mideleg for Hyp extension Backports commit 713d8363deb3774db14fb88a9fcd99687dcef114 from qemu	2020-03-22 01:32:12 -04:00
Alistair Francis	ebc7b9371f	target/riscv: Add virtual register swapping function Backports commit 66e594f2800ddc55f908830bf9e8dc4cda1304fe from qemu	2020-03-22 01:30:22 -04:00
Alistair Francis	042e3df075	target/riscv: Add Hypervisor machine CSRs accesses Backports commit 34cfb5f61842d495c6f6fc3eeb4197b5b44fd570 from qemu	2020-03-22 01:26:47 -04:00
Alistair Francis	dd356201a6	target/riscv: Add Hypervisor virtual CSRs accesses Backports commit 8747c9eeb2aaec8441d0900b198725ab33af4951 from qemu	2020-03-22 01:24:42 -04:00
Alistair Francis	221c427fc3	target/riscv: Add Hypervisor CSR access functions Backports commit ff2cc1294cd8179d87de299b8e7a16bdb1e69523 from qemu	2020-03-22 01:22:58 -04:00
Alistair Francis	e4e41d3d87	target/riscv: Fix CSR perm checking for HS mode Update the CSR permission checking to work correctly when we are in HS-mode. Backports commit 0a42f4c4408824dc7cb9ff60c9bdce6dcc0d24a5 from qemu	2020-03-22 01:19:04 -04:00
Alistair Francis	ab7c1570d8	target/riscv: Add the force HS exception mode Add a FORCE_HS_EXCEP mode to the RISC-V virtulisation status. This bit specifies if an exeption should be taken to HS mode no matter the current delegation status. This is used when an exeption must be taken to HS mode, such as when handling interrupts. Backports commit c7b1bbc80fc2af17395d3986c346fd2307e57829 from qemu	2020-03-22 01:16:55 -04:00
Alistair Francis	95537388c5	target/riscv: Add the virtulisation mode Backports commit ef6bb7b62682badefdcb744831510aaa5971684f from qemu	2020-03-22 01:15:06 -04:00
Alistair Francis	fa04e7ea45	target/riscv: Rename the H irqs to VS irqs Backports commit 205377f8940898e4c53d1b44350a3d4934a2da72 from qemu	2020-03-22 01:09:04 -04:00
Alistair Francis	0198a09698	target/riscv: Add support for the new execption numbers The v0.5 Hypervisor spec add new execption numbers, let's add support for those. Backports commit ab67a1d07a4f6f1b4d577c5c47013273b9804551 from qemu	2020-03-22 01:07:23 -04:00
Alistair Francis	870603305d	target/riscv: Add the Hypervisor CSRs to CPUState Add the Hypervisor CSRs to CPUState and at the same time (to avoid bisect issues) update the CSR macros for the v0.5 Hyp spec. Backports commit bd023ce33b85d73791b7bc78fd04a8115c60995e from qemu	2020-03-22 01:05:23 -04:00
Alistair Francis	40afe12008	target/riscv: Add the Hypervisor extension Backports commit af1fa0039c799a350bcde07b3d8a71dfde07d11b from qemu	2020-03-22 01:01:07 -04:00
Alistair Francis	83ac8747a5	target/riscv: Convert MIP CSR to target_ulong The MIP CSR is a xlen CSR, it was only 32-bits to allow atomic access. Now that we don't use atomics for MIP we can change this back to a xlen CSR. Backports commit 028616130d5f0abc8a3b96f28963da51a875024b from qemu	2020-03-22 00:59:47 -04:00
Peter Maydell	7271ebf96d	target/arm: Implement ARMv8.3-CCIDX The ARMv8.3-CCIDX extension makes the CCSIDR_EL1 system ID registers have a format that uses the full 64 bit width of the register, and adds a new CCSIDR2 register so AArch32 can get at the high 32 bits. QEMU doesn't implement caches, so we just treat these ID registers as opaque values that are set to the correct constant values for each CPU. The only thing we need to do is allow 64-bit values in our cssidr[] array and provide the CCSIDR2 accessors. We don't set the CCIDX field in our 'max' CPU because the CCSIDR constant values we use are the same as the ones used by the Cortex-A57 and they are in the old 32-bit format. This means that the extra regdef added here is unused currently, but it means that whenever in the future we add a CPU that does need the new 64-bit format it will just work when we set the cssidr values and the ID registers for it. Backports commit 957e615503bd0de22393fd8dbcb22a5064fd2b5c from qemu	2020-03-22 00:17:37 -04:00
Peter Maydell	5416c5a672	target/arm: Implement v8.4-RCPC The v8.4-RCPC extension implements some new instructions: * LDAPUR, LDAPURB, LDAPURH, LDAPRSB, LDAPRSH, LDAPRSW * STLUR, STLURB, STLURH These are all in a new subgroup of encodings that sits below the top-level "Loads and Stores" group in the Arm ARM. The STLUR* instructions have standard store-release semantics; the LDAPUR* have Load-AcquirePC semantics, but (as with LDAPR*) we choose to implement them as the slightly stronger Load-Acquire. Backports commit a1229109dec4375259d3fff99f362405aab7917a from qemu	2020-03-22 00:15:46 -04:00
Peter Maydell	f72582bb7a	target/arm: Implement v8.3-RCPC The v8.3-RCPC extension implements three new load instructions which provide slightly weaker consistency guarantees than the existing load-acquire operations. For QEMU we choose to simply implement them with a full LDAQ barrier. Backports commit 2677cf9f92a5319bb995927f9225940414ce879d from qemu	2020-03-22 00:13:08 -04:00
Peter Maydell	8d12309fd8	target/arm: Fix wrong use of FIELD_EX32 on ID_AA64DFR0 We missed an instance of using FIELD_EX32 on a 64-bit ID register, in isar_feature_aa64_pmu_8_4(). Fix it. Backports commit 54117b90ffd8a3977917971c3bd99bb5242710d9 from qemu.	2020-03-22 00:10:52 -04:00
Richard Henderson	c3eaaf7c33	target/arm: Split VMINMAXNM decode Passing the raw op field from the manual is less instructive than it might be. Do the full decode and use the existing helpers to perform the expansion. Since these are v8 insns, VECLEN+VECSTRIDE are already RES0. Backports commit f2eafb75511e5d2ee601b43dc6ee0bcc6e453acd from qemu	2020-03-22 00:09:53 -04:00
Richard Henderson	303d922e5d	target/arm: Split VFM decode Passing the raw o1 and o2 fields from the manual is less instructive than it might be. Do the full decode and let the trans_* functions pass in booleans to a helper. Backports commit d486f8308a13543bbcc4887f246e856df991a4bc from qemu	2020-03-22 00:07:53 -04:00
Richard Henderson	a445b1dab9	target/arm: Add formats for some vfp 2 and 3-register insns Those vfp instructions without extra opcode fields can share a common @format for brevity. Backports commit 906b60facc3d3dd3af56cb1a7860175d805e10a3 from qemu	2020-03-22 00:05:27 -04:00
Richard Henderson	3d2a091389	target/arm: Remove ARM_FEATURE_VFP* We have converted all tests against these features to ISAR tests. Backports commit f9506e162c33e87b609549157dd8431fcc732085 from qemu	2020-03-22 00:02:13 -04:00
Richard Henderson	4ce91875e4	target/arm: Move the vfp decodetree calls next to the base isa Have the calls adjacent as an intermediate step toward actually merging the decodes. Backports commit f0f6d5c81be47d593e5ece7f06df6fba4c15738b from qemu	2020-03-21 23:54:56 -04:00
Richard Henderson	f1ce64857c	target/arm: Move VLLDM and VLSTM to vfp.decode Now that we no longer have an early check for ARM_FEATURE_VFP, we can use the proper ISA check in trans_VLLDM_VLSTM. Backports commit dc778a6873f534817a13257be2acba3ca87ec015 from qemu	2020-03-21 23:51:59 -04:00
Richard Henderson	7592564248	target/arm: Remove ARM_FEATURE_VFP check from disas_vfp_insn We now have proper ISA checks within each trans_* function. Backports commit 46c98019255b056f5dbc9676a6490951469ca661 from qemu	2020-03-21 23:49:14 -04:00
Richard Henderson	3f0ae7ccee	target/arm: Replace ARM_FEATURE_VFP4 with isar_feature_aa32_simdfmac All remaining tests for VFP4 are for fused multiply-add insns. Since the MVFR1 field is used for both VFP and NEON, move its adjustment from the !has_neon block to the (!has_vfp && !has_neon) block. Test for vfp of the appropraite width alongside the test for simdfmac within translate-vfp.inc.c. Within disas_neon_data_insn, we have already tested for ARM_FEATURE_NEON. Backports commit c52881bbc22b50db99a6c37171ad3eea7d959ae6 from qemu	2020-03-21 23:48:13 -04:00
Richard Henderson	f6b5a9ef81	target/arm: Add missing checks for fpsp_v2 We will eventually remove the early ARM_FEATURE_VFP test, so add a proper test for each trans_* that does not already have another ISA test. Backports commit 82f6abe16b9b951180657c5fe15942d5214aa12e from qemu	2020-03-21 23:42:27 -04:00
Richard Henderson	ed1ce1437a	target/arm: Replace ARM_FEATURE_VFP3 checks with fp{sp, dp}_v3 Sort this check to the start of a trans_* function. Merge this with any existing test for fpdp_v2. Backports commit 84774cc37f2c17e48a4867a8e8e055deb23bea69 from qemu	2020-03-21 23:33:13 -04:00
Richard Henderson	54e9ce5174	target/arm: Perform fpdp_v2 check first Shuffle the order of the checks so that we test the ISA before we test anything else, such as the register arguments. Backports commit 799449abda137153a0e68b8788d8e1486f389490 from qemu	2020-03-21 23:29:08 -04:00
Richard Henderson	7e99995b7b	target/arm: Add isar_feature_aa64_fp_simd, isar_feature_aa32_vfp We cannot easily create "any" functions for these, because the ID_AA64PFR0 fields for FP and SIMD signal "enabled" with zero. Which means that an aarch32-only cpu will return incorrect results when testing the aarch64 registers. To use these, we must either have context or additionally test vs ARM_FEATURE_AARCH64. Backports commit 7d63183ff1a61b3f7934dc9b40b10e4fd5e100cd from qemu	2020-03-21 23:19:14 -04:00
Richard Henderson	39c49f488b	target/arm: Add isar_feature_aa32_{fpsp_v2, fpsp_v3, fpdp_v3} We will shortly use these to test for VFPv2 and VFPv3 in different situations. Backports commit f67957e17cbf8fc3cc5d1146a2db2023404578b0 from qemu	2020-03-21 23:17:19 -04:00
Richard Henderson	f73b360f8e	target/arm: Rename isar_feature_aa32_fpdp_v2 The old name, isar_feature_aa32_fpdp, does not reflect that the test includes VFPv2. We will introduce another feature tests for VFPv3. Backports commit c4ff873583834c8275586914fff714e3ae65dee4 from qemu	2020-03-21 23:16:00 -04:00
Richard Henderson	06b52d6660	target/arm: Add isar_feature_aa32_vfp_simd Use this in the places that were checking ARM_FEATURE_VFP, and are obviously testing for the existance of the register set as opposed to testing for some particular instruction extension. Backports commit 7fbc6a403a0aab834e764fa61d81ed8586cfe352 from qemu	2020-03-21 23:11:36 -04:00
Richard Henderson	764207865a	target/arm: Set ID_MMFR4.HPDS for aarch64_max_initfn We had set this for aarch32-only in arm_max_initfn, but failed to set the same bit for aarch64. Backports commit dac65ba1d7945c5d58ab63d8769103634adb2b01 from qemu	2020-03-21 19:45:09 -04:00
Paolo Bonzini	583cc7b8b1	target/i386: check for empty register in FXAM The fxam instruction returns the wrong result after fdecstp or after an underflow. Check fptags to handle this. Backports commit 93c3593ad04f2610fd0a176dfa89a7e40b6afe1f from qemu	2020-03-21 19:43:24 -04:00
Richard Henderson	7c799d29db	target/arm: Set MVFR0.FPSP for ARMv5 cpus We are going to convert FEATURE tests to ISAR tests, so FPSP needs to be set for these cpus, like we have already for FPDP. Backports commit 9eb4f58918a851fb46895fd9b7ce579afeac9d02 from qemu	2020-03-21 19:41:18 -04:00
Richard Henderson	833de589ed	target/arm: Use isar_feature_aa32_simd_r32 more places Many uses of ARM_FEATURE_VFP3 are testing for the number of simd registers implemented. Use the proper test vs MVFR0.SIMDReg. Backports commit a6627f5fc607939f7c8b9c3157fdcb2d368ba0ed from qemu	2020-03-21 19:39:35 -04:00
Richard Henderson	c06fd38b57	target/arm: Rename isar_feature_aa32_simd_r32 The old name, isar_feature_aa32_fp_d32, does not reflect the MVFR0 field name, SIMDReg. Backports commit 0e13ba7889432c5e2f1bdb1b25e7076ca1b1dcba from qemu	2020-03-21 19:37:33 -04:00
Richard Henderson	fcce8d4aa1	target/arm: Convert PMULL.8 to gvec We still need two different helpers, since NEON and SVE2 get the inputs from different locations within the source vector. However, we can convert both to the same internal form for computation. The sve2 helper is not used yet, but adding it with this patch helps illustrate why the neon changes are helpful. Backports commit e7e96fc5ec8c79dc77fef522d5226ac09f684ba5 from qemu	2020-03-21 19:35:46 -04:00
Richard Henderson	c00f72f74f	target/arm: Convert PMULL.64 to gvec The gvec form will be needed for implementing SVE2. Backports commit b9ed510e46f2f9e31e5e8adb4661d5d1cbe9a459 from qemu	2020-03-21 19:27:38 -04:00
Richard Henderson	db8a935b44	target/arm: Convert PMUL.8 to gvec The gvec form will be needed for implementing SVE2. Extend the implementation to operate on uint64_t instead of uint32_t. Use a counted inner loop instead of terminating when op1 goes to zero, looking toward the required implementation for ARMv8.4-DIT. Backports commit a21bb78e5817be3f494922e1dadd6455fe5d6318 from qemu	2020-03-21 19:22:18 -04:00
Richard Henderson	d3139f2f0a	target/arm: Vectorize USHL and SSHL These instructions shift left or right depending on the sign of the input, and 7 bits are significant to the shift. This requires several masks and selects in addition to the actual shifts to form the complete answer. That said, the operation is still a small improvement even for two 64-bit elements -- 13 vector operations instead of 2 * 7 integer operations. Backports commit 87b74e8b6edd287ea2160caa0ebea725fa8f1ca1 from qemu	2020-03-21 19:14:17 -04:00
Peter Maydell	61cf5abc9e	target/arm: Correctly implement ACTLR2, HACTLR2 The ACTLR2 and HACTLR2 AArch32 system registers didn't exist in ARMv7 or the original ARMv8. They were later added as optional registers, whose presence is signaled by the ID_MMFR4.AC2 field. From ARMv8.2 they are mandatory (ie ID_MMFR4.AC2 must be non-zero). We implemented HACTLR2 in commit 0e0456ab8895a5e85, but we incorrectly made it exist for all v8 CPUs, and we didn't implement ACTLR2 at all. Sort this out by implementing both registers only when they are supposed to exist, and setting the ID_MMFR4 bit for -cpu max. Note that this removes HACTLR2 from our Cortex-A53, -A47 and -A72 CPU models; this is correct, because those CPUs do not implement this register. Fixes: 0e0456ab8895a5e85 Backports commit f6287c24c66d6b9187c1c2887e1c7cfa4d304b0c from qemu	2020-03-21 18:52:30 -04:00
Peter Maydell	1876feeede	target/arm: Use FIELD_EX32 for testing 32-bit fields Cut-and-paste errors mean we're using FIELD_EX64() to extract fields from some 32-bit ID register fields. Use FIELD_EX32() instead. (This makes no difference in behaviour, it's just more consistent.) Backports commit b3a816f6ce1ec184ab6072f50bbe4479fc5116c3 from qemu	2020-03-21 18:50:14 -04:00
Peter Maydell	2ce106df33	target/arm: Use isar_feature function for testing AA32HPD feature Now we have moved ID_MMFR4 into the ARMISARegisters struct, we can define and use an isar_feature for the presence of the ARMv8.2-AA32HPD feature, rather than open-coding the test. While we're here, correct a comment typo which missed an 'A' from the feature name. Backports commit 4036b7d1cd9fb1097a5f4bc24d7d31744256260f from qemu	2020-03-21 18:48:57 -04:00
Peter Maydell	4693b2c011	target/arm: Test correct register in aa32_pan and aa32_ats1e1 checks The isar_feature_aa32_pan and isar_feature_aa32_ats1e1 functions are supposed to be testing fields in ID_MMFR3; but a cut-and-paste error meant we were looking at MVFR0 instead. Fix the functions to look at the right register; this requires us to move at least id_mmfr3 to the ARMISARegisters struct; we choose to move all the ID_MMFRn registers for consistency. Backports commit 10054016eda1b13bdd8340d100fd029cc8b58f36 from qemu	2020-03-21 18:47:12 -04:00
Peter Maydell	e72fa1cb33	target/arm: Correct handling of PMCR_EL0.LC bit The LC bit in the PMCR_EL0 register is supposed to be: * read/write * RES1 on an AArch64-only implementation * an architecturally UNKNOWN value on reset (and use of LC==0 by software is deprecated). We were implementing it incorrectly as read-only always zero, though we do have all the code needed to test it and behave accordingly. Instead make it a read-write bit which resets to 1 always, which satisfies all the architectural requirements above. Backports commit 62d96ff48510f4bf648ad12f5d3a5507227b026f from qemu	2020-03-21 18:40:26 -04:00
Peter Maydell	de428e4b45	target/arm: Correct definition of PMCRDP The PMCR_EL0.DP bit is bit 5, which is 0x20, not 0x10. 0x10 is 'X'. Correct our #define of PMCRDP and add the missing PMCRX. We do have the correct behaviour for handling the DP bit being set, so this fixes a guest-visible bug. Fixes: 033614c47de Backports commit a1ed04dd79aabb9dbeeb5fa7d49f1a3de0357553 from qemu	2020-03-21 18:39:37 -04:00
Peter Maydell	28b239adb9	target/arm: Provide ARMv8.4-PMU in '-cpu max' Set the ID register bits to provide ARMv8.4-PMU (and implicitly also ARMv8.1-PMU) in the 'max' CPU. Backports commit 3bec78447a958d4819911252e056f29740ac25e4 from qemu	2020-03-21 18:38:53 -04:00
Peter Maydell	4dd57f7acc	target/arm: Implement ARMv8.4-PMU extension The ARMv8.4-PMU extension adds: * one new required event, STALL * one new system register PMMIR_EL1 (There are also some more L1-cache related events, but since we don't implement any cache we don't provide these, in the same way we don't provide the base-PMUv3 cache events.) The STALL event "counts every attributable cycle on which no attributable instruction or operation was sent for execution on this PE". QEMU doesn't stall in this sense, so this is another always-reads-zero event. The PMMIR_EL1 register is a read-only register providing implementation-specific information about the PMU; currently it has only one field, SLOTS, which defines behaviour of the STALL_SLOT PMU event. Since QEMU doesn't implement the STALL_SLOT event, we can validly make the register read zero. Backports commit 15dd1ebda4a6ef928d484c5a4f48b8ccb7438bb2 from qemu	2020-03-21 18:37:50 -04:00
Peter Maydell	5c93f43eb9	target/arm: Implement ARMv8.1-PMU extension The ARMv8.1-PMU extension requires: * the evtCount field in PMETYPER<n>_EL0 is 16 bits, not 10 * MDCR_EL2.HPMD allows event counting to be disabled at EL2 * two new required events, STALL_FRONTEND and STALL_BACKEND * ID register bits in ID_AA64DFR0_EL1 and ID_DFR0 We already implement the 16-bit evtCount field and the HPMD bit, so all that is missing is the two new events: STALL_FRONTEND "counts every cycle counted by the CPU_CYCLES event on which no operation was issued because there are no operations available to issue to this PE from the frontend" STALL_BACKEND "counts every cycle counted by the CPU_CYCLES event on which no operation was issued because the backend is unable to accept any available operations from the frontend" QEMU never stalls in this sense, so our implementation is trivial: always return a zero count. Backports commit 0727f63b1ecf765ebc48266f616f8fc362dc7fbc from qemu	2020-03-21 18:34:33 -04:00
Peter Maydell	7dfc30b754	target/arm: Read debug-related ID registers from KVM Backports 1548a7b2ad621a31b4216ed703b6d658a2ecf0d0 from qemu	2020-03-21 18:30:20 -04:00
Peter Maydell	cef6f3e72c	target/arm: Move DBGDIDR into ARMISARegisters We're going to want to read the DBGDIDR register from KVM in a subsequent commit, which means it needs to be in the ARMISARegisters sub-struct. Move it. Backports commit 4426d3617d64922d97b74ed22e67e33b6fb7de0a from qemu	2020-03-21 18:29:01 -04:00
Peter Maydell	a6c9c87a5d	target/arm: Stop assuming DBGDIDR always exists The AArch32 DBGDIDR defines properties like the number of breakpoints, watchpoints and context-matching comparators. On an AArch64 CPU, the register may not even exist if AArch32 is not supported at EL1. Currently we hard-code use of DBGDIDR to identify the number of breakpoints etc; this works for all our TCG CPUs, but will break if we ever add an AArch64-only CPU. We also have an assert() that the AArch32 and AArch64 registers match, which currently works only by luck for KVM because we don't populate either of these ID registers from the KVM vCPU and so they are both zero. Clean this up so we have functions for finding the number of breakpoints, watchpoints and context comparators which look in the appropriate ID register. This allows us to drop the "check that AArch64 and AArch32 agree on the number of breakpoints etc" asserts: * we no longer look at the AArch32 versions unless that's the right place to be looking * it's valid to have a CPU (eg AArch64-only) where they don't match * we shouldn't have been asserting the validity of ID registers in a codepath used with KVM anyway Backports commit 88ce6c6ee85d902f59dc65afc3ca86b34f02b9ed from qemu	2020-03-21 18:26:24 -04:00
Peter Maydell	afc28d9b2c	target/arm: Add _aa64_ and _any_ versions of pmu_8_1 isar checks Add the 64-bit version of the "is this a v8.1 PMUv3?" ID register check function, and the _any_ version that checks for either AArch32 or AArch64 support. We'll use this in a later commit. We don't (yet) do any isar_feature checks on ID_AA64DFR1_EL1, but we move id_aa64dfr1 into the ARMISARegisters struct with id_aa64dfr0, for consistency. Backports commit 2a609df87d9b886fd38a190a754dbc241ff707e8 from qemu	2020-03-21 18:24:00 -04:00
Peter Maydell	e64143966a	target/arm: Define an aa32_pmu_8_1 isar feature test function Instead of open-coding a check on the ID_DFR0 PerfMon ID register field, create a standardly-named isar_feature for "does AArch32 have a v8.1 PMUv3" and use it. This entails moving the id_dfr0 field into the ARMISARegisters struct. Backports commit a617953855b65a602d36364b9643f7e5bc31288e from qemu	2020-03-21 18:21:26 -04:00
Peter Maydell	fd537585d7	target/arm: Use FIELD macros for clearing ID_DFR0 PERFMON field We already define FIELD macros for ID_DFR0, so use them in the one place where we're doing direct bit value manipulation. Backports commit d52c061e541982a3663ad5c65bd3b518dbe85b87 from qemu	2020-03-21 18:17:55 -04:00
Peter Maydell	fd6c635e03	target/arm: Add and use FIELD definitions for ID_AA64DFR0_EL1 Add FIELD() definitions for the ID_AA64DFR0_EL1 and use them where we currently have hard-coded bit values. Backports commit ceb2744b47a1ef4184dca56a158eb3156b6eba36 from qemu	2020-03-21 18:16:55 -04:00
Peter Maydell	ebd7131c16	target/arm: Factor out PMU register definitions Pull the code that defines the various PMU registers out into its own function, matching the pattern we have already for the debug registers. Apart from one style fix to a multi-line comment, this is purely movement of code with no changes to it. Backports commit 24183fb6f00ecca8b508e245c95ff50ddde3f18b from qemu	2020-03-21 18:15:09 -04:00
Peter Maydell	b1c088e2f2	target/arm: Define and use any_predinv isar_feature test Instead of open-coding "ARM_FEATURE_AARCH64 ? aa64_predinv: aa32_predinv", define and use an any_predinv isar_feature test function. Backports commit 22e570730d15374453baa73ff2a699e01ef4e950 from qemu	2020-03-21 18:13:25 -04:00
Peter Maydell	62178626e4	target/arm: Add isar_feature_any_fp16 and document naming/usage conventions Our current usage of the isar_feature feature tests almost always uses an _aa32_ test when the code path is known to be AArch32 specific and an _aa64_ test when the code path is known to be AArch64 specific. There is just one exception: in the vfp_set_fpscr helper we check aa64_fp16 to determine whether the FZ16 bit in the FP(S)CR exists, but this code is also used for AArch32. There are other places in future where we're likely to want a general "does this feature exist for either AArch32 or AArch64" check (typically where architecturally the feature exists for both CPU states if it exists at all, but the CPU might be AArch32-only or AArch64-only, and so only have one set of ID registers). Introduce a new category of isar_feature_* functions: isar_feature_any_foo() should be tested when what we want to know is "does this feature exist for either AArch32 or AArch64", and always returns the logical OR of isar_feature_aa32_foo() and isar_feature_aa64_foo(). Backports commit 6e61f8391cc6cb0846d4bf078dbd935c2aeebff5 from qemu	2020-03-21 18:12:02 -04:00
Peter Maydell	778fcd9562	target/arm: Check aa32_pan in take_aarch32_exception(), not aa64_pan In take_aarch32_exception(), we know we are dealing with a CPU that has AArch32, so the right isar_feature test is aa32_pan, not aa64_pan. Backports commit f8af1143ef93954e77cf59e09b5e004dafbd64fd from qemu	2020-03-21 18:09:27 -04:00
Peter Maydell	e63f70f980	target/arm: Add _aa32_ to isar_feature functions testing 32-bit ID registers Enforce a convention that an isar_feature function that tests a 32-bit ID register always has _aa32_ in its name, and one that tests a 64-bit ID register always has _aa64_ in its name. We already follow this except for three cases: thumb_div, arm_div and jazelle, which all need _aa32_ adding. (As noted in the comment, isar_feature_aa32_fp16_arith() is an exception in that it currently tests ID_AA64PFR0_EL1, but will switch to MVFR1 once we've properly implemented FP16 for AArch32.) Backports commit 873b73c0c891ec20adacc7bd1ae789294334d675 from qemu	2020-03-21 18:08:23 -04:00
Richard Henderson	0131e804fb	target/arm: Split out aa64_va_parameter_tbi, aa64_va_parameter_tbid For the purpose of rebuild_hflags_a64, we do not need to compute all of the va parameters, only tbi. Moreover, we can compute them in a form that is more useful to storing in hflags. This eliminates the need for aa64_va_parameter_both, so fold that in to aa64_va_parameter. The remaining calls to aa64_va_parameter are in get_phys_addr_lpae and in pauth_helper.c. This reduces the total cpu consumption of aa64_va_parameter in a kernel boot plus a kvm guest kernel boot from 3% to 0.5%. Backports commit b830a5ee82e66f54697dcc6450fe9239b7412d13 from qemu	2020-03-21 18:04:39 -04:00
Richard Henderson	2cce7e0dd0	target/arm: Remove ttbr1_valid check from get_phys_addr_lpae Now that aa64_va_parameters_both sets select based on the number of ranges in the regime, the ttbr1_valid check is redundant. Backports commit 03f27724dff15633911e68a3906c30f57938ea45 from qemu	2020-03-21 18:01:24 -04:00
Richard Henderson	f3fa39829d	target/arm: Fix select for aa64_va_parameters_both Select should always be 0 for a regime with one range. Backports commit 71d181640a1a9470f074fa28600ca85587e2ca6b from qemu	2020-03-21 18:00:15 -04:00
Richard Henderson	3183349f1c	target/arm: Use bit 55 explicitly for pauth The psuedocode in aarch64/functions/pac/auth/Auth and aarch64/functions/pac/strip/Strip always uses bit 55 for extfield and do not consider if the current regime has 2 ranges. Backports commit 7eeb4c2ce8dc0a5655526f3f39bd5d6cc02efb39 from qemu	2020-03-21 17:59:06 -04:00
Richard Henderson	51b6064ba4	target/arm: Flush high bits of sve register after AdvSIMD INS Writes to AdvSIMD registers flush the bits above 128. Backports commit 528dc354b6f3aa82d65141cc60bc0e725e6cae98 from qemu	2020-03-21 17:58:09 -04:00
Richard Henderson	74cbfceb56	target/arm: Flush high bits of sve register after AdvSIMD ZIP/UZP/TRN Writes to AdvSIMD registers flush the bits above 128. Backports commit 33649de62e40df0060a1c514574e4ef25c4e52e1 from qemu	2020-03-21 17:56:40 -04:00
Richard Henderson	6eb8472344	target/arm: Flush high bits of sve register after AdvSIMD TBL/TBX Writes to AdvSIMD registers flush the bits above 128. Backports commit 263273bc988e677ebadeaf7d0e49f6792a112db5 from qemu	2020-03-21 17:56:08 -04:00
Richard Henderson	18e9c4805f	target/arm: Flush high bits of sve register after AdvSIMD EXT Writes to AdvSIMD registers flush the bits above 128. Backports commit 78cedfabd53b6f64e7e64fc84878d848e5df1d08 from qemu	2020-03-21 17:55:12 -04:00
Peter Maydell	96a96565db	target/arm: Implement ARMv8.1-VMID16 extension The ARMv8.1-VMID16 extension extends the VMID from 8 bits to 16 bits: * the ID_AA64MMFR1_EL1.VMIDBits field specifies whether the VMID is 8 or 16 bits * the VMID field in VTTBR_EL2 is extended to 16 bits * VTCR_EL2.VS lets the guest specify whether to use the full 16 bits, or use the backwards-compatible 8 bits For QEMU implementing this is trivial: * we do not track VMIDs in TLB entries, so we never use the VMID field * we treat any write to VTTBR_EL2, not just a change to the VMID field bits, as a "possible VMID change" that causes us to throw away TLB entries, so that code doesn't need changing * we allow the guest to read/write the VTCR_EL2.VS bit already So all that's missing is the ID register part: report that we support VMID16 in our 'max' CPU. Backports commit dc7a88d0810ad272bdcd2e0869359af78fdd9114 from qemu	2020-03-21 17:52:43 -04:00
Richard Henderson	57f0aa3044	target/arm: Enable ARMv8.2-UAO in -cpu max Backports commit e11f0eb6724571adb812a3ce5269c41586e0262b from qemu	2020-03-21 17:51:44 -04:00
Richard Henderson	18a86780ee	target/arm: Implement UAO semantics We need only override the current condition under which TBFLAG_A64.UNPRIV is set. Backports commit 7a8014ab871d5320effd737dfe88b2e80f16a509 from qemu	2020-03-21 17:50:29 -04:00
Richard Henderson	5b5050c6ca	target/arm: Update MSR access to UAO Backports commit 9eeb7a1c9531cb3574bfe2c36eb7624802c3ec00 from qemu	2020-03-21 17:48:01 -04:00
Richard Henderson	0630e66b5a	target/arm: Add ID_AA64MMFR2_EL1 Add definitions for all of the fields, up to ARMv8.5. Convert the existing RESERVED register to a full register. Query KVM for the value of the register for the host. Backports commit 64761e10af2742a916c08271828890274137b9e8 from qemu	2020-03-21 17:45:27 -04:00
Richard Henderson	7287bf16b8	target/arm: Enable ARMv8.2-ATS1E1 in -cpu max This includes enablement of ARMv8.1-PAN. Backports commit e0fe7309a7c21ef2386de50d37c86aea0d671c08 from qemu	2020-03-21 17:43:54 -04:00
Richard Henderson	d196288b4f	target/arm: Implement ATS1E1 system registers This is a minor enhancement over ARMv8.1-PAN. The *_PAN mmu_idx are used with the existing do_ats_write. Backports commit 04b07d29722192926f467ea5fedf2c3b0996a2a5 from qemu	2020-03-21 17:42:01 -04:00
Richard Henderson	6576864930	target/arm: Set PAN bit as required on exception entry The PAN bit is preserved, or set as per SCTLR_ELx.SPAN, plus several other conditions listed in the ARM ARM. Backports commit 4a2696c0d4d80e14a192b28148c6167bc5056f94 from qemu	2020-03-21 17:40:11 -04:00
Richard Henderson	aad0621f96	target/arm: Enforce PAN semantics in get_S1prot If we have a PAN-enforcing mmu_idx, set prot == 0 if user_rw != 0. Backports commit 81636b70c226dc27d7ebc8dedbcec26166d23085 from qemu	2020-03-21 17:35:55 -04:00
Richard Henderson	41d03da852	target/arm: Update arm_mmu_idx_el for PAN Examine the PAN bit for EL1, EL2, and Secure EL1 to determine if it applies. Backports commit 66412260cc1bee60a22d96e4ad8569b85745fea4 from qemu	2020-03-21 17:34:12 -04:00
Richard Henderson	35fab80c57	target/arm: Update MSR access for PAN For aarch64, there's a dedicated msr (imm, reg) insn. For aarch32, this is done via msr to cpsr. Writes from el0 are ignored, which is already handled by the CPSR_USER mask. Backports commit 220f508f49c5f49fb771d5105f991c19ffede3f7 from qemu	2020-03-21 17:33:16 -04:00
Richard Henderson	50bb867a6f	target/arm: Introduce aarch64_pstate_valid_mask Use this along the exception return path, where we previously accepted any values Backports commit 140845111809cd6fd57ccde93867b48cc56ffc74 from qemu	2020-03-21 17:26:00 -04:00
Richard Henderson	b6b69d7ac5	target/arm: Remove CPSR_RESERVED The only remaining use was in op_helper.c. Use PSTATE_SS directly, and move the commentary so that it is more obvious what is going on. Backports commit 70dae0d069c45250bbefd9424089383a8ac239de from qemu	2020-03-21 17:24:21 -04:00
Richard Henderson	2d3239d0a1	target/arm: Use aarch32_cpsr_valid_mask in helper_exception_return Using ~0 as the mask on the aarch64->aarch32 exception return was not even as correct as the CPSR_ERET_MASK that we had used on the aarch32->aarch32 exception return. Backports commit d203cabd1bd12f31c9df0b5737421ba67b96857b from qemu	2020-03-21 17:20:53 -04:00
Richard Henderson	c450694f1a	target/arm: Replace CPSR_ERET_MASK with aarch32_cpsr_valid_mask CPSR_ERET_MASK was a useless renaming of CPSR_RESERVED. The function also takes into account bits that the cpu does not support. Backports commit 437864216d63f052f3cd06ec8861d0e432496424 from qemu	2020-03-21 17:19:17 -04:00
Richard Henderson	e4a7a089f0	target/arm: Mask CPSR_J when Jazelle is not enabled The J bit signals Jazelle mode, and so of course is RES0 when the feature is not enabled. Backports commit f062d1447f2a80e7a5f593b8cb5ac7cab5e16eb0 from qemu	2020-03-21 17:17:50 -04:00
Richard Henderson	ca2bb77ab3	target/arm: Split out aarch32_cpsr_valid_mask Split this helper out of msr_mask in translate.c. At the same time, transform the negative reductive logic to positive accumulative logic. It will be usable along the exception paths. While touching msr_mask, fix up formatting. Backports commit 4f9584ed4bba8a57a3cb2fa48a682725005d530a from qemu	2020-03-21 17:16:20 -04:00
Richard Henderson	e9850834d5	target/arm: Move LOR regdefs to file scope For static const regdefs, file scope is preferred. Backports commit d8564ee4e5bce87ec1fdf23656df9367eb1bc571 from qemu	2020-03-21 17:13:58 -04:00
Richard Henderson	5c8c2ca505	target/arm: Add isar_feature tests for PAN + ATS1E1 Include definitions for all of the bits in ID_MMFR3. We already have a definition for ID_AA64MMFR1.PAN. Backports commit 3d6ad6bb466f487bcc861f99e2c9054230df1076 from qemu	2020-03-21 17:13:07 -04:00
Richard Henderson	7aaf0d442b	target/arm: Add mmu_idx for EL1 and EL2 w/ PAN enabled To implement PAN, we will want to swap, for short periods of time, to a different privileged mmu_idx. In addition, we cannot do this with flushing alone, because the AT* instructions have both PAN and PAN-less versions. Add the ARMMMUIdx_PAN constants where necessary next to the corresponding ARMMMUIdx constant. Backports commit 452ef8cb8c7b06f44a30a3c3a54d3be82c4aef59 from qemu	2020-03-21 17:12:16 -04:00
Richard Henderson	ed5a4950fd	target/arm: Add arm_mmu_idx_is_stage1_of_2 Use a common predicate for querying stage1-ness. Backports commit fee7aa46edd76f06c3dc176abb8fd05b365efce6 from qemu	2020-03-21 16:56:03 -04:00
Richard Henderson	12b4e01d9c	tcg: Add tcg_gen_gvec_5_ptr Extend the vector generator infrastructure to handle 5 vector arguments. Backports commit 2445971604c1cfd3ec484457159f4ac300fb04d2 from qemu	2020-03-21 16:54:01 -04:00
Taylor Simpson	6507fdb3b1	tcg: Add support for a helper with 7 arguments Currently, helpers can only take up to 6 arguments. This patch adds the capability for up to 7 arguments. I have tested it with the Hexagon port that I am preparing for submission. Backports commit e6cadf49c3d191f6984e56ec3bbeb0b103ca5bc2 from qemu	2020-03-21 16:53:56 -04:00
Richard Henderson	eb0586f9cd	target/arm: Raise only one interrupt in arm_cpu_exec_interrupt The fall through organization of this function meant that we would raise an interrupt, then might overwrite that with another. Since interrupt prioritization is IMPLEMENTATION DEFINED, we can recognize these in any order we choose. Unify the code to raise the interrupt in a block at the end. Backports commit d63d0ec59d87a698de5ed843288f90a23470cf2e from qemu	2020-03-21 16:42:52 -04:00
Richard Henderson	d00e5ec47d	target/arm: Use bool for unmasked in arm_excp_unmasked The value computed is fully boolean; using int8_t is odd. Backports commit 16e07f78df002067bc4bfb115ba1ee0ce278e9e5 from qemu	2020-03-21 16:40:36 -04:00
Richard Henderson	975f0a9bc5	target/arm: Pass more cpu state to arm_excp_unmasked Avoid redundant computation of cpu state by passing it in from the caller, which has already computed it for itself. Backports commit be87955687446be152f366af543c9234eab78a7c from qemu	2020-03-21 16:39:16 -04:00
Richard Henderson	6023db20bc	target/arm: Move arm_excp_unmasked to cpu.c This inline function has one user in cpu.c, and need not be exposed otherwise. Code movement only, with fixups for checkpatch. Backports commit 310cedf39dea240a89f90729fd99481ff6158e90 from qemu	2020-03-21 16:37:12 -04:00
Richard Henderson	ad5a3b2532	target/arm: Enable ARMv8.1-VHE in -cpu max Backports commit cd3f80aba0c559a6291f7c3e686422b15381f6b7 from qemu	2020-03-21 16:36:04 -04:00
Richard Henderson	36407da586	target/arm: Update arm_cpu_do_interrupt_aarch64 for VHE When VHE is enabled, the exception level below EL2 is not EL1, but EL0, and so to identify the entry vector offset for exceptions targeting EL2 we need to look at the width of EL0, not of EL1. Backports commit cb092fbbaeb7b4e91b3f9c53150c8160f91577c7 from qemu	2020-03-21 16:35:07 -04:00
Richard Henderson	8f1201e392	target/arm: Update get_a64_user_mem_index for VHE The EL2&0 translation regime is affected by Load Register (unpriv). The code structure used here will facilitate later changes in this area for implementing UAO and NV. Backports commit cc28fc30e333dc2f20ebfde54444697e26cd8f6d from qemu	2020-03-21 16:33:52 -04:00
Alex Bennée	76ca1cd732	target/arm: check TGE and E2H flags for EL0 pauth traps According to ARM ARM we should only trap from the EL1&0 regime. Backports commit a7469a3c1edc7687d7d25967bc2c0280de202bca from qemu	2020-03-21 16:27:40 -04:00
Richard Henderson	01e1e7a3a0	target/arm: Update {fp,sve}_exception_el for VHE When TGE+E2H are both set, CPACR_EL1 is ignored. Backports commit c2ddb7cf963b3bea838266bfca62514dc9750a10 from qemu	2020-03-21 16:26:01 -04:00
Richard Henderson	86d0163465	target/arm: Update arm_phys_excp_target_el for TGE The TGE bit routes all asynchronous exceptions to EL2. Backports commit d1b31428fd522b725bc053c84b5fbc8764061363 from qemu	2020-03-21 16:23:52 -04:00
Richard Henderson	0c03fa2dac	target/arm: Flush tlbs for E2&0 translation regime Backports commit 85d0dc9fa205027554372367f6925749a2d2b4c4 from qemu	2020-03-21 16:22:46 -04:00
Richard Henderson	50ac89852a	target/arm: Flush tlb for ASID changes in EL2&0 translation regime Since we only support a single ASID, flush the tlb when it changes. Note that TCR_EL2, like TCR_EL1, has the A1 bit that chooses between the two TTBR* registers for the location of the ASID. Backports commit d06dc93340825030b6297c61199a17c0067b0377 from qemu	2020-03-21 16:13:55 -04:00
Richard Henderson	a2b8ebabfa	target/arm: Add VHE timer register redirection and aliasing Apart from the wholesale redirection that HCR_EL2.E2H performs for EL2, there's a separate redirection specific to the timers that happens for EL0 when running in the EL2&0 regime. Backports commit bb5972e439dc0ac4d21329a9d97bad6760ec702d from qemu	2020-03-21 16:09:54 -04:00
Richard Henderson	e41c51f6da	target/arm: Add VHE system register redirection and aliasing Several of the EL1/0 registers are redirected to the EL2 version when in EL2 and HCR_EL2.E2H is set. Many of these registers have side effects. Link together the two ARMCPRegInfo structures after they have been properly instantiated. Install common dispatch routines to all of the relevant registers. The same set of registers that are redirected also have additional EL12/EL02 aliases created to access the original register that was redirected. Omit the generic timer registers from redirection here, because we'll need multiple kinds of redirection from both EL0 and EL2. Backports commit e2cce18f5c1d0d55328c585c8372cdb096bbf528 from qemu	2020-03-21 15:57:03 -04:00
Richard Henderson	ff720b7fd3	target/arm: Update define_one_arm_cp_reg_with_opaque for VHE For ARMv8.1, op1 == 5 is reserved for EL2 aliases of EL1 and EL0 registers. Backports commit b4ecf60f7eee88cbfe5700044790cb7494c5dd37 from qemu	2020-03-21 15:39:54 -04:00
Richard Henderson	8c7795dc04	target/arm: Update timer access for VHE Backports commit 5bc8437136fb1e7bc8b566f4f2f7269b0f990fad from qemu	2020-03-21 15:38:47 -04:00
Richard Henderson	d6150127b4	target/arm: Add the hypervisor virtual counter Backports commit 8c94b071a09c2183f032febff3112f2b7662156c from qemu	2020-03-21 15:35:36 -04:00
Richard Henderson	8e2ac48ad0	target/arm: Update ctr_el0_access for EL2 Update to include checks against HCR_EL2.TID2. Backports commit 97475a89375d62a7722e04ced9fbdf0b992f4b83 from qemu	2020-03-21 15:31:48 -04:00
Richard Henderson	6886ba66d0	target/arm: Update aa64_zva_access for EL2 The comment that we don't support EL2 is somewhat out of date. Update to include checks against HCR_EL2.TDZ. Backports commit 4351cb72fb65926136ab618c9e40c1f5a8813251 from qemu	2020-03-21 15:30:37 -04:00
Richard Henderson	3a5135473f	target/arm: Update arm_sctlr for VHE Use the correct sctlr for EL2&0 regime. Due to header ordering, and where arm_mmu_idx_el is declared, we need to move the function out of line. Use the function in many more places in order to select the correct control. Backports commit aaec143212bb70ac9549cf73203d13100bd5c7c2 from qemu	2020-03-21 15:29:21 -04:00
Richard Henderson	6073542afc	target/arm: Update arm_mmu_idx for VHE Return the indexes for the EL2&0 regime when the appropriate bits are set within HCR_EL2. Backports commit 6003d9800ee38aa11eefb5cd64ae55abb64bef16 from qemu	2020-03-21 15:23:38 -04:00
Richard Henderson	3d583cd45f	target/arm: Split out arm_mmu_idx_el Backports commit 164690b29f9eaf69fe641859bc9f8954f12e691d from qemu	2020-03-21 15:22:01 -04:00
Richard Henderson	f4397b0212	target/arm: Add regime_has_2_ranges Create a predicate to indicate whether the regime has both positive and negative addresses. Backports commit 339370b90d067345b69585ddf4b668fa01f41d67 from qemu	2020-03-21 15:14:11 -04:00
Richard Henderson	0318d7af99	target/arm: Reorganize ARMMMUIdx Prepare for, but do not yet implement, the EL2&0 regime. This involves adding the new MMUIdx enumerators and adjusting some of the MMUIdx related predicates to match. Backports commit b9f6033c1a5fb7da55ed353794db8ec064f78bb2 from qemu.	2020-03-21 15:10:05 -04:00
Richard Henderson	85a7cfbdc6	target/arm: Tidy ARMMMUIdx m-profile definitions Replace the magic numbers with the relevant ARM_MMU_IDX_M_* constants. Keep the definitions short by referencing previous symbols. Backports commit 25568316b2a7e73d68701042ba6ebdb217205e20 from qemu	2020-03-21 14:58:17 -04:00
Richard Henderson	c223708063	target/arm: Rearrange ARMMMUIdxBit Define via macro expansion, so that renumbering of the base ARMMMUIdx symbols is automatically reflected in the bit definitions. Backports commit 5f09a6dfbfbff4662f52cc3130a2e07044816497 from qemu	2020-03-21 14:56:46 -04:00
Richard Henderson	56504d255b	target/arm: Expand TBFLAG_ANY.MMUIDX to 4 bits We are about to expand the number of mmuidx to 10, and so need 4 bits. For the benefit of reading the number out of -d exec, align it to the penultimate nibble. Backports commit 506f149815c2168f16ade17893e117419d93f248 from qemu	2020-03-21 14:54:55 -04:00
Richard Henderson	be3c71fb8b	target/arm: Recover 4 bits from TBFLAGs We had completely run out of TBFLAG bits. Split A- and M-profile bits into two overlapping buckets. This results in 4 free bits. We used to initialize all of the a32 and m32 fields in DisasContext by assignment, in arm_tr_init_disas_context. Now we only initialize either the a32 or m32 by assignment, because the bits overlap in tbflags. So zero the entire structure in gen_intermediate_code. Backports commit 79cabf1f473ca6e9fa0727f64ed9c2a84a36f0aa from qemu	2020-03-21 14:51:46 -04:00
Richard Henderson	153d7aadd5	target/arm: Rename ARMMMUIdx_S1E2 to ARMMMUIdx_E2 This is part of a reorganization to the set of mmu_idx. The non-secure EL2 regime only has a single stage translation; there is no point in pointing out that the idx is for stage1. Backports commit e013b7411339342aac8d986c5d5e329e1baee8e1 from qemu	2020-03-21 14:42:23 -04:00
Richard Henderson	f45ab0614e	target/arm: Rename ARMMMUIdx_S1E3 to ARMMMUIdx_SE3 This is part of a reorganization to the set of mmu_idx. The EL3 regime only has a single stage translation, and is always secure. Backports commit 127b2b086303296289099a6fb10bbc51077f1d53 from qemu	2020-03-21 14:38:44 -04:00
Richard Henderson	1a672fc3b1	target/arm: Rename ARMMMUIdx_S1SE[01] to ARMMMUIdx_SE10_[01] This is part of a reorganization to the set of mmu_idx. This emphasizes that they apply to the Secure EL1&0 regime. Backports commit fba37aedecb82506c62a1f9e81d066b4fd04e443 from qemu	2020-03-21 14:35:28 -04:00
Richard Henderson	31837384b3	target/arm: Rename ARMMMUIdx_S1NSE* to ARMMMUIdx_Stage1_E* This is part of a reorganization to the set of mmu_idx. The EL1&0 regime is the only one that uses 2-stage translation. Spelling out Stage avoids confusion with Secure. Backports commit 2859d7b590760283a7b5aef40b723e9dfd7c98ba from qemu	2020-03-21 14:20:31 -04:00
Richard Henderson	b62b4c4f35	target/arm: Rename ARMMMUIdx_S2NS to ARMMMUIdx_Stage2 The EL1&0 regime is the only one that uses 2-stage translation. Backports commit 97fa9350017e647151dd1dc212f1bbca0294dba7 from qemu	2020-03-21 14:15:35 -04:00
Richard Henderson	ec05f22e82	target/arm: Rename ARMMMUIdx_S12NSE to ARMMMUIdx_E10_ This is part of a reorganization to the set of mmu_idx. This emphasizes that they apply to the EL1&0 regime. The ultimate goal is -- Non-secure regimes: ARMMMUIdx_E10_0, ARMMMUIdx_E20_0, ARMMMUIdx_E10_1, ARMMMUIdx_E2, ARMMMUIdx_E20_2, -- Secure regimes: ARMMMUIdx_SE10_0, ARMMMUIdx_SE10_1, ARMMMUIdx_SE3, -- Helper mmu_idx for non-secure EL1&0 stage1 and stage2 ARMMMUIdx_Stage2, ARMMMUIdx_Stage1_E0, ARMMMUIdx_Stage1_E1, The 'S' prefix is reserved for "Secure". Unless otherwise specified, each mmu_idx represents all stages of translation. Backports commit 01b98b686460b3a0fb47125882e4f8d4268ac1b6 from qemu	2020-03-21 14:09:15 -04:00
Richard Henderson	270d557a99	target/arm: Split out alle1_tlbmask No functional change, but unify code sequences. Backports commit 90c19cdf1de440d7d9745cf255168999071b3a31 from qemu	2020-03-21 13:57:04 -04:00
Richard Henderson	6d4a7b84b5	target/arm: Split out vae1_tlbmask No functional change, but unify code sequences. Backports commit b7e0730de32d7079a1447ecbb5616d89de77b823 from qemu	2020-03-21 13:53:39 -04:00

... 7 8 9 10 11 ...

5834 commits