unicorn

mirror of https://github.com/yuzu-emu/unicorn.git synced 2025-01-07 06:35:29 +00:00

Author	SHA1	Message	Date
LIU Zhiwei	8b06759ba4	target/riscv: vector floating-point/integer type-convert instructions Backports 921009732614fd620c75f05496597796719544cf	2021-03-07 12:00:36 -05:00
LIU Zhiwei	fabc8bab77	target/riscv: vector floating-point merge instructions Backports 64ab5846974140118c64e4d94ff2696932a0a58b	2021-03-07 11:58:41 -05:00
LIU Zhiwei	f9c9716534	target/riscv: vector floating-point classify instructions Backports 121ddbb36f17d24a7f39d6024d9b3145d154a98c	2021-03-07 11:55:45 -05:00
LIU Zhiwei	b859be12b9	target/riscv: vector floating-point compare instructions Backports 2a68e9e568faddf4d689a37fa6895bcb8404a677	2021-03-07 11:47:51 -05:00
LIU Zhiwei	31978f270b	target/riscv: vector floating-point sign-injection instructions Backports 1d426b81f71eeeb1cbfec76c2f27ed0495719fb0	2021-03-07 11:43:47 -05:00
LIU Zhiwei	f7f0425a4d	target/riscv: vector floating-point min/max instructions Backports 230b53ddd706c8b18a6d9beed1a0153b276d7037	2021-03-07 11:42:05 -05:00
LIU Zhiwei	69c73cfc4e	target/riscv: vector floating-point square-root instruction Backports d9e4ce72a5a0f7c404156d40d3252d4d6a9d6a36	2021-03-07 11:40:04 -05:00
LIU Zhiwei	95a6d78121	target/riscv: vector widening floating-point fused multiply-add instructions Backports 0dd509594fbd53fc9c3edc79bd7a575f079c3c87	2021-03-07 11:37:23 -05:00
LIU Zhiwei	42116609f0	target/riscv: vector single-width floating-point fused multiply-add instructions Backports 4aa5a8fed4a21fe2e132a9a21b251aa95e19de80	2021-03-07 11:34:56 -05:00
LIU Zhiwei	14cbabde4f	target/riscv: vector widening floating-point multiply Backports f7c7b7cd293ca6f14f23cc2c14d6d23fc47a604d	2021-03-07 11:32:19 -05:00
LIU Zhiwei	5e4b142c31	target/riscv: vector single-width floating-point multiply/divide instructions Backports 0e0057cbe2169195a08ae8247504e69f9b80542b	2021-03-07 11:30:14 -05:00
LIU Zhiwei	0de56731ae	target/riscv: vector widening floating-point add/subtract instructions eeffab2ec1b332a5eb2d2dcd2732cdb57179c6eb	2021-03-07 11:27:33 -05:00
LIU Zhiwei	06092b88b9	target/riscv: vector single-width floating-point add/subtract instructions Backports ce2a0343f441f0ee949690eabae5ab600397e2eb	2021-03-05 09:50:56 -05:00
LIU Zhiwei	5fb589cdd7	target/riscv: vector narrowing fixed-point clip instructions Backports 9ff3d28739b760970f5e542c74a033470dca3f9b	2021-03-05 09:34:11 -05:00
LIU Zhiwei	241deddb50	target/riscv: vector single-width scaling shift instructions Backports 04a614062dd5fb43f00bd955f44f7a2c3def016d	2021-03-05 09:32:15 -05:00
LIU Zhiwei	e7582a5d74	target/riscv: vector widening saturating scaled multiply-add Backports 0a1eaf0036442b2bfa69df7fad9a5f1d6a4984f2	2021-03-05 09:29:42 -05:00
LIU Zhiwei	e27aadfa4f	target/riscv: vector single-width fractional multiply with rounding and saturation Backports 9f0ff9e51480f8f1d2d7a62b11aa156fcdb4ef95	2021-03-05 09:26:56 -05:00
LIU Zhiwei	2343892c2e	target/riscv: vector single-width averaging add and subtract Backports b7aee4819206cbb7adfdb624d4f2fa9918c25d43	2021-03-05 09:25:09 -05:00
LIU Zhiwei	87db3eb130	target/riscv: vector single-width saturating add and subtract Backports eb2650e35ec1ed60ff302ce3330bd6c770640833	2021-03-05 09:23:17 -05:00
LIU Zhiwei	025aa6fd39	target/riscv: vector integer merge and move instructions Backports f020a7a14505d6996497693e63331ab609847d93	2021-03-05 09:20:34 -05:00
LIU Zhiwei	9d14cc8d35	target/riscv: vector widening integer multiply-add instructions Backports 2b587b335050dbc0cb3823758341f145c0375312	2021-03-05 09:13:03 -05:00
LIU Zhiwei	58891e213d	target/riscv: vector single-width integer multiply-add instructions Backports 54df813a331d3badfb83604c36bef7cb1de4315a	2021-03-05 09:11:33 -05:00
LIU Zhiwei	436e092e36	target/riscv: vector widening integer multiply instructions Backports 97b1cba39967251ab78b9d52fd9a4c62bb42d428	2021-03-05 09:09:08 -05:00
LIU Zhiwei	d144afdc45	target/riscv: vector integer divide instructions Backports 85e6658cfe9d71cc207a710ffdf0e6546f8612aa	2021-03-05 09:05:00 -05:00
Lioncash	14d06ee38c	sparc: Fix build	2021-03-05 08:54:43 -05:00
Lioncash	704353c758	mips: Fix build	2021-03-05 08:51:51 -05:00
Lioncash	dec4c70142	i386: Fix build	2021-03-05 08:35:14 -05:00
Lioncash	5436b713ce	m68k: Fix build A bunch of changes to the memory functions recently broke the build. This fixes it.	2021-03-05 08:29:53 -05:00
Zheng Zhan Liang	dfd53d7573	tcg/i386: rdpmc: fix the the condtions Backports c45b426acd1ad8e30fbe1b9af8c07b2889c28c6b	2021-03-04 18:50:48 -05:00
Chenyi Qiang	d7adcf1d7f	target/i386: Add bus lock debug exception support Bus lock debug exception is a feature that can notify the kernel by generate an #DB trap after the instruction acquires a bus lock when CPL>0. This allows the kernel to enforce user application throttling or mitigations. This feature is enumerated via CPUID.(EAX=7,ECX=0).ECX[bit 24]. Backports 06e878b413766778a53be3d25c0373a23679d039	2021-03-04 18:50:00 -05:00
Richard Henderson	d044062b26	target/arm: Enable MTE for user-only Backports e32328645ed6fc4f20f0164dfc9ce1bf7e667cc4	2021-03-04 18:46:47 -05:00
Richard Henderson	c588c150e4	target/arm: Add allocation tag storage for user mode Use the now-saved PAGE_ANON and PAGE_MTE bits, and the per-page saved data. Backports a11d3830d96ad8077440ce4e0aa60608f1f12dde	2021-03-04 18:46:13 -05:00
Richard Henderson	f03656b5c3	target/arm: Split out syndrome.h from internals.h Move everything related to syndromes to a new file, which can be shared with linux-user. Backports 1fe27859427bd377a45708310947de54c687d9ff	2021-03-04 18:44:07 -05:00
Richard Henderson	84368d2d6d	target/arm: Use the proper TBI settings for linux-user We were fudging TBI1 enabled to speed up the generated code. Now that we've improved the code generation, remove this. Also, tidy the comment to reflect the current code. The pauth test was testing a kernel address (-1) and making incorrect assumptions about TBI1; stick to userland addresses. Backports 16c849784873d10d0da257d698e391fddea1f0e4	2021-03-04 18:41:49 -05:00
Richard Henderson	de982a8346	target/arm: Improve gen_top_byte_ignore Use simple arithmetic instead of a conditional move when tbi0 != tbi1. Backports 2169b5c6f7a791ef9c43c72412efaafae3245114	2021-03-04 18:39:43 -05:00
Daniel Müller	642a683d7a	target/arm: Correctly initialize MDCR_EL2.HPMN When working with performance monitoring counters, we look at MDCR_EL2.HPMN as part of the check whether a counter is enabled. This check fails, because MDCR_EL2.HPMN is reset to 0, meaning that no counters are "enabled" for < EL2. That's in violation of the Arm specification, which states that > On a Warm reset, this field [MDCR_EL2.HPMN] resets to the value in > PMCR_EL0.N That's also what a comment in the code acknowledges, but the necessary adjustment seems to have been forgotten when support for more counters was added. This change fixes the issue by setting the reset value to PMCR.N, which is four. Backports d3c1183ffeb71ca3a783eae3d7e1c51e71e8a621	2021-03-04 18:34:06 -05:00
Rebecca Cran	93b0428f48	target/arm: Set ID_PFR0.DIT to 1 for max 32-bit CPU Enable FEAT_DIT for the "max" 32-bit CPU. Backports 5385320c2b3183f2e18dbc55c23ecba9272500c2	2021-03-04 18:31:36 -05:00
Rebecca Cran	66d96057a4	target/arm: Set ID_AA64PFR0.DIT and ID_PFR0.DIT to 1 for max AA64 CPU Enable FEAT_DIT for the "max" AARCH64 CPU. Backports 2bf1eff9e9125a3d73901991dcfb9cb2ace03be1	2021-03-04 18:30:59 -05:00
Rebecca Cran	f7424d89e2	target/arm: Support AA32 DIT by moving PSTATE_SS from cpsr into env->pstate cpsr has been treated as being the same as spsr, but it isn't. Since PSTATE_SS isn't in cpsr, remove it and move it into env->pstate. This allows us to add support for CPSR_DIT, adding helper functions to merge SPSR_ELx to and from CPSR. Backports f944a854ce4007000accf7c191b5b52916947198	2021-03-04 18:24:57 -05:00
Rebecca Cran	d8458f14af	target/arm: Add support for FEAT_DIT, Data Independent Timing Add support for FEAT_DIT. DIT (Data Independent Timing) is a required feature for ARMv8.4. Since virtual machine execution is largely nondeterministic and TCG is outside of the security domain, it's implemented as a NOP. Backports dc8b18534ea1dcc90d80ad9a61a3b0aa7eb312fb	2021-03-04 18:19:32 -05:00
Mike Nawrocki	4e482764e2	target/arm: Fix SCR RES1 handling The FW and AW bits of SCR_EL3 are RES1 only in some contexts. Force them to 1 only when there is no support for AArch32 at EL1 or above. The reset value will be 0x30 only if the CPU is AArch64-only; if there is support for AArch32 at EL1 or above, it will be reset to 0. Also adds helper function isar_feature_aa64_aa32_el1 to check if AArch32 is supported at EL1 or above. Backports 10d0ef3e6cfe228df4b2d3e27325f1b0e2b71fd5	2021-03-04 18:15:39 -05:00
Chenyi Qiang	807d541e19	target/i386: Expose VMX entry/exit load pkrs control bits Expose the VMX exit/entry load pkrs control bits in VMX_TRUE_EXIT_CTLS/VMX_TRUE_ENTRY_CTLS MSRs to guest, which supports the PKS in nested VM. Backports 52a44ad2b92ba4cd81c2b271cd5e4a2d820e91fc	2021-03-04 18:13:36 -05:00
Paolo Bonzini	834e2b2643	target/i86: implement PKS Protection Keys for Supervisor-mode pages is a simple extension of the PKU feature that QEMU already implements. For supervisor-mode pages, protection key restrictions come from a new MSR. The MSR has no XSAVE state associated to it. PKS is only respected in long mode. However, in principle it is possible to set the MSR even outside long mode, and in fact even the XSAVE state for PKRU could be set outside long mode using XRSTOR. So do not limit the migration subsections for PKRU and PKRS to long mode. Backports e7e7bdababeefff10736c6adf410c66d2f0d46fe	2021-03-04 18:12:44 -05:00
David Greenaway	0c1c359b5c	target/i386: Fix decoding of certain BMI instructions This patch fixes a translation bug for a subset of x86 BMI instructions such as the following: c4 e2 f9 f7 c0 shlxq %rax, %rax, %rax Currently, these incorrectly generate an undefined instruction exception when SSE is disabled via CR4, while instructions like "shrxq" work fine. The problem appears to be related to BMI instructions encoded using VEX and with a mandatory prefix of "0x66" (data). Instructions with this data prefix (such as shlxq) are currently rejected. Instructions with other mandatory prefixes (such as shrxq) translate as expected. This patch removes the incorrect check in "gen_sse" that causes the exception to be generated. For the non-BMI cases, the check is redundant: prefixes are already checked at line 3696. Buglink: https://bugs.launchpad.net/qemu/+bug/1748296 Backports 51909241d26fe6fe18a08def93ccc8273f61a8b3	2021-03-04 18:08:47 -05:00
Paolo Bonzini	56afe9f919	target/i386: do not set LM for 32-bit emulation '-cpu host/max' 32-bit targets by definition do not support long mode; therefore, the bit must be masked in the features supported by the accelerator. As a side effect, this avoids setting up the 0x80000008 CPUID leaf for qemu-system-i386 -cpu host which since commit 5a140b255d ("x86/cpu: Use max host physical address if -cpu max option is applied") would have printed this error: qemu-system-i386: phys-bits should be between 32 and 36 (but is 48) Backports 5ea9e9e239db83391a39c09f1de63c4099c20df5	2021-03-04 18:07:38 -05:00
Claudio Fontana	18100d1a3b	cpu: move debug_check_watchpoint to tcg_ops commit 568496c0c0f1 ("cpu: Add callback to check architectural") and commit 3826121d9298 ("target-arm: Implement checking of fired") introduced an ARM-specific hack for cpu_check_watchpoint. Make debug_check_watchpoint optional, and move it to tcg_ops. Backports c73bdb35a91fb6b17c2c93b1ba381fc88a406f8d	2021-03-04 17:30:20 -05:00
Claudio Fontana	7b0c98c236	cpu: move adjust_watchpoint_address to tcg_ops commit 40612000599e ("arm: Correctly handle watchpoints for BE32 CPUs") introduced this ARM-specific, TCG-specific hack to adjust the address, before checking it with cpu_check_watchpoint. Make adjust_watchpoint_address optional and move it to tcg_ops. Backports 9ea9087bb4a86893e4ac6ff643837937dc9e5849	2021-03-04 17:24:32 -05:00
Claudio Fontana	ddfed5f3a6	cpu: move do_unaligned_access to tcg_ops make it consistently SOFTMMU-only. Backports 8535dd702dd054a37a85e0c7971cfb43cc7b50e3	2021-03-04 17:20:02 -05:00
Claudio Fontana	ec08ac4995	cpu: move cc->transaction_failed to tcg_ops Backports cbc183d2d9f5b8a33c2a6cf9cb242b04db1e8d5c	2021-03-04 17:16:41 -05:00
Claudio Fontana	ee73443c7d	cpu: move cc->do_interrupt to tcg_ops Backports 0545608056a6161e7020cd7b9368d9636fa80051	2021-03-04 17:10:14 -05:00
Eduardo Habkost	bc86f4377c	cpu: Move debug_excp_handler to tcg_ops Backports e9ce43e97a19090ae8975ef168b95ba3d29be991	2021-03-04 17:05:57 -05:00
Eduardo Habkost	76a10fa8e0	cpu: Move tlb_fill to tcg_ops Backports e124536f37377cff5d68925d4976ad604d0ebf3a	2021-03-04 17:01:55 -05:00
Eduardo Habkost	03cc62e39c	cpu: Move cpu_exec_* to tcg_ops Backports 48c1a3e303b5a2cca48679645ad3fbb914db741a	2021-03-04 16:56:55 -05:00
Eduardo Habkost	eb38ac1809	cpu: Move synchronize_from_tb() to tcg_ops Backports ec62595bab1873c48a34849de70011093177e769	2021-03-04 16:48:27 -05:00
Claudio Fontana	21375463ea	target/riscv: remove CONFIG_TCG, as it is always TCG for now only TCG is allowed as an accelerator for riscv, so remove the CONFIG_TCG use. Backports 6a3d2e7c0654c3fb2d3368d05363d0635e8bb8ff	2021-03-04 16:40:33 -05:00
Eduardo Habkost	b9b711afe3	cpu: Introduce TCGCpuOperations struct The TCG-specific CPU methods will be moved to a separate struct, to make it easier to move accel-specific code outside generic CPU code in the future. Start by moving tcg_initialize(). The new CPUClass.tcg_opts field may eventually become a pointer, but keep it an embedded struct for now, to make code conversion easier. Backports e9e51b7154404efc9af8735ab87c658a9c434cfd	2021-03-04 16:38:25 -05:00
Claudio Fontana	11ae599cb8	target/arm: do not use cc->do_interrupt for KVM directly cc->do_interrupt is in theory a TCG callback used in accel/tcg only, to prepare the emulated architecture to take an interrupt as defined in the hardware specifications, but in reality the _do_interrupt style of functions in targets are also occasionally reused by KVM to prepare the architecture state in a similar way where userspace code has identified that it needs to deliver an exception to the guest. In the case of ARM, that includes: 1) the vcpu thread got a SIGBUS indicating a memory error, and we need to deliver a Synchronous External Abort to the guest to let it know about the error. 2) the kernel told us about a debug exception (breakpoint, watchpoint) but it is not for one of QEMU's own gdbstub breakpoints/watchpoints so it must be a breakpoint the guest itself has set up, therefore we need to deliver it to the guest. So in order to reuse code, the same arm_do_interrupt function is used. This is all fine, but we need to avoid calling it using the callback registered in CPUClass, since that one is now TCG-only. Fortunately this is easily solved by replacing calls to CPUClass::do_interrupt() with explicit calls to arm_do_interrupt(). Backports 853bfef4e6d60244fd131ec55bbf1e7caa52599b. We don't support KVM, so we just bring the comment addition over.	2021-03-04 16:33:23 -05:00
Philippe Mathieu-Daudé	daafb0ba17	target/arm: Replace magic value by MMU_DATA_LOAD definition cpu_get_phys_page_debug() uses 'DATA LOAD' MMU access type. Backports a9dd161ff2f54446f0b0547447d8196699aca3e1	2021-03-04 15:43:47 -05:00
Richard Henderson	2c8f7b1fbc	target/arm: Conditionalize DBGDIDR Only define the register if it exists for the cpu. Backports 54a78718be6dd5fc6b6201f84bef8de5ac3b3802	2021-03-04 15:42:03 -05:00
Richard Henderson	073923709f	target/arm: Implement ID_PFR2 This was defined at some point before ARMv8.4, and will shortly be used by new processor descriptions. Backports 1d51bc96cc4a9b2d31a3f4cb8442ce47753088e2	2021-03-04 15:40:49 -05:00
Philippe Mathieu-Daudé	d36a968f8e	target/arm/m_helper: Silence GCC 10 maybe-uninitialized error When building with GCC 10.2 configured with --extra-cflags=-Os, we get: target/arm/m_helper.c: In function ‘arm_v7m_cpu_do_interrupt’: target/arm/m_helper.c:1811:16: error: ‘restore_s16_s31’ may be used uninitialized in this function [-Werror=maybe-uninitialized] 1811 \| if (restore_s16_s31) { \| ^ target/arm/m_helper.c:1350:10: note: ‘restore_s16_s31’ was declared here 1350 \| bool restore_s16_s31; \| ^~~~~~~~~~~~~~~ cc1: all warnings being treated as errors Initialize the 'restore_s16_s31' variable to silence the warning. Backports 0ae4f11ee57350dac0e705ba79516310400ff43c	2021-03-04 15:16:55 -05:00
Richard Henderson	0636518de4	target/arm: Update REV, PUNPK for pred_desc Update all users of do_perm_pred2 for the new predicate descriptor field definitions. Backports 70acaafef2e053a312d54c09b6721c730690e72c	2021-03-04 15:15:47 -05:00
Richard Henderson	eb315be37e	target/arm: Update ZIP, UZP, TRN for pred_desc Update all users of do_perm_pred3 for the new predicate descriptor field definitions. Backports f9b0fcceccfc05cde62ff7577fbf2bc13b842414	2021-03-04 15:15:10 -05:00
Richard Henderson	fac4e416c9	target/arm: Update PFIRST, PNEXT for pred_desc These two were odd, in that do_pfirst_pnext passed the count of 64-bit words rather than bytes. Change to pass the standard pred_full_reg_size to avoid confusion. Backports 86300b5d044064046395ae8ed605cc19e63f2a7c	2021-03-04 15:09:47 -05:00
Richard Henderson	4ef4735cd3	target/arm: Introduce PREDDESC field definitions SVE predicate operations cannot use the "usual" simd_desc encoding, because the lengths are not a multiple of 8. But we were abusing the SIMD_* fields to store values anyway. This abuse broke when SIMD_OPRSZ_BITS was modified in e2e7168a214. Introduce a new set of field definitions for exclusive use of predicates, so that it is obvious what kind of predicate we are manipulating. To be used in future patches Backports b64ee454a4a086ed459bcda4c0bbb54e197841e4	2021-03-04 15:08:32 -05:00
Rémi Denis-Courmont	9dfa469976	target/arm: refactor vae1_tlbmask() Backports bc944d3a8b305029196a5e1406702a92fa0b94cf	2021-03-04 15:05:54 -05:00
Rémi Denis-Courmont	8aeaff9385	target/arm: enable Secure EL2 in max CPU Backports 24179fea7e34c4952d4878ae1b26108ba65e5933	2021-03-04 15:04:43 -05:00
Rémi Denis-Courmont	e6d32dc2e0	target/arm: Implement SCR_EL2.EEL2 This adds handling for the SCR_EL3.EEL2 bit. Backports 926c1b97895879b78ca14bca2831c08740ed1c38	2021-03-04 15:03:08 -05:00
Rémi Denis-Courmont	9690ed8236	target/arm: revector to run-time pick target EL On ARMv8-A, accesses by 32-bit secure EL1 to monitor registers trap to the upper (64-bit) EL. With Secure EL2 support, we can no longer assume that that is always EL3, so make room for the value to be computed at run-time. Backports 6b340aeb48e4f7f983e1c38790de65ae93079840	2021-03-04 14:59:14 -05:00
Rémi Denis-Courmont	ce8872709f	target/arm: set HPFAR_EL2.NS on secure stage 2 faults Backport 9861248f637ecf11113b04b0b5c7b13c9aa06f09	2021-03-04 14:54:33 -05:00
Rémi Denis-Courmont	b49531cfef	target/arm: secure stage 2 translation regime b1a10c868f9b2b09e64009b43450e9a86697d9f3	2021-03-04 14:49:33 -05:00
Rémi Denis-Courmont	eeefc3c4a2	target/arm: generalize 2-stage page-walk condition The stage_1_mmu_idx() already effectively keeps track of which translation regimes have two stages. Don't hard-code another test. Backports 7879460a6149ed5e80c29cac85449191d9c5754a	2021-03-04 14:26:22 -05:00
Rémi Denis-Courmont	07ebb7f7ba	target/arm: translate NS bit in page-walks 588c6dd113b27b8db393c7264297b9d33261692e	2021-03-04 14:25:13 -05:00
Rémi Denis-Courmont	6f57520b1d	target/arm: do S1_ptw_translate() before address space lookup In the secure stage 2 translation regime, the VSTCR.SW and VTCR.NSW bits can invert the secure flag for pagetable walks. This patchset allows S1_ptw_translate() to change the non-secure bit. Backports 3d4bd397433b12b148d150c8bc5655a696389bd1	2021-03-04 14:23:43 -05:00
Rémi Denis-Courmont	ce50ba6d07	target/arm: handle VMID change in secure state The VTTBR write callback so far assumes that the underlying VM lies in non-secure state. This handles the secure state scenario. backports c4f060e89effd70ebdb23d3315495d33af377a09	2021-03-04 14:20:47 -05:00
Rémi Denis-Courmont	a78c31e36a	target/arm: add ARMv8.4-SEL2 system registers Backports e9152ee91cc39ed8a53d03607e6e980a7e9444e6	2021-03-04 14:20:10 -05:00
Rémi Denis-Courmont	edd5f021e6	target/arm: add MMU stage 1 for Secure EL2 This adds the MMU indices for EL2 stage 1 in secure state. To keep code contained, which is largelly identical between secure and non-secure modes, the MMU indices are reassigned. The new assignments provide a systematic pattern with a non-secure bit. Backports b6ad6062f1e55bd5b9407ce89e55e3a08b83827c	2021-03-04 14:16:31 -05:00
Rémi Denis-Courmont	fbdcef3ca5	target/arm: add 64-bit S-EL2 to EL exception table With the ARMv8.4-SEL2 extension, EL2 is a legal exception level in secure mode, though it can only be AArch64. This patch adds the target EL for exceptions from 64-bit S-EL2. It also fixes the target EL to EL2 when HCR.{A,F,I}MO are set in secure mode. Those values were never used in practice as the effective value of HCR was always 0 in secure mode. Backports 6c85f906261226e87211506bd9f787fd48a09f17	2021-03-04 14:00:23 -05:00
Rémi Denis-Courmont	159043008f	target/arm: Define isar_feature function to test for presence of SEL2 Backports 5ca192dfc551c8a40871c4e30a8b8ceb879adc31	2021-03-04 13:58:57 -05:00
Rémi Denis-Courmont	b42e6d6036	target/arm: factor MDCR_EL2 common handling This adds a common helper to compute the effective value of MDCR_EL2. That is the actual value if EL2 is enabled in the current security context, or 0 elsewise. Backports 59dd089cf9e4a9cddee596c8a1378620df51b9bb	2021-03-04 13:57:34 -05:00
Rémi Denis-Courmont	b657bfc59b	target/arm: use arm_hcr_el2_eff() where applicable This will simplify accessing HCR conditionally in secure state. Backports e04a5752cb03e066d7b1e583e340c7982fcd5e4e	2021-03-04 13:53:30 -05:00
Rémi Denis-Courmont	58af3e76e6	target/arm: use arm_is_el2_enabled() where applicable Do not assume that EL2 is available in and only in non-secure context. That equivalence is broken by ARMv8.4-SEL2. Backports e6ef0169264b00cce552404f689ce137018ff290	2021-03-04 13:49:19 -05:00
Rémi Denis-Courmont	7a694223ca	target/arm: add arm_is_el2_enabled() helper This checks if EL2 is enabled (meaning EL2 registers take effects) in the current security context. Backports f3ee5160ce3c03795a28e16d1a0b4916a6c959f4	2021-03-04 13:44:04 -05:00
Rémi Denis-Courmont	7402645436	target/arm: remove redundant tests In this context, the HCR value is the effective value, and thus is zero in secure mode. The tests for HCR.{F,I}MO are sufficient. Backports cc974d5cd84ea60a3dad59752aea712f3d47f8ce	2021-03-04 13:42:12 -05:00
Richard Henderson	f6973abb3e	target/arm: Add cpu properties to control pauth The crypto overhead of emulating pauth can be significant for some workloads. Add two boolean properties that allows the feature to be turned off, on with the architected algorithm, or on with an implementation defined algorithm. We need two intermediate booleans to control the state while parsing properties lest we clobber ID_AA64ISAR1 into an invalid intermediate state. Backports relevent members from eb94284d0812b4e7c11c5d075b584100ac1c1b9a	2021-03-04 13:40:27 -05:00
Richard Henderson	0332498752	target/arm: Implement an IMPDEF pauth algorithm Without hardware acceleration, a cryptographically strong algorithm is too expensive for pauth_computepac. Even with hardware accel, we are not currently expecting to link the linux-user binaries to any crypto libraries, and doing so would generally make the --static build fail. So choose XXH64 as a reasonably quick and decent hash. Backports 283fc52ade85eb50141f3b8b85f82b07d016cb17	2021-03-04 13:38:22 -05:00
Peter Maydell	68f645dd4f	target/arm: Don't decode insns in the XScale/iWMMXt space as cp insns In commit cd8be50e58f63413c0 we converted the A32 coprocessor insns to decodetree. This accidentally broke XScale/iWMMXt insns, because it moved the handling of "cp insns which are handled by looking up the cp register in the hashtable" from after the call to the legacy disas_xscale_insn() decode to before it, with the result that all XScale/iWMMXt insns now UNDEF. Update valid_cp() so that it knows that on XScale cp 0 and 1 are not standard coprocessor instructions; this will cause the decodetree trans_ functions to ignore them, so that execution will correctly get through to the legacy decode again. Backports e4d51ac6921dc861bfb3d20e4c7dcf345840a9da	2021-03-03 20:17:20 -05:00
Leif Lindholm	09fd12e5f2	target/arm: add aarch32 ID register fields to cpu.h Add entries present in ARM DDI 0487F.c (August 2020). Backports bd78b6be24f3ceb71f1a7ec2c98c7a5e49cb4a86	2021-03-03 20:16:26 -05:00
Leif Lindholm	a2faae9e30	target/arm: add aarch64 ID register fields to cpu.h Add entries present in ARM DDI 0487F.c (August 2020). Backports 00a92832f453275ca023962c00a60dde3a4f2fed	2021-03-03 20:15:16 -05:00
Leif Lindholm	ba891afd32	target/arm: add descriptions of CLIDR_EL1, CCSIDR_EL1, CTR_EL0 to cpu.h Backports 2a14526a6f56973348d622abc572db377f5a23ef	2021-03-03 20:14:05 -05:00
Leif Lindholm	fc8e5fe38d	target/arm: make ARMCPU.ctr 64-bit When FEAT_MTE is implemented, the AArch64 view of CTR_EL0 adds the TminLine field in bits [37:32]. Extend the ctr field to be able to hold this context. Backports a5fd319ae7f6d496ff5448ec1dedcae8e2f59e9f	2021-03-03 20:13:20 -05:00
Leif Lindholm	e6eb25f75a	target/arm: make ARMCPU.clidr 64-bit The AArch64 view of CLIDR_EL1 extends the ICB field to include also bit 32, as well as adding a Ttype<n> field when FEAT_MTE is implemented. Extend the clidr field to be able to hold this context. Backports f6450bcb6b2d3e4beae77141edce9e99cb8c277e	2021-03-03 20:12:48 -05:00
Leif Lindholm	3fff83e48f	target/arm: fix typo in cpu.h ID_AA64PFR1 field name SBSS -> SSBS Backports 9a286bcdfd2b04afca9a668a6d6e0feb809d2d63	2021-03-03 20:12:08 -05:00
Rémi Denis-Courmont	6f06f383ea	target/arm: enable Small Translation tables in max CPU Backports 078e9fe3cbd6894fb6e420d8b53f304a3d5c0464	2021-03-03 20:11:10 -05:00
Rémi Denis-Courmont	c7415c92d5	target/arm: ARMv8.4-TTST extension This adds for the Small Translation tables extension in AArch64 state. Backports c36c65ea3c35b309d524c05a1c05fdeabf83ddd5	2021-03-03 20:09:01 -05:00
Peter Maydell	f7939926dc	target/arm: Implement Cortex-M55 model Now that we have implemented all the features needed by the v8.1M architecture, we can add the model of the Cortex-M55. This is the configuration without MVE support; we'll add MVE later Backports 590e05d6b48937f6d3c631354fd706f8e005b8f6	2021-03-03 20:06:06 -05:00
Peter Maydell	e586a27a7b	target/arm: Implement FPCXT_NS fp system register Implement the v8.1M FPCXT_NS floating-point system register. This is a little more complicated than FPCXT_S, because it has specific handling for "current FP state is inactive", and it only wants to do PreserveFPState(), not the full set of actions done by ExecuteFPCheck() which vfp_access_check() implements. Backports eb20dafdbff92063a88624176fdc396e01961bf3	2021-03-03 20:02:36 -05:00
Peter Maydell	311b6fd74c	target/arm: Correct store of FPSCR value via FPCXT_S In commit 64f863baeedc8659 we implemented the v8.1M FPCXT_S register, but we got the write behaviour wrong. On read, this register reads bits [27:0] of FPSCR plus the CONTROL.SFPA bit. On write, it doesn't just write back those bits -- it writes a value to the whole FPSCR, whose upper 4 bits are zeroes. We also incorrectly implemented the write-to-FPSCR as a simple store to vfp.xregs; this skips the "update the softfloat flags" part of the vfp_set_fpscr helper so the value would read back correctly but not actually take effect. Fix both of these things by doing a complete write to the FPSCR using the helper function. Backports 7fbf95a037d79c5e923ffb51ac902dbe9599c87f	2021-03-03 19:57:56 -05:00
Richard Henderson	85b417d438	target/arm: Fix MTE0_ACTIVE In 50244cc76abc we updated mte_check_fail to match the ARM pseudocode, using the correct EL to select the TCF field. But we failed to update MTE0_ACTIVE the same way, which led to g_assert_not_reached(). Backports cc97b0019bb590b9b3c2a623e9ebee48831e0ce3	2021-03-03 19:56:23 -05:00
Peter Maydell	1a3abaa81a	target/i386: Check privilege level for protected mode 'int N' task gate When the 'int N' instruction is executed in protected mode, the pseudocode in the architecture manual specifies that we need to check: * vector number within IDT limits * selected IDT descriptor is a valid type (interrupt, trap or task gate) * if this was a software interrupt then gate DPL < CPL The way we had structured the code meant that the privilege check for software interrupts ended up not in the code path taken for task gate handling, because all of the task gate handling code was in the 'case 5' of the switch which was checking "is this descriptor a valid type". Move the task gate handling code out of that switch (so that it is now purely doing the "valid type?" check) and below the software interrupt privilege check. The effect of this missing check was that in a guest userspace binary executing 'int 8' would cause a guest kernel panic rather than the userspace binary being handed a SEGV. This is essentially the same bug fixed in VirtualBox in 2012: https://www.halfdog.net/Security/2012/VirtualBoxSoftwareInterrupt0x8GuestCrash/ Note that for QEMU this is not a security issue because it is only present when using TCG. Backports 3df1a3d070575419859cbbab1083fafa7ec2669a	2021-03-03 19:32:10 -05:00
zhaolichang	f526d4455c	m68k: fix some comment spelling errors I found that there are many spelling errors in the comments of qemu/target/m68k. I used spellcheck to check the spelling errors and found some errors in the folder. Backports ce00ff729ee8461dc94a1593d25ceda65d973d3c	2021-03-03 19:13:26 -05:00
Laurent Vivier	bf2c52bc83	target/m68k: remove useless qregs array They are unused since the target has been converted to TCG. Backports 4160d5e6bd347e5d27804912b61d02df0a90ba8e	2021-03-03 19:11:44 -05:00
Bin Meng	c59e391194	target/i386: seg_helper: Correct segment selector nullification in the RET/IRET helper Per the SDM, when returning to outer privilege level, for segment registers (ES, FS, GS, and DS) if the check fails, the segment selector becomes null, but QEMU clears the base/limit/flags as well as nullifying the segment selector, which should be a spec violation. Real hardware seems to be compliant with the spec, at least on one Coffee Lake board I tested. Backports c2ba0515f2df58a661fcb5d6485139877d92ab1b	2021-03-03 19:10:24 -05:00
Paolo Bonzini	1da5d669a7	target/i386: fix operand order for PDEP and PEXT For PDEP and PEXT, the mask is provided in the memory (mod+r/m) operand, and therefore is loaded in s->T0 by gen_ldst_modrm. The source is provided in the second source operand (VEX.vvvv) and therefore is loaded in s->T1. Fix the order in which they are passed to the helpers. Backports 75b208c28316095c4685e8596ceb9e3f656592e2	2021-03-03 19:09:21 -05:00
Peter Maydell	a9abb7c647	target/arm: Implement M-profile "minimal RAS implementation" For v8.1M the architecture mandates that CPUs must provide at least the "minimal RAS implementation" from the Reliability, Availability and Serviceability extension. This consists of: * an ESB instruction which is a NOP -- since it is in the HINT space we need only add a comment * an RFSR register which will RAZ/WI * a RAZ/WI AIRCR.IESB bit -- the code which handles writes to AIRCR does not allow setting of RES0 bits, so we already treat this as RAZ/WI; add a comment noting that this is deliberate * minimal implementation of the RAS register block at 0xe0005000 -- this will be in a subsequent commit * setting the ID_PFR0.RAS field to 0b0010 -- we will do this when we add the Cortex-M55 CPU model Backports 46f4976f22a4549322307b34272e053d38653243	2021-03-03 19:07:27 -05:00
Peter Maydell	543483444d	target/arm: Implement CCR_S.TRD behaviour for SG insns v8.1M introduces a new TRD flag in the CCR register, which enables checking for stack frame integrity signatures on SG instructions. Add the code in the SG insn implementation for the new behaviour. Backports 7f484147369080d36c411c4ba969f90d025aed55	2021-03-03 19:05:25 -05:00
Peter Maydell	7aa516aff2	target/arm: Implement new v8.1M VLLDM and VLSTM encodings v8.1M adds new encodings of VLLDM and VLSTM (where bit 7 is set). The only difference is that: * the old T1 encodings UNDEF if the implementation implements 32 Dregs (this is currently architecturally impossible for M-profile) * the new T2 encodings have the implementation-defined option to read from memory (discarding the data) or write UNKNOWN values to memory for the stack slots that would be D16-D31 We choose not to make those accesses, so for us the two instructions behave identically assuming they don't UNDEF. Backports fe6fa228a71f0eb8b8ee315452e6a7736c537b1f	2021-03-03 19:01:33 -05:00
Peter Maydell	f02045f5f5	target/arm: Implement new v8.1M NOCP check for exception return In v8.1M a new exception return check is added which may cause a NOCP UsageFault (see rule R_XLTP): before we clear s0..s15 and the FPSCR we must check whether access to CP10 from the Security state of the returning exception is disabled; if it is then we must take a fault. (Note that for our implementation CPPWR is always RAZ/WI and so can never cause CP10 accesses to fail.) The other v8.1M change to this register-clearing code is that if MVE is implemented VPR must also be cleared, so add a TODO comment to that effect. Backports 3423fbf10427db7680d3237d4f62d8370052fca0	2021-03-03 18:59:37 -05:00
Peter Maydell	05d479a8c0	target/arm: For v8.1M, always clear R0-R3, R12, APSR, EPSR on exception entry In v8.0M, on exception entry the registers R0-R3, R12, APSR and EPSR are zeroed for an exception taken to Non-secure state; for an exception taken to Secure state they become UNKNOWN, and we chose to leave them at their previous values. In v8.1M the behaviour is specified more tightly and these registers are always zeroed regardless of the security state that the exception targets (see rule R_KPZV). Implement this. Backports a59b1ed618415212c5f0f05abc1192e14ad5fdbb	2021-03-03 18:55:56 -05:00
Peter Maydell	94b36be626	target/arm: Implement FPCXT_S fp system register Implement the new-in-v8.1M FPCXT_S floating point system register. This is for saving and restoring the secure floating point context, and it reads and writes bits [27:0] from the FPSCR and the CONTROL.SFPA bit in bit [31]. Backports 64f863baeedc86590a608e2f1722dd8640aa9431	2021-03-03 18:53:23 -05:00
Peter Maydell	362379a9e1	target/arm: Factor out preserve-fp-state from full_vfp_access_check() Factor out the code which handles M-profile lazy FP state preservation from full_vfp_access_check(); accesses to the FPCXT_NS register are a special case which need to do just this part (corresponding in the pseudocode to the PreserveFPState() function), and not the full set of actions matching the pseudocode ExecuteFPCheck() which normal FP instructions need to do. Backports 96dfae686628fc14ba4f993824322b93395e221b	2021-03-03 18:48:47 -05:00
Peter Maydell	2de945ba4d	target/arm: Use new FPCR_NZCV_MASK constant We defined a constant name for the mask of NZCV bits in the FPCR/FPSCR in the previous commit; use it in a couple of places in existing code, where we're masking out everything except NZCV for the "load to Rt=15 sets CPSR.NZCV" special case. Backports 6a017acdf83e3bb6bd5e85289ca90b2ea3282b7e	2021-03-03 18:47:30 -05:00
Peter Maydell	2c6e54d1cd	target/arm: Implement M-profile FPSCR_nzcvqc v8.1M defines a new FP system register FPSCR_nzcvqc; this behaves like the existing FPSCR, except that it reads and writes only bits [31:27] of the FPSCR (the N, Z, C, V and QC flag bits). (Unlike the FPSCR, the special case for Rt=15 of writing the CPSR.NZCV is not permitted.) Implement the register. Since we don't yet implement MVE, we handle the QC bit as RES0, with todo comments for where we will need to add support later. Backports 9542c30bcf13c495400d63616dd8dfa825b04685	2021-03-03 18:45:38 -05:00
Peter Maydell	56532aa94c	target/arm: Implement VLDR/VSTR system register Implement the new-in-v8.1M VLDR/VSTR variants which directly read or write FP system registers to memory. Backports 0bf0dd4dcbd9fab324700ac6e0cd061cd043de0d	2021-03-03 18:42:05 -05:00
Peter Maydell	edae732810	target/arm: Move general-use constant expanders up in translate.c The constant-expander functions like negate, plus_2, etc, are generally useful; move them up in translate.c so we can use them in the VFP/Neon decoders as well as in the A32/T32/T16 decoders. Backports f7ed0c9433e7c5c157d2e6235eb5c8b93234a71a	2021-03-03 18:29:32 -05:00
Peter Maydell	a72c744370	target/arm: Refactor M-profile VMSR/VMRS handling Currently M-profile borrows the A-profile code for VMSR and VMRS (access to the FP system registers), because all it needs to support is the FPSCR. In v8.1M things become significantly more complicated in two ways: * there are several new FP system registers; some have side effects on read, and one (FPCXT_NS) needs to avoid the usual vfp_access_check() and the "only if FPU implemented" check * all sysregs are now accessible both by VMRS/VMSR (which reads/writes a general purpose register) and also by VLDR/VSTR (which reads/writes them directly to memory) Refactor the structure of how we handle VMSR/VMRS to cope with this: * keep the M-profile code entirely separate from the A-profile code * abstract out the "read or write the general purpose register" part of the code into a loadfn or storefn function pointer, so we can reuse it for VLDR/VSTR. Backports 32a290b8c3c2dc85cd88bd8983baf900d575cab	2021-03-03 18:13:17 -05:00
Peter Maydell	4eafe42d67	target/arm: Enforce M-profile VMRS/VMSR register restrictions For M-profile before v8.1M, the only valid register for VMSR/VMRS is the FPSCR. We have a comment that states this, but the actual logic to forbid accesses for any other register value is missing, so we would end up with A-profile style behaviour. Add the missing check. Backports ede97c9d71110821738a48f88ff9f10d6bec017f	2021-03-03 18:06:23 -05:00
Peter Maydell	2e3bd010a8	target/arm: Implement CLRM instruction In v8.1M the new CLRM instruction allows zeroing an arbitrary set of the general-purpose registers and APSR. Implement this. The encoding is a subset of the LDMIA T2 encoding, using what would be Rn=0b1111 (which UNDEFs for LDMIA). Backports 6e21a013fbdf54960a079dccc90772bb622e28e8	2021-03-03 18:00:28 -05:00
Peter Maydell	43d8441881	target/arm: Implement VSCCLRM insn Implement the v8.1M VSCCLRM insn, which zeros floating point registers if there is an active floating point context. This requires support in write_neon_element32() for the MO_32 element size, so add it. Because we want to use arm_gen_condlabel(), we need to move the definition of that function up in translate.c so it is before the #include of translate-vfp.c.inc. Backports 83ff3d6add965c9752324de11eac5687121ea826	2021-03-03 17:57:30 -05:00
Peter Maydell	952ebdc207	target/arm: Don't clobber ID_PFR1.Security on M-profile cores In arm_cpu_realizefn() we check whether the board code disabled EL3 via the has_el3 CPU object property, which we create if the CPU starts with the ARM_FEATURE_EL3 feature bit. If it is disabled, then we turn off ARM_FEATURE_EL3 and also zero out the relevant fields in the ID_PFR1 and ID_AA64PFR0 registers. This codepath was incorrectly being taken for M-profile CPUs, which do not have an EL3 and don't set ARM_FEATURE_EL3, but which may have the M-profile Security extension and so should have non-zero values in the ID_PFR1.Security field. Restrict the handling of the feature flag to A/R-profile cores. Backports 4018818840f499d0a478508aedbb6802c8eae928	2021-03-03 17:52:30 -05:00
Peter Maydell	cfefada296	target/arm: Implement v8.1M PXN extension In v8.1M the PXN architecture extension adds a new PXN bit to the MPU_RLAR registers, which forbids execution of code in the region from a privileged mode. This is another feature which is just in the generic "in v8.1M" set and has no ID register field indicating its presence. Backports cad8e2e3160dd10371552fce6cd8c6e171503e13	2021-03-03 17:50:26 -05:00
Rémi Denis-Courmont	d9592046ef	target/arm: fix stage 2 page-walks in 32-bit emulation Using a target unsigned long would limit the Input Address to a LPAE page-walk to 32 bits on AArch32 and 64 bits on AArch64. This is okay for stage 1 or on AArch64, but it is insufficient for stage 2 on AArch32. In that later case, the Input Address can have up to 40 bits. Backports commit 98e8779770c40901ed585745aacc9a8e2b934a28	2021-03-02 13:37:02 -05:00
Chetan Pant	3e25486110	x86 tcg cpus: Fix Lesser GPL version number There is no "version 2" of the "Lesser" General Public License. It is either "GPL version 2.0" or "Lesser GPL version 2.1". This patch replaces all occurrences of "Lesser GPL version 2" with "Lesser GPL version 2.1" in comment section. Backport d9ff33ada7f32ca59f99b270a2d0eb223b3c9c8f	2021-03-02 13:33:10 -05:00
Chetan Pant	c7f6786089	arm tcg cpus: Fix Lesser GPL version number There is no "version 2" of the "Lesser" General Public License. It is either "GPL version 2.0" or "Lesser GPL version 2.1". This patch replaces all occurrences of "Lesser GPL version 2" with "Lesser GPL version 2.1" in comment section. Backports 50f57e09fda4b7ffbc5ba62aad6cebf660824023	2021-03-02 13:30:35 -05:00
Peter Maydell	f991d945d3	target/arm/translate-neon.c: Handle VTBL UNDEF case before VFP access check Checks for UNDEF cases should go before the "is VFP enabled?" access check, except in special cases. Move a stray UNDEF check in the VTBL trans function up above the access check. Backports b6c56c8a9a4064ea783f352f43c5df6231a110fa	2021-03-02 13:24:51 -05:00
Richard Henderson	9623047097	target/arm: Fix neon VTBL/VTBX for len > 1 The helper function did not get updated when we reorganized the vector register file for SVE. Since then, the neon dregs are non-sequential and cannot be simply indexed. At the same time, make the helper function operate on 64-bit quantities so that we do not have to call it twice. Backports 604cef3e57eaeeef77074d78f6cf2eca1be11c62	2021-03-02 13:23:13 -05:00
Xinhao Zhang	b3f63b72a2	target/arm: add space before the open parenthesis '(' Fix code style. Space required before the open parenthesis '('. Backports 7f350a87e3a85e8a260ce4b133d549a7b2789213	2021-03-02 13:17:48 -05:00
Xinhao Zhang	71d4aced5d	target/arm: Don't use '#' flag of printf format Fix code style. Don't use '#' flag of printf format ('%#') in format strings, use '0x' prefix instead Backports 6eb55edbabb9eed1e4c7dfb233e7d738e8b5fa89	2021-03-02 13:16:09 -05:00
Xinhao Zhang	492fbc4d2c	target/arm: add spaces around operator Fix code style. Operator needs spaces both sides. Backports bdc3b6f570e8bd219aa6a24a149b35a691e6986c	2021-03-02 13:15:12 -05:00
Peter Maydell	e528c8229e	target/arm: Get correct MMU index for other-security-state In arm_v7m_mmu_idx_for_secstate() we get the 'priv' level to pass to armv7m_mmu_idx_for_secstate_and_priv() by calling arm_current_el(). This is incorrect when the security state being queried is not the current one, because arm_current_el() uses the current security state to determine which of the banked CONTROL.nPRIV bits to look at. The effect was that if (for instance) Secure state was in privileged mode but Non-Secure was not then we would return the wrong MMU index. The only places where we are using this function in a way that could trigger this bug are for the stack loads during a v8M function-return and for the instruction fetch of a v8M SG insn. Fix the bug by expanding out the M-profile version of the arm_current_el() logic inline so it can use the passed in secstate rather than env->v7m.secure. Backports 7142eb9e24b4aa5118cd67038057f15694d782aa	2021-03-02 13:08:44 -05:00
Rémi Denis-Courmont	a4053565d6	target/arm: fix LORID_EL1 access check Secure mode is not exempted from checking SCR_EL3.TLOR, and in the future HCR_EL2.TLOR when S-EL2 is enabled. Backports 9bd268bae5c4760870522292fb1d46e7da7e372a	2021-03-02 13:06:50 -05:00
Rémi Denis-Courmont	df4413edc7	target/arm: fix handling of HCR.FB HCR should be applied when NS is set, not when it is cleared. Backports 373e7ffde9bae90a20fb5db21b053f23091689f4	2021-03-02 13:05:01 -05:00
Peter Maydell	6b8096d9fc	target/arm: Fix VUDOT/VSDOT (scalar) on big-endian hosts The helper functions for performing the udot/sdot operations against a scalar were not using an address-swizzling macro when converting the index of the scalar element into a pointer into the vm array. This had no effect on little-endian hosts but meant we generated incorrect results on big-endian hosts. For these insns, the index is indexing over group of 4 8-bit values, so 32 bits per indexed entity, and H4() is therefore what we want. (For Neon the only possible input indexes are 0 and 1.) Backports d1a9254be5cc93afb15be19f7543da6ff4806256	2021-03-02 13:03:51 -05:00
Peter Maydell	5c6730a432	target/arm: Fix float16 pairwise Neon ops on big-endian hosts In the neon_padd/pmax/pmin helpers for float16, a cut-and-paste error meant we were using the H4() address swizzler macro rather than the H2() which is required for 2-byte data. This had no effect on little-endian hosts but meant we put the result data into the destination Dreg in the wrong order on big-endian hosts. Backports 552714c0812a10e5cff239bd29928e5fcb8d8b3b	2021-03-02 13:02:31 -05:00
Richard Henderson	d473f66177	target/arm: Improve do_prewiden_3d We can use proper widening loads to extend 32-bit inputs, and skip the "widenfn" step. Backports 8aab18a2c5209e4e48998a61fbc2d89f374331ed	2021-03-02 13:00:25 -05:00
Richard Henderson	9263117d47	target/arm: Simplify do_long_3d and do_2scalar_long In both cases, we can sink the write-back and perform the accumulate into the normal destination temps Backports 9f1a5f93c2dd345dc6c8fe86ed14bf1485056f6e	2021-03-02 12:46:53 -05:00
Richard Henderson	07c2b70234	target/arm: Rename neon_load_reg64 to vfp_load_reg64 The only uses of this function are for loading VFP double-precision values, and nothing to do with NEON. Backports b38b96ca90827012ab8eb045c1337cea83a54c4b	2021-03-02 12:43:25 -05:00
Richard Henderson	9d87b62578	target/arm: Add read/write_neon_element64 Replace all uses of neon_load/store_reg64 within translate-neon.c.inc. Backports 0aa8e700a53b0aa7275ed747b8fa3acb61d35f2d	2021-03-02 12:40:33 -05:00
Richard Henderson	89b1f62878	target/arm: Rename neon_load_reg32 to vfp_load_reg32 The only uses of this function are for loading VFP single-precision values, and nothing to do with NEON. Backports 21c1c0e50b73c580c6bfc8f2314d1b6a14793561	2021-03-02 12:30:20 -05:00
Richard Henderson	011d9ab061	target/arm: Expand read/write_neon_element32 to all MemOp We can then use this to improve VMOV (scalar to gp) and VMOV (gp to scalar) so that we simply perform the memory operation that we wanted, rather than inserting or extracting from a 32-bit quantity. These were the last uses of neon_load/store_reg, so remove them. Backports 4d5fa5a80ac28f34b8497be1e85371272413a12e	2021-03-02 12:26:41 -05:00
Richard Henderson	d21316d639	target/arm: Add read/write_neon_element32 Model these off the aa64 read/write_vec_element functions. Use it within translate-neon.c.inc. The new functions do not allocate or free temps, so this rearranges the calling code a bit. Backports a712266f5d5a36d04b22fe69fa15592d62bed019	2021-03-02 12:18:31 -05:00
Richard Henderson	e390c1ec7f	target/arm: Use neon_element_offset in vfp_reg_offset This seems a bit more readable than using offsetof CPU_DoubleU. Backports d8719785fde2f5041986853a314c05c6f567d3cb	2021-03-02 11:55:49 -05:00
Richard Henderson	c1ca9e53da	target/arm: Use neon_element_offset in neon_load/store_reg These are the only users of neon_reg_offset, so remove that. Backports 0f2cdc82276a723ee58562b56b9d537a4bd7bfef	2021-03-02 11:54:56 -05:00
Richard Henderson	1b09d0d96f	target/arm: Move neon_element_offset to translate.c This will shortly have users outside of translate-neon.c.inc. Backports 7ec85c02833f4264840c6ed78b749443a7b4ffe0	2021-03-02 11:52:59 -05:00
Richard Henderson	8a20537e7f	target/arm: Introduce neon_full_reg_offset This function makes it clear that we're talking about the whole register, and not the 32-bit piece at index 0. This fixes a bug when running on a big-endian host. Backports 015ee81a4c06b644969f621fd9965cc6372b879e	2021-03-02 11:50:36 -05:00
Peter Maydell	2f0940677e	target/arm: Implement FPSCR.LTPSIZE for M-profile LOB extension If the M-profile low-overhead-branch extension is implemented, FPSCR bits [18:16] are a new field LTPSIZE. If MVE is not implemented (currently always true for us) then this field always reads as 4 and ignores writes. These bits used to be the vector-length field for the old short-vector extension, so we need to take care that they are not misinterpreted as setting vec_len. We do this with a rearrangement of the vfp_set_fpscr() code that deals with vec_len, vec_stride and also the QC bit; this obviates the need for the M-profile only masking step that we used to have at the start of the function. We provide a new field in CPUState for LTPSIZE, even though this will always be 4, in preparation for MVE, so we don't have to come back later and split it out of the vfp.xregs[FPSCR] value. (This state struct field will be saved and restored as part of the FPSCR value via the vmstate_fpscr in machine.c.) Backports 8128c8e8cc9489a8387c74075974f86dc0222e7f	2021-03-01 20:36:02 -05:00
Peter Maydell	8a6e118a17	target/arm: Allow M-profile CPUs with FP16 to set FPSCR.FP16 M-profile CPUs with half-precision floating point support should be able to write to FPSCR.FZ16, but an M-profile specific masking of the value at the top of vfp_set_fpscr() currently prevents that. This is not yet an active bug because we have no M-profile FP16 CPUs, but needs to be fixed before we can add any. The bits that the masking is effectively preventing from being set are the A-profile only short-vector Len and Stride fields, plus the Neon QC bit. Rearrange the order of the function so that those fields are handled earlier and only under a suitable guard; this allows us to drop the M-profile specific masking, making FZ16 writeable. This change also makes the QC bit correctly RAZ/WI for older no-Neon A-profile cores. This refactoring also paves the way for the low-overhead-branch LTPSIZE field, which uses some of the bits that are used for A-profile Stride and Len. Backports commit d31e2ce68d56f5bcc83831497e5fe4b8a7e18e85	2021-03-01 20:33:22 -05:00
Peter Maydell	3ae5543825	target/arm: Implement v8.1M low-overhead-loop instructions v8.1M's "low-overhead-loop" extension has three instructions for looping: * DLS (start of a do-loop) * WLS (start of a while-loop) * LE (end of a loop) The loop-start instructions are both simple operations to start a loop whose iteration count (if any) is in LR. The loop-end instruction handles "decrement iteration count and jump back to loop start"; it also caches the information about the branch back to the start of the loop to improve performance of the branch on subsequent iterations. As with the branch-future instructions, the architecture permits an implementation to discard the LO_BRANCH_INFO cache at any time, and QEMU takes the IMPDEF option to never set it in the first place (equivalent to discarding it immediately), because for us a "real" implementation would be unnecessary complexity. (This implementation only provides the simple looping constructs; the vector extension MVE (Helium) adds some extra variants to handle looping across vectors. We'll add those later when we implement MVE.) Backports commit b7226369721896ab9ef71544e4fe95b40710e05a	2021-03-01 20:29:04 -05:00
Peter Maydell	be197f9857	target/arm: Implement v8.1M branch-future insns (as NOPs) v8.1M implements a new 'branch future' feature, which is a set of instructions that request the CPU to perform a branch "in the future", when it reaches a particular execution address. In hardware, the expected implementation is that the information about the branch location and destination is cached and then acted upon when execution reaches the specified address. However the architecture permits an implementation to discard this cached information at any point, and so guest code must always include a normal branch insn at the branch point as a fallback. In particular, an implementation is specifically permitted to treat all BF insns as NOPs (which is equivalent to discarding the cached information immediately). For QEMU, implementing this caching of branch information would be complicated and would not improve the speed of execution at all, so we make the IMPDEF choice to implement all BF insns as NOPs. Backports commit 05903f036edba8e3ed940cc215b8e27fb49265b9	2021-03-01 20:25:15 -05:00
Peter Maydell	966246d991	target/arm: Don't allow BLX imm for M-profile The BLX immediate insn in the Thumb encoding always performs a switch from Thumb to Arm state. This would be totally useless in M-profile which has no Arm decoder, and so the instruction does not exist at all there. Make the encoding UNDEF for M-profile. (This part of the encoding space is used for the branch-future and low-overhead-loop insns in v8.1M.) Backports 920f04fa3ea789f8f85a52cee5395b8887b56cf7	2021-03-01 20:23:59 -05:00

1 2 3 4 5 ...

2624 commits