unicorn

mirror of https://github.com/yuzu-emu/unicorn.git synced 2024-12-24 02:05:40 +00:00

Author	SHA1	Message	Date
Richard Henderson	451683ee79	target/arm: Vectorize SABA/UABA Include 64-bit element size in preparation for SVE2. Backports commit cfdb2c0c95ae9205b0dd7f0f5e970cdec50fef20 from qemu	2020-05-15 22:15:14 -04:00
Richard Henderson	98c79f9afc	target/arm: Vectorize SABD/UABD Include 64-bit element size in preparation for SVE2. Backports commit 50c160d44eb059c7fc7f348ae2c3b0cb41437044 from qemu	2020-05-15 22:01:29 -04:00
Richard Henderson	efdcad70b1	target/arm: Remove fp_status from helper_{recpe, rsqrte}_u32 These operations do not touch fp_status. Backports commit fe6fb4beb2f9bb0afc813e565504b66a92bbf04b from qemu	2020-05-15 21:32:03 -04:00
Richard Henderson	6190be3191	target/arm: Create gen_gvec_{sri,sli} The functions eliminate duplication of the special cases for this operation. They match up with the GVecGen2iFn typedef. Add out-of-line helpers. We got away with only having inline expanders because the neon vector size is only 16 bytes, and we know that the inline expansion will always succeed. When we reuse this for SVE, tcg-gvec-op may decide to use an out-of-line helper due to longer vector lengths. Backports commit 893ab0542aa385a287cbe46d5535c8b9e95ce699 from qemu	2020-05-15 20:39:28 -04:00
Richard Henderson	2609e6f319	target/arm: Create gen_gvec_{u,s}{rshr,rsra} Create vectorized versions of handle_shri_with_rndacc for shift+round and shift+round+accumulate. Add out-of-line helpers in preparation for longer vector lengths from SVE. Backports commit 6ccd48d4ea244c1c46a24dfa50bfb547f11422dd from qemu	2020-05-15 20:28:44 -04:00
Richard Henderson	5d7c46204d	target/arm: Create gen_gvec_[us]sra The functions eliminate duplication of the special cases for this operation. They match up with the GVecGen2iFn typedef. Add out-of-line helpers. We got away with only having inline expanders because the neon vector size is only 16 bytes, and we know that the inline expansion will always succeed. When we reuse this for SVE, tcg-gvec-op may decide to use an out-of-line helper due to longer vector lengths. Backports commit 631e565450c483e0622eec3d8b61d7fa41d16bca from qemu	2020-05-15 20:10:32 -04:00
Richard Henderson	b26b4c06cd	target/arm: Vectorize integer comparison vs zero These instructions are often used in glibc's string routines. They were the final uses of the 32-bit at a time neon helpers. Backports commit 6b375d3546b009d1e63e07397ec9c6af256e15e9 from qemu	2020-04-30 21:29:17 -04:00
Richard Henderson	3cb68bc44e	target/arm: Move helper_dc_zva to helper-a64.c This is an aarch64-only function. Move it out of the shared file. This patch is code movement only. Backports commit 7b182eb2467af6c47c9c77c64bbbeed8ed53c330 from qemu	2020-04-30 06:12:26 -04:00
Richard Henderson	fcce8d4aa1	target/arm: Convert PMULL.8 to gvec We still need two different helpers, since NEON and SVE2 get the inputs from different locations within the source vector. However, we can convert both to the same internal form for computation. The sve2 helper is not used yet, but adding it with this patch helps illustrate why the neon changes are helpful. Backports commit e7e96fc5ec8c79dc77fef522d5226ac09f684ba5 from qemu	2020-03-21 19:35:46 -04:00
Richard Henderson	c00f72f74f	target/arm: Convert PMULL.64 to gvec The gvec form will be needed for implementing SVE2. Backports commit b9ed510e46f2f9e31e5e8adb4661d5d1cbe9a459 from qemu	2020-03-21 19:27:38 -04:00
Richard Henderson	db8a935b44	target/arm: Convert PMUL.8 to gvec The gvec form will be needed for implementing SVE2. Extend the implementation to operate on uint64_t instead of uint32_t. Use a counted inner loop instead of terminating when op1 goes to zero, looking toward the required implementation for ARMv8.4-DIT. Backports commit a21bb78e5817be3f494922e1dadd6455fe5d6318 from qemu	2020-03-21 19:22:18 -04:00
Richard Henderson	d3139f2f0a	target/arm: Vectorize USHL and SSHL These instructions shift left or right depending on the sign of the input, and 7 bits are significant to the shift. This requires several masks and selects in addition to the actual shifts to form the complete answer. That said, the operation is still a small improvement even for two 64-bit elements -- 13 vector operations instead of 2 * 7 integer operations. Backports commit 87b74e8b6edd287ea2160caa0ebea725fa8f1ca1 from qemu	2020-03-21 19:14:17 -04:00
Marc Zyngier	868de52f69	target/arm: Handle trapping to EL2 of AArch32 VMRS instructions HCR_EL2.TID3 requires that AArch32 reads of MVFR[012] are trapped to EL2, and HCR_EL2.TID0 does the same for reads of FPSID. In order to handle this, introduce a new TCG helper function that checks for these control bits before executing the VMRC instruction. Tested with a hacked-up version of KVM/arm64 that sets the control bits for 32bit guests. Backports commit 9ca1d776cb49c09b09579d9edd0447542970c834 from qemu	2020-01-07 18:04:16 -05:00
Richard Henderson	3d3d56056b	target/arm: Remove helper_double_saturate Replace x = double_saturate(y) with x = add_saturate(y, y). There is no need for a separate more specialized helper. Backports commit 640581a06d14e2d0d3c3ba79b916de6bc43578b0 from qemu	2019-11-18 20:13:21 -05:00
Richard Henderson	552e48f14e	target/arm: Use tcg_gen_abs_i64 and tcg_gen_gvec_abs Backports commit 4e027a710673f5d4dc6cff88728bcfd32e4c47b0 from qemu	2019-05-16 16:43:02 -04:00
Peter Maydell	77ae3982b4	target/arm: Implement VLLDM for v7M CPUs with an FPU Implement the VLLDM instruction for v7M for the FPU present cas. Backports commit 956fe143b4f254356496a0a1c479fa632376dfec from qemu	2019-04-30 11:27:54 -04:00
Peter Maydell	b483951046	target/arm: Implement VLSTM for v7M CPUs with an FPU Implement the VLSTM instruction for v7M for the FPU present case. Backports commit 019076b036da4444494de38388218040d9d3a26c from qemu	2019-04-30 11:25:44 -04:00
Peter Maydell	a976d7642a	target/arm: Implement M-profile lazy FP state preservation The M-profile architecture floating point system supports lazy FP state preservation, where FP registers are not pushed to the stack when an exception occurs but are instead only saved if and when the first FP instruction in the exception handler is executed. Implement this in QEMU, corresponding to the check of LSPACT in the pseudocode ExecuteFPCheck(). Backports commit e33cf0f8d8c9998a7616684f9d6aa0d181b88803 from qemu	2019-04-30 11:21:50 -04:00
Richard Henderson	f116560d2c	target/arm: Implement ARMv8.5-FRINT Backports 6bea25631af92531027d3bf3ef972a4d51d62e7c from qemu.	2019-03-05 23:17:33 -05:00
Richard Henderson	45c297c99b	target/arm: Add set/clear_pstate_bits, share gen_ss_advance We do not need an out-of-line helper for manipulating bits in pstate. While changing things, share the implementation of gen_ss_advance. Backports commit 22ac3c49641f6eed93dca5b852030b4d3eacf6c4 from qemu	2019-03-05 22:55:22 -05:00
Richard Henderson	60742608f5	target/arm: Split helper_msr_i_pstate into 3 The EL0+UMA check is unique to DAIF. While SPSel had avoided the check by nature of already checking EL >= 1, the other post v8.0 extensions to MSR (imm) allow EL0 and do not require UMA. Avoid the unconditional write to pc and use raise_exception_ra to unwind. Backports commit ff730e9666a716b669ac4a8ca7c521177d1d2b15 from qemu	2019-03-05 22:45:11 -05:00
Richard Henderson	5473c3603f	target/arm: Add helpers for FMLAL Note that float16_to_float32 rightly squashes SNaN to QNaN. But of course pickNaNMulAdd, for ARM, selects SNaNs first. So we have to preserve SNaN long enough for the correct NaN to be selected. Thus float16_to_float32_by_bits. Backports commit a4e943a716d5fac923d82df3eabc65d1e3624019 from qemu	2019-02-28 15:31:48 -05:00
Richard Henderson	c9ad233678	target/arm: Implement ARMv8.3-JSConv Backports commit 6c1f6f2733a7692793135ea5ce72b829add99a50 from qemu	2019-02-22 19:08:57 -05:00
Richard Henderson	f3cb92c86c	target/arm: Use vector operations for saturation For same-sign saturation, we have tcg vector operations. We can compute the QC bit by comparing the saturated value against the unsaturated value. Backports commit 89e68b575e138d0af1435f11a8ffcd8779c237bd from qemu	2019-02-15 18:14:09 -05:00
Richard Henderson	ed7c9d0710	target/arm: Remove neon min/max helpers These are now unused. Backports commit a5c5dc53c4688efc149b235361d2d49869e77139 from qemu	2019-02-15 17:57:18 -05:00
Richard Henderson	0c6f58ebc6	target/arm: Move helper_exception_return to helper-a64.c This function is only used by AArch64. Code movement only. Backports commit ce02fd99e6d53df6f3cf5eca85bcac403b402510 from qemu	2019-01-22 15:44:53 -05:00
Peter Maydell	ca5d7b8fd2	target/arm: Add v8M stack checks on ADD/SUB/MOV of SP Add code to insert calls to a helper function to do the stack limit checking when we handle these forms of instruction that write to SP: * ADD (SP plus immediate) * ADD (SP plus register) * SUB (SP minus immediate) * SUB (SP minus register) * MOV (register) Backports commit 5520318939fea5d659bf808157cd726cb967b761 from qemu	2018-10-08 14:15:15 -04:00
Richard Henderson	d343f8ac0f	target/arm: Implement SVE dot product (indexed) Backports commit 16fcfdc7325649b187ac489f3ae0b0d2a20b6230 from qemu	2018-07-03 04:42:41 -04:00
Richard Henderson	2f6d555473	target/arm: Implement SVE dot product (vectors) Backports commit d730ecaae77ac696515207a5ef99509240fc792b from qemu	2018-07-03 04:35:25 -04:00
Richard Henderson	a9160a0e08	target/arm: Implement SVE floating-point convert to integer Backports commit df4de1affc440d6f2cdaeea329b90c0b88ece5a1 from qemu	2018-07-03 04:02:58 -04:00
Richard Henderson	942f3c835e	target/arm: Implement SVE Floating Point Unary Operations - Unpredicated Group Backports commit 3887c0388d39930ab419d4ae6e8ca5ea67a74ad5 from qemu	2018-07-03 03:44:40 -04:00
Richard Henderson	cf3c7824ff	target/arm: Implement SVE Floating Point Multiply Indexed Group Backports commit ca40a6e6e390eb1cad7ade881dc7c622793f9324 from qemu	2018-07-03 03:35:49 -04:00
Richard Henderson	d81cc5f5cd	target/arm: Implement SVE Floating Point Arithmetic - Unpredicated Group Backports commit 29b80469dc51ae4064e9ef9223967882d2610523 from qemu	2018-06-15 14:10:16 -04:00
Lioncash	1eaa2e4571	target/arm: Implement SVE predicate test	2018-05-20 01:16:16 -04:00
Alex Bennée	40d57900bf	target/arm: convert conversion helpers to fpst/ahp_flag Instead of passing env and leaving it up to the helper to get the right fpstatus we pass it explicitly. There was already a get_fpstatus helper for neon for the 32 bit code. We also add an get_ahp_flag() for passing the state of the alternative FP16 format flag. This leaves scope for later tracking the AHP state in translation flags. Backports commit 486624fcd3eaca6165ab8401d73bbae6c0fb81c1 from qemu	2018-05-19 22:58:25 -04:00
Richard Henderson	8436080518	target/arm: Implement FCVT (scalar, integer) for fp16 Backports commit 564a0632504fad840491aa9a59453f4e64a316c4 from qemu	2018-05-15 22:06:49 -04:00
Richard Henderson	67740bbc7f	target/arm: Fix float16 to/from int16 The instruction "ucvtf v0.4h, v04h, #2", with input 0x8000u, overflows the intermediate float16 to infinity before we have a chance to scale the output. Use float64 as the intermediate type so that no input argument (uint32_t in this case) can overflow or round before scaling. Given the declared argument, the signed int32_t function has the same problem. When converting from float16 to integer, using u/int32_t instead of u/int16_t means that the bounding is incorrect. Backports commit 88808a022c06f98d81cd3f2d105a5734c5614839 from qemu	2018-05-14 08:41:20 -04:00
Peter Maydell	7a3ee5fd95	target/arm: Honour MDCR_EL2.TDE when routing exceptions due to BKPT/BRK The MDCR_EL2.TDE bit allows the exception level targeted by debug exceptions to be set to EL2 for code executing at EL0. We handle this in the arm_debug_target_el() function, but this is only used for hardware breakpoint and watchpoint exceptions, not for the exception generated when the guest executes an AArch32 BKPT or AArch64 BRK instruction. We don't have enough information for a translate-time equivalent of arm_debug_target_el(), so instead make BKPT and BRK call a special purpose helper which can do the routing, rather than the generic exception_with_syndrome helper. Backports commit c900a2e62dd6dde11c8f5249b638caad05bb15be from qemu	2018-03-25 16:33:04 -04:00
Richard Henderson	abd86b2287	target/arm: Decode aa64 armv8.3 fcmla Backports commit d17b7cdcf4ea3e858ceee8b86fc8544bb71561e6 from qemu Also remember to commit vec_helper.	2018-03-09 01:05:02 -05:00
Richard Henderson	4b39a36416	target/arm: Decode aa64 armv8.3 fcadd Backports commit 1695cd61b08d4376c11e0658836c4f08b4fc3aa1 from qemu	2018-03-09 00:58:37 -05:00
Lioncash	12fd2cc113	target/arm: Decode aa64 armv8.1 three same extra	2018-03-09 00:10:09 -05:00
Richard Henderson	4f585f71fb	target/arm: Decode aa64 armv8.1 scalar three same extra Backports commit d9061ec3d27eb940402a7eafee3fb77ce1146ad4 from qemu	2018-03-09 00:02:23 -05:00
Alex Bennée	068143595e	arm/helper.c: re-factor rsqrte and add rsqrte_f16 Much like recpe the ARM ARM has simplified the pseudo code for the calculation which is done on a fixed point 9 bit integer maths. So while adding f16 we can also clean this up to be a little less heavy on the floating point and just return the fractional part and leave the calle's to do the final packing of the result. Backports commit d719cbc7641991d16b891ffbbfc3a16a04e37b9a from qemu Also removes a load of symbols that seem unnecessary from the header_gen script	2018-03-08 22:42:04 -05:00
Alex Bennée	5f3864c2c2	arm/helper.c: re-factor recpe and add recepe_f16 It looks like the ARM ARM has simplified the pseudo code for the calculation which is done on a fixed point 9 bit integer maths. So while adding f16 we can also clean this up to be a little less heavy on the floating point and just return the fractional part and leave the calle's to do the final packing of the result. Backports commit 5eb70735af1c0b607bf2671a53aff3710cc1672f from qemu	2018-03-08 19:05:48 -05:00
Alex Bennée	7161c1ed52	arm/translate-a64: add FP16 SCVTF/UCVFT to simd_two_reg_misc_fp16	2018-03-08 18:48:25 -05:00
Alex Bennée	27d8d01566	target/arm/helper: pass explicit fpst to set_rmode As the rounding mode is now split between FP16 and the rest of floating point we need to be explicit when tweaking it. Instead of passing the CPU env we now pass the appropriate fpst pointer directly. Backports commit 9b04991686785e18b18a36d193b68f08f7c91648 from qemu	2018-03-08 12:41:54 -05:00
Ard Biesheuvel	85e6d710e4	target/arm: implement SM4 instructions This implements emulation of the new SM4 instructions that have been added as an optional extension to the ARMv8 Crypto Extensions in ARM v8.2. Backports commit b6577bcd251ca0d57ae1de149e3c706b38f21587 from qemu	2018-03-07 08:57:53 -05:00
Ard Biesheuvel	78d15a9cd0	target/arm: implement SM3 instructions This implements emulation of the new SM3 instructions that have been added as an optional extension to the ARMv8 Crypto Extensions in ARM v8.2. Backports commit 80d6f4c6bbb718f343a832df8dee15329cc7686c from qemu	2018-03-07 08:53:47 -05:00
Ard Biesheuvel	0ef74f6d6d	target/arm: implement SHA-512 instructions This implements emulation of the new SHA-512 instructions that have been added as an optional extensions to the ARMv8 Crypto Extensions in ARM v8.2. Backports commit 90b827d131812d7f0a8abb13dba1942a2bcee821 from qemu	2018-03-07 08:39:49 -05:00
Richard Henderson	404fa33c4b	target/arm: Use pointers in neon tbl helper Rather than passing a regno to the helper, pass pointers to the vector register directly. This eliminates the need to pass in the environment pointer and reduces the number of places that directly access env->vfp.regs[]. Backports commit e7c06c4e4c98c47899417f154df1f2ef4e8d09a0 from qemu	2018-03-06 10:20:21 -05:00

1 2

57 commits