unicorn

mirror of https://github.com/yuzu-emu/unicorn.git synced 2024-12-23 05:25:31 +00:00

Author	SHA1	Message	Date
Richard Henderson	456fb66617	tcg: Fix generation of dupi_vec for 32-bit host The definition of INDEX_op_dupi_vec is that it operates on units of tcg_target_ulong -- in this case 32 bits. It does not work to use this for a uint64_t value that happens to be small enough to fit in tcg_target_ulong. Backports a5b30d950c42b14bc9da24d1e68add6538d23336	2021-03-01 19:45:30 -05:00
Richard Henderson	578673be68	tcg/i386: Fix dupi for avx2 32-bit hosts The previous change wrongly stated that 32-bit avx2 should have used VPBROADCASTW. But that's a 16-bit broadcast and we want a 32-bit broadcast. Backports f80d09b599a5e0fd7f44653f23b04104cb703f7a	2021-03-01 19:44:09 -05:00
Richard Henderson	50b3632ab4	tcg: Remove TCGOpDef.used The last user of this field disappeared in f69d277ece4.	2021-03-01 19:43:37 -05:00
Richard Henderson	7813c57f9e	tcg: Move some TCG_CT_* bits to TCGArgConstraint bitfields These are easier to set and test when they have their own fields. Reduce the size of alias_index and sort_index to 4 bits, which is sufficient for TCG_MAX_OP_ARGS. This leaves only the bits indicating constants within the ct field. Move all initialization to allocation time, rather than init individual fields in process_op_defs. Backports bc2b17e6ea582ef3ade2bdca750de269c674c915	2021-03-01 19:41:34 -05:00
Richard Henderson	71a34d84e5	tcg: Remove TCG_CT_REG This wasn't actually used for anything, really. All variable operands must accept registers, and which are indicated by the set in TCGArgConstraint.regs. Backports commit 74a117906b87ff9220e4baae5a7431d6f4eadd45	2021-03-01 19:38:00 -05:00
Richard Henderson	ae075d324d	tcg: Move sorted_args into TCGArgConstraint.sort_index This uses an existing hole in the TCGArgConstraint structure and will be convenient for keeping the data in one place. Backports 66792f90f14fef18b25a168922877a367ecdca05	2021-03-01 19:33:45 -05:00
Richard Henderson	e3356f9bad	tcg: Drop union from TCGArgConstraint The union is unused; let "regs" appear in the main structure without the "u.regs" wrapping. Backports 9be0d08019465b38e2f1a605960961a491430c21	2021-03-01 19:29:19 -05:00
Richard Henderson	1551f6be9d	tcg: Adjust simd_desc size encoding With larger vector sizes, it turns out oprsz == maxsz, and we only need to represent mismatch for oprsz <= 32. We do, however, need to represent larger oprsz and do so without reducing SIMD_DATA_BITS. Reduce the size of the oprsz field and increase the maxsz field. Steal the oprsz value of 24 to indicate equality with maxsz. Backports e2e7168a214b0ed98dc357bba96816486a289762	2021-03-01 19:23:37 -05:00
Richard Henderson	567fa21c65	target/arm: Fix SVE splice While converting to gen_gvec_ool_zzzp, we lost passing a->esz as the data argument to the function. Backports commit dd701fafe55a78e655d4823d29226d92250a6b56	2021-03-01 19:20:44 -05:00
Richard Henderson	ccb293911f	target/arm: Fix sve ldr/str The mte update missed a bit when producing clean addresses. Fixes: b2aa8879b88 Backports d8227b098301935ea8e0e032e7d41e5dc3e97590	2021-03-01 19:20:04 -05:00
Peter Maydell	79feec40df	target/arm: Make isar_feature_aa32_fp16_arith() handle M-profile The M-profile definition of the MVFR1 ID register differs slightly from the A-profile one, and in particular the check for "does the CPU support fp16 arithmetic" is not the same. We don't currently implement any M-profile CPUs with fp16 arithmetic, so this is not yet a visible bug, but correcting the logic now disarms this beartrap for when we eventually do. Backports commit dfc523a84b06b6a4b583ed4c29d24fd980dd37a0	2021-03-01 19:17:23 -05:00
Peter Maydell	09a7d6381e	target/arm: Move id_pfr0, id_pfr1 into ARMISARegisters Move the id_pfr0 and id_pfr1 fields into the ARMISARegisters sub-struct. We're going to want id_pfr1 for an isar_features check, and moving both at the same time avoids an odd inconsistency. Changes other than the ones to cpu.h and kvm64.c made automatically with: perl -p -i -e 's/cpu->id_pfr/cpu->isar.id_pfr/' target/arm/*.c hw/intc/armv7m_nvic.c Backports commit 8a130a7be6e222965641e1fd9469fd3ee752c7d4	2021-03-01 19:15:10 -05:00
Peter Maydell	ed92f3c42b	target/arm: Replace ARM_FEATURE_PXN with ID_MMFR0.VMSA check The ARM_FEATURE_PXN bit indicates whether the CPU supports the PXN bit in short-descriptor translation table format descriptors. This is indicated by ID_MMFR0.VMSA being at least 0b0100. Replace the feature bit with an ID register check, in line with our preference for ID register checks over feature bits. Backports commit 0ae0326b984e77a55c224b7863071bd3d8951231	2021-03-01 19:06:15 -05:00
Xiaoyao Li	d9d68cc128	i386/cpu: Clear FEAT_XSAVE_COMP_{LO,HI} when XSAVE is not available Per Intel SDM vol 1, 13.2, if CPUID.1:ECX.XSAVE[bit 26] is 0, the processor provides no further enumeration through CPUID function 0DH. QEMU does not do this for "-cpu host,-xsave". Backports 19ca8285fcd61a8f60f2f44f789a561e0958e8e6	2021-03-01 19:04:03 -05:00
Richard Henderson	5e6196ea6b	target/riscv: Set instance_align on RISCVCPU TypeInfo Fix alignment of CPURISCVState.vreg. Backports 5de5b99b3101a1648ed583193db8d92eea0c4545	2021-03-01 19:00:27 -05:00
Richard Henderson	cdf40f7ff6	target/arm: Set instance_align on CPUARM TypeInfo Fix alignment of CPUARMState.vfp.zregs. Backports d03087bda4ba17076b430fd2af083020d7c5112a	2021-03-01 18:58:44 -05:00
Richard Henderson	86dd30850d	qom: Allow objects to be allocated with increased alignment It turns out that some hosts have a default malloc alignment less than that required for vectors. We assume that, with compiler annotation on CPUArchState, that we can properly align the vector portion of the guest state. Fix the alignment of the allocation by using qemu_memalloc when required.	2021-03-01 18:32:51 -05:00
Eduardo Habkost	6baafeafd4	qom: Correct object_class_dynamic_cast_assert() documentation object_class_dynamic_cast_assert() is not used by INTERFACE_CHECK, remove misleading mention of that function in the documentation.	2021-03-01 18:29:34 -05:00
Aaron Lindsay	97702da7ad	target/arm: Count PMU events when MDCR.SPME is set This check was backwards when introduced in commit 033614c47de78409ad3fb39bb7bd1483b71c6789: target/arm: Filter cycle counter based on PMCCFILTR_EL0 Backports commit db1f3afb17269cf2bd86c222e1bced748487ef71	2021-03-01 18:25:25 -05:00
Peter Maydell	16ad0d93d9	target/arm: Convert VCMLA, VCADD size field to MO_* in decode The VCMLA and VCADD insns have a size field which is 0 for fp16 and 1 for fp32 (note that this is the reverse of the Neon 3-same encoding!). Convert it to MO_* values in decode for consistency. Backports d186a4854c04e9832907b0b4240a47731da20993	2021-03-01 18:23:34 -05:00
Peter Maydell	61abec1908	target/arm: Convert Neon VCVT fp size field to MO_* in decode Convert the insns using the 2reg_vcvt and 2reg_vcvt_f16 formats to pass the size through to the trans function as a MO_* value rather than the '0==f32, 1==f16' used in the fp 3-same encodings. Backports commit 0ae715c658a02af1834b63563c56112a6d8842cb	2021-03-01 18:20:11 -05:00
Peter Maydell	524b54bc7b	target/arm: Convert Neon 3-same-fp size field to MO_* in decode In the Neon instructions, some instruction formats have a 2-bit size field which corresponds exactly to QEMU's MO_8/16/32/64. However the floating-point insns in the 3-same group have a 1-bit size field which is "0 for 32-bit float and 1 for 16-bit float". Currently we pass these values directly through to trans_ functions, which means that when reading a particular trans_ function you need to know if that insn uses a 2-bit size or a 1-bit size. Move the handling of the 1-bit size to the decodetree file, so that all these insns consistently pass a size to the trans_ function which is an MO_8/16/32/64 value. In this commit we switch over the insns using the 3same_fp and 3same_fp_q0 formats. Backports commit 6cf0f240e0b980a877abed12d2995f740eae6515	2021-03-01 18:15:18 -05:00
Richard Henderson	cd79d2a915	tcg: Implement 256-bit dup for tcg_gen_gvec_dup_mem We already support duplication of 128-bit blocks. This extends that support to 256-bit blocks. This will be needed by SVE2. Backports commit fe4b0b5bfa96c38ad1cad0689a86cca9f307e353	2021-03-01 18:10:07 -05:00
Richard Henderson	b478ce5052	tcg: Eliminate one store for in-place 128-bit dup_mem Do not store back to the exact memory from which we just loaded. Backports 6a17646176e011ddc463a2870a64c7aaccfe9c50	2021-03-01 18:06:17 -05:00
Stephen Long	c9dc750058	tcg: Fix tcg gen for vectorized absolute value The fallback inline expansion for vectorized absolute value, when the host doesn't support such an insn was flawed. E.g. when a vector of bytes has all elements negative, mask will be 0xffff_ffff_ffff_ffff. Subtracting mask only adds 1 to the low element instead of all elements becase -mask is 1 and not 0x0101_0101_0101_0101. Backports commit e7e8f33fb603c3bfa0479d7d924f2ad676a84317	2021-03-01 18:04:46 -05:00
Eduardo Habkost	cefb1666c0	arm: Fix typo in AARCH64_CPU_GET_CLASS definition There's a typo in the type name of AARCH64_CPU_GET_CLASS. This was never detected because the macro is not used by any code. Backports 37e3d65043229bb20bd07af74dc0866e12071415	2021-03-01 18:03:29 -05:00
Peter Maydell	ff74ede2fd	target/arm: Enable FP16 in '-cpu max' Set the MVFR1 ID register FPHP and SIMDHP fields to indicate that our "-cpu max" has v8.2-FP16. Backports commit 5f07817eb94542e39a419baafa3026b15e8d33f7	2021-03-01 18:00:13 -05:00
Peter Maydell	b948636c4a	target/arm: Implement fp16 for Neon VMUL, VMLA, VMLS Convert the Neon floating-point VMUL, VMLA and VMLS to use gvec, and use this to implement fp16 support. Backports fc8ae790311882afa3c7816df004daf978c40e9a	2021-03-01 17:57:36 -05:00
Peter Maydell	8c6affbca4	target/arm/vec_helper: Add gvec fp indexed multiply-and-add operations Add gvec helpers for doing Neon-style indexed non-fused fp multiply-and-accumulate operations. Backports commit c50d8d144098a8261233ca31b47e3bc487e112fe	2021-03-01 17:52:31 -05:00
Peter Maydell	3cc3099e36	target/arm/vec_helper: Handle oprsz less than 16 bytes in indexed operations In the gvec helper functions for indexed operations, for AArch32 Neon the oprsz (total size of the vector) can be less than 16 bytes if the operation is on a D reg. Since the inner loop in these helpers always goes from 0 to segment, we must clamp it based on oprsz to avoid processing a full 16 byte segment when asked to handle an 8 byte wide vector. Backports commit d7ce81e553e6789bf27657105b32575668d60b1c	2021-03-01 17:48:42 -05:00
Peter Maydell	681218b4ab	target/arm: Implement fp16 for Neon VRINTX Convert the Neon VRINTX insn to use gvec, and use this to implement fp16 support for it. Backports 23afcdd2511f2a3dc05bed650d27bd25cf9b2a3c	2021-03-01 17:47:25 -05:00
Peter Maydell	53aba9d900	target/arm: Implement fp16 for Neon VRINT-with-specified-rounding-mode Convert the Neon VRINT-with-specified-rounding-mode insns to gvec, and use this to implement the fp16 versions. Backports 18725916b1438b54d6d6533980833d2251a20b7c	2021-03-01 17:44:49 -05:00
Peter Maydell	eb4054d04f	target/arm: Implement fp16 for Neon VCVT with rounding modes Convert the Neon VCVT with-specified-rounding-mode instructions to gvec, and use this to implement fp16 support for them. Backports ca88a6efdf4ce96b646a896059f9bd324c2cebc4	2021-03-01 17:40:36 -05:00
Peter Maydell	56fe927d40	target/arm: Implement fp16 for Neon VCVT fixed-point Implement fp16 for the Neon VCVT insns which convert between float and fixed-point. Backports 24018cf3990b692b51e50183c5fbd98d17b3fa40	2021-03-01 17:36:43 -05:00
Peter Maydell	948b01ad01	target/arm: Convert Neon VCVT fixed-point to gvec Convert the Neon VCVT float<->fixed-point insns to a gvec style, in preparation for adding fp16 support. Backports 7b959c5890deb9a6d71bc6800006a0eae0a84c60	2021-03-01 17:33:20 -05:00
Peter Maydell	c324c6817e	target/arm: Implement fp16 for Neon float-integer VCVT Convert the Neon float-integer VCVT insns to gvec, and use this to implement fp16 support for them. Note that unlike the VFP int<->fp16 VCVT insns we converted earlier and which convert to/from a 32-bit integer, these Neon insns convert to/from 16-bit integers. So we can use the existing vfp conversion helpers for the f32<->u32/i32 case but need to provide our own for f16<->u16/i16. Backports 7782a9afec81d1efe23572135c1ed777691ccde5	2021-03-01 17:29:02 -05:00
Peter Maydell	82f4a7e135	target/arm: Implement fp16 for Neon pairwise fp ops Convert the Neon pairwise fp ops to use a single gvic-style helper to do the full operation instead of one helper call for each 32-bit part. This allows us to use the same framework to implement the fp16. Backports 1dc587ee9bfe804406eb3e0bacf47a80644d8abc	2021-03-01 17:25:19 -05:00
Peter Maydell	b08ea84374	target/arm: Implement fp16 for Neon VRSQRTS Convert the Neon VRSQRTS insn to using a gvec helper, and use this to implement the fp16 case. As with VRECPS, we adjust the phrasing of the new implementation slightly so that the fp32 version parallels the fp16 one. Backports 40fde72dda2da8d55b820fa6c5efd85814be2023	2021-03-01 17:20:22 -05:00
Peter Maydell	f4ebbba9fd	target/arm: Implement fp16 for Neon VRECPS Convert the Neon VRECPS insn to using a gvec helper, and use this to implement the fp16 case. The phrasing of the new float32_recps_nf() is slightly different from the old recps_f32() so that it parallels the f16 version; for f16 we can't assume that flush-to-zero is always enabled. Backports ac8c62c4e5a3f24e6d47f52ec1bfb20994caefa5	2021-03-01 17:09:16 -05:00
Peter Maydell	5776c594e4	target/arm: Implement fp16 for Neon fp compare-vs-0 Convert the neon floating-point vector compare-vs-0 insns VCEQ0, VCGT0, VCLE0, VCGE0 and VCLT0 to use a gvec helper, and use this to implement the fp16 case. Backport 635187aaa92f21ab001e2868e803b3c5460261ca	2021-03-01 17:05:03 -05:00
Peter Maydell	8de258c3cb	target/arm: Implement fp16 for Neon VFMA, VMFS Convert the neon floating-point vector operations VFMA and VFMS to use a gvec helper, and use this to implement the fp16 case. This is the last use of do_3same_fp() so we can now delete that function. Backports commit cf722d75b329ef3f86b869e7e68cbfb1607b3bde	2021-03-01 17:00:49 -05:00
Peter Maydell	587c3549b7	target/arm: Implement fp16 for Neon VMLA, VMLS operations Convert the Neon floating-point VMLA and VMLS insns over to using a gvec helper, and use this to implement the fp16 case. Backports e5adc70665ecaf4009c2fb8d66775ea718a85abd	2021-03-01 16:57:20 -05:00
Peter Maydell	0068d12355	target/arm: Implement fp16 for Neon VMAXNM, VMINNM Convert the Neon floating point VMAXNM and VMINNM insns to using a gvec helper and use this to implement the fp16 case. Backports e22705bb941d82d6c2a09e8b2031084326902be3	2021-03-01 16:53:57 -05:00
Peter Maydell	465cfb54c4	target/arm: Implement fp16 for Neon VMAX, VMIN Convert the Neon float-point VMAX and VMIN insns over to using a gvec helper, and use this to implement the fp16 case. Backport e43268c54b6cbcb197d179409df7126e81f8cd52	2021-03-01 16:50:23 -05:00
Peter Maydell	6dd4a8e93f	target/arm: Implement fp16 for VACGE, VACGT Convert the neon floating-point vector absolute comparison ops VACGE and VACGT over to using a gvec hepler and use this to implement the fp16 case. Backports bb2741da186ebaebc7d5189372be4401e1ff9972	2021-03-01 16:47:44 -05:00
Peter Maydell	4eb39f1b2f	target/arm: Implement fp16 for VCEQ, VCGE, VCGT comparisons Convert the Neon floating-point vector comparison ops VCEQ, VCGE and VCGT over to using a gvec helper and use this to implement the fp16 case. (We put the float16_ceq() etc functions above the DO_2OP() macro definition because later when we convert the compare-against-zero instructions we'll want their definitions to be visible at that point in the source file.) Backports ad505db233b89b7fd4b5a98b6f0e8ac8d05b11db	2021-03-01 16:44:34 -05:00
Peter Maydell	0e8fd4cd0c	target/arm: Implement fp16 for Neon VABS, VNEG of floats Rewrite Neon VABS/VNEG of floats to use gvec logical AND and XOR, so that we can implement the fp16 version of the insns. Backport 2b70d8cd09f5450c15788acd24f6f8bc4116c395	2021-03-01 16:40:33 -05:00
Peter Maydell	6c71951d54	target/arm: Implement fp16 for Neon VRECPE, VRSQRTE using gvec We already have gvec helpers for floating point VRECPE and VRQSRTE, so convert the Neon decoder to use them and add the fp16 support. Backports 4a15d9a3b39d4d161d7e03dfcf52e9f214eef0b8	2021-03-01 16:35:04 -05:00
Peter Maydell	4850377f01	target/arm: Implement FP16 for Neon VADD, VSUB, VABD, VMUL Implement FP16 support for the Neon insns which use the DO_3S_FP_GVEC macro: VADD, VSUB, VABD, VMUL. For VABD this requires us to implement a new gvec_fabd_h helper using the machinery we have already for the other helpers. Backport e4a6d4a69e239becfd83bdcd996476e7b8e1138d	2021-03-01 16:31:54 -05:00
Peter Maydell	08b70267d0	target/arm: Implement VFP fp16 VMOV between gp and halfprec registers Implement the VFP fp16 variant of VMOV that transfers a 16-bit value between a general purpose register and a VFP register. Note that Rt == 15 is UNPREDICTABLE; since this insn is v8 and later only we have no need to replicate the old "updates CPSR.NZCV" behaviour that the singleprec version of this insn does Backports commit 46a4b854525cb9f34a611f6ada6cdff1eab0ac2d	2021-03-01 16:26:34 -05:00

... 5 6 7 8 9 ...

7307 commits