unicorn

mirror of https://github.com/yuzu-emu/unicorn.git synced 2025-02-02 15:51:00 +00:00

Author	SHA1	Message	Date
Richard Henderson	742301a7c1	tcg: Fix integral argument type to tcg_gen_rot[rl]i_i{32,64} For the benefit of compatibility of function pointer types, we have standardized on int32_t and int64_t as the integral argument to tcg expanders. We converted most of them in 474b2e8f0f7, but missed the rotates. Backports commit 07dada0336a83002dfa8673a9220a88e13d9a45c from qemu	2020-05-07 10:41:01 -04:00
Richard Henderson	0bcd0ca93d	tcg: Add load_dest parameter to GVecGen2 We have this same parameter for GVecGen2i, GVecGen3, and GVecGen3i. This will make some SVE2 insns easier to parameterize. Backports commit ac09ae627e9a2c65c8a452b69c3dac33c29d0719 from qemu	2020-05-07 10:35:47 -04:00
Richard Henderson	f02f71f38f	tcg: Improve vector tail clearing Better handling of non-power-of-2 tails as seen with Arm 8-byte vector operations. Backports commit f47db80cc073c0a7a22136c8296b5eca20c0e199 from qemu	2020-05-07 10:24:00 -04:00
Richard Henderson	549b0ec3c5	tcg: Add tcg_gen_gvec_dup_tl For use when a target needs to pass a configure-specific target_ulong value to duplicate. Backports commit 0f039e3ad9131966d9fe509c231b756868b015e2 from qemu	2020-05-07 10:12:09 -04:00
Richard Henderson	e65806c356	tcg: Remove tcg_gen_gvec_dup{8,16,32,64}i These interfaces are now unused. Backports commit 398f21412aeec158338963e3f71c9313bc126a71 form qemu	2020-05-07 10:11:00 -04:00
Richard Henderson	43a72b0540	tcg: Use tcg_gen_gvec_dup_imm in logical simplifications Replace the outgoing interface. Backports commit 03ddb6f315ca6d02dfdba0aecc43aa97c728c428 from qemu	2020-05-07 10:09:53 -04:00
Richard Henderson	07f622e57d	tcg: Add tcg_gen_gvec_dup_imm Add a version of tcg_gen_dup_* that takes both immediate and a vector element size operand. This will replace the set of tcg_gen_gvec_dup{8,16,32,64}i functions that encode the element size within the function name. Backports commit 44c94677febd15488f9190b11eaa4a08e8ac696b from qemu	2020-05-07 09:55:25 -04:00
lixinyu	7c32c5b0a4	tcg/mips: mips sync* encode error OPC_SYNC_WMB, OPC_SYNC_MB, OPC_SYNC_ACQUIRE, OPC_SYNC_RELEASE and OPC_SYNC_RMB have wrong encode. According to the mips manual, their encode should be 'OPC_SYNC \| 0x?? << 6' rather than 'OPC_SYNC \| 0x?? << 5'. Wrong encode can lead illegal instruction errors. These instructions often appear with multi-threaded simulation. Fixes: 6f0b99104a3 ("tcg/mips: Add support for fence") Backports commit a4e57084c16d5b0eff3651693fba04f26b30b551 from qemu	2020-04-30 07:24:57 -04:00
Richard Henderson	299ba4e867	tcg/i386: Fix INDEX_op_dup2_vec We were only constructing the 64-bit element, and not replicating the 64-bit element across the rest of the vector. Backports commit e20cb81d9c5a3d0f9c08f3642728a210a1c162c9 from qemu	2020-04-30 07:15:08 -04:00
Richard Henderson	b358f771f6	tcg/i386: Bound shift count expanding sari_vec A given RISU testcase for SVE can produce tcg-op-vec.c:511: do_shifti: Assertion `i >= 0 && i < (8 << vece)' failed. because expand_vec_sari gave a shift count of 32 to a MO_32 vector shift. In 44f1441dbe1, we changed from direct expansion of vector opcodes to re-use of the tcg expanders. So while the comment correctly notes that the hw will handle such a shift count, we now have to take our own sanity checks into account. Which is easy in this particular case. Fixes: 44f1441dbe1 Backports commit 312b426fea4d6dd322d7472c80010a8ba7a166d2 from qemu	2020-04-30 06:26:42 -04:00
Richard Henderson	12b4e01d9c	tcg: Add tcg_gen_gvec_5_ptr Extend the vector generator infrastructure to handle 5 vector arguments. Backports commit 2445971604c1cfd3ec484457159f4ac300fb04d2 from qemu	2020-03-21 16:54:01 -04:00
Tony Nguyen	f75368cd0f	tcg: TCGMemOp is now accelerator independent MemOp Preparation for collapsing the two byte swaps, adjust_endianness and handle_bswap, along the I/O path. Target dependant attributes are conditionalized upon NEED_CPU_H. Backports commit 14776ab5a12972ea439c7fb2203a4c15a09094b4 from qemu	2019-11-28 03:01:12 -05:00
Emilio G. Cota	ca15f620b0	tcg/README: fix typo s/afterwise/afterwards/ Afterwise is "wise after the fact", as in "hindsight". Here we meant "afterwards" (as in "subsequently"). Fix it.	2019-11-28 02:37:51 -05:00
tony.nguyen@bt.com	b4c2c94602	configure: Define target access alignment in configure This patch moves the define of target access alignment earlier from target/foo/cpu.h to configure. Suggested in Richard Henderson's reply to "[PATCH 1/4] tcg: TCGMemOp is now accelerator independent MemOp" Backports commit 52bf9771fdfce98e98cea36a17a18915be6f6b7f from qemu	2019-11-18 21:41:35 -05:00
Richard Henderson	d291a311ee	tcg/aarch64: Fix output of extract2 opcodes This patch fixes two problems: (1) The inputs to the EXTR insn were reversed, (2) The input constraints use rZ, which means that we need to use the REG0 macro in order to supply XZR for a constant 0 input. Fixes: 464c2969d5d Backports commit 1789d4274b851fb8fdf4a947ce5474c63e813d0d from qemu	2019-08-08 19:25:37 -04:00
Richard Henderson	b2d75f4955	tcg: Fix constant folding of INDEX_op_extract2_i32 On a 64-bit host, discard any replications of the 32-bit sign bit when performing the shift and merge. Backports commit 80f4d7c3ae216c191fb403e149bcba88d6aa40bb from qemu	2019-08-08 19:25:01 -04:00
Richard Henderson	955661ad7b	tcg: Fix expansion of INDEX_op_not_vec This operation can always be emitted, even if we need to fall back to xor. Adjust the assertions to match. Backports commit 11978f6f58f1d3d66429f7ff897524f693d823ce from qemu	2019-08-08 19:23:03 -04:00
Richard Henderson	d7ea41c3a3	cpu: Move icount_decr to CPUNegativeOffsetState Amusingly, we had already ignored the comment to keep this value at the end of CPUState. This restores the minimum negative offset from TCG_AREG0 for code generation. For the couple of uses within qom/cpu.c, without NEED_CPU_H, add a pointer from the CPUState object to the IcountDecr object within CPUNegativeOffsetState. Backports commit 5e1401969b25f676fee6b1c564441759cf967a43 from qemu	2019-06-13 15:34:28 -04:00
Richard Henderson	ac176ccb38	cpu: Move ENV_OFFSET to exec/gen-icount.h Now that we have ArchCPU, we can define this generically, in the one place that needs it. Backports commit 677c4d69ac21961e76a386f9bfc892a44923acc0 from qemu	2019-06-12 12:20:21 -04:00
Markus Armbruster	5e5197b136	Supply missing header guards Backports applicable parts of commit f91005e195e7e1485e60cb121731589960f1a3c9 from qemu	2019-06-12 10:59:10 -04:00
Richard Henderson	67f0af4282	tcg/aarch64: Allow immediates for vector ORR and BIC The allows immediates to be used for ORR and BIC, as well as the trivial inversions, ORC and AND. Backports commit 9e27f58b9902834dffc0d66d9eb62f78d9c2a632 from qemu	2019-05-24 18:47:07 -04:00
Richard Henderson	5ecfba4fe6	tcg/aarch64: Build vector immediates with two insns Use MOVI+ORR or MVNI+BIC in order to build some vector constants, as opposed to dropping them to the constant pool. This includes all 16-bit constants and a similar set of 32-bit constants. Backports commit 02f3a5b4744885258758d07ebe09cf965de78bcf from qemu	2019-05-24 18:43:54 -04:00
Richard Henderson	06058ef648	tcg/aarch64: Use MVNI in tcg_out_dupi_vec The compliment of a subset of immediates can be computed with a single instruction. Backports commit 7e308e003e5b6ddd3130e09711e1d33693230696 from qemu	2019-05-24 18:42:40 -04:00
Richard Henderson	c18ec586dc	tcg/aarch64: Split up is_fimm There are several sub-classes of vector immediate, and only MOVI can use them all. This will enable usage of MVNI and ORRI, which use progressively fewer sub-classes. This patch adds no new functionality, merely splits the function and moves part of the logic into tcg_out_dupi_vec. Backports commit 984fdcee342473dfe797897758929dad654693c8 from qemu	2019-05-24 18:41:37 -04:00
Richard Henderson	0ea4c05dc3	tcg/aarch64: Support vector bitwise select value The instruction set has 3 insns that perform the same operation, only varying in which operand must overlap the destination. We can represent the operation without overlap and choose based on the operands seen. Backports commit a9e434a5dc16f71ee156428619fc3c3765b68f26 from qemu	2019-05-24 18:38:37 -04:00
Richard Henderson	c79510378f	tcg/i386: Use umin/umax in expanding unsigned compare Using umin(a, b) == a as an expansion for TCG_COND_LEU is a better alternative to (a - INT_MIN) <= (b - INT_MIN). Backports commit ebcfb91abed8c0fb180a968b9004419c208dcc02 from qemu	2019-05-24 18:36:32 -04:00
Richard Henderson	ffdbc1a233	tcg/i386: Remove expansion for missing minmax This is now handled by code within tcg-op-vec.c. Backports commit 3ec3538a45f2fead475b0cca6945092c87927b4f from qemu	2019-05-24 18:34:44 -04:00
Richard Henderson	68cb096196	tcg/i386: Support vector comparison select value We already had backend support for this feature. Expand the new cmpsel opcode using vpblendb. The combination allows us to avoid an extra NOT for some comparison codes. Backports commit 904c5e19672778cc3349f4975437cfdf3371abb6 from qemu	2019-05-24 18:33:16 -04:00
Richard Henderson	a868533297	tcg: Add TCG_OPF_NOT_PRESENT if TCG_TARGET_HAS_foo is negative If INDEX_op_foo is always expanded by tcg_expand_vec_op, then there may be no reasonable set of constraints to return from tcg_target_op_def for that opcode. Let TCG_TARGET_HAS_foo be specified as -1 in that case. Thus a boolean test for TCG_TARGET_HAS_foo is true, but we will not assert within process_op_defs when no constraints are specified. Compare this with tcg_can_emit_vec_op, which already uses this tri-state indication. Backports commit 25c012b4009256505be3430480954a0233de343e from qemu	2019-05-24 18:28:11 -04:00
Richard Henderson	568da655c6	tcg: Expand vector minmax using cmp+cmpsel Provide a generic fallback for the min/max operations. Backports commit 72b4c792c7a576d9246207a8e9a940ed9e191722 from qemu	2019-05-24 18:26:53 -04:00
Richard Henderson	56d35e80aa	tcg: Introduce do_op3_nofail for vector expansion This makes do_op3 match do_op2 in allowing for failure, and thus fall back expansions. Backports commit 17f79944ebeace8bf43047a33b7775ba5ed9070e from qemu	2019-05-24 18:24:44 -04:00
Richard Henderson	2ea6dfbd63	tcg: Add support for vector compare select Perform a per-element conditional move. This combination operation is easier to implement on some host vector units than plain cmp+bitsel. Omit the usual gvec interface, as this is intended to be used by target-specific gvec expansion call-backs. Backports commit f75da2988eb2457fa23d006d573220c5c680ec4e from qemu	2019-05-24 18:21:13 -04:00
Richard Henderson	ca58be9cb4	tcg: Add support for vector bitwise select This operation performs d = (b & a) \| (c & ~a), and is present on a majority of host vector units. Include gvec expanders. Backports commit 38dc12947ec9106237f9cdbd428792c985cd86ae from qemu	2019-05-24 18:15:10 -04:00
Richard Henderson	fa363c3d6d	tcg: Fix missing checks and clears in tcg_gen_gvec_dup_mem The paths through tcg_gen_dup_mem_vec and through MO_128 were missing the check_size_align. The path through MO_128 was also missing the expand_clr. This last was not visible because the only user is ARM SVE, which would set oprsz == maxsz, and not require the clear. Fix by adding the check_size_align and using do_dup directly instead of duplicating the check in tcg_gen_gvec_dup_{i32,i64}. Backports commit 532ba368a13712724137228b5e7e9435994d25e1 from qemu	2019-05-24 18:07:28 -04:00
Richard Henderson	60cfe541b2	tcg/i386: Fix dupi/dupm for avx1 and 32-bit hosts The VBROADCASTSD instruction only allows %ymm registers as destination. Rather than forcing VEX.L and writing to the entire 256-bit register, revert to using MOVDDUP with an %xmm register. This is sufficient for an avx1 host since we do not support TCG_TYPE_V256 for that case. Also fix the 32-bit avx2, which should have used VPBROADCASTW. Fixes: 1e262b49b533 Backports commit 7b60ef3264e9627ac6efb34e9a6130647e9b55c0 from qemu	2019-05-24 18:04:08 -04:00
Lioncash	fcaa52c1fe	tcg: Synchronize with qemu Resolves any formatting discrepancies and bad merges that slipped through.	2019-05-16 18:11:08 -04:00
Richard Henderson	de260cfbd6	tcg/aarch64: Do not advertise minmax for MO_64 The min/max instructions are not available for 64-bit elements. Backports commit a7b6d286cfb5205b9f5330aefc5727269b3d810f from qemu	2019-05-16 16:44:34 -04:00
Richard Henderson	7c9b3a9021	tcg/aarch64: Support vector absolute value Backports commit a456394ae540f852cd0d10fd693fe9f33598dc01 from qemu	2019-05-16 16:39:14 -04:00
Richard Henderson	fd35490991	tcg/i386: Support vector absolute value Backports commit 18f9b65f1a4225dd314cb9b0a8dea968c5bc2ef3 from qemu	2019-05-16 16:37:33 -04:00
Richard Henderson	6d5e7856ff	tcg: Add support for vector absolute value Backports commit bcefc90208f8a1d6f619d61c2647281d92277015 from qemu	2019-05-16 16:33:43 -04:00
Richard Henderson	6d1730048d	tcg: Add support for integer absolute value Remove a function of the same name from target/arm/. Use a branchless implementation of abs gleaned from gcc. Backports commit ff1f11f7f8710a768f9313f24bd7f509d3db27e5 from qemu	2019-05-16 16:25:15 -04:00
Richard Henderson	18b3df6e4e	tcg/i386: Support vector scalar shift opcodes Backports commit 0a8d7a3bf5a149a82450eef555fd61728703dd84 from qemu	2019-05-16 16:19:44 -04:00
Richard Henderson	79b9dc559e	tcg: Add gvec expanders for vector shift by scalar Allow expansion either via shift by scalar or by replicating the scalar for shift by vector. Backports commit b4578cd91cda4cef1c413304353ca6dc5b957b60 from qemu	2019-05-16 16:17:58 -04:00
Richard Henderson	0217ee7b24	tcg/aarch64: Support vector variable shift opcodes Backports commit 79525dfd08262d8de10d271f17e5a4096ef96d16 from qemu	2019-05-16 15:58:54 -04:00
Richard Henderson	f793ec847d	tcg/i386: Support vector variable shift opcodes Backports commit a2ce146a06807fe1d1a81e878b8f249ff1e14038 from qemu	2019-05-16 15:53:33 -04:00
Richard Henderson	8c17687934	tcg: Add gvec expanders for variable shift The gvec expanders perform a modulo on the shift count. If the target requires alternate behaviour, then it cannot use the generic gvec expanders anyway, and will have to have its own custom code. Backports commit 5ee5c14cacda27e904cd6b0d9e7ffe1acff42838 from qemu	2019-05-16 15:51:09 -04:00
Richard Henderson	66e6bea084	tcg: Add INDEX_op_dupm_vec Allow the backend to expand dup from memory directly, instead of forcing the value into a temp first. This is especially important if integer/vector register moves do not exist. Note that officially tcg_out_dupm_vec is allowed to fail. If it did, we could fix this up relatively easily: VECE == 32/64: Load the value into a vector register, then dup. Both of these must work. VECE == 8/16: If the value happens to be at an offset such that an aligned load would place the desired value in the least significant end of the register, go ahead and load w/garbage in high bits. Load the value w/INDEX_op_ld{8,16}_i32. Attempt a move directly to vector reg, which may fail. Store the value into the backing store for OTS. Load the value into the vector reg w/TCG_TYPE_I32, which must work. Duplicate from the vector reg into itself, which must work. All of which is well and good, except that all supported hosts can support dupm for all vece, so all of the failure paths would be dead code and untestable. Backports commit 37ee55a081b7863ffab2151068dd1b2f11376914 from qemu	2019-05-16 15:38:02 -04:00
Richard Henderson	fd7a67e4a7	tcg/aarch64: Implement tcg_out_dupm_vec The LD1R instruction does all the work. Note that the only useful addressing mode is a base register with no offset. Backports commit f23e5e15edfd49d5dd72cab2ed2d85ac354b2eeb from qemu	2019-05-16 15:29:04 -04:00
Richard Henderson	a6fd4e2345	tcg/i386: Implement tcg_out_dupm_vec At the same time, improve tcg_out_dupi_vec wrt broadcast from the constant pool. Backports commit 1e262b49b5331441f697461e4305fe06719758a7 from qemu	2019-05-16 15:27:15 -04:00
Richard Henderson	d4e7c6a8c5	tcg: Add tcg_out_dupm_vec to the backend interface Currently stubbed out in all backends that support vectors. Backports commit d6ecb4a978b718dbe108a9fa9ecccc8b7f7cb579 from qemu	2019-05-16 15:24:48 -04:00

1 2 3 4 5 ...

588 commits