unicorn

mirror of https://github.com/yuzu-emu/unicorn.git synced 2025-09-10 06:07:04 +00:00

Author	SHA1	Message	Date
Richard Henderson	ffdbc1a233	tcg/i386: Remove expansion for missing minmax This is now handled by code within tcg-op-vec.c. Backports commit 3ec3538a45f2fead475b0cca6945092c87927b4f from qemu	2019-05-24 18:34:44 -04:00
Richard Henderson	68cb096196	tcg/i386: Support vector comparison select value We already had backend support for this feature. Expand the new cmpsel opcode using vpblendb. The combination allows us to avoid an extra NOT for some comparison codes. Backports commit 904c5e19672778cc3349f4975437cfdf3371abb6 from qemu	2019-05-24 18:33:16 -04:00
Richard Henderson	a868533297	tcg: Add TCG_OPF_NOT_PRESENT if TCG_TARGET_HAS_foo is negative If INDEX_op_foo is always expanded by tcg_expand_vec_op, then there may be no reasonable set of constraints to return from tcg_target_op_def for that opcode. Let TCG_TARGET_HAS_foo be specified as -1 in that case. Thus a boolean test for TCG_TARGET_HAS_foo is true, but we will not assert within process_op_defs when no constraints are specified. Compare this with tcg_can_emit_vec_op, which already uses this tri-state indication. Backports commit 25c012b4009256505be3430480954a0233de343e from qemu	2019-05-24 18:28:11 -04:00
Richard Henderson	568da655c6	tcg: Expand vector minmax using cmp+cmpsel Provide a generic fallback for the min/max operations. Backports commit 72b4c792c7a576d9246207a8e9a940ed9e191722 from qemu	2019-05-24 18:26:53 -04:00
Richard Henderson	56d35e80aa	tcg: Introduce do_op3_nofail for vector expansion This makes do_op3 match do_op2 in allowing for failure, and thus fall back expansions. Backports commit 17f79944ebeace8bf43047a33b7775ba5ed9070e from qemu	2019-05-24 18:24:44 -04:00
Richard Henderson	2ea6dfbd63	tcg: Add support for vector compare select Perform a per-element conditional move. This combination operation is easier to implement on some host vector units than plain cmp+bitsel. Omit the usual gvec interface, as this is intended to be used by target-specific gvec expansion call-backs. Backports commit f75da2988eb2457fa23d006d573220c5c680ec4e from qemu	2019-05-24 18:21:13 -04:00
Richard Henderson	ca58be9cb4	tcg: Add support for vector bitwise select This operation performs d = (b & a) \| (c & ~a), and is present on a majority of host vector units. Include gvec expanders. Backports commit 38dc12947ec9106237f9cdbd428792c985cd86ae from qemu	2019-05-24 18:15:10 -04:00
Richard Henderson	fa363c3d6d	tcg: Fix missing checks and clears in tcg_gen_gvec_dup_mem The paths through tcg_gen_dup_mem_vec and through MO_128 were missing the check_size_align. The path through MO_128 was also missing the expand_clr. This last was not visible because the only user is ARM SVE, which would set oprsz == maxsz, and not require the clear. Fix by adding the check_size_align and using do_dup directly instead of duplicating the check in tcg_gen_gvec_dup_{i32,i64}. Backports commit 532ba368a13712724137228b5e7e9435994d25e1 from qemu	2019-05-24 18:07:28 -04:00
Richard Henderson	60cfe541b2	tcg/i386: Fix dupi/dupm for avx1 and 32-bit hosts The VBROADCASTSD instruction only allows %ymm registers as destination. Rather than forcing VEX.L and writing to the entire 256-bit register, revert to using MOVDDUP with an %xmm register. This is sufficient for an avx1 host since we do not support TCG_TYPE_V256 for that case. Also fix the 32-bit avx2, which should have used VPBROADCASTW. Fixes: 1e262b49b533 Backports commit 7b60ef3264e9627ac6efb34e9a6130647e9b55c0 from qemu	2019-05-24 18:04:08 -04:00
Alistair Francis	f8f3e50372	target/arm: Fix vector operation segfault Commit 89e68b575 "target/arm: Use vector operations for saturation" causes this abort() when booting QEMU ARM with a Cortex-A15: 0 0x00007ffff4c2382f in raise () at /usr/lib/libc.so.6 1 0x00007ffff4c0e672 in abort () at /usr/lib/libc.so.6 2 0x00005555559c1839 in disas_neon_data_insn (insn=<optimized out>, s=<optimized out>) at ./target/arm/translate.c:6673 3 0x00005555559c1839 in disas_neon_data_insn (s=<optimized out>, insn=<optimized out>) at ./target/arm/translate.c:6386 4 0x00005555559cd8a4 in disas_arm_insn (insn=4081107068, s=0x7fffe59a9510) at ./target/arm/translate.c:9289 5 0x00005555559cd8a4 in arm_tr_translate_insn (dcbase=0x7fffe59a9510, cpu=<optimized out>) at ./target/arm/translate.c:13612 6 0x00005555558d1d39 in translator_loop (ops=0x5555561cc580 <arm_translator_ops>, db=0x7fffe59a9510, cpu=0x55555686a2f0, tb=<optimized out>, max_insns=<optimized out>) at ./accel/tcg/translator.c:96 7 0x00005555559d10d4 in gen_intermediate_code (cpu=cpu@entry=0x55555686a2f0, tb=tb@entry=0x7fffd7840080 <code_gen_buffer+126091347>, max_insns=max_insns@entry=512) at ./target/arm/translate.c:13901 8 0x00005555558d06b9 in tb_gen_code (cpu=cpu@entry=0x55555686a2f0, pc=3067096216, cs_base=0, flags=192, cflags=-16252928, cflags@entry=524288) at ./accel/tcg/translate-all.c:1736 9 0x00005555558ce467 in tb_find (cf_mask=524288, tb_exit=1, last_tb=0x7fffd783e640 <code_gen_buffer+126084627>, cpu=0x1) at ./accel/tcg/cpu-exec.c:407 10 0x00005555558ce467 in cpu_exec (cpu=cpu@entry=0x55555686a2f0) at ./accel/tcg/cpu-exec.c:728 11 0x000055555588b0cf in tcg_cpu_exec (cpu=0x55555686a2f0) at ./cpus.c:1431 12 0x000055555588d223 in qemu_tcg_cpu_thread_fn (arg=0x55555686a2f0) at ./cpus.c:1735 13 0x000055555588d223 in qemu_tcg_cpu_thread_fn (arg=arg@entry=0x55555686a2f0) at ./cpus.c:1709 14 0x0000555555d2629a in qemu_thread_start (args=<optimized out>) at ./util/qemu-thread-posix.c:502 15 0x00007ffff4db8a92 in start_thread () at /usr/lib/libpthread. This patch ensures that we don't hit the abort() in the second switch case in disas_neon_data_insn() as we will return from the first case. Backports commit 2f143d3ad1c05e91cf2cdf5de06d59a80a95e6c8 from qemu	2019-05-24 18:02:32 -04:00
Richard Henderson	9287750362	target/arm: Simplify BFXIL expansion The mask implied by the extract is redundant with the one implied by the deposit. Also, fix spelling of BFXIL. Backports commit 87eb65a3c45c788a309986d48170a54a0d1c0705 from qemu	2019-05-24 18:01:26 -04:00
Richard Henderson	1778828644	target/arm: Use extract2 for EXTR This is, after all, how we implement extract2 in tcg/aarch64. Backports commit 80ac954c369e7e61bd1ed00cef07b63e11f9c734 from qemu	2019-05-24 17:58:58 -04:00
Richard Henderson	0412b3be8a	target/i386: Implement CPUID_EXT_RDRAND We now have an interface for guest visible random numbers. Backports commit 369fd5ca66810b2ddb16e23a497eabe59385eceb from qemu with the actual RNG portion disabled for the time being.	2019-05-23 15:12:50 -04:00
Richard Henderson	3dd7358a53	target/arm: Implement ARMv8.5-RNG Use the newly introduced infrastructure for guest random numbers. Backports commit de390645675966cce113bf5394445bc1f8d07c85 from qemu (with the actual RNG portion disabled to preserve determinism for the time being).	2019-05-23 15:03:23 -04:00
Richard Henderson	a8df33c37c	target/arm: Put all PAC keys into a structure This allows us to use a single syscall to initialize them all. Backports commit 108b3ba891408c4dce93df78261ec4aca38c0e2e from qemu	2019-05-23 14:54:06 -04:00
Paolo Bonzini	1341e371a8	target/i386: add MDS-NO feature Microarchitectural Data Sampling is a hardware vulnerability which allows unprivileged speculative access to data which is available in various CPU internal buffers. Some Intel processors use the ARCH_CAP_MDS_NO bit in the IA32_ARCH_CAPABILITIES MSR to report that they are not vulnerable, make it available to guests. Backports commit 20140a82c67467f53814ca197403d5e1b561a5e5 from qemu	2019-05-23 14:49:20 -04:00
Paolo Bonzini	a4f2517f46	target/i386: define md-clear bit md-clear is a new CPUID bit which is set when microcode provides the mechanism to invoke a flush of various exploitable CPU buffers by invoking the VERW instruction. Backports commit b2ae52101fca7f9547ac2f388085dbc58f8fe1c0 from qemu	2019-05-23 14:48:18 -04:00
Philippe Mathieu-Daudé	4a1b8d64bd	target/m68k: Optimize rotate_x() using extract_i32() Optimize rotate_x() using tcg_gen_extract_i32(). We can now free the 'sz' tcg_temp earlier. Since it is allocated with tcg_const_i32(), free it with tcg_temp_free_i32(). Backports commit 60d3d0cfeb1658d2827d6a4f0df27252bb36baba from qemu	2019-05-17 12:07:07 -04:00
Philippe Mathieu-Daudé	3c6cb445a0	target/m68k: Fix a tcg_temp leak The function gen_get_ccr() returns a tcg_temp created with tcg_temp_new(). Free it with tcg_temp_free(). Backports commit 44c64e90950adf9efe7f4235a32eb868d1290ebb from qemu	2019-05-17 12:05:11 -04:00
Philippe Mathieu-Daudé	a72a53c2d9	target/m68k: Reduce the l1 TCGLabel scope Backports commit 89fa312be0dfd8b4c539c8763796e785c6b00b46 from qemu	2019-05-17 12:03:37 -04:00
Peter Maydell	3fb64fd5a2	target/m68k: Switch to transaction_failed hook Switch the m68k target from the old unassigned_access hook to the transaction_failed hook. The notable difference is that rather than it being called for all physical memory accesses which fail (including those made by DMA devices or by the gdbstub), it is only called for those made by the CPU via its MMU. (In previous commits we put in explicit checks for the direct physical loads made by the target/m68k code which will no longer be handled by calling the unassigned_access hook.) Backports commit e1aaf3a88e95ab007445281e2b2f6e3c8da47f22 from qemu	2019-05-17 12:01:40 -04:00
Peter Maydell	ab63f1a102	target/m68k: In get_physical_address() check for memory access failures In get_physical_address(), use address_space_ldl() and address_space_stl() instead of ldl_phys() and stl_phys(). This allows us to check whether the memory access failed. For the moment, we simply return -1 in this case; add a TODO comment that we should ideally generate the appropriate kind of fault. Backports commit adcf0bf017351776510121e47b9226095836023c from qemu	2019-05-17 11:59:01 -04:00
Richard Henderson	2a4a7b9391	tcg: Use tlb_fill probe from tlb_vaddr_to_host Most of the existing users would continue around a loop which would fault the tlb entry in via a normal load/store. But for AArch64 SVE we have an existing emulation bug wherein we would mark the first element of a no-fault vector load as faulted (within the FFR, not via exception) just because we did not have its address in the TLB. Now we can properly only mark it as faulted if there really is no valid, readable translation, while still not raising an exception. (Note that beyond the first element of the vector, the hardware may report a fault for any reason whatsoever; with at least one element loaded, forward progress is guaranteed.) Backports commit 4811e9095c0491bc6f5450e5012c9c4796b9e59d from qemu	2019-05-16 18:27:03 -04:00
Laurent Vivier	8cdfed1032	linux-user: fix 32bit g2h()/h2g() sparc32plus has 64bit long type but only 32bit virtual address space. For instance, "apt-get upgrade" failed because of a mmap()/msync() sequence. mmap() returned 0xff252000 but msync() used g2h(0xffffffffff252000) to find the host address. The "(target_ulong)" in g2h() doesn't fix the address because it is 64bit long. This patch introduces an "abi_ptr" that is set to uint32_t if the virtual address space is addressed using 32bit in the linux-user case. It stays set to target_ulong with softmmu case. Backports commit 3e23de15237c81fe7af7c3ffa299a6ae5fec7d43 from qemu	2019-05-16 18:20:55 -04:00
Richard Henderson	e736ef3238	tcg: Remove CPUClass::handle_mmu_fault This hook is now completely replaced by tlb_fill. Backports commit 69963f5709a0645934c169784820d0bee22208ba from qemu	2019-05-16 18:12:17 -04:00
Lioncash	fcaa52c1fe	tcg: Synchronize with qemu Resolves any formatting discrepancies and bad merges that slipped through.	2019-05-16 18:11:08 -04:00
Richard Henderson	dab0061a0d	tcg: Use CPUClass::tlb_fill in cputlb.c We can now use the CPUClass hook instead of a named function. Create a static tlb_fill function to avoid other changes within cputlb.c. This also isolates the asserts within. Remove the named tlb_fill function from all of the targets. Backports commit c319dc13579a92937bffe02ad2c9f1a550e73973 from qemu	2019-05-16 17:35:37 -04:00
Richard Henderson	5d83199931	target/sparc: Convert to CPUClass::tlb_fill Backports commit e84942f2ceaa79430414f2cb68d77c044dadca96 from qemu	2019-05-16 17:29:35 -04:00
Richard Henderson	e98c731550	target/riscv: Convert to CPUClass::tlb_fill Note that env->pc is removed from the qemu_log as that value is garbage. The PC isn't recovered until cpu_restore_state, called from cpu_loop_exit_restore, called from riscv_raise_exception. Backports commit 8a4ca3c10a96be6ed7f023b685b688c4d409bbcb from qemu	2019-05-16 17:24:01 -04:00
Richard Henderson	14d48974a4	target/mips: Convert to CPUClass::tlb_fill Note that env->active_tc.PC is removed from the qemu_log as that value is garbage. The PC isn't recovered until cpu_restore_state, called from cpu_loop_exit_restore, called from do_raise_exception_err. Backports commit 931d019f5b2e7bbacb162869497123be402ddd86 from qemu	2019-05-16 17:19:47 -04:00
Richard Henderson	49cb8cfe5b	target/mips: Tidy control flow in mips_cpu_handle_mmu_fault Since the only non-negative TLBRET_* value is TLBRET_MATCH, the subsequent test for ret < 0 is useless. Use early return to allow subsequent blocks to be unindented. Backports commit e38f4eb63020075432cb77bf48398187809cf4a3 from qemu	2019-05-16 17:15:33 -04:00
Richard Henderson	f175e89ca2	target/mips: Pass a valid error to raise_mmu_exception for user-only At present we give ret = 0, or TLBRET_MATCH. This gets matched by the default case, which falls through to TLBRET_BADADDR. However, it makes more sense to use a proper value. All of the tlb-related exceptions are handled identically in cpu_loop.c, so TLBRET_BADADDR is as good as any other. Retain it. Backports commit 995ffde9622c01f5b307cab47f9bd7962ac09db2 from qemu	2019-05-16 17:14:02 -04:00
Richard Henderson	52998fe46d	target/m68k: Convert to CPUClass::tlb_fill Backports commit fe5f7b1b3a2317f598687218c348b54e02a75e1f from qemu	2019-05-16 17:12:41 -04:00
Richard Henderson	fe9ac6e1c4	target/i386: Convert to CPUClass::tlb_fill We do not support probing, but we do not need it yet either. Backports commit 5d0044212c375c0696baef7bba13699277dac5b5 from qemu	2019-05-16 17:08:14 -04:00
Richard Henderson	31ecdb5341	target/arm: Convert to CPUClass::tlb_fill Backports commit 7350d553b5066abdc662045d7db5cdb73d0f9d53 from qemu	2019-05-16 16:55:12 -04:00
Richard Henderson	1f30062c41	tcg: Add CPUClass::tlb_fill This hook will replace the (user-only mode specific) handle_mmu_fault hook, and the (system mode specific) tlb_fill function. The handle_mmu_fault hook was written as if there was a valid way to recover from an mmu fault, and had 3 possible return states. In reality, the only valid action is to raise an exception, return to the main loop, and deliver the SIGSEGV to the guest. Note that all of the current implementations of handle_mmu_fault for guests which support linux-user do in fact only ever return 1, which is the signal to return to the main loop. Using the hook for system mode requires that all targets be converted, so for now the hook is (optionally) used only from user-only mode. Backports commit da6bbf8513e621a8fc2fd315d77318f36547474d from qemu	2019-05-16 16:46:19 -04:00
Richard Henderson	de260cfbd6	tcg/aarch64: Do not advertise minmax for MO_64 The min/max instructions are not available for 64-bit elements. Backports commit a7b6d286cfb5205b9f5330aefc5727269b3d810f from qemu	2019-05-16 16:44:34 -04:00
Richard Henderson	552e48f14e	target/arm: Use tcg_gen_abs_i64 and tcg_gen_gvec_abs Backports commit 4e027a710673f5d4dc6cff88728bcfd32e4c47b0 from qemu	2019-05-16 16:43:02 -04:00
Richard Henderson	7c9b3a9021	tcg/aarch64: Support vector absolute value Backports commit a456394ae540f852cd0d10fd693fe9f33598dc01 from qemu	2019-05-16 16:39:14 -04:00
Richard Henderson	fd35490991	tcg/i386: Support vector absolute value Backports commit 18f9b65f1a4225dd314cb9b0a8dea968c5bc2ef3 from qemu	2019-05-16 16:37:33 -04:00
Richard Henderson	6d5e7856ff	tcg: Add support for vector absolute value Backports commit bcefc90208f8a1d6f619d61c2647281d92277015 from qemu	2019-05-16 16:33:43 -04:00
Richard Henderson	6d1730048d	tcg: Add support for integer absolute value Remove a function of the same name from target/arm/. Use a branchless implementation of abs gleaned from gcc. Backports commit ff1f11f7f8710a768f9313f24bd7f509d3db27e5 from qemu	2019-05-16 16:25:15 -04:00
Richard Henderson	18b3df6e4e	tcg/i386: Support vector scalar shift opcodes Backports commit 0a8d7a3bf5a149a82450eef555fd61728703dd84 from qemu	2019-05-16 16:19:44 -04:00
Richard Henderson	79b9dc559e	tcg: Add gvec expanders for vector shift by scalar Allow expansion either via shift by scalar or by replicating the scalar for shift by vector. Backports commit b4578cd91cda4cef1c413304353ca6dc5b957b60 from qemu	2019-05-16 16:17:58 -04:00
Richard Henderson	0217ee7b24	tcg/aarch64: Support vector variable shift opcodes Backports commit 79525dfd08262d8de10d271f17e5a4096ef96d16 from qemu	2019-05-16 15:58:54 -04:00
Richard Henderson	f793ec847d	tcg/i386: Support vector variable shift opcodes Backports commit a2ce146a06807fe1d1a81e878b8f249ff1e14038 from qemu	2019-05-16 15:53:33 -04:00
Richard Henderson	8c17687934	tcg: Add gvec expanders for variable shift The gvec expanders perform a modulo on the shift count. If the target requires alternate behaviour, then it cannot use the generic gvec expanders anyway, and will have to have its own custom code. Backports commit 5ee5c14cacda27e904cd6b0d9e7ffe1acff42838 from qemu	2019-05-16 15:51:09 -04:00
Richard Henderson	66e6bea084	tcg: Add INDEX_op_dupm_vec Allow the backend to expand dup from memory directly, instead of forcing the value into a temp first. This is especially important if integer/vector register moves do not exist. Note that officially tcg_out_dupm_vec is allowed to fail. If it did, we could fix this up relatively easily: VECE == 32/64: Load the value into a vector register, then dup. Both of these must work. VECE == 8/16: If the value happens to be at an offset such that an aligned load would place the desired value in the least significant end of the register, go ahead and load w/garbage in high bits. Load the value w/INDEX_op_ld{8,16}_i32. Attempt a move directly to vector reg, which may fail. Store the value into the backing store for OTS. Load the value into the vector reg w/TCG_TYPE_I32, which must work. Duplicate from the vector reg into itself, which must work. All of which is well and good, except that all supported hosts can support dupm for all vece, so all of the failure paths would be dead code and untestable. Backports commit 37ee55a081b7863ffab2151068dd1b2f11376914 from qemu	2019-05-16 15:38:02 -04:00
Richard Henderson	fd7a67e4a7	tcg/aarch64: Implement tcg_out_dupm_vec The LD1R instruction does all the work. Note that the only useful addressing mode is a base register with no offset. Backports commit f23e5e15edfd49d5dd72cab2ed2d85ac354b2eeb from qemu	2019-05-16 15:29:04 -04:00
Richard Henderson	a6fd4e2345	tcg/i386: Implement tcg_out_dupm_vec At the same time, improve tcg_out_dupi_vec wrt broadcast from the constant pool. Backports commit 1e262b49b5331441f697461e4305fe06719758a7 from qemu	2019-05-16 15:27:15 -04:00

... 4 5 6 7 8 ...

6121 commits