unicorn

mirror of https://github.com/yuzu-emu/unicorn.git synced 2026-05-08 22:13:32 +00:00

Author	SHA1	Message	Date
Richard Henderson	18b3df6e4e	tcg/i386: Support vector scalar shift opcodes Backports commit 0a8d7a3bf5a149a82450eef555fd61728703dd84 from qemu	2019-05-16 16:19:44 -04:00
Richard Henderson	79b9dc559e	tcg: Add gvec expanders for vector shift by scalar Allow expansion either via shift by scalar or by replicating the scalar for shift by vector. Backports commit b4578cd91cda4cef1c413304353ca6dc5b957b60 from qemu	2019-05-16 16:17:58 -04:00
Richard Henderson	0217ee7b24	tcg/aarch64: Support vector variable shift opcodes Backports commit 79525dfd08262d8de10d271f17e5a4096ef96d16 from qemu	2019-05-16 15:58:54 -04:00
Richard Henderson	f793ec847d	tcg/i386: Support vector variable shift opcodes Backports commit a2ce146a06807fe1d1a81e878b8f249ff1e14038 from qemu	2019-05-16 15:53:33 -04:00
Richard Henderson	8c17687934	tcg: Add gvec expanders for variable shift The gvec expanders perform a modulo on the shift count. If the target requires alternate behaviour, then it cannot use the generic gvec expanders anyway, and will have to have its own custom code. Backports commit 5ee5c14cacda27e904cd6b0d9e7ffe1acff42838 from qemu	2019-05-16 15:51:09 -04:00
Richard Henderson	66e6bea084	tcg: Add INDEX_op_dupm_vec Allow the backend to expand dup from memory directly, instead of forcing the value into a temp first. This is especially important if integer/vector register moves do not exist. Note that officially tcg_out_dupm_vec is allowed to fail. If it did, we could fix this up relatively easily: VECE == 32/64: Load the value into a vector register, then dup. Both of these must work. VECE == 8/16: If the value happens to be at an offset such that an aligned load would place the desired value in the least significant end of the register, go ahead and load w/garbage in high bits. Load the value w/INDEX_op_ld{8,16}_i32. Attempt a move directly to vector reg, which may fail. Store the value into the backing store for OTS. Load the value into the vector reg w/TCG_TYPE_I32, which must work. Duplicate from the vector reg into itself, which must work. All of which is well and good, except that all supported hosts can support dupm for all vece, so all of the failure paths would be dead code and untestable. Backports commit 37ee55a081b7863ffab2151068dd1b2f11376914 from qemu	2019-05-16 15:38:02 -04:00
Richard Henderson	fd7a67e4a7	tcg/aarch64: Implement tcg_out_dupm_vec The LD1R instruction does all the work. Note that the only useful addressing mode is a base register with no offset. Backports commit f23e5e15edfd49d5dd72cab2ed2d85ac354b2eeb from qemu	2019-05-16 15:29:04 -04:00
Richard Henderson	a6fd4e2345	tcg/i386: Implement tcg_out_dupm_vec At the same time, improve tcg_out_dupi_vec wrt broadcast from the constant pool. Backports commit 1e262b49b5331441f697461e4305fe06719758a7 from qemu	2019-05-16 15:27:15 -04:00
Richard Henderson	d4e7c6a8c5	tcg: Add tcg_out_dupm_vec to the backend interface Currently stubbed out in all backends that support vectors. Backports commit d6ecb4a978b718dbe108a9fa9ecccc8b7f7cb579 from qemu	2019-05-16 15:24:48 -04:00
Richard Henderson	cf238d3544	tcg: Manually expand INDEX_op_dup_vec This case is similar to INDEX_op_mov_* in that we need to do different things depending on the current location of the source. Backports commit bab1671f0fa928fd678a22f934739f06fd5fd035 from qemu	2019-05-16 15:22:29 -04:00
Richard Henderson	3d20e1678c	tcg: Promote tcg_out_{dup,dupi}_vec to backend interface The i386 backend already has these functions, and the aarch64 backend could easily split out one. Nothing is done with these functions yet, but this will aid register allocation of INDEX_op_dup_vec in a later patch. Adjust the aarch64 tcg_out_dupi_vec signature to match the new interface. Backports commit e7632cfa8b76cdbbc1c76e8737338ef5844e7d60 from qemu	2019-05-16 15:18:48 -04:00
Richard Henderson	d58d9ad16e	tcg: Support cross-class moves without instruction support PowerPC Altivec does not support direct moves between vector registers and general registers. So when tcg_out_mov fails, we can use the backing memory for the temporary to perform the move. Backports commit 240c08d0998f402c325fce489de0d14831048128 from qemu	2019-05-16 15:16:23 -04:00
Richard Henderson	f86bd1c5d6	tcg: Return bool success from tcg_out_mov This patch merely changes the interface, aborting on all failures, of which there are currently none. Backports commit 78113e83e0007e869c9f0cb4c0497a77538988e3 from qemu	2019-05-16 15:14:42 -04:00
Richard Henderson	f7d9ee8451	tcg/arm: Use tcg_out_mov_reg in tcg_out_mov We have a function that takes an additional condition parameter over the standard backend interface. It already takes care of eliding no-op moves. Backports commit c16f52b2c5d91c36e121795bd3b386cea0b7573c from qemu	2019-05-16 15:10:52 -04:00
Richard Henderson	fef5700c9c	tcg: Assert fixed_reg is read-only The only fixed_reg is cpu_env, and it should not be modified during any TB. Therefore code that tries to special-case moves into a fixed_reg is dead. Remove it. Backports commit d63e3b6e694ad6c887be135dddb9cd4893f1a844 from qemu	2019-05-16 15:09:37 -04:00
Richard Henderson	c54b2776f6	tcg: Specify optional vector requirements with a list Replace the single opcode in .opc with a null-terminated array in .opt_opc. We still require that all opcodes be used with the same .vece. Validate the contents of this list with CONFIG_DEBUG_TCG. All tcg_gen_*_vec functions will check any list active during .fniv expansion. Swap the active list in and out as we expand other opcodes, or take control away from the front-end function. Convert all existing vector aware front ends. Backports commit 53229a7703eeb2bbe101a19a33ef22aaf960c65b from qemu	2019-05-16 15:05:02 -04:00
Richard Henderson	37762fd92b	tcg: Allow add_vec, sub_vec, neg_vec, not_vec to be expanded PowerPC Altivec does not support add and subtract of 64-bit elements. Prepare for that configuration by not assuming the operation is universally supported. Backports commit ce27c5d1a38e93da38653af71fb468c5eded4c7b from qemu	2019-05-16 14:33:18 -04:00
Richard Henderson	9a9b681b38	tcg: Do not recreate INDEX_op_neg_vec unless supported Use tcg_can_emit_vec_op instead of just TCG_TARGET_HAS_neg_vec, so that we check the type and vece for the actual operation. Backports commit ac383dde33405106469d04a78de1d76f1a730cb1 from qemu	2019-05-16 14:28:41 -04:00
David Hildenbrand	f3b4a64d27	tcg: Implement tcg_gen_gvec_3i() Let's add tcg_gen_gvec_3i(), similar to tcg_gen_gvec_2i(), however without introducing "gen_helper_gvec_3i *fnoi", as it isn't needed for now. Backports commit e1227bb6e59173117f094a6a13b998587b45c928 from qemu	2019-05-16 14:26:50 -04:00
Markus Armbruster	1b2c8c44d5	Clean up ill-advised or unusual header guards Leading underscores are ill-advised because such identifiers are reserved. Trailing underscores are merely ugly. Strip both. Our header guards commonly end in _H. Normalize the exceptions. Done with scripts/clean-header-guards.pl. Backports commit a8b991b52dcde75ab5065046653626951aac666d from qemu	2019-05-14 08:02:53 -04:00
Richard Henderson	9a02741c13	cputlb: Do unaligned store recursion to outermost function This is less tricky than for loads, because we always fall back to single byte stores to implement unaligned stores. Backports commit 4601f8d10d7628bcaf2a8179af36e04b42879e91 from qemu	2019-05-14 07:45:15 -04:00
Richard Henderson	bcab6f1719	cputlb: Do unaligned load recursion to outermost function If we attempt to recurse from load_helper back to load_helper, even via intermediary, we do not get all of the constants expanded away as desired. But if we recurse back to the original helper (or a shim that has a consistent function signature), the operands are folded away as desired. Backports commit 2dd926067867c2dd19e66d31a7990e8eea7258f6 from qemu	2019-05-14 07:43:31 -04:00
Richard Henderson	f12f36aebd	cputlb: Drop attribute flatten Going to approach this problem via __attribute__((always_inline)) instead, but full conversion will take several steps. Backports commit fc1bc777910dc14a3db4e2ad66f3e536effc297d from qemu	2019-05-14 07:33:39 -04:00
Richard Henderson	7991cd601f	cputlb: Move TLB_RECHECK handling into load/store_helper Having this in io_readx/io_writex meant that we forgot to re-compute index after tlb_fill. It also means we can use the normal aligned memory load path. It also fixes a bug in that we had cached a use of index across a tlb_fill. Backports commit f1be36969de2fb9b6b64397db1098f115210fcd9 from qemu	2019-05-14 07:28:15 -04:00
Alex Bennée	ccee796272	accel/tcg: demacro cputlb Instead of expanding a series of macros to generate the load/store helpers we move stuff into common functions and rely on the compiler to eliminate the dead code for each variant. Backports commit eed5664238ea5317689cf32426d9318686b2b75c from qemu	2019-05-14 07:28:11 -04:00
Peter Maydell	26cb1b8767	target/arm: Stop using variable length array in dc_zva Currently the dc_zva helper function uses a variable length array. In fact we know (as the comment above remarks) that the length of this array is bounded because the architecture limits the block size and QEMU limits the target page size. Use a fixed array size and assert that we don't run off it. Backports commit 63159601fb3e396b28da14cbb71e50ed3f5a0331 from qemu	2019-05-09 17:48:25 -04:00
Peter Maydell	7861820e94	target/arm: Implement XPSR GE bits In the M-profile architecture, if the CPU implements the DSP extension then the XPSR has GE bits, in the same way as the A-profile CPSR. When we added DSP extension support we forgot to add support for reading and writing the GE bits, which are stored in env->GE. We did put in the code to add XPSR_GE to the mask of bits to update in the v7m_msr helper, but forgot it in v7m_mrs. We also must not allow the XPSR we pull off the stack on exception return to set the nonexistent GE bits. Correct these errors: * read and write env->GE in xpsr_read() and xpsr_write() * only set GE bits on exception return if DSP present * read GE bits for MRS if DSP present Backports commit f1e2598c46d480c9e21213a244bc514200762828 from qemu	2019-05-09 17:46:31 -04:00
Cao Jiaxi	bcb1270f23	osdep: Fix mingw compilation regarding stdio formats I encountered the following compilation error on mingw: /mnt/d/qemu/include/qemu/osdep.h:97:9: error: '__USE_MINGW_ANSI_STDIO' macro redefined [-Werror,-Wmacro-redefined] \#define __USE_MINGW_ANSI_STDIO 1 ^ /mnt/d/llvm-mingw/aarch64-w64-mingw32/include/_mingw.h:433:9: note: previous definition is here \#define __USE_MINGW_ANSI_STDIO 0 /* was not defined so it should be 0 */ It turns out that __USE_MINGW_ANSI_STDIO must be set before any system headers are included, not just before stdio.h. Backports commit 946376c21be1cd9dcc3c7936b204b113781603f7 from qemu	2019-05-09 17:44:14 -04:00
Cao Jiaxi	3922118434	util/cacheinfo: Use uint64_t on LLP64 model to satisfy Windows ARM64 Windows ARM64 uses LLP64 model, which breaks current assumptions. Backports commit 8041336ef74e19ca607c1601016333c986de8f9c from qemu	2019-05-09 17:43:27 -04:00
Lioncash	a71c027063	decodetree: Add DisasContext argument to !function expanders This does require adjusting all existing users. Backports commit 451e4ffdb0003ab5ed0d98bd37b385c076aba183 from qemu	2019-05-09 17:40:45 -04:00
Richard Henderson	9030870a8f	decodetree: Expand a decode_load function Read the instruction, loading no more bytes than necessary. Backports commit 70e0711ab18fa48279cd2c8cc570b57f38648598 from qemu	2019-05-09 17:35:20 -04:00
Richard Henderson	a98e70e791	decodetree: Initial support for variable-length ISAs Assuming that the ISA clearly describes how to determine the length of the instruction, and the ISA has a reasonable maximum instruction length, the input to the decoder can be right-justified in an appropriate insn word. This is not 100% convenient, as out-of-line %fields are numbered relative to the maximum instruction length, but this appears to still be usable. Backports commit 17560e9349ff1fcce814184b37993f92378cf0c4 from qemu	2019-05-09 17:32:38 -04:00
Richard Henderson	8fdd009a9d	tcg: Remove CF_IGNORE_ICOUNT Now that we have curr_cflags, we can include CF_USE_ICOUNT early and then remove it as necessary. Backports commit 416986d3f97329655e30da7271a2d11c6d707b06 from qemu	2019-05-06 00:57:09 -04:00
Richard Henderson	12f9def3a2	tcg: Add CF_LAST_IO + CF_USE_ICOUNT to CF_HASH_MASK These flags are used by target/*/translate.c, and affect code generation. Backports commit 0cf8a44c2f56ba884c2f6db47d27fbb24975daa3 from qemu	2019-05-06 00:53:35 -04:00
Emilio G. Cota	b1b069e8ad	cpu-exec: lookup/generate TB outside exclusive region during step_atomic Now that all code generation has been converted to check CF_PARALLEL, we can generate !CF_PARALLEL code without having yet set !parallel_cpus -- and therefore without having to be in the exclusive region during cpu_exec_step_atomic. While at it, merge cpu_exec_step into cpu_exec_step_atomic. Backports commit ac03ee5331612e44beb393df2b578c951d27dc0d from qemu	2019-05-06 00:52:43 -04:00
Emilio G. Cota	c1e26c4e35	tcg: check CF_PARALLEL instead of parallel_cpus Thereby decoupling the resulting translated code from the current state of the system. The tb->cflags field is not passed to tcg generation functions. So we add a field to TCGContext, storing there a copy of tb->cflags. Most architectures have <= 32 registers, which results in a 4-byte hole in TCGContext. Use this hole for the new field. Backports commit e82d5a2460b0e176128027651ff9b104e4bdf5cc from qemu	2019-05-06 00:52:08 -04:00
Emilio G. Cota	175a5223ad	target/sparc: check CF_PARALLEL instead of parallel_cpus Thereby decoupling the resulting translated code from the current state of the system. Backports commit 87d757d60d66d5ee1608460b0f1e07e2b758db9c from qemu	2019-05-06 00:43:21 -04:00
Emilio G. Cota	77ccb4918d	target/m68k: check CF_PARALLEL instead of parallel_cpus Thereby decoupling the resulting translated code from the current state of the system. Backports commit f0ddf11b23260f0af84fb529486a8f9ba2d19401 from qemu	2019-05-06 00:42:16 -04:00
Emilio G. Cota	ad2a4edd76	target/i386: check CF_PARALLEL instead of parallel_cpus Thereby decoupling the resulting translated code from the current state of the system. Backports commit b5e3b4c2aca8eb5a9cfeedfb273af623f17c3731 from qemu	2019-05-04 22:45:49 -04:00
Emilio G. Cota	1715f382b4	target/arm: check CF_PARALLEL instead of parallel_cpus Thereby decoupling the resulting translated code from the current state of the system. Backports commit 2399d4e7cec22ecf1c51062d2ebfd45220dbaace from qemu	2019-05-04 22:44:32 -04:00
Richard Henderson	4a858100f4	tcg: Include CF_COUNT_MASK in CF_HASH_MASK Backports commit cdfef1715c779eb528d633e8b76cbc8a10e71ac8 from qemu	2019-05-04 22:31:32 -04:00
Richard Henderson	30c0950567	tcg: Add CPUState cflags_next_tb We were generating code during tb_invalidate_phys_page_range, check_watchpoint, cpu_io_recompile, and (seemingly) discarding the TB, assuming that it would magically be picked up during the next iteration through the cpu_exec loop. Instead, record the desired cflags in CPUState so that we request the proper TB so that there is no more magic. Backports commit 9b990ee5a3cc6aa38f81266fb0c6ef37a36c45b9 from qemu	2019-05-04 22:30:22 -04:00
Richard Henderson	ee1ddf4a92	tcg: define CF_PARALLEL and use it for TB hashing along with CF_COUNT_MASK This will enable us to decouple code translation from the value of parallel_cpus at any given time. It will also help us minimize TB flushes when generating code via EXCP_ATOMIC. Note that the declaration of parallel_cpus is brought to exec-all.h to be able to define there the "curr_cflags" inline. Backports commit 4e2ca83e71b51577b06b1468e836556912bd5b6e from qemu	2019-05-04 22:22:06 -04:00
Lioncash	cc37db76b6	tcg: Synchronize with qemu	2019-05-04 21:40:23 -04:00
Daniel P. Berrangé	aec899b73e	configure: automatically pick python3 is available Unless overridden via an env var or configure arg, QEMU will only look for the 'python' binary in $PATH. This is unhelpful on distros which are only shipping Python 3.x (eg Fedora) in their default install as, if they comply with PEP 394, the bare 'python' binary won't exist. This changes configure so that by default it will search for all three common python binaries, preferring to find Python 3.x versions. Backports commit faf441429adfe5767be52c5dcdb8bc03161d064f from qemu	2019-05-03 11:36:36 -04:00
Thomas Huth	73176e89ce	configure: Remove old -config-devices.mak.d files when running configure When running "make" in a build directory from the pre-Kconfig merge time, the build process currently fails with: make: ** No rule to make target `.../default-configs/pci.mak', needed by `aarch64-softmmu/config-devices.mak'. Stop. To make sure that this problem at least goes away when the user runs "configure" (or "sh config.status") again, we have to make sure that we re-generate the .mak.d files. Thus remove the old stale files while running the configure script. Backports commit 9c79024225af6b3ae04ea2dd94a5e5c4132a9e65 from qemu	2019-05-03 11:33:56 -04:00
Thomas Huth	7107f72cc6	configure: Add -Wno-typedef-redefinition to CFLAGS (for Clang) Without the -Wno-typedef-redefinition option, clang complains if a typedef gets redefined in gnu99 mode (since this is officially a C11 feature). This used to also happen with older versions of GCC, but since we've bumped our minimum GCC version to 4.8, all versions of GCC that we support do not seem to issue this warning in gnu99 mode anymore. So this has become a common problem for people who only test their code with GCC - they do not notice the issue until they submit their patches and suddenly patchew or a maintainer complains. Now that we do not urgently need to keep the code clean from typedef redefintions anymore with recent versions of GCC, we can ease the situation with clang, too, and simply shut these warnings off for good. Backports commit e6e90feedb706b1b92827a5977b37e1e8defb8ef from qemu	2019-05-03 11:33:06 -04:00
Eduardo Habkost	42c35d968a	accel: Remove unused AccelClass::available field The field is not used anymore, we can remove it. Backports commit 8d006d4bc2ab4f72877d8bd47cba9aa8d24b54d0 from qemu	2019-05-03 11:31:27 -04:00
Peter Maydell	d4549fccfb	target/arm: Enable FPU for Cortex-M4 and Cortex-M33 Enable the FPU by default for the Cortex-M4 and Cortex-M33. Backports commit 14fd0c31e26b88a2189b3f459b864d5e1faf302a from qemu	2019-04-30 11:29:26 -04:00
Peter Maydell	77ae3982b4	target/arm: Implement VLLDM for v7M CPUs with an FPU Implement the VLLDM instruction for v7M for the FPU present cas. Backports commit 956fe143b4f254356496a0a1c479fa632376dfec from qemu	2019-04-30 11:27:54 -04:00

1 2 3 4 5 ...

5829 commits