unicorn

mirror of https://github.com/yuzu-emu/unicorn.git synced 2024-12-23 15:15:38 +00:00

Author	SHA1	Message	Date
Richard Henderson	a124110db4	tcg/s390: Remove retranslation code There is no longer a need for preserving branch offset operands, as we no longer re-translate. Backports commit 3661612fc3e4b65be03482bf6bafd116101881e1 from qemu	2018-12-18 05:21:03 -05:00
Richard Henderson	85485dc20e	tcg/ppc: Fold away noaddr branch routines There is no longer a need for preserving branch offset operands, as we no longer re-translate. Backports commit f9c7246faa279237200a2a53beacaa8100ea1900 from qemu	2018-12-18 05:18:59 -05:00
Richard Henderson	b49a353adb	tcg/arm: Fold away noaddr branch routines There are one use apiece for these. There is no longer a need for preserving branch offset operands, as we no longer re-translate. Backports commit 37ee93a974c49ab9edfcd1db0aad3838b0395b14 from qemu	2018-12-18 05:17:22 -05:00
Richard Henderson	1167aa481d	tcg/arm: Remove reloc_pc24_atomic It is unused since 3fb53fb4d12f2e7833bd1659e6013237b130ef20. Backports commit 2672ccc7eee742e23928f4bf60a13a77d64f540d from qemu	2018-12-18 05:16:29 -05:00
Richard Henderson	0a8bc142d3	tcg/aarch64: Fold away noaddr branch routines There are one use apiece for these. There is no longer a need for preserving branch offset operands, as we no longer re-translate. Backports commit 733589b3382afcb0ae9f43e72e083a5ddd38abd5 from qemu	2018-12-18 05:15:41 -05:00
Richard Henderson	cbe1065e83	tcg/aarch64: Remove reloc_pc26_atomic It is unused since b68686bd4bfeb70040b4099df993dfa0b4f37b03. Backports commit 90d6cb781130891f96eb54f8315e29fbd4e99a71 from qemu	2018-12-18 05:14:22 -05:00
Richard Henderson	091b4fa1ff	tcg/i386: Move TCG_REG_CALL_STACK from define to enum Backports commit 66c0285df4270d184afce5ac8b97ac175c89562f from qemu	2018-12-18 05:13:47 -05:00
Richard Henderson	f3a8a4a306	tcg/i386: Always use %ebp for TCG_AREG0 For x86_64, this can remove a REX prefix resulting in smaller code when manipulating globals of type i32, as we move them between backing store via cpu_env, aka TCG_AREG0. Backports commit 5740d9f714835964873325d1210b26811252843f from qemu	2018-12-18 05:13:05 -05:00
Richard Henderson	7ab51fc012	target/sparc: Remove the constant pool Partially reverts ab20bdc1162. The 14-bit displacement that we allowed to reach the constant pool is not always sufficient. Retain the tb-relative addressing, as that is how most return values from the tb are computed. Backports commit f6823cbe3787aa47db62deede6683077e3da9a2c from qemu	2018-12-18 05:12:11 -05:00
Thomas Huth	3ba2114043	tcg/tcg.h: Remove GCC check for tcg_debug_assert() macro Both GCC v4.8 and Clang v3.4 (our minimum versions) support __builtin_unreachable(), so we can remove the version check here now. Backports commit 6fa2cef205a60b5c5c3b058f53852416b885c455 from qemu	2018-12-18 03:53:56 -05:00
Peter Maydell	78906db067	tcg/tcg-op.h: Add multiple include guard The tcg-op.h header was missing the usual guard against multiple inclusion; add it. (Spotted by lgtm.com's static analyzer.) Backports commit a7ce790a029bd94eb320d8c69f38900f5233997e from qemu	2018-11-11 08:51:51 -05:00
Craig Janeczek	58dc377890	target/mips: Introduce MXU registers Define and initialize the 16 MXU registers - 15 general computational register, and 1 control register). There is also a zero register, but it does not have any corresponding variable. Backports commit eb5559f67dc8dc12335dd996877bb6daaea32eb2 from qemu.	2018-11-11 05:50:52 -05:00
Richard Henderson	d74e00a30a	tcg: Split CONFIG_ATOMIC128 GCC7+ will no longer advertise support for 16-byte __atomic operations if only cmpxchg is supported, as for x86_64. Fortunately, x86_64 still has support for __sync_compare_and_swap_16 and we can make use of that. AArch64 does not have, nor ever has had such support, so open-code it. Backports commit e6cd4bb59b8154fa00da611200beef7eb4e8ec56 from qemu	2018-10-23 15:17:39 -04:00
Emilio G. Cota	e5b43d2794	tcg: plug holes in struct TCGProfile This plugs two 4-byte holes in 64-bit. Backports commit dd1d7da23b0abef87f46d9ab39ba9b0974eaec04 from qemu	2018-10-23 14:38:16 -04:00
Emilio G. Cota	223975ada0	tcg: fix use of uninitialized variable under CONFIG_PROFILER We forgot to initialize n in commit 15fa08f845 ("tcg: Dynamically allocate TCGOps", 2017-12-29). Backports commit c1f543b739086733024e31d74a52d9e41553f316 from qemu	2018-10-23 14:37:37 -04:00
Richard Henderson	e01deeb9ba	tcg: Implement CPU_LOG_TB_NOCHAIN during expansion Rather than test NOCHAIN before linking, do not emit the goto_tb opcode at all. We already do this for goto_ptr. Backports commit d7f425fdea991f052241c6479acd9feae834063b from qemu	2018-10-23 14:35:12 -04:00
Lioncash	cc3d618e61	tcg: Remove unnecessary MSVC ifdef All relevant arrays have at least one member in them now, making this unnecessary.	2018-10-06 05:08:17 -04:00
Lioncash	766c70f608	arm: Move cpu_M0 to DisasContext	2018-10-06 03:32:39 -04:00
Lioncash	787fd448b1	arm: Move cpu_V1 to DisasContext	2018-10-06 03:28:42 -04:00
Lioncash	1aa20da917	arm: Move cpu_V0 to DisasContext	2018-10-06 03:26:52 -04:00
Lioncash	06c21baaa4	arm: Move cpu_F1d to DisasContext	2018-10-06 03:11:54 -04:00
Lioncash	5f3dd68f9c	arm: Move cpu_F0d to DisasContext	2018-10-06 03:07:42 -04:00
Lioncash	e457ce8ccc	arm: Move cpu_F1s to DisasContext	2018-10-06 03:02:06 -04:00
Lioncash	97a5955a2a	tcg: Remove leftover unused variable from TCGContext This was previously used by the i386 target, however all of the locals were moved to the DisasContext struct, leaving this unused.	2018-10-06 02:46:27 -04:00
Emilio G. Cota	b9bb6cead9	target/i386: move x86_64_hregs to DisasContext And convert it to a bool to use an existing hole in the struct. Backports commit 1dbe15ef57abdf7b6a26c8e638abf6413a4b9d0c from qemu	2018-10-04 04:02:50 -04:00
Emilio G. Cota	04530acab2	target/i386: move cpu_tmp3_i32 to DisasContext Backports commit 4f82446de695f080ed148a0e47fc141e928665af from qemu	2018-10-04 03:56:05 -04:00
Emilio G. Cota	781e6bde41	target/i386: move cpu_tmp2_i32 to DisasContext Backports commit 6bd48f6f206b6f32a5bbeebc3ae6886d4f587981 from qemu	2018-10-04 03:53:31 -04:00
Emilio G. Cota	c13337d1bc	target/i386: move cpu_ptr1 to DisasContext Backports commit 6387e8303ffb26cfb40b0f93372f1519229b4d2c from qemu	2018-10-04 03:48:09 -04:00
Emilio G. Cota	3e442d4480	target/i386: move cpu_ptr0 to DisasContext Backports commit 2ee2646491a293a92d1c85e90e12419a8c199ed0 from qemu	2018-10-04 03:46:53 -04:00
Emilio G. Cota	cc872aa711	target/i386: move cpu_tmp4 to DisasContext Backports commit 5022f28f1e4033eb369b744ad61b96d086beca1b from qemu	2018-10-04 03:45:28 -04:00
Emilio G. Cota	d2752ebc42	target/i386: move cpu_tmp0 to DisasContext Backports commit fbd80f02df3fe272ba0f4825df27b8459dafbc14 from qemu	2018-10-04 03:41:13 -04:00
Emilio G. Cota	b704b6c205	target/i386: move cpu_T1 to DisasContext Backports commit b48597b0eda32d4c7ade2ba3f98f06f62289e3e2 from qemu	2018-10-04 03:35:10 -04:00
Emilio G. Cota	70b327dc82	target/i386: move cpu_T0 to DisasContext Backports commit c66f97273f677d76afaaeb0e688eb08499701b1b from qemu	2018-10-04 03:29:13 -04:00
Emilio G. Cota	c1d70758ea	target/i386: move cpu_A0 to DisasContext Backports commit 6b672b5d6b14422c131969c5725f738751e12847 from qemu	2018-10-04 01:16:35 -04:00
Emilio G. Cota	30c66bcca3	target/i386: move cpu_cc_srcT to DisasContext Backports commit 93a3e108eb6a9bb781ab7db6e92d91528e482030 from qemu	2018-10-04 00:59:00 -04:00
Roman Kapl	33e69342e3	tcg/i386: fix vector operations on 32-bit hosts The TCG backend uses LOWREGMASK to get the low 3 bits of register numbers. This was defined as no-op for 32-bit x86, with the assumption that we have eight registers anyway. This assumption is not true once we have xmm regs. Since LOWREGMASK was a no-op, xmm register indidices were wrong in opcodes and have overflown into other opcode fields, wreaking havoc. To trigger these problems, you can try running the "movi d8, #0x0" AArch64 instruction on 32-bit x86. "vpxor %xmm0, %xmm0, %xmm0" should be generated, but instead TCG generated "vpxor %xmm0, %xmm0, %xmm2". Fixes: 770c2fc7bb ("Add vector operations") Backports commit 93bf9a42733321fb632bcb9eafd049ef0e3d9417 from qemu	2018-10-02 04:22:35 -04:00
Richard Henderson	9e8c8a617b	tcg/optimize: Do not skip default processing of dup_vec If we do not opimize away dup_vec, we must mark its output as changed. Backports commit 1fb57da72ae0886eba1234a2d98ddd10e88a9efc from qemu	2018-08-09 00:53:07 -04:00
Richard Henderson	a4c2dbef3e	tcg/i386: Mark xmm registers call-clobbered When host vector registers and operations were introduced, I failed to mark the registers call clobbered as required by the ABI. Fixes: 770c2fc7bb7 Backports commit 672189cd586ea38a2c1d8ab91eb1f9dcff5ceb05 from qemu	2018-07-23 20:00:26 -04:00
Alex Bennée	11948dd1cc	tcg/aarch64: limit mul_vec size In AdvSIMD we can only do 32x32 integer multiples although SVE is capable of larger 64 bit multiples. As a result we can end up generating invalid opcodes. Fix this by only reprting we can emit mul vector ops if the size is small enough. Fixes a crash on: sve-all-short-v8.3+sve@vq3/insn_mul_z_zi___INC.risu.bin When running on AArch64 hardware. Backports commit e65a5f227d77a5dbae7a7123c3ee915ee4bd80cf from qemu	2018-07-21 14:15:59 -04:00
Richard Henderson	f1aaf5be62	tcg: Restrict check_size_impl to multiples of the line size Normally this is automatic in the size restrictions that are placed on vector sizes coming from the implementation. However, for the legitimate size tuple [oprsz=8, maxsz=32], we need to clear the final 24 bytes of the vector register. Without this check, do_dup selects TCG_TYPE_V128 and clears only 16 bytes. Backports commit 499748d7683198a765d17b4fdf6901ab9dca920c from qemu	2018-07-09 16:41:53 -04:00
John Arbuckle	22c3206738	tcg/i386: Use byte form of xgetbv instruction The assembler in most versions of Mac OS X is pretty old and does not support the xgetbv instruction. To go around this problem, the raw encoding of the instruction is used instead. Backports commit 1019242af11400252f6735ca71a35f81ac23a66d from qemu	2018-06-28 13:23:32 -05:00
Richard Henderson	10e2b13650	tcg: Pass tb and index to tcg_gen_exit_tb separately Do the cast to uintptr_t within the helper, so that the compiler can type check the pointer argument. We can also do some more sanity checking of the index argument. Backports commit 07ea28b41830f946de3841b0ac61a3413679feb9 from qemu	2018-06-07 11:56:32 -04:00
Emilio G. Cota	7e8902eccc	tcg: fix s/compliment/complement/ typos Backports commit 1d349821551c2da4dfefe36c6ac17319f33ebbd5 from qemu	2018-05-22 00:29:51 -04:00
Richard Henderson	de1708aadc	tcg: Introduce atomic helpers for integer min/max Given that this atomic operation will be used by both risc-v and aarch64, let's not duplicate code across the two targets. Backports commit 5507c2bf35aa6b4705939349184e71afd5e058b2 from qemu	2018-05-14 08:06:42 -04:00
Richard Henderson	eef66443b2	tcg: Introduce helpers for integer min/max These operations are re-invented by several targets so far. Several supported hosts have insns for these, so place the expanders out-of-line for a future introduction of tcg opcodes. Backports commit b87fb8cd9f9a0ba599ff79e7bf03222da02e5724 from qemu	2018-05-14 07:31:50 -04:00
Richard Henderson	f417df19b7	tcg: Limit the number of ops in a TB In 6001f7729e12 we partially attempt to address the branch displacement overflow caused by 15fa08f845. However, gcc/testsuite/gcc.target/aarch64/advsimd-intrinsics/vqtbX.c is a testcase that contains a TB so large as to overflow anyway. The limit here of 8000 ops produces a maximum output TB size of 24112 bytes on a ppc64le host with that test case. This is still much less than the maximum forward branch distance of 32764 bytes. Backports commit abebf92597186be2bc48d487235da28b1127860f from qemu	2018-05-11 11:25:01 -04:00
Richard Henderson	33f7f6f09a	tcg/i386: Fix dup_vec in non-AVX2 codepath The VPUNPCKLD* instructions are all "non-destructive source", indicated by "NDS" in the encoding string in the x86 ISA manual. This means that they take two source operands, one of which is encoded in the VEX.vvvv field. We were incorrectly treating them as if they were destructive-source and passing 0 as the 'v' argument of tcg_out_vex_modrm(). This meant we were always using %xmm0 as one of the source operands, causing incorrect results if the register allocator happened to want to use something else. For instance the input AArch64 insn: DUP v26.16b, w21 which becomes TCG IR ops: dup_vec v128,e8,tmp2,x21 st_vec v128,e8,tmp2,env,$0xa40 was assembled to: 0x607c568c: c4 c1 7a 7e 86 e8 00 00 vmovq 0xe8(%r14), %xmm0 0x607c5694: 00 0x607c5695: c5 f9 60 c8 vpunpcklbw %xmm0, %xmm0, %xmm1 0x607c5699: c5 f9 61 c9 vpunpcklwd %xmm1, %xmm0, %xmm1 0x607c569d: c5 f9 70 c9 00 vpshufd $0, %xmm1, %xmm1 0x607c56a2: c4 c1 7a 7f 8e 40 0a 00 vmovdqu %xmm1, 0xa40(%r14) 0x607c56aa: 00 when the vpunpcklwd insn should be "%xmm1, %xmm1, %xmm1". This resulted in our incorrectly setting the output vector to q26=0000320000003200:0000320000003200 when given an input of x21 == 0000000002803200 rather than the expected all-zeroes. Pass the correct source register number to tcg_out_vex_modrm() for these insns. Backports commit 7eb30ef0ba2eb59e7430d4848ae8d4bf4e50f768 from qemu	2018-05-11 11:22:38 -04:00
Laurent Vivier	ec12091943	tcg: workaround branch instruction overflow in tcg_out_qemu_ld/st ppc64 uses a BC instruction to call the tcg_out_qemu_ld/st slow path. BC instruction uses a relative address encoded on 14 bits. The slow path functions are added at the end of the generated instructions buffer, in the reverse order of the callers. So more we have slow path functions more the distance between the caller (BC) and the function increases. This patch changes the behavior to generate the functions in the same order of the callers. Backports commit 6001f7729e12dd1d810291e4cbf83cee8e07441d from qemu	2018-05-03 15:09:07 -04:00
Richard Henderson	2150745db4	tcg: Improve TCGv_ptr support Drop TCGV_PTR_TO_NAT and TCGV_NAT_TO_PTR internal macros. Add tcg_temp_local_new_ptr, tcg_gen_brcondi_ptr, tcg_gen_ext_i32_ptr, tcg_gen_trunc_i64_ptr, tcg_gen_extu_ptr_i64, tcg_gen_trunc_ptr_i32. Use inlines instead of macros where possible. Backports commit 5bfa803448638a45542441fd6b7cc1241403ea72 from qemu	2018-05-03 15:05:43 -04:00
Richard Henderson	4fa9ea2ae1	tcg: Allow wider vectors for cmp and mul In db432672, we allow wide inputs for operations such as add. However, in 212be173 and 3774030a we didn't do the same for compare and multiply. Backports commit 9a938d86b04025ac605db0ea9819e5896bf576ec from qemu	2018-05-03 14:42:57 -04:00
Henry Wertz	090e2e9d0e	tcg/arm: Fix memory barrier encoding I found with qemu 2.11.x or newer that I would get an illegal instruction error running some Intel binaries on my ARM chromebook. On investigation, I found it was quitting on memory barriers. qemu instruction: mb $0x31 was translating as: 0x604050cc: 5bf07ff5 blpl #0x600250a8 After patch it gives: 0x604050cc: f57ff05b dmb ish In short, I found INSN_DMB_ISH (memory barrier for ARMv7) appeared to be correct based on online docs, but due to some endian-related shenanigans it had to be byte-swapped to suit qemu; it appears INSN_DMB_MCR (memory barrier for ARMv6) also should be byte swapped (and this patch does so). I have not checked for correctness of aarch64's barrier instruction. Backports commit 3f814b803797c007abfe5c4041de754e01723031 from qemu	2018-05-03 14:41:36 -04:00
Richard Henderson	16a55143dc	tcg: Document INDEX_mul[us]h_* Backports commit d103021269ca9307ed7ca0d845d2b9e6c387509a from qemu	2018-05-03 14:40:49 -04:00
Peter Maydell	778d0c47df	tcg/mips: Handle large offsets from target env to tlb_table The MIPS TCG target makes the assumption that the offset from the target env pointer to the tlb_table is less than about 64K. This used to be true, but gradual addition of features to the Arm target means that it's no longer true there. This results in the build-time assertion failing: In file included from /home/pm215/qemu/include/qemu/osdep.h:36:0, from /home/pm215/qemu/tcg/tcg.c:28: /home/pm215/qemu/tcg/mips/tcg-target.inc.c: In function ‘tcg_out_tlb_load’: /home/pm215/qemu/include/qemu/compiler.h:90:36: error: static assertion failed: "not expecting: offsetof(CPUArchState, tlb_table[NB_MMU_MODES - 1][1]) > 0x7ff0 + 0x7fff" ^ /home/pm215/qemu/include/qemu/compiler.h:98:30: note: in expansion of macro ‘QEMU_BUILD_BUG_MSG’ ^ /home/pm215/qemu/tcg/mips/tcg-target.inc.c:1236:9: note: in expansion of macro ‘QEMU_BUILD_BUG_ON’ QEMU_BUILD_BUG_ON(offsetof(CPUArchState, ^ /home/pm215/qemu/rules.mak:66: recipe for target 'tcg/tcg.o' failed An ideal long term approach would be to rearrange the CPU state so that the tlb_table was not so far along it, but this is tricky because it would move it from the "not cleared on CPU reset" part of the struct to the "cleared on CPU reset" part. As a simple fix for the 2.12 release, make the MIPS TCG target handle an arbitrary offset by emitting more add instructions. This will mean an extra instruction in the fastpath for TCG loads and stores for the affected guests (currently just aarch64-softmmu) Backports commit 161dfd1e7fad1203840c0390f235030eba3fd23c from qemu	2018-04-16 13:44:39 -04:00
Richard Henderson	49476ebf5e	tcg: Introduce tcg_set_insn_start_param The parameters for tcg_gen_insn_start are target_ulong, which may be split into two TCGArg parameters for storage in the opcode on 32-bit hosts. Fixes the ARM target and its direct use of tcg_set_insn_param, which would set the wrong argument in the 64-on-32 case. Backports commit 9743cd5736263e90d312b2c33bd739ffe1eae70d from qemu	2018-04-11 19:34:18 -04:00
Richard Henderson	c2e46f2931	tcg: Mark muluh_i64 and mulsh_i64 as 64-bit ops Failure to do so results in the tcg optimizer sign-extending any constant fold from 32-bits. This turns out to be visible in the RISC-V testsuite using a host that emits these opcodes (e.g. any non-x86_64). Backports commit f2f1dde75160cac6ede330f3db50dc817d01a2d6 from qemu	2018-03-29 14:03:00 -04:00
Lioncash	6bdfeb35ec	tcg/i386: Perform comparison pass against qemu Ensures formatting and code are consistent.	2018-03-20 06:29:06 -04:00
Richard Henderson	0dcb2d20ed	tcg: Add choose_vector_size This unifies 5 copies of checks for supported vector size, and in the process fixes a missing check in tcg_gen_gvec_2s. This lead to an assertion failure for 64-bit vector multiply, which is not available in the AVX instruction set. Bakports commit adb196cbd5cff26547bc32a208074f03f4c4a627 from qemu	2018-03-17 20:22:31 -04:00
Richard Henderson	2310bd4887	tcg/i386: Support INDEX_op_dup2_vec for -m32 Unknown why -m32 was passing with gcc but not clang; it should have failed for both. This would be used for tcg_gen_dup_i64_vec, and visible with the right TB and an aarch64 guest. Backports commit 7f34ed4bcdfda55f978f51aadca64aa970c9f4b6 from qemu	2018-03-17 20:22:24 -04:00
Richard Henderson	e9eee21efd	tcg: Improve tcg_gen_muli_i32/i64 Convert multiplication by power of two to left shift. Backports commit b2e3ae9452fa55eb036739ec39c33f0782a97504 from qemu	2018-03-17 20:22:10 -04:00
Richard Henderson	31e93018f3	tcg: Allow 6 arguments to TCG helpers We already handle this in the backends, and the lifetime datum for the TCGOp is already large enough. Backports commit 1df3caa946e08b387511dfba3a37d78910e51796 from qemu	2018-03-17 18:29:04 -04:00
Lioncash	035f1afa7d	tcg: move tcg backend files into accel/tcg/ move tcg-runtime.c, translate-all.(ch) and translate-common.c into accel/tcg/ subdirectory and updated related trace-events file. Backports commit 244f144134d0dd182f1af8654e7f9a79fe770368 and applies relevant changes made in db432672dc50ed86dda17ac821b7eb07411a90af and d9bb58e51068dfc48746c6af0179926c8dc05bce from qemu	2018-03-13 11:48:15 -04:00
Lioncash	99dbbf1571	tcg/optimize: Perform comparison pass with qemu Keeps formatting and code synced	2018-03-12 18:06:29 -04:00
Lioncash	21b0afe218	tcg: Perform comparison pass with qemu Makes formatting and code consistent with qemu	2018-03-12 18:03:06 -04:00
Lioncash	b28c64ed34	tcg/i386: Amend bad merge	2018-03-12 10:11:03 -04:00
Richard Henderson	a16ee979fc	tcg/i386: Always use TZCNT when available I think this is cleaner than sometimes using BSF. Backports commit 39f099ec9d6d420b6fe6f7f4f8ed80ae29c65ff2 from qemu	2018-03-12 05:11:42 -04:00
Richard Henderson	7e327aaf84	util: Introduce include/qemu/cpuid.h Clang 3.9 passes the CONFIG_AVX2_OPT configure test. However, the supplied <cpuid.h> does not contain the bit_AVX2 define that we use when detecting whether the routine can be enabled. Introduce a qemu-specific header that uses the compiler's definition of __cpuid et al, but supplies any missing bit_* definitions needed. This avoids introducing any extra ifdefs to util/bufferiszero.c, and allows quite a few to be removed from tcg/i386/tcg-target.inc.c. Backports commit 5dd8990841a9e331d9d4838a116291698208cbb6 from qemu	2018-03-09 12:12:00 -05:00
Richard Henderson	d1da0b8f6d	tcg/aarch64: Add vector operations Backports commit 14e4c1e2355473ccb2939afc69ac8f25de103b92 from qemu	2018-03-07 08:07:58 -05:00
Richard Henderson	b3e89e9996	tcg/i386: Add vector operations The x86 vector instruction set is extremely irregular. With newer editions, Intel has filled in some of the blanks. However, we don't get many 64-bit operations until SSE4.2, introduced in 2009. The subsequent edition was for AVX1, introduced in 2011, which added three-operand addressing, and adjusts how all instructions should be encoded. Given the relatively narrow 2 year window between possible to support and desirable to support, and to vastly simplify code maintainence, I am only planning to support AVX1 and later cpus. Backports commit 770c2fc7bb70804ae9869995fd02dadd6d7656ac from qemu	2018-03-07 08:07:40 -05:00
Richard Henderson	7f55d6ed69	tcg/optimize: Handle vector opcodes during optimize Trivial move and constant propagation. Some identity and constant function folding, but nothing that requires knowledge of the size of the vector element. Backports commit 170ba88f45bd7b1c5593021ed8e174f663b0bd1a from qemu	2018-03-06 16:10:09 -05:00
Richard Henderson	ac4d051b05	tcg: Add generic vector helpers with a scalar operand Use dup to convert a non-constant scalar to a third vector. Add addition, multiplication, and logical operations with an immediate. Add addition, subtraction, multiplication, and logical operations with a non-constant scalar. Allow for the front-end to build operations in which the scalar operand comes first. Backports commit 22fc3527034678489ec554e82fd52f8a7f05418e from qemu	2018-03-06 16:10:09 -05:00
Richard Henderson	57bdf0faa2	tcg: Add generic helpers for saturating arithmetic No vector ops as yet. SSE only has direct support for 8- and 16-bit saturation; handling 32- and 64-bit saturation is much more expensive. Backports commit f49b12c6e6a75a5bd109bcbbda072b24e5fb8dfd from qemu	2018-03-06 16:10:09 -05:00
Richard Henderson	ab8579123e	tcg: Add generic vector ops for multiplication Backports commit 3774030a3e523689df24a7ed22854ce7a06b0116 from qemu	2018-03-06 16:10:08 -05:00
Richard Henderson	f9c4930ecd	tcg: Add generic vector ops for comparisons Backports commit 212be173f01e85e6589fd76676827953a84a732b from qemu	2018-03-06 16:09:38 -05:00
Richard Henderson	577ee114c3	tcg: Add generic vector ops for constant shifts Opcodes are added for scalar and vector shifts, but considering the varied semantics of these do not expose them to the front ends. Do go ahead and provide them in case they are needed for backend expansion. Backports commit d0ec97967f940bbc11dced83422b39c224127f1e from qemu	2018-03-06 14:03:30 -05:00
Richard Henderson	64365612bf	tcg: Add generic vector expanders Backports commit db432672dc50ed86dda17ac821b7eb07411a90af from qemu	2018-03-06 13:42:52 -05:00
Richard Henderson	12fb906688	tcg: Standardize integral arguments to expanders Some functions use intN_t arguments, some use uintN_t, some just used "unsigned". To aid putting function pointers in tables, we need consistency. Backports commit 474b2e8f0f765515515b495e6872b5e18a660baf from qemu	2018-03-06 12:18:28 -05:00
Richard Henderson	b9cd924fa5	tcg: Add types and basic operations for host vectors Nothing uses or enables them yet. Backports commit d2fd745fe8b9ac574d28b7ac63c39f6529749bd2 from qemu	2018-03-06 12:13:32 -05:00
Richard Henderson	9ef32fc039	tcg: Allow multiple word entries into the constant pool This will be required for storing vector constants. Backports commit da73a4abca6acefc4bb55d30bd0242bdaddb6045 from qemu	2018-03-06 11:43:21 -05:00
Lioncash	02eee6d5f7	tcg/ppc: Update to commit 030ffe39dd4128eb90483af82a5b23b23054a466	2018-03-06 09:16:37 -05:00
Richard Henderson	6212981120	tcg/ppc: Support tlb offsets larger than 64k AArch64 with SVE has an offset of 80k to the 8th TLB. Backports commit 4a64e0fd6876e45b34cd87b700ee30ef5c10c87a from qemu	2018-03-06 09:14:05 -05:00
Richard Henderson	c4f6a7d06d	tcg/arm: Support tlb offsets larger than 64k AArch64 with SVE has an offset of 80k to the 8th TLB. Backports commit 71f9cee9d0a36dc4c00dfeeeca1301f265268f62 from qemu	2018-03-06 09:13:17 -05:00
Richard Henderson	9cd6985799	tcg/arm: Fix double-word comparisons The code sequence we were generating was only good for unsigned comparisons. For signed comparisions, use the sequence from gcc. Fixes booting of ppc64 firmware, with a patch changing the code sequence for ppc comparisons. Backports commit 7170ac33135e6ecf89752d3949bcecf9b9766d1c from qemu	2018-03-06 09:12:14 -05:00
Richard Henderson	bbd87f9d73	tcg: Add tcg_signed_cond Complimenting the existing tcg_unsigned_cond. Backports commit 923ed1750186591b04d7d61399f6d68b4e0608f2 from qemu	2018-03-05 16:55:17 -05:00
Richard Henderson	140058221d	tcg: Generalize TCGOp parameters We had two fields specific to INDEX_op_call. Rename these and add some macros so that the fields may be reused for other opcodes. Backports commit cd9090aa9dbba30db8aec9a2fc103aaf1ab0f5a7 from qemu	2018-03-05 16:53:50 -05:00
Richard Henderson	7fe5f620df	tcg: Dynamically allocate TCGOps With no fixed array allocation, we can't overflow a buffer. This will be important as optimizations related to host vectors may expand the number of ops used. Use QTAILQ to link the ops together. Backports commit 15fa08f8451babc88d733bd411d4c94976f9d0f8 from qemu	2018-03-05 16:34:40 -05:00
Richard Henderson	5f074f09ab	tcg: Remove TCGV_UNUSED* and TCGV_IS_UNUSED* These are now trivial sets and tests against NULL. Unwrap. Backports commit f764718d0cb30af9f1f8e1d6a33622cc05ca4155 from qemu	2018-03-05 15:58:15 -05:00
Richard Henderson	5ef155a68f	tcg/s390x: Use constant pool for prologue Rather than have separate code only used for guest_base, rely on a recent change to handle constant pool entries. Backports commit ba2c747992f8c315c2fbddba196ce9137430d61d from qemu	2018-03-05 11:28:39 -05:00
Richard Henderson	ef3f552229	tcg: Allow constant pool entries in the prologue Both ARMv6 and AArch64 currently may drop complex guest_base values into the constant pool. But generic code wasn't expecting that, and the pool is not emitted. Correct that. Backports commit 5b38ee31616d1532c3c3a6dc644a9160d608ed2f from qemu	2018-03-05 11:25:56 -05:00
Richard Henderson	ab9df6244c	tcg: Use offsets not indices for TCGv_* Using the offset of a temporary, relative to TCGContext, rather than its index means that we don't use 0. That leaves offset 0 free for a NULL representation without having to leave index 0 unused. Backports commit e89b28a63501c0ad6d2501fe851d0c5202055e70 from qemu	2018-03-05 10:12:08 -05:00
Richard Henderson	4d9c8583fa	tcg: Remove TCGV_EQUAL* When we used structures for TCGv_*, we needed a macro in order to perform a comparison. Now that we use pointers, this is just clutter Backports commit 11f4e8f8bfaa2caaab24bef6bbbb8a0205015119 from qemu	2018-03-05 09:16:07 -05:00
Richard Henderson	d450156414	tcg: Remove GET_TCGV_* and MAKE_TCGV_* The GET and MAKE functions weren't really specific enough. We now have a full complement of functions that convert exactly between temporaries, arguments, tcgv pointers, and indices. The target/sparc change is also a bug fix, which would have affected a host that defines TCG_TARGET_HAS_extr[lh]_i64_i32, i.e. MIPS64. Backports commit dc41aa7d34989b552efe712ffe184236216f960b from qemu	2018-03-05 09:12:26 -05:00
Richard Henderson	960eb3f4f9	tcg: Introduce temp_tcgv_{i32,i64,ptr} Backports commit 085272b35e0644fea373c33b5265c1818b7a978c from qemu	2018-03-05 08:55:52 -05:00
Richard Henderson	2bb5011b18	tcg: Introduce tcgv_{i32,i64,ptr}_{arg,temp} Transform TCGv_* to an "argument" or a temporary. For now, an argument is simply the temporary index. Backports commit ae8b75dc6ec808378487064922f25f1e7ea7a9be from qemu	2018-03-05 08:46:12 -05:00
Richard Henderson	9f8c6a456b	tcg: Use per-temp state data in optimize While we're touching many of the lines anyway, adjust the naming of the functions to better distinguish when "TCGArg" vs "TCGTemp" should be used. Backports commit 6349039d0b06eda59820629b934944246b14a1c1 from qemu	2018-03-05 08:24:06 -05:00
Richard Henderson	387060ccf5	tcg: Remove unused TCG_CALL_DUMMY_TCGV Backports commit 54534d7cfd3bdff1aa1f6c9472d94243d2303656 from qemu	2018-03-05 07:52:35 -05:00
Richard Henderson	d104b792a6	tcg: Change temp_allocate_frame arg to TCGTemp Backports commit 2272e4a791b7e1a01ffac143616ba4ece9a5762d from qemu	2018-03-05 07:51:40 -05:00
Richard Henderson	35a7a9c9a4	tcg: Avoid loops against variable bounds Copy s->nb_globals or s->nb_temps to a local variable for the purposes of iteration. This should allow the compiler to use low-overhead looping constructs on some hosts. Backports commit ac3b88911ebc6fc841f28898ee8aed40839debe2 from qemu	2018-03-05 07:50:06 -05:00
Richard Henderson	1f4ac863bf	tcg: Use per-temp state data in liveness This avoids having to allocate external memory for each temporary. Backports commit b83eabeac06e38706738bd5e92b1ba117a1b554d from qemu	2018-03-05 07:47:51 -05:00
Richard Henderson	87f2067aac	tcg: Introduce temp_arg, export temp_idx At the same time, drop the TCGContext argument and use tcg_ctx instead. Backports commit 1807f4c40098070008eb84b2032e25b7ac42569e from qemu	2018-03-05 07:24:17 -05:00
Richard Henderson	a659a03ff5	tcg: Return NULL temp for TCG_CALL_DUMMY_ARG Backports commit c6c7d84df8889b9d6298466999b88a8a42e5f976 from qemu	2018-03-05 07:22:38 -05:00
Richard Henderson	010ded3088	tcg: Add temp_global bit to TCGTemp This avoids needing to test the index of a temp against nb_globals. Backports commit fa477d25470187030614288d35bc734edffa41ee from qemu	2018-03-05 07:21:10 -05:00
Richard Henderson	a9c46ad7a0	tcg: Introduce arg_temp Backports commit 434391390ba99996af1591b427a73b3f5c05065e from qemu	2018-03-05 07:17:44 -05:00
Richard Henderson	c8f0f6901e	tcg: Propagate TCGOp down to allocators Backports commit dd186292017641d5b31fc13225a420677e1d20d3 from qemu	2018-03-05 07:12:48 -05:00
Richard Henderson	f1e2ea6847	tcg: Propagate args to op->args in tcg.c Backports commit efee3746fa471852daba7674b0d34f8c88be7559 from qemu	2018-03-05 07:06:50 -05:00
Richard Henderson	845cfc2ae9	tcg: Propagate args to op->args in optimizer Backports commit acd937019bdaf933fcf1a7b57679ba07119c89b7 from qemu	2018-03-05 06:56:06 -05:00
Richard Henderson	eb488f5bd6	tcg: Merge opcode arguments into TCGOp Rather than have a separate buffer of 10*max_ops entries, give each opcode 10 entries. The result is actually a bit smaller and should have slightly more cache locality. Backports commit 75e8b9b7aa0b95a761b9add7e2f09248b101a392 from qemu	2018-03-05 04:45:20 -05:00
Jiang Biao	60ef6d016d	tcg/mips: delete commented out extern keyword Backports commit 8df8d529ed958de4e23dcbf38bd34eff1a4716f2 from qemu	2018-03-05 03:24:25 -05:00
Emilio G. Cota	239e9771df	tcg: define TCG_HIGHWATER Will come in handy very soon. Backports commit a505785cd221994dd3713bde860861869a059940 from qemu	2018-03-05 03:22:27 -05:00
Emilio G. Cota	8552d95c52	exec-all: extract tb->tc_* into a separate struct tc_tb In preparation for adding tc.size to be able to keep track of TB's using the binary search tree implementation from glib. Backports commit e7e168f41364c6e83d0f75fc1b3ce7f9c41ccf76 from qemu	2018-03-05 02:57:22 -05:00
Emilio G. Cota	5fae6dd433	tcg: remove addr argument from lookup_tb_ptr It is unlikely that we will ever want to call this helper passing an argument other than the current PC. So just remove the argument, and use the pc we already get from cpu_get_tb_cpu_state. This change paves the way to having a common "tb_lookup" function. Backports commit 7f11636dbee89b0e4d03e9e2b96e14649a7db778 from qemu	2018-03-05 02:16:34 -05:00
Emilio G. Cota	f1d6630893	tcg/mips: constify tcg_target_callee_save_regs Backports commit d453ec78251d03cbd4ffc28dbf6070931c8ae469 from qemu	2018-03-05 02:08:36 -05:00
Emilio G. Cota	3cf23eb256	tcg/i386: constify tcg_target_callee_save_regs Backports commit e268f4c036d2b47a4f8bf293c1371b328e03ca04 from qemu	2018-03-05 02:08:02 -05:00
Richard Henderson	7168f72d4d	tcg/mips: Fully convert tcg_target_op_def Backports commit 89b2e37e6506d92b00ac478e7953be6ddd7a86a9 from qemu	2018-03-04 23:54:26 -05:00
Richard Henderson	24c5be0472	tcg/sparc: Fully convert tcg_target_op_def Backports commit 9be44a16c258287aab5a3accda153d3a5144359f from qemu	2018-03-04 23:52:18 -05:00
Richard Henderson	d3b1c8d5a4	tcg/ppc: Fully convert tcg_target_op_def Backports commit 6cb3658a04149b2c1fb92e2ea9d2e2f6cecc0014 from qemu	2018-03-04 23:50:58 -05:00
Richard Henderson	3094e7927e	tcg/arm: Fully convert tcg_target_op_def Backports commit 7536b82d28876d1ffe0359667b28c93d49386fa0 from qemu	2018-03-04 23:48:55 -05:00
Richard Henderson	47ed20fdd4	tcg/aarch64: Fully convert tcg_target_op_def Backports commit 1897cc2eb8be2d8be23380b45a2d3c1a2808723f from qemu	2018-03-04 23:46:38 -05:00
Richard Henderson	fe632c4df8	tcg: Fix types in tcg_regset_{set,reset}_reg There was a potential problem here with an ILP32 host with 64 host registers. Backports commit 80a8b9a910e14d4a1937f70dce944891990f3441 from qemu	2018-03-04 23:44:13 -05:00
Richard Henderson	fc8b4316a9	tcg: Remove tcg_regset_set32 It's not even clear what the interface REG and VAL32 were supposed to mean. All uses had REG = 0 and VAL32 was the bitset assigned to the destination. Backports commit f46934df662182097dce07d57ec00f37e4d2abf1 from qemu	2018-03-04 23:42:59 -05:00
Richard Henderson	9a9c2ede4a	tcg: Remove tcg_regset_{or,and,andnot,not} Backports commit 07ddf036fa66bca279590c09fe1c46bcdcc5bcff from qemu	2018-03-04 23:34:16 -05:00
Richard Henderson	7ba6f6f5e6	tcg: Remove tcg_regset_set Backports commit d21369f5fb41299d5e7b032ec6da12da7f95f72f from qemu	2018-03-04 23:31:35 -05:00
Richard Henderson	49d09d6888	tcg: Remove tcg_regset_clear Backports commit ccb1bb66ea2a42e773bfa04178d8b383ff86d4d8 from qemu	2018-03-04 23:24:45 -05:00
Richard Henderson	7b68a8f0ca	tcg: Add tcg_op_supported Backports commit be0f34b5840312bbe9627c2b9f68a25f32903dae from qemu	2018-03-04 23:20:28 -05:00
Lioncash	3c5f8b2800	tcg/ppc: Update to commit 53c89efd02cef626040165cc8f06b5cf2c15355d	2018-03-04 23:00:03 -05:00
Lioncash	c786137691	tcg/arm: Update to commit afe74dbd6a58031741b68e99843c1f1d390996b2	2018-03-04 22:58:36 -05:00
Richard Henderson	504bdad70d	tcg/arm: Tighten tlb indexing offset test We are not going to use ldrd for loading the comparator for 32-bit guests, so don't limit cmp_off to 8 bits then. This eliminates one insn in the tlb load for some guests. Backports commit 95ede84f4de18747d03d79c148013cff99acd60b from qemu	2018-03-04 22:57:04 -05:00
Richard Henderson	e4d05c2567	tcg/arm: Improve tlb load for armv7 Use UBFX to avoid limitation on CPU_TLB_BITS. Since we're dropping the initial shift, we need to replace the page masking. We can use MOVW+BIC to do this without shifting. The result is the same size as the armv6 path with one less conditional instruction. Backports commit 647ab96aaf5defeb138e48d610f7f633c587b40d from qemu	2018-03-04 22:56:27 -05:00
Richard Henderson	b3fd6a8c8c	tcg/sparc: Use constant pool for movi Backports commit e9823b4c3347370414b63010ec4a2a4754e4abb5 from qemu	2018-03-04 22:53:59 -05:00
Richard Henderson	b786e2d27e	tcg/sparc: Introduce TCG_REG_TB Backports commit ab20bdc11624837bd0c8aea83c603b66f0406e8b from qemu	2018-03-04 22:51:38 -05:00
Richard Henderson	0c3781e7eb	tcg/aarch64: Use constant pool for movi Backports commit 55129955e92ec164ee2d778f20070dc214109bc6 from qemu	2018-03-04 22:46:50 -05:00
Richard Henderson	5150970625	tcg/s390: Use constant pool for cmpi Also use CHI/CGHI for 16-bit signed constants. Backports commit a534bb15f30ff7e420434b3e5746bcad595c5429 from qemu	2018-03-04 22:44:26 -05:00
Richard Henderson	c08620b984	tcg/s390: Use constant pool for xori Backports commit 5bf67a9217a31512f35b036924e1db1baf2f9ebf from qemu	2018-03-04 22:39:14 -05:00
Lioncash	35d3118469	tcg/s390: Use constant pool for ori	2018-03-04 22:35:27 -05:00
Richard Henderson	bdadfa7520	tcg/s390: Use constant pool for andi Backports commit bdcd5d1926a7ae42c060efdcaa15074930a92ebb from qemu	2018-03-04 22:33:08 -05:00
Richard Henderson	bc23bab79d	tcg/s390: Use constant pool for movi Split out maybe_out_small_movi for use with other operations that want to add to the constant pool. Backports commit 28eef8aaece5e83df4568d9842ab9611ec130b2c from qemu	2018-03-04 22:32:04 -05:00
Richard Henderson	ba1563eb2f	tcg/s390: Fix sign of patch_reloc addend We were passing in -2 instead of +2, but then ignoring the actual contents of addend in the calculation. Backports commit e692a3492d04500355bcf23575eed7cf137b38d5 from qemu	2018-03-04 22:28:24 -05:00
Richard Henderson	2fff7d54cb	tcg/s390: Introduce TCG_REG_TB Backports commit 829e1376d94009a7ccacc0535bffcc679f7bb507 from qemu	2018-03-04 22:26:52 -05:00
Richard Henderson	b96f53e8a3	tcg/i386: Store out-of-range call targets in constant pool Already it saves 2 bytes per call, but also the constant pool entry may well be shared across multiple calls. Backports commit 4e45f23943c0bb91588627de3801826546155ad8 from qemu	2018-03-04 22:22:49 -05:00
Richard Henderson	e9d8cef430	tcg: Infrastructure for managing constant pools A new shared header tcg-pool.inc.c adds new_pool_label, for registering a tcg_target_ulong to be emitted after the generated code, plus relocation data to install a pointer to the data. A new pointer is added to the TCGContext, so that we dump the constant pool as data, not code. Backports commit 57a269469dbf70013dab3a176e1735636010a772 from qemu	2018-03-04 22:17:33 -05:00
Richard Henderson	f96514a99c	tcg: Rearrange ldst label tracking Dispense with TCGBackendData, as it has never been used for more than holding a single pointer. Use a define in the cpu/tcg-target.h to signal requirement for TCGLabelQemuLdst, so that we can drop the no-op tcg-be-null.h stubs. Rename tcg-be-ldst.h to tcg-ldst.inc.c. Backports commit 659ef5cbb893872d25e9d95191cc23b16546c8a1 from qemu	2018-03-04 22:13:13 -05:00
Richard Henderson	3c8cdb237a	tcg: Use tcg_malloc to allocate TCGLabelQemuLdst Pre-allocating 640 of them per TB is a waste. Backports commit 686461c96254f34bcce67a949c72867ab6ec3fcf from qemu	2018-03-04 22:00:24 -05:00
Richard Henderson	31b8b67cd3	tcg: Move USE_DIRECT_JUMP discriminator to tcg/cpu/tcg-target.h Replace the USE_DIRECT_JUMP ifdef with a TCG_TARGET_HAS_direct_jump boolean test. Replace the tb_set_jmp_target1 ifdef with an unconditional function tb_target_set_jmp_target. While we're touching all backends, add a parameter for tb->tc_ptr; we're going to need it shortly for some backends. Move tb_set_jmp_target and tb_add_jump from exec-all.h to cpu-exec.c. Backports commit a85833933628384d74ec412024d55cf012640287 from qemu	2018-03-04 21:52:35 -05:00
Richard Henderson	1642f7d404	tcg/s390: Use slbgr for setcond le and leu Backports commit 4609190b5f7f68a5e2a8738029594f45a062d4c9 from qemu	2018-03-04 13:48:42 -05:00
Richard Henderson	83e703d2bd	tcg/s390: Use load-on-condition-2 facility This allows LOAD HALFWORD IMMEDIATE ON CONDITION, eliminating one insn in some common cases. Backports commit 7af525af01b9615c4f4df5da2e8a50f2fe00b023 from qemu	2018-03-04 13:46:06 -05:00
Richard Henderson	d87e7126c3	tcg/s390: Use distinct-operands facility This allows using a 3-operand insn form for some arithmetic, logicals and shifts. Backports commit c2097136ad6e3f476fd177fc3d2e48fa6bffacfd from qemu	2018-03-04 13:42:56 -05:00
Richard Henderson	3df9d84459	tcg/s390: Merge ori+xori facilities check to tcg_target_op_def Backports commit e42349cbd6afd1f6838e719184e3d07190c02de7 from qemu	2018-03-04 13:36:20 -05:00
Richard Henderson	becadbe755	tcg/s390: Merge add2i facilities check to tcg_target_op_def Backports commit ba18b07dc689a21caa31feee922c165e90b4c28b from qemu	2018-03-04 13:34:16 -05:00
Richard Henderson	a1b4fa71cf	tcg/s390: Merge muli facilities check to tcg_target_op_def Backports commit a8f0269e9edde143d831b4a016b1e86c1f175123 from qemu	2018-03-04 13:32:29 -05:00
Richard Henderson	168ebcce61	tcg/s390: Merge cmpi facilities check to tcg_target_op_def Backports commit 07952d9570add4c78594b46605825408d956b2ad from qemu	2018-03-04 13:30:57 -05:00
Richard Henderson	9a29afcb50	tcg/s390: Fully convert tcg_target_op_def Use a switch instead of searching a table. Backports commit 9b5500b697b61460f433f0e3a30619ace2c32ca6 from qemu	2018-03-04 13:28:01 -05:00
Pranith Kumar	902886cc45	tcg: Implement implicit ordering semantics Currently, we cannot use mttcg for running strong memory model guests on weak memory model hosts due to missing ordering semantics. We implicitly generate fence instructions for stronger guests if an ordering mismatch is detected. We generate fences only for the orders for which fence instructions are necessary, for example a fence is not necessary between a store and a subsequent load on x86 since its absence in the guest binary tells that ordering need not be ensured. Also note that if we find multiple subsequent fence instructions in the generated IR, we combine them in the TCG optimization pass. This patch allows us to boot an x86 guest on ARM64 hosts using mttcg. Backports commit b32dc3370a666e237b2099c22166b15e58cb6df8 from qemu	2018-03-04 13:24:27 -05:00
Pranith Kumar	862bbef07d	tcg: Add tcg target default memory ordering Backports commit 71650df7b0ee0600308810a267a123b971b3d533 from qemu	2018-03-04 13:22:41 -05:00
Richard Henderson	b33f2b40e8	tcg: Increase minimum alignment from tcg_malloc to 8 For a 64-bit ILP32 host, aligning to sizeof(long) is not enough. Guess the minimum for any host is 8, as that covers uint64_t. Qemu doesn't use a host long double or host vectors, except in extremely limited circumstances. Fixes a bus error for a sparc v8plus host. Backports commit 13aaef678ed377b12b76dc7fb9e615b2f2f9047b from qemu	2018-03-04 01:36:59 -05:00
Richard Henderson	29ea0681d0	tcg/arm: Fix runtime overalignment test Patch 85aa80813dd changed the IF emitting the TST instruction, but failed to change the ?: converting CMP to CMPEQ, so the result of the TST is ignored. Backports commit ca671de8af96798e0f493378240034620a3a04ee from qemu	2018-03-04 01:36:20 -05:00
Jiang Biao	f1211b1c88	tcg/mips: reserve a register for the guest_base. Reserve a register for the guest_base using ppc code for reference. By doing so, we do not have to recompute it for every memory load. Backports commit 4df9cac57f5220c17d856292e90fce455f708421 from qemu	2018-03-03 23:04:55 -05:00
Jiang Biao	60703a4f57	tcg/mips: Bugfix for crash when running program with qemu-i386. When running a helloworld program with qemu-i386 in linux-user mode on Loongson 3A3000, it will crash. This patch fix the bug. Backports commit 8b8d768f19037a825a0bc81654492caa7c8fab8b from qemu	2018-03-03 22:06:26 -05:00
Pranith Kumar	57f8eec080	tcg/aarch64: Enable indirect jump path using LDR (literal) This patch enables the indirect jump path using an LDR (literal) instruction. It will be interesting to test and see which performs better among the two paths. Backports commit 2acee8b2b5e6bba2935bb6ce5be92d0f0f9799cb from qemu	2018-03-03 22:03:39 -05:00
Pranith Kumar	5e9e39cafd	tcg/aarch64: Use ADRP+ADD to compute target address We use ADRP+ADD to compute the target address for goto_tb. This patch introduces the NOP instruction which is used to align the above instruction pair so that we can use one atomic instruction to patch the destination offsets. Backports commit b68686bd4bfeb70040b4099df993dfa0b4f37b03 from qemu	2018-03-03 22:01:38 -05:00
Pranith Kumar	0998ba8259	tcg/aarch64: Introduce and use long branch to register We can use a branch to register instruction for exit_tb for offsets greater than 128MB. Backports commit 23b7aa1d2af04ba57cc94f74d9f0ab25dce72fa0 from qemu	2018-03-03 21:59:58 -05:00
Laurent Vivier	1c6b1e2b9f	target-m68k: use floatx80 internally Coldfire uses float64, but 680x0 use floatx80. This patch introduces the use of floatx80 internally and enables 680x0 80bits FPU. Backports commit f83311e4764f1f25a8abdec2b32c64483be1759b from qemu	2018-03-03 19:35:17 -05:00
Richard Henderson	9ec975448b	tcg/arm: Use ldr (literal) for goto_tb The new placement of the TB means that we can use one insn to load the goto_tb destination directly from the TB. Backports commit 308714e6bc945389c64faf1b9213e2c0d3f03391 from qemu	2018-03-03 17:14:27 -05:00
Richard Henderson	c99edca63b	tcg/arm: Try pc-relative addresses for movi Backports commit 9c39b94f1448770e7e573e9516d2483816785d1b from qemu	2018-03-03 17:13:31 -05:00
Richard Henderson	68275ba6f3	tcg/arm: Use indirect branch for goto_tb Backports commit 3fb53fb4d12f2e7833bd1659e6013237b130ef20 from qemu	2018-03-03 17:11:18 -05:00
Richard Henderson	9a85cb0a26	tcg/aarch64: Use ADR in tcg_out_movi The new placement of the TB means that we can use one insn to load the return value for exit_tb returning the TB pointer. Backports commit cc74d332ff9a78684374847375ef63fc4bd10436 from qemu	2018-03-03 17:09:42 -05:00
Emilio G. Cota	d3ada2feb5	tcg: allocate TB structs before the corresponding translated code Allocating an arbitrarily-sized array of tbs results in either (a) a lot of memory wasted or (b) unnecessary flushes of the code cache when we run out of TB structs in the array. An obvious solution would be to just malloc a TB struct when needed, and keep the TB array as an array of pointers (recall that tb_find_pc() needs the TB array to run in O(log n)). Perhaps a better solution, which is implemented in this patch, is to allocate TB's right before the translated code they describe. This results in some memory waste due to padding to have code and TBs in separate cache lines--for instance, I measured 4.7% of padding in the used portion of code_gen_buffer when booting aarch64 Linux on a host with 64-byte cache lines. However, it can allow for optimizations in some host architectures, since TCG backends could safely assume that the TB and the corresponding translated code are very close to each other in memory. See this message by rth for a detailed explanation: https://lists.gnu.org/archive/html/qemu-devel/2017-03/msg05172.html Subject: Re: GSoC 2017 Proposal: TCG performance enhancements Backports commit 6e3b2bfd6af488a896f7936e99ef160f8f37e6f2 from qemu	2018-03-03 17:05:49 -05:00
Aurelien Jarno	0e9d3d1943	tcg/mips: implement goto_ptr Backports commit 5786e0683c4f8170dd05a550814b8809d8ae6d86 from qemu	2018-03-03 14:19:46 -05:00
Richard Henderson	1d6c4f1a42	tcg/arm: Implement goto_ptr Backports commit 085c648bef7301eabe7d4a3301c8d012ae4423b8 from qemu	2018-03-03 14:18:41 -05:00
Richard Henderson	3b02642372	tcg/arm: Clarify tcg_out_bx for arm4 host In theory this would re-enable usage of QEMU on an armv4 host. Whether this is worthwhile is debatable -- we've been unconditionally issuing the armv5t BX instruction in the prologue since 2011 without complaint. Possibly we should simply require an armv6 host. Backports commit 702a947484eb3e615183dafc93de590ab0679f60 from qemu	2018-03-03 14:17:13 -05:00
Richard Henderson	d496bb6150	tcg/s390: Implement goto_ptr Backports commit 46644483cae978c734460131bb1d9071f813b287 from qemu	2018-03-03 14:16:03 -05:00
Richard Henderson	f0420c3427	tcg/sparc: Implement goto_ptr Backports commit 38f81dc5938fb7025531c5ed602afd41fef799a7 from qemu	2018-03-03 14:14:32 -05:00
Richard Henderson	81f1aae572	tcg/aarch64: Implement goto_ptr Measurements: SPECint06 (test set), x86_64-linux-user. Host: APM 64-bit ARMv8 (Atlas/A57) @ 2.4 GHz 1.45x +-+-------------------------------------------------------------------------------------------------------------+-+ \| ***** \| \| +++ * * +goto-ptr \| 1.4x +-+...****...................................................................................................+-+ \| +++* * * +++ \| 1.35x +-+................................................................****....................................+-+ \| * * * +++ \| \| * * * * * * \| 1.3x +-+.......................................................................................................+-+ \| * * * * * * \| \| * * * * * * ***** \| 1.25x +-+.................****.........................................................***.................+-+ \| * * * * * * * +++ * * \| 1.2x +-+.................................................................................................+-+ \| * * * * * * * * * * * * \| \| * * * * * * * * * * * * ***** \| 1.15x +-+...............................................................................................+-+ \| * * * * * * * * +++ * * * * * * \| \| * * * * * * * * ***** * * * * * * \| 1.1x +-+........................****.........***..................................................+-+ \| * * * * * * * * * * * * * * * * * * * \| 1.05x +-+.........................................................................................+-+ \| * * ***** * * * * * * * * * * * * * * * * * * \| \| * * * * * * * * * * * * *** *** * * * * * * * * * * \| 1x +-+---***---*---*----*---*---*---*---*---*---*----*---*---***---+-+ astar bzip2 gcc gobmk h264ref hmmlibquantum mcf omnetpperlbench sjenxalancbmk hmean png: http://imgur.com/en9HE8L Backports commit b19f0c2e7d344d4d62daf554951acdb6c94a34b0 from qemu	2018-03-03 14:13:09 -05:00
Emilio G. Cota	e4dfb7f807	tcg/i386: implement goto_ptr Backports commit 5cb4ef80f65252dd85b86fa7f3c985015423d670 from qemu	2018-03-02 21:08:38 -05:00
Emilio G. Cota	8f4f15e5f5	tcg: Introduce goto_ptr opcode and tcg_gen_lookup_and_goto_ptr Instead of exporting goto_ptr directly to TCG frontends, export tcg_gen_lookup_and_goto_ptr(), which calls goto_ptr with the pointer returned by the lookup_tb_ptr() helper. This is the only use case we have for goto_ptr and lookup_tb_ptr, so having this function is very convenient. Furthermore, it trivially allows us to avoid calling the lookup helper if goto_ptr is not implemented by the backend. Backports commit cedbcb01529cb6cf9a2289cdbebbc63f6149fc18 from qemu	2018-03-02 21:05:18 -05:00
Aurelien Jarno	00ebbae128	tcg/mips: fix field extraction opcode The "msb" argument should correspond to (len - 1). Backports commit 2f5a5f5774d95baacf86c03aa8a77a2d0390f2b2 from qemu	2018-03-02 18:59:12 -05:00
Richard Henderson	69116abafc	tcg: Initialize return value after exit_atomic Users of tcg_gen_atomic_cmpxchg and do_atomic_op rightfully utilize the output. Even though this code is dead, it gets translated, and without the initialization we encounter a tcg_error. Backports commit 79b1af906245558c30e0a5faf26cb52b63f83cce from qemu	2018-03-02 18:59:11 -05:00
Peter Maydell	b8b70dfcd2	Drop QEMU_GNUC_PREREQ() checks for gcc older than 4.1 We already require gcc 4.1 or newer (for the atomic support), so the fallback codepaths for older gcc versions than that are now dead code and we can just delete them. NB: clang reports itself as gcc 4.2 (regardless of clang version), so clang won't be using the fallbacks either. Backports commit fa54abb8c298f892639ffc4bc2f61448ac3be4a1 from qemu	2018-03-02 18:59:05 -05:00
Peter Maydell	008a235b5e	tcg/sparc: Zero extend address argument to ld/st helpers The C store helper functions take the address argument as a target_ulong type; if this is 32 bit but the host is 64 bit then the SPARC calling convention requires that the caller must zero extend the value. We weren't doing this, which meant we could pass values to the caller with high bits set and QEMU would crash if it was compiled with optimizations. In particular, the i386 BIOS would not start. Backports commit 5c32be5baf41aec4f4675d2bf24f9948756abf3c from qemu	2018-03-02 14:25:17 -05:00
Peter Maydell	40718df109	tcg/sparc: Zero extend data argument to store helpers The C store helper functions take the data argument as a uint8_t, uint16_t, etc depending on the store size. The SPARC calling convention requires that data types smaller than the register size must be extended by the caller. We weren't doing this, which meant that if QEMU was compiled with optimizations enabled we could end up storing incorrect values to guest memory. (In particular the i386 guest BIOS would crash on startup.) Add code to the trampolines that call the store helpers to do the zero extension as required. Backports commit 709a340d679d95a0c6cbb9b5f654498f04345b50 from qemu	2018-03-02 14:24:24 -05:00
Pranith Kumar	ee609fa59f	aarch64: Change ext type to TCGType to fix warnings To fix the following warnings: In file included from /users/pranith/qemu/tcg/tcg.c:255: /users/pranith/qemu/tcg/aarch64/tcg-target.inc.c:879:24: warning: implicit conversion from enumeration type 'TCGMemOp' (aka 'enum TCGMemOp') to different enumeration type 'TCGType' (aka 'enum TCGType') [-Wenum-conversion] tcg_out_cmp(s, ext, a, b, b_const); ~~~~~~~~~~~ ^~~ /users/pranith/qemu/tcg/aarch64/tcg-target.inc.c:893:36: warning: implicit conversion from enumeration type 'TCGMemOp' (aka 'enum TCGMemOp') to different enumeration type 'TCGType' (aka 'enum TCGType') [-Wenum-conversion] tcg_out_insn(s, 3201, CBZ, ext, a, offset); ~~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~ /users/pranith/qemu/tcg/aarch64/tcg-target.inc.c:389:65: note: expanded from macro 'tcg_out_insn' glue(tcg_out_insn_,FMT)(S, glue(glue(glue(I,FMT),_),OP), ## __VA_ARGS__) ^ /users/pranith/qemu/tcg/aarch64/tcg-target.inc.c:895:37: warning: implicit conversion from enumeration type 'TCGMemOp' (aka 'enum TCGMemOp') to different enumeration type 'TCGType' (aka 'enum TCGType') [-Wenum-conversion] tcg_out_insn(s, 3201, CBNZ, ext, a, offset); ~~~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~ /users/pranith/qemu/tcg/aarch64/tcg-target.inc.c:389:65: note: expanded from macro 'tcg_out_insn' glue(tcg_out_insn_,FMT)(S, glue(glue(glue(I,FMT),_),OP), ## __VA_ARGS__) ^ /users/pranith/qemu/tcg/aarch64/tcg-target.inc.c:1610:27: warning: implicit conversion from enumeration type 'TCGType' (aka 'enum TCGType') to different enumeration type 'TCGMemOp' (aka 'enum TCGMemOp') [-Wenum-conversion] tcg_out_brcond(s, ext, a2, a0, a1, const_args[1], arg_label(args[3])); ~~~~~~~~~~~~~~ ^~~ backports commit dc1eccd661ada3b746ca4438e444993c36a0f04f from qemu	2018-03-02 10:48:56 -05:00
Alex Bennée	caba238b5a	tcg: enable MTTCG by default for ARM on x86 hosts This enables the multi-threaded system emulation by default for ARMv7 and ARMv8 guests using the x86_64 TCG backend. This is because on the guest side: - The ARM translate.c/translate-64.c have been converted to - use MTTCG safe atomic primitives - emit the appropriate barrier ops - The ARM machine has been updated to - hold the BQL when modifying shared cross-vCPU state - defer powerctl changes to async safe work All the host backends support the barrier and atomic primitives but need to provide same-or-better support for normal load/store operations. Backports commit ca759f9e387db87e1719911f019bc60c74be9ed8 from qemu	2018-03-02 10:32:47 -05:00
KONRAD Frederic	c5730ff194	tcg: add options for enabling MTTCG We know there will be cases where MTTCG won't work until additional work is done in the front/back ends to support. It will however be useful to be able to turn it on. As a result MTTCG will default to off unless the combination is supported. However the user can turn it on for the sake of testing. Backports commit 8d4e9146b3568022ea5730d92841345d41275d66 from qemu	2018-03-02 09:25:01 -05:00
Alex Bennée	8c89344517	tcg: move TCG_MO/BAR types into own file We'll be using the memory ordering definitions to define values for both the host and guest. To avoid fighting with circular header dependencies just move these types into their own minimal header. Backports commit 20937143145b8f5a4194e5c407731ba38797864e from qemu	2018-03-02 09:08:44 -05:00
Richard Henderson	4bec129626	tcg/i386: Handle ctpop opcode Backports commit 993508e43e6d180e9ba9b747a9657eac69aec5bb from qemu	2018-03-01 18:49:43 -05:00
Richard Henderson	3a0fba32f3	tcg/ppc: Handle ctpop opcode Backports commit 33e75fb9c8cc44165c8dad9093762ba728cc7596 from qemu	2018-03-01 18:46:43 -05:00
Richard Henderson	6d4fc1319a	tcg/ppc: Handle ctz and clz opcodes Backports commit d0b07481fabb4dc4ed05d56d09718758f5f7a136 from qemu	2018-03-01 18:44:54 -05:00
Richard Henderson	ff3512a045	tcg: Use ctpop to generate ctz if needed Particularly when andc is also available, this is two insns shorter than using clz to compute ctz. Backports commit 14e99210f6c6cede461a54b2e0f9b4cd55175f00 from qemu	2018-03-01 18:39:20 -05:00
Richard Henderson	5f6e7bbdbd	tcg: Add opcode for ctpop The number of actual invocations of ctpop itself does not warrent an opcode, but it is very helpful for POWER7 to use in generating an expansion for ctz. Backports commit a768e4e99247911f00c5c0267c12d4e207d5f6cc from qemu	2018-03-01 18:26:41 -05:00
Richard Henderson	fff7ca4617	tcg: Add helpers for clrsb The number of actual invocations does not warrent an opcode, and the backends generating it. But at least we can eliminate redundant helpers. Backports commit 086920c2c8008f125fd38781072fa25c3ad158ea from qemu	2018-03-01 18:14:11 -05:00
Richard Henderson	246d891668	tcg/i386: Handle ctz and clz opcodes Backports commit bbf25f90ba802a286fd72be9175a860ae5fec726 from qemu	2018-03-01 16:56:08 -05:00
Richard Henderson	73ab332185	tcg/i386: Allow bmi2 shiftx to have non-matching operands Previously we could not have different constraints for different ISA levels, which prevented us from eliding the matching constraint for shifts. We do now have to make sure that the operands match for constant shifts. We can also handle some small left shifts via lea. Backports commit 6a5aed4bdc7078838a8098336588d56c9ce09d1d from qemu	2018-03-01 16:45:04 -05:00
Richard Henderson	9e3feebbfb	tcg/i386: Hoist common arguments in tcg_out_op Backports commit 42d5b514928a8a0d2f55a4c243d1333f9675815b from qemu	2018-03-01 16:42:30 -05:00
Richard Henderson	142ca07077	tcg/i386: Fuly convert tcg_target_op_def Use a switch instead of searching a table. Share constraints between 32-bit and 64-bit, when at all possible. Backports commit cd26449a505f808e479af4fdd539e05767e09c06 from qemu	2018-03-01 16:32:31 -05:00
Richard Henderson	54ca83b900	tcg/s390: Handle clz opcode Backports commit ce411066f4886cf3a4981fc0a070042a221a5fc8 from qemu	2018-03-01 16:24:29 -05:00
Richard Henderson	a90e026c18	tcg/mips: Handle clz opcode Backports commit 2a1d9d41aedd722d674b2a94d9b7dbea61469cac from qemu	2018-03-01 16:22:52 -05:00
Richard Henderson	303fc987ed	tcg/arm: Handle ctz and clz opcodes Backports commit cc0fec8a4d2a8546fe236a09bfd80150af9cbe6b from qemu	2018-03-01 16:20:46 -05:00
Richard Henderson	2b87ddda35	tcg/aarch64: Handle ctz and clz opcodes Backports commit 53c76c19904983d2c81e4f5e77027c241918a479 from qemu	2018-03-01 16:19:34 -05:00
Richard Henderson	2cf34e1b55	tcg: Add clz and ctz opcodes Backports commit 0e28d0063bbd9e59a981ea2d20f82f30c5d956a8 from qemu	2018-03-01 16:04:11 -05:00
Richard Henderson	b4b173615c	tcg: Allow an operand to be matching or a constant This allows an output operand to match an input operand only when the input operand needs a register. Backports commit 17280ff4a5f264e01e55ae514ee6d3586f9577b2 from qemu	2018-03-01 15:49:05 -05:00
Richard Henderson	3f38611159	tcg: Pass the opcode width to target_parse_constraint This will let us choose how to interpret a given constraint depending on whether the opcode is 32- or 64-bit. Which will let us share more constraint combinations between opcodes. At the same time, change the interface to return the advanced pointer instead of passing it in/out by reference. Backports commit 069ea736b50b75fdec99c9b8cc603b97bd98419e from qemu	2018-03-01 15:45:40 -05:00
Richard Henderson	b8c93597b4	tcg: Transition flat op_defs array to a target callback This will allow the target to tailor the constraints to the auto-detected ISA extensions. Backports commit f69d277ece43c42c7ab0144c2ff05ba740f6706b from qemu	2018-03-01 15:40:11 -05:00
Richard Henderson	551ef0a9f7	tcg: Add markup for output requires new register This is the same concept as, and same markup as, the early clobber markup in gcc. Backports commit 82790a870992bd87d5fd9e607f40859dcf4f82ac from qemu	2018-03-01 15:24:58 -05:00
Richard Henderson	199b3859c4	tcg/optimize: Fold movcond 0/1 into setcond Backports commit 333b21b809fc80ce67c8f6a7d1c7cc66437d9791 from qemu	2018-03-01 14:41:38 -05:00
Richard Henderson	f0781470b4	tcg/s390: Support deposit into zero Since we can no longer use matching constraints, this does mean we must handle that data movement by hand. Backports commit 752b1be94757de906b9c24ebc8f5e6aa54b96b23 from qemu	2018-03-01 13:47:20 -05:00
Richard Henderson	a7462cc7bf	tcg/s390: Implement field extraction opcodes Backports commit b0bf5fe82df93c180f69d439af59f1f546632f13 from qemu	2018-03-01 13:45:33 -05:00
Richard Henderson	ab8871ea82	tcg/s390: Implement field extraction opcodes Backports commit b0bf5fe82df93c180f69d439af59f1f546632f13 from qemu	2018-03-01 13:43:46 -05:00
Richard Henderson	348802286c	tcg/s390: Expose host facilities to tcg-target.h This lets us expose facilities to TCG_TARGET_HAS_* defines directly, rather than hiding behind function calls. Backports commit b2c98d9d392c87c9b9e975d30f79924719d9cbbe from qemu	2018-03-01 13:43:00 -05:00
Richard Henderson	db41c6f1d0	tcg/ppc: Implement field extraction opcodes Backports commit c05021c3c8d6c976e4677d3010b9ef01488a4434 from qemu	2018-03-01 13:38:42 -05:00
Richard Henderson	b10a4a9ee6	tcg/mips: Implement field extraction opcodes Backports commit befbb3ced5869003ee2e806c4f36e306918d2374 from qemu	2018-03-01 13:37:24 -05:00
Richard Henderson	7a7a5c640d	tcg/i386: Implement field extraction opcodes Backports commit 78fdbfb94616f0391834d2eccabd16ea29e37da5 from qemu	2018-03-01 13:35:41 -05:00
Richard Henderson	cabb6f71a0	tcg/arm: Implement field extraction opcodes Backports commit ec903af18418e0870af84f6036d7aca1e6a5dc0a from qemu	2018-03-01 13:33:55 -05:00
Richard Henderson	c4f56ec541	tcg/arm: Move isa detection to tcg-target.h This allows us to use this detection within the TCG_TARGET_HAS_* macros, instead of requiring a function call into tcg-target.inc.c. Backports commit 40b2ccb156534f5d5f1d110a6ce008d87ee10af1 from qemu	2018-03-01 13:32:39 -05:00
Richard Henderson	fbea4130fc	tcg/aarch64: Implement field extraction opcodes Backports commit e2179f94a17bf0933df29ce1b4f6bc93cbe7dbd3 from qemu	2018-03-01 13:30:55 -05:00
Richard Henderson	9f2fcaaf27	tcg: Add deposit_z expander While we don't require a new opcode, it is handy to have an expander that knows the first source is zero. Backports commit 07cc68d52852bf47dea7c402b46ddd28248d4212 from qemu	2018-03-01 13:29:24 -05:00
Richard Henderson	8e0585dcb1	tcg: Add field extraction primitives Adds tcg_gen_extract_* and tcg_gen_sextract_* for extraction of fixed position bitfields, much like we already have for deposit. Backports commit 7ec8bab3deae643b1ce579c2d65a244f30708330 from qemu	2018-03-01 13:21:30 -05:00
Jin Guojie	4ed2a37f6d	tcg-mips: Adjust qemu_ld/st for mips64 Backports commit f0d703314ecb0415d51425727ed73ad2c6e3238a from qemu	2018-03-01 13:01:05 -05:00
Jin Guojie	25b4e11814	tcg-mips: Adjust calling conventions for mips64 Backports commit 999b941633cabf2487d9bc77ce382b3fde3cd66d from qemu	2018-03-01 12:53:42 -05:00
Jin Guojie	3de761976c	tcg-mips: Adjust prologue for mips64 Take stack frame parameters out from the function body. Backports commit 0973b1cff8b66f3561befb1f467b2ab4d1a7d55a from qemu	2018-03-01 12:51:36 -05:00
Jin Guojie	b55b7403a8	tcg-mips: Adjust load/store functions for mips64 tcg_out_ldst: using a generic ALIAS_PADD to avoid ifdefs tcg_out_ld: generates LD or LW tcg_out_st: generates SD or SW Backports commit 32b69707df3365aadaad1d058044a7704397ec62 from qemu	2018-03-01 12:50:12 -05:00
Jin Guojie	022ff3580e	tcg-mips: Adjust move functions for mips64 tcg_out_mov: using OPC_OR as most mips assemblers do; tcg_out_movi: extended to 64-bit immediate. Backports commit 2294d05dab503d11664e73712c7f250fd0bf9e3b from qemu	2018-03-01 12:49:19 -05:00
Jin Guojie	00ccf9cec7	tcg-mips: Add bswap32u and bswap64 Without the mips32r2 instructions to perform swapping, bswap is quite large, dominating the size of each reverse-endian qemu_ld/qemu_st operation. Create two subroutines in the prologue block. The subroutines require extra reserved registers (TCG_TMP[2, 3]). Using these within qemu_ld means that we need not place additional restrictions on the qemu_ld outputs. Backports commit 7f54eaa3b78d71cb57e45a719980f9b5ff06d21c from qemu	2018-03-01 12:47:45 -05:00
Jin Guojie	397db1b046	tcg-mips: Support 64-bit opcodes Bulk patch adding 64-bit opcodes into tcg_out_op. Note that mips64 is as yet neither complete nor enabled. Backports commit 0119b1927d531f3fac22b9b4da01dafc23644973 from qemu	2018-03-01 12:46:18 -05:00
Jin Guojie	286f3a9f70	tcg-mips: Add mips64 opcodes Since the mips manual tables are in octal, reorg all of the opcodes into that format for clarity. Note that the 64-bit opcodes are as yet unused. Backports commit 57a701fc2b34902310d4dbd1411088055616938a from qemu	2018-03-01 12:36:20 -05:00
Jin Guojie	d2aa49e9d3	tcg-mips: Move bswap code to a subroutine Without the mips32r2 instructions to perform swapping, bswap is quite large, dominating the size of each reverse-endian qemu_ld/qemu_st operation. Create a subroutine in the prologue block. The subroutine requires extra reserved registers (TCG_TMP[2, 3]). Using these within qemu_ld means that we need not place additional restrictions on the qemu_ld outputs. Backports commit bb08afe9f0aee1a3f5c23508e2511b882ca31e1b from qemu	2018-03-01 12:35:20 -05:00
Laurent Vivier	77b8b2f3b8	target-m68k: add 680x0 divu/divs variants Update helper to set the throwing location in case of div-by-0. Cleanup divX.w and add quad word variants of divX.l. Backports commit 0ccb9c1d8128a020720d5c6abf99a470742a1b94 from qemu	2018-03-01 11:38:53 -05:00
Richard Henderson	fcc05dc1ce	tcg/s390: Remove 'R' constraint Since R0 is reserved, we don't need a special case constraint. Backports commit e45d4ef6e345831c8d67a5bffe0d057efc20f4ff from qemu	2018-03-01 11:05:57 -05:00
Richard Henderson	7852cc600d	tcg/s390: Fix setcond expansion We can't use LOAD AND TEST for unsigned data and then expect to extract the result with ADD LOGICAL WITH CARRY. Fall through to using COMPARE LOGICAL IMMEDIATE instead. Backports commit 65839b56b9a740e6b898b5d81afc160502bd2935 from qemu	2018-03-01 11:04:40 -05:00
Richard Henderson	6820964e2f	tcg/aarch64: Fix tcg_out_movi There were some patterns, like 0x0000_ffff_ffff_00ff, for which we would select to begin a multi-insn sequence with MOVN, but would fail to set the 0x0000 lane back from 0xffff. Backports commit 50b468d42107a2c646b1c566ed17d9ec362c51c4 from qemu	2018-03-01 09:15:34 -05:00
Richard Henderson	a03666f2f2	tcg/aarch64: Fix addsub2 for 0+C When al == xzr, we cannot use addi/subi because that encodes xsp. Force a zero into the temp register for that (rare) case. Backports commit 028fbea47713f909d6ea761a457779a82b276247 from qemu	2018-03-01 09:13:54 -05:00
Joseph Myers	7ff441826c	tcg: correct 32-bit tcg_gen_ld8s_i64 sign-extension The version of tcg_gen_ld8s_i64 for 32-bit systems does a load into the low part of the return value - then attempts a sign extension into the high part, but wrongly sets the high part to a sign extension of itself rather than of the low part. This results in TCG internal errors from the use of the uninitialized high part (in some GCC tests of AArch64 NEON shift intrinsics, in particular). This patch corrects the sign-extension logic, making it match other functions such as tcg_gen_ld16s_i64. Backports commit 3ff91d7e85176f8b4b131163d7fd801757a2c949 from qemu	2018-03-01 08:41:23 -05:00
Peter Maydell	f9c5c1a604	tcg/tcg.h: Improve documentation of TCGv_i32 etc types The typedefs we use for the TCGv_i32, TCGv_i64 and TCGv_ptr types are somewhat confusing, because we define them as pointers to structs, but the structs themselves are never defined. Explain in the comments a bit more clearly why this is OK and what is going on under the hood. Backports commit a40d4701bc9f6e6a3bbfb7b4fbe756a5b72b5df1 from qemu	2018-03-01 08:40:35 -05:00
Richard Henderson	f5a35908da	tcg: Add tcg_gen_mulsu2_{i32,i64,tl} This multiply has one signed input and one unsigned input, producing the full double-width result. Backports commit 5087abfb7dfd1d368ae6939420057036b4d8e509 from qemu	2018-03-01 08:39:37 -05:00
Paolo Bonzini	9d64a89acf	tcg: comment on which functions have to be called with tb_lock held softmmu requires more functions to be thread-safe, because translation blocks can be invalidated from e.g. notdirty callbacks. Probably the same holds for user-mode emulation, it's just that no one has ever tried to produce a coherent locking there. This patch will guide the introduction of more tb_lock and tb_unlock calls for system emulation. Note that after this patch some (most) of the mentioned functions are still called outside tb_lock/tb_unlock. The next one will rectify this. Backports commit 7d7500d99895f888f97397ef32bb536bb0df3b74 from qemu	2018-02-28 10:26:28 -05:00
Richard Henderson	b48508a6c1	tcg: Emit barriers with parallel_cpus Backports commit 91682118aa330aff7e8ef0cc685c32d101f49940 from qemu	2018-02-27 22:28:33 -05:00
Richard Henderson	064543a415	tcg: Add CONFIG_ATOMIC64 Allow qemu to build on 32-bit hosts without 64-bit atomic ops. Even if we only allow 32-bit hosts to multi-thread emulate 32-bit guests, we still need some way to handle the 32-bit guest using a 64-bit atomic operation. Do so by dropping back to single-step. Backports commit df79b996a7b21c6ea7847f7927a2e1a294b86c72 from qemu	2018-02-27 22:25:36 -05:00
Richard Henderson	da01e53757	tcg: Add atomic128 helpers Force the use of cmpxchg16b on x86_64. Wikipedia suggests that only very old AMD64 (circa 2004) did not have this instruction. Further, it's required by Windows 8 so no new cpus will ever omit it. If we truely care about these, then we could check this at startup time and then avoid executing paths that use it. Backports commit 7ebee43ee3e2fcd7b5063058b7ef74bc43216733 from qemu	2018-02-27 21:43:48 -05:00
Richard Henderson	5c0ce1b99c	tcg: Add atomic helpers Add all of cmpxchg, op_fetch, fetch_op, and xchg. Handle both endian-ness, and sizes up to 8. Handle expanding non-atomically, when emulating in serial. Backports commit c482cb117cc418115ca9c6d21a7a2315414c0a40 from qemu	2018-02-27 15:57:47 -05:00
Richard Henderson	4e498cc54d	target-m68k: Reorg flags handling Separate all ccr bits. Continue to batch updates via cc_op. Backports commit 620c6cf66584bfbee90db84a7e87a6eabf230ca9 from qemu	2018-02-27 10:02:02 -05:00
Paolo Bonzini	8734e13a73	tcg: try sti when moving a constant into a dead memory temp This comes from free from unifying tcg_reg_alloc_mov and tcg_reg_alloc_movi's handling of TEMP_VAL_CONST. It triggers often on moves to cc_dst, such as the following translation of "sub $0x3c,%esp": before: after: subl $0x3c,%ebp subl $0x3c,%ebp movl %ebp,0x10(%r14) movl %ebp,0x10(%r14) movl $0x3c,%ebx movl $0x3c,0x2c(%r14) movl %ebx,0x2c(%r14) Backports commit 0fe4fca4e1a5e06a270127dd80bb753d4dda61c6 from qemu	2018-02-26 10:08:47 -05:00
Alex Bennée	bf72733576	tcg/optimize: move default return out of if statement This is to appease sanitizer builds which complain that: "error: control reaches end of non-void function" Backports commit 550276ae0a88851edda2cb7fcdd64256dbb8e314 from qemu	2018-02-26 05:05:21 -05:00
Richard Henderson	2ab4b8fa4d	tcg/i386: Extend TARGET_PAGE_MASK to the proper type TARGET_PAGE_MASK, as defined, has type "int". We need to extend that to the proper target width before oring in an "unsigned". Backports commit ebb90a005da67147245cd38fb04a965a87a961b7 from qemu	2018-02-26 03:32:38 -05:00
Pranith Kumar	16d71f0f10	tcg: Optimize fence instructions This commit optimizes fence instructions. Two optimizations are currently implemented: (1) unnecessary duplicate fence instructions, and (2) merging weaker fences into a stronger fence. [rth: Merge tcg_optimize_mb back into tcg_optimize, so that we only loop over the opcode stream once. Merge "unrelated" weaker barriers into one stronger barrier.] Backports commit 34f939218ce78163171addd63750e1e0300376ab from qemu	2018-02-26 03:29:59 -05:00
Pranith Kumar	65a73763e3	tcg/sparc: Add support for fence Backports commit f8f03b3707b49898052fb8cd75ee31d19c8161fc from qemu	2018-02-26 03:20:39 -05:00
Pranith Kumar	a6fdc24e28	tcg/s390: Add support for fence Backports commit c9314d610e0e5da4d2cd5a36f3563d102b3294e0 from qemu	2018-02-26 03:19:41 -05:00
Pranith Kumar	bdd9cad15c	tcg/ppc: Add support for fence Backports commit 7b4af5ee8a1336bc39714b6de47924ee71fba761 from qemu	2018-02-26 03:18:43 -05:00
Pranith Kumar	5f10101245	tcg/mips: Add support for fence Backports commit 6f0b99104a396905870edc3049310ece29b6b8d6 from qemu	2018-02-26 03:17:34 -05:00
Pranith Kumar	e29cbe9640	tcg/arm: Add support for fence Backports commit 40f191ab8226fdada185efa49c44b60d8f494890 from qemu	2018-02-26 03:13:17 -05:00
Pranith Kumar	907060b865	tcg/aarch64: Add support for fence Backports commit c7a59c2a92592e556b9361437c9c4229917bd1e3 from qemu	2018-02-26 03:11:03 -05:00
Pranith Kumar	d49bd55f52	tcg/i386: Add support for fence Generate a 'lock orl $0,0(%esp)' instruction for ordering instead of mfence which has similar ordering semantics. Backports commit a7d00d4effb58889ac6df64f98ac50c9d1594149 from qemu	2018-02-26 03:10:58 -05:00
Pranith Kumar	5e44ce9be8	Introduce TCGOpcode for memory barrier This commit introduces the TCGOpcode for memory barrier instruction. This opcode takes an argument which is the type of memory barrier which should be generated. Backports commit f65e19bc2c9e8358e634d309606144ac2a3c2936 from qemu	2018-02-26 03:02:41 -05:00
Richard Henderson	91f5cf0417	tcg: Support arbitrary size + alignment Previously we allowed fully unaligned operations, but not operations that are aligned but with less alignment than the operation size. In addition, arm32, ia64, mips, and sparc had been omitted from the previous overalignment patch, which would have led to that alignment being enforced. Backports commit 85aa80813dd9f5c1f581c743e45678a3bee220f8 from qemu	2018-02-26 02:47:26 -05:00

... 3 4 5 6 7 ...

651 commits