unicorn

mirror of https://github.com/yuzu-emu/unicorn.git synced 2025-09-10 18:17:19 +00:00

Author	SHA1	Message	Date
Aurelien Jarno	59909fe549	tcg/optimize: track const/copy status separately Instead of using an enum which could be either a copy or a const, track them separately. This will be used in the next patch. Constants are tracked through a bool. Copies are tracked by initializing temp's next_copy and prev_copy to itself, allowing to simplify the code a bit. Backports commit b41059dd9deec367a4ccd296659f0bc5de2dc705 from qemu	2018-02-10 22:15:43 -05:00
Aurelien Jarno	134a7dfe82	tcg/optimize: add temp_is_const and temp_is_copy functions Add two accessor functions temp_is_const and temp_is_copy, to make the code more readable and make code change easier. Backports commit d9c769c60948815ee03b2684b1c1c68ee4375149 from qemu	2018-02-10 22:07:02 -05:00
Aurelien Jarno	b450b79622	tcg/optimize: optimize temps tracking The tcg_temp_info structure uses 24 bytes per temp. Now that we emulate vector registers on most guests, it's not uncommon to have more than 100 used temps. This means we have initialize more than 2kB at least twice per TB, often more when there is a few goto_tb. Instead used a TCGTempSet bit array to track which temps are in used in the current basic block. This means there are only around 16 bytes to initialize. This improves the boot time of a MIPS guest on an x86-64 host by around 7% and moves out tcg_optimize from the the top of the profiler list. Backports commit 1208d7dd5fddc1fbd98de800d17429b4e5578848 from qemu	2018-02-10 21:51:46 -05:00
Aurelien Jarno	5f67ab74e7	tcg/optimize: fix constant signedness By convention, on a 64-bit host TCG internally stores 32-bit constants as sign-extended. This is not the case in the optimizer when a 32-bit constant is folded. This doesn't seem to have more consequences than suboptimal code generation. For instance the x86 backend assumes sign-extended constants, and in some rare cases uses a 32-bit unsigned immediate 0xffffffff instead of a 8-bit signed immediate 0xff for the constant -1. This is with a ppc guest: before ------ ---- 0x9f29cc movi_i32 tmp1,$0xffffffff movi_i32 tmp2,$0x0 add2_i32 tmp0,CA,CA,tmp2,r6,tmp2 add2_i32 tmp0,CA,tmp0,CA,tmp1,tmp2 mov_i32 r10,tmp0 0x7fd8c7dfe90c: xor %ebp,%ebp 0x7fd8c7dfe90e: mov %ebp,%r11d 0x7fd8c7dfe911: mov 0x18(%r14),%r9d 0x7fd8c7dfe915: add %r9d,%r10d 0x7fd8c7dfe918: adc %ebp,%r11d 0x7fd8c7dfe91b: add $0xffffffff,%r10d 0x7fd8c7dfe922: adc %ebp,%r11d 0x7fd8c7dfe925: mov %r11d,0x134(%r14) 0x7fd8c7dfe92c: mov %r10d,0x28(%r14) after ----- ---- 0x9f29cc movi_i32 tmp1,$0xffffffffffffffff movi_i32 tmp2,$0x0 add2_i32 tmp0,CA,CA,tmp2,r6,tmp2 add2_i32 tmp0,CA,tmp0,CA,tmp1,tmp2 mov_i32 r10,tmp0 0x7f37010d490c: xor %ebp,%ebp 0x7f37010d490e: mov %ebp,%r11d 0x7f37010d4911: mov 0x18(%r14),%r9d 0x7f37010d4915: add %r9d,%r10d 0x7f37010d4918: adc %ebp,%r11d 0x7f37010d491b: add $0xffffffffffffffff,%r10d 0x7f37010d491f: adc %ebp,%r11d 0x7f37010d4922: mov %r11d,0x134(%r14) 0x7f37010d4929: mov %r10d,0x28(%r14) Backports commit 29f3ff8d6cbc28f79933aeaa25805408d0984a8f from qemu	2018-02-10 21:40:20 -05:00
Aurelien Jarno	e273acf87a	tcg/optimize: fix tcg_opt_gen_movi Due to a copy&paste, the new op value is tested against mov_i32 instead of movi_i32. The test is therefore always false. Fix that. Backports commit 961521261a3d600b0695b2e6d2b0f490076f7e90 from qemu	2018-02-10 21:38:09 -05:00
Aurelien Jarno	42dd2addbe	tcg/optimize: rename tcg_constant_folding The tcg_constant_folding folding ends up doing all the optimizations (which is a good thing to avoid looping on all ops multiple time), so make it clear and just rename it tcg_optimize. Backports commit 36e60ef6ac5d8a262d0fbeedfdb2b588514cb1ea from qemu	2018-02-10 21:36:34 -05:00
Aurelien Jarno	7b0055d742	tcg/optimize: fold constant test in tcg_opt_gen_mov Most of the calls to tcg_opt_gen_mov are preceeded by a test to check if the source temp is a constant. Fold that into the tcg_opt_gen_mov function. Backports commit 97a79eb70dd35a24fda87d86196afba5e6f21c5d from qemu	2018-02-10 21:34:00 -05:00
Aurelien Jarno	517fac57c3	tcg/optimize: fold temp copies test in tcg_opt_gen_mov Each call to tcg_opt_gen_mov is preceeded by a test to check if the source and destination temps are copies. Fold that into the tcg_opt_gen_mov function. Backports commit 5365718a9afeeabde3784d82a542f8ad909b18cf from qemu	2018-02-10 21:27:06 -05:00
Aurelien Jarno	d21f474c39	tcg/optimize: remove opc argument from tcg_opt_gen_mov We can get the opcode using the TCGOp pointer. It needs to be dereferenced, but it's anyway done a few lines below to write the new value. Backports commit 8d6a91602ea824ef4435ea38fd475387eecc098c from qemu	2018-02-10 21:23:34 -05:00
Aurelien Jarno	0fd0afad13	tcg/optimize: remove opc argument from tcg_opt_gen_movi We can get the opcode using the TCGOp pointer. It needs to be dereferenced, but it's anyway done a few lines below to write the new value. Backports commit ebd27391b00cdafc81e0541a940686137b3b48df from qemu	2018-02-10 21:21:13 -05:00
Richard Henderson	dafc44c0a5	target-mips: Use CPU_LOG_INT for logging related to interrupts There are now no unconditional uses of qemu_log in the subdirectory. Backports commit c85570163bdf1ba29cb52a63f22ff1c48f1b9398 from qemu	2018-02-10 21:12:41 -05:00
Richard Henderson	6f66fb4bd5	target-mips: Copy restrictions from ext/ins to dext/dins The checks in dins is required to avoid triggering an assertion in tcg_gen_deposit_tl. The check in dext is just for completeness. Fold the other D cases in via fallthru. Backports commit b7f26e523914b982a1c1bfa8295f77ff9787c33c from qemu	2018-02-10 21:09:26 -05:00
Richard Henderson	f5e38ea71e	tcg/aarch64: use 32-bit offset for 32-bit softmmu emulation Similar to the same fix for user-mode, except this instance occurs on the softmmu path. Again, the tlb addend must be the base register, while the guest address is the index. Backports commit 80adb8fcad4778376a11d394a9e01516819e2327 from qemu	2018-02-10 20:59:13 -05:00
Paolo Bonzini	cfc9356a8e	tcg/aarch64: use 32-bit offset for 32-bit user-mode emulation Thanks to the previous patch, it is now easy for tcg_out_qemu_ld and tcg_out_qemu_st to use a 32-bit zero extended offset. However, the guest base register x28 must be the base and addr_reg must be the index. Backports commit ffc6372851d8631a9f9fa56ec613b3244dc635b9 from qemu	2018-02-10 20:55:51 -05:00
Paolo Bonzini	85bac3c96d	tcg/aarch64: add ext argument to tcg_out_insn_3310 The new argument lets you pick uxtw or uxtx mode for the offset register. For now, all callers pass TCG_TYPE_I64 so that uxtx is generated. The bits for uxtx are removed from I3312_TO_I3310. Backports commit 6c0f0c0f124718650a8d682ba275044fc02f6fe2 from qemu	2018-02-10 20:51:37 -05:00
Richard Henderson	95e666c547	tcg/i386: Extend addresses for 32-bit guests Removing the ??? comment explaining why it (mostly) worked. Backports commit ee8ba9e4d8458b8bba5455a7ae704620c4f2ef4b from qemu	2018-02-10 20:42:33 -05:00
Richard Henderson	17c1f027c1	tcg: Handle MO_AMASK in tcg_dump_ops Backports commit 59c4b7e8dfab0cdc41434fedbf2686222f541e57 from qemu	2018-02-10 20:32:52 -05:00
Richard Henderson	c5a2a50c06	tcg: Mask TCGMemOp appropriately for indexing The addition of MO_AMASK means that places that used inverted masks need to be changed to use positive masks, and places that failed to mask the intended bits need updating. Backports commit 2b7ec66f025263a5331f37d5ad78a625496fd7bd from qemu	2018-02-10 20:29:36 -05:00
Richard Henderson	336833c11e	tcg: Add MO_ALIGN, MO_UNALN These modifiers control, on a per-memory-op basis, whether unaligned memory accesses are allowed. The default setting reflects the target's definition of ALIGNED_ONLY. Backports commit dfb36305626636e2e07e0c5acd3a002a5419399e from qemu	2018-02-10 20:18:53 -05:00
Richard Henderson	ac713c7034	tcg: Push merged memop+mmu_idx parameter to softmmu routines The extra information is not yet used but it is now available. This requires minor changes through all of the tcg backends. Backports commit 3972ef6f830d65e9bacbd31257abedc055fd6dc8 from qemu	2018-02-10 20:03:22 -05:00
Richard Henderson	6234d07489	tcg: Merge memop and mmu_idx parameters to qemu_ld/st At the tcg opcode level, not at the tcg-op.h generator level. This requires minor changes through all of the tcg backends, but none of the cpu translators. Backports commit 59227d5d45bb3c31dc2118011691c35b3c00879c from qemu	2018-02-10 19:01:49 -05:00
Richard Henderson	7532c92358	tcg/optimize: Handle or r,a,a with constant a Backports commit 2374c4b8375072da1f401c6daccc68ae76c73e63 from qemu	2018-02-09 14:56:12 -05:00
Richard Henderson	e0d99a1a06	tcg: Complete handling of ALWAYS and NEVER Missing from movcond, and brcondi_i32 (but not brcondi_i64). Backports commit 37ed3bf1ee07bb1a26adca0df8718f601f231c0b from qemu	2018-02-09 14:52:21 -05:00
Richard Henderson	6bd102ba86	tcg: Use tcg_malloc to allocate TCGLabel Pre-allocating 512 of them per TB is a waste. Backports commit 51e3972c41598adc91fe3f4767057f5198dcc15c from qemu	2018-02-09 14:48:20 -05:00
Richard Henderson	00b0a50f47	tcg: Change generator-side labels to a pointer This is less about improved type checking than enabling a subsequent change to the representation of labels. Backports commit bec1631100323fac0900aea71043d5c4e22fc2fa from qemu	2018-02-09 14:40:59 -05:00
Richard Henderson	232632e76c	tcg: Change translator-side labels to a pointer This is improved type checking for the translators -- it's no longer possible to accidentally swap arguments to the branch functions. Note that the code generating backends still manipulate labels as int. With notable exceptions, the scope of the change is just a few lines for each target, so it's not worth building extra machinery to do this change in per-target increments. Backports commit 42a268c241183877192c376d03bd9b6d527407c7 from qemu	2018-02-09 14:17:56 -05:00
Richard Henderson	255a160c66	tcg: Remove unused opcodes We no longer need INDEX_op_end to terminate the list, nor do we need 5 forms of nop, since we just remove the TCGOp instead. Backports commit 15fc7daa770764cc795158cbb525569f156f3659 from qemu	2018-02-09 13:20:41 -05:00
Richard Henderson	70f28c8bd5	tcg: Implement insert_op_before Rather reserving space in the op stream for optimization, let the optimizer add ops as necessary. Backports commit a4ce099a7a4b4734c372f6bf28f3362e370f23c1 from qemu	2018-02-09 13:11:50 -05:00
Richard Henderson	4fcaabf38c	tcg: Remove opcodes instead of noping them out With the linked list scheme we need not leave nops in the stream that we need to process later. Backports commit 0c627cdca20155753a536c51385abb73941a59a0 from qemu	2018-02-09 13:03:58 -05:00
Lioncash	0273e6ae18	tcg: Put opcodes in a linked list The previous setup required ops and args to be completely sequential, and was error prone when it came to both iteration and optimization.	2018-02-09 12:54:05 -05:00
Richard Henderson	a41b9acc0c	tcg: Introduce tcg_op_buf_count and tcg_op_buf_full The method by which we count the number of ops emitted is going to change. Abstract that away into some inlines. Backports commit fe700adb3db5b028b504423b946d4ee5200a8f2f from qemu.	2018-02-09 09:31:17 -05:00
Richard Henderson	78378289e3	tcg: Move emit of INDEX_op_end into gen_tb_end Backports commit 0a7df5da986bd7ee0789f2d7b8611f2e8eee5046 from qemu	2018-02-09 08:51:01 -05:00
Richard Henderson	4d46959c3b	tcg: Reduce ifdefs in tcg-op.c Almost completely eliminates the ifdefs in this file, improving confidence in the lesser used 32-bit builds. Backports commit 3a13c3f34ce2058e0c2decc3b0f9f56be24c9400 from qemu	2018-02-09 08:35:52 -05:00
Richard Henderson	500c546444	tcg: Move some opcode generation functions out of line Some of these functions are really quite large. We have a number of things that ought to be circularly dependent, but we duplicated code to break that chain for the inlines. This saved 25% of the code size of one of the translators I examined.	2018-02-09 08:10:00 -05:00
Richard Henderson	cb7b19ad26	tcg: Change ts->mem_reg to ts->mem_base Chain the temporaries together via pointers intstead of indices. The mem_reg value is now mem_base->reg. This will be important later. This does require that the frame pointer have a global temporary allocated for it. This is simple bar the existing reserved_regs check. Backports commit b3a62939561e07bc34493444fa926b6137cba4e8 from qemu	2018-02-08 13:04:48 -05:00
Richard Henderson	6b4b493dae	tcg: Change tcg_global_mem_new_* to take a TCGv_ptr Thus, use cpu_env as the parameter, not TCG_AREG0 directly. Update all uses in the translators. Backports commit e1ccc05444676b92c63708096e36582be27fbee1 from qemu	2018-02-08 12:33:33 -05:00
Richard Henderson	afb67fc002	target/arm: Fix aa64 ldp register writeback Backports commit 3e4d91b94ce400326fae0850578d9e9f30a71adb from qemu	2018-02-08 08:29:51 -05:00
Eric Blake	37cdcbf771	maint: Fix macros with broken 'do/while(0); ' usage	2018-02-07 20:27:37 -05:00
Lioncash	0f453b0595	target/arm: Add aa{32, 64}_vfp_{dreg, qreg} helpers Backports commit 9a2b5256ea1f68c89d5da4b54f180f576c2c82d6 from qemu	2018-02-07 10:09:26 -05:00
Lioncash	dd577f5ea5	target/arm: Change the type of vfp.regs Backports commit 3f68b8a5a6862f856524bb347bf348ae364dd43c from qemu	2018-02-07 09:57:43 -05:00
Lioncash	ef07c136b6	target/arm: Add fp16 support to vfp_expand_imm Backports commit 8081796a75414f9ed5ec3d97158e543ed45908ec from qemu.	2018-02-07 09:47:04 -05:00
Lioncash	b55f35ba92	target/arm: Split out vfp_expand_imm Backports commit e90a99fe6bde9b85bff8c052ade51520f20d9bce from qemu.	2018-02-07 09:44:52 -05:00
Lioncash	4c165ed788	translate-a64: Silence unused variable warning	2018-02-06 08:38:01 -05:00
Merry	29d38d7c22	Merge pull request #10 from lioncash/el-busto-ldst-exclusive translate-a64: Backport fix for incorrect load/store exclusive unallocated checks	2018-02-05 20:59:25 +00:00
Merry	b7bb608197	Merge pull request #9 from lioncash/ia64 tcg: Drop ia64 host support	2018-02-05 20:59:18 +00:00
Merry	82c4212ce3	Merge pull request #8 from lioncash/optimize Backport REV16 optimizations from qemu	2018-02-05 20:58:58 +00:00
Lioncash	1e451b386a	translate-a64: Backport fix for incorrect load/store exclusive unallocated checks Backports commit e14f0eb12f920fd96b9f79d15cedd437648e8667 from qemu	2018-02-04 23:17:45 -05:00
Lioncash	7f665d8c1e	tcg: Drop ia64 host support Backports commit a46c1244a0d65d5f37fc12e4d42f2479eac87b52 from qemu	2018-02-04 18:33:02 -05:00
Lioncash	5a37b8c28e	Backport optimizations to AArch32's REV16 handling Backports commit 68cedf733ae32363ccf54f0b52c8a424d5ec98ed from qemu	2018-02-04 14:53:28 -05:00
Lioncash	4a8a92bad2	Backport optimizations to AArch64's REV16 handling Backports commits abb1066df313602ef0ca631126bd342d399d5359 and e4256c3cbf7eefebc0bc6e1f472c47c6dd20b996 from qemu.	2018-02-04 14:45:39 -05:00

... 2 3 4 5 6 ...

2002 commits