unicorn

mirror of https://github.com/yuzu-emu/unicorn.git synced 2025-11-17 10:25:00 +00:00

Author	SHA1	Message	Date
Richard Henderson	31e93018f3	tcg: Allow 6 arguments to TCG helpers We already handle this in the backends, and the lifetime datum for the TCGOp is already large enough. Backports commit 1df3caa946e08b387511dfba3a37d78910e51796 from qemu	2018-03-17 18:29:04 -04:00
Lioncash	a81439c7ca	exec: Drop unnecessary code for unicorn The dirty memory code isn't strictly necessary	2018-03-12 10:11:46 -04:00
Alexey Kardashevskiy	b90333a531	memory: Share special empty FlatView This shares an cached empty FlatView among address spaces. The empty FV is used every time when a root MR renders into a FV without memory sections which happens when MR or its children are not enabled or zero-sized. The empty_view is not NULL to keep the rest of memory API intact; it also has a dispatch tree for the same reason. On POWER8 with 255 CPUs, 255 virtio-net, 40 PCI bridges guest this halves the amount of FlatView's in use (557 -> 260) and dispatch tables (~800000 -> ~370000). In an unrelated experiment with 112 non-virtio devices on x86 ("-M pc"), only 4 FlatViews are alive, and about ~2000 are created at startup. Backports commit 092aa2fc65b7a35121616aad8f39d47b8f921618 from qemu	2018-03-11 22:34:28 -04:00
Alexey Kardashevskiy	1fd8b64072	memory: Get rid of address_space_init_shareable Since FlatViews are shared now and ASes not, this gets rid of address_space_init_shareable(). This should cause no behavioural change. Backports commit b516572f31c0ea0937cd9d11d9bd72dd83809886 from qemu	2018-03-11 22:12:38 -04:00
Alexey Kardashevskiy	d9bc1bcc8c	memory: Rename mem_begin/mem_commit/mem_add helpers This renames some helpers to reflect better what they do. This should cause no behavioural change. Backports commit 8629d3fcb77e9775e44d9051bad0fb5187925eae from qemu	2018-03-11 21:36:50 -04:00
Alexey Kardashevskiy	aa2b76b4e8	memory: Switch memory from using AddressSpace to FlatView FlatView's will be shared between AddressSpace's and subpage_t and MemoryRegionSection cannot store AS anymore, hence this change. In particular, for: typedef struct subpage_t { MemoryRegion iomem; - AddressSpace as; + FlatView fv; hwaddr base; uint16_t sub_section[]; } subpage_t; struct MemoryRegionSection { MemoryRegion mr; - AddressSpace address_space; + FlatView *fv; hwaddr offset_within_region; Int128 size; hwaddr offset_within_address_space; bool readonly; }; This should cause no behavioural change. Backports commit 166206845f7fd75e720e6feea0bb01957c8da07f from qemu	2018-03-11 21:21:37 -04:00
Lioncash	1591f208c0	memory: Move AddressSpaceDispatch from AddressSpace to FlatView As we are going to share FlatView's between AddressSpace's, and AddressSpaceDispatch is a structure to perform quick lookup in FlatView, this moves ASD to FlatView. After previosly open coded ASD rendering, we can also remove as->next_dispatch as the new FlatView pointer is stored on a stack and set to an AS atomically. flatview_destroy() is executed under RCU instead of address_space_dispatch_free() now. This makes mem_begin/mem_commit to work with ASD and mem_add with FV as later on mem_add will be taking FV as an argument anyway. This should cause no behavioural change. Backports commit 66a6df1dc6d5b28cc3e65db0d71683fbdddc6b62 from qemu	2018-03-11 20:40:24 -04:00
Alex Bennée	e56ed38819	include/exec/helper-head.h: support f16 in helper calls This allows us to explicitly pass float16 to helpers rather than assuming uint32_t and dealing with the result. Of course they will be passed in i32 sized registers by default. Backports commit 35737497008aeabce5dc381a41d3827bec486192 from qemu	2018-03-08 12:28:05 -05:00
Paolo Bonzini	c88064b52c	memory: remove memory_region_test_and_clear_dirty It is unused after g364fb has been converted to use DirtyBitmapSnapshot. Backports commit 77302fb5df05ffca9f41b5b54e3b67c601719d57 from qemu	2018-03-08 09:02:06 -05:00
Laurent Vivier	0aecb15f3b	accel/tcg: add size paremeter in tlb_fill() The MC68040 MMU provides the size of the access that triggers the page fault. This size is set in the Special Status Word which is written in the stack frame of the access fault exception. So we need the size in m68k_cpu_unassigned_access() and m68k_cpu_handle_mmu_fault(). To be able to do that, this patch modifies the prototype of handle_mmu_fault handler, tlb_fill() and probe_write(). do_unassigned_access() already includes a size parameter. This patch also updates handle_mmu_fault handlers and tlb_fill() of all targets (only parameter, no code change). Backports commit 98670d47cd8d63a529ff230fd39ddaa186156f8c from qemu	2018-03-06 10:56:34 -05:00
Richard Henderson	7fe5f620df	tcg: Dynamically allocate TCGOps With no fixed array allocation, we can't overflow a buffer. This will be important as optimizations related to host vectors may expand the number of ops used. Use QTAILQ to link the ops together. Backports commit 15fa08f8451babc88d733bd411d4c94976f9d0f8 from qemu	2018-03-05 16:34:40 -05:00
Peter Xu	1bb34aadf9	cpu: refactor cpu_address_space_init() Normally we create an address space for that CPU and pass that address space into the function. Let's just do it inside to unify address space creations. It'll simplify my next patch to rename those address spaces. Backports commit 80ceb07a83375e3a0091591f96bd47bce2f640ce from qemu	2018-03-05 14:39:25 -05:00
Marc-André Lureau	ffa45adb57	memory: remove unused memory_region_set_global_locking() This was never used since its introduction in commit 196ea13104f8 ("memory: Add global-locking property to memory regions"). Backports commit e2fbe20851ceec5ccd7b539a89db0420393fb85d from qemu	2018-03-05 14:14:43 -05:00
Richard Henderson	d450156414	tcg: Remove GET_TCGV_* and MAKE_TCGV_* The GET and MAKE functions weren't really specific enough. We now have a full complement of functions that convert exactly between temporaries, arguments, tcgv pointers, and indices. The target/sparc change is also a bug fix, which would have affected a host that defines TCG_TARGET_HAS_extr[lh]_i64_i32, i.e. MIPS64. Backports commit dc41aa7d34989b552efe712ffe184236216f960b from qemu	2018-03-05 09:12:26 -05:00
Richard Henderson	2bb5011b18	tcg: Introduce tcgv_{i32,i64,ptr}_{arg,temp} Transform TCGv_* to an "argument" or a temporary. For now, an argument is simply the temporary index. Backports commit ae8b75dc6ec808378487064922f25f1e7ea7a9be from qemu	2018-03-05 08:46:12 -05:00
Emilio G. Cota	8552d95c52	exec-all: extract tb->tc_* into a separate struct tc_tb In preparation for adding tc.size to be able to keep track of TB's using the binary search tree implementation from glib. Backports commit e7e168f41364c6e83d0f75fc1b3ce7f9c41ccf76 from qemu	2018-03-05 02:57:22 -05:00
Emilio G. Cota	5fc83f3eb2	exec-all: introduce TB_PAGE_ADDR_FMT And fix the following warning when DEBUG_TB_INVALIDATE is enabled in translate-all.c: CC mipsn32-linux-user/accel/tcg/translate-all.o /data/src/qemu/accel/tcg/translate-all.c: In function ‘tb_alloc_page’: /data/src/qemu/accel/tcg/translate-all.c:1201:16: error: format ‘%lx’ expects argument of type ‘long unsigned int’, but argument 2 has type ‘tb_page_addr_t {aka unsigned int}’ [-Werror=format=] printf("protecting code page: 0x" TARGET_FMT_lx "\n", ^ cc1: all warnings being treated as errors /data/src/qemu/rules.mak:66: recipe for target 'accel/tcg/translate-all.o' failed make[1]: * [accel/tcg/translate-all.o] Error 1 Makefile:328: recipe for target 'subdir-mipsn32-linux-user' failed make: * [subdir-mipsn32-linux-user] Error 2 cota@flamenco:/data/src/qemu/build ((18f3fe1...) *$)$ Backports commit 67a5b5d2f6eb6d3b980570223ba5c478487ddb6f from qemu	2018-03-05 02:49:44 -05:00
Emilio G. Cota	b4a7d8b773	exec-all: bring tb->invalid into tb->cflags This gets rid of a hole in struct TranslationBlock. Backports commit 84f1c148da2b35fbb5a436597872765257e8914e from qemu	2018-03-05 02:46:21 -05:00
Emilio G. Cota	210d13ec49	tcg: consolidate TB lookups in tb_lookup__cpu_state This avoids duplicating code. cpu_exec_step will also use the new common function once we integrate parallel_cpus into tb->cflags. Note that in this commit we also fix a race, described by Richard Henderson during review. Think of this scenario with threads A and B: (A) Lookup succeeds for TB in hash without tb_lock (B) Sets the TB's tb->invalid flag (B) Removes the TB from tb_htable (B) Clears all CPU's tb_jmp_cache (A) Store TB into local tb_jmp_cache Given that order of events, (A) will keep executing that invalid TB until another flush of its tb_jmp_cache happens, which in theory might never happen. We can fix this by checking the tb->invalid flag every time we look up a TB from tb_jmp_cache, so that in the above scenario, next time we try to find that TB in tb_jmp_cache, we won't, and will therefore be forced to look it up in tb_htable. Performance-wise, I measured a small improvement when booting debian-arm. Note that inlining pays off: Performance counter stats for 'taskset -c 0 qemu-system-arm \ -machine type=virt -nographic -smp 1 -m 4096 \ -netdev user,id=unet,hostfwd=tcp::2222-:22 \ -device virtio-net-device,netdev=unet \ -drive file=jessie.qcow2,id=myblock,index=0,if=none \ -device virtio-blk-device,drive=myblock \ -kernel kernel.img -append console=ttyAMA0 root=/dev/vda1 \ -name arm,debug-threads=on -smp 1' (10 runs): Before: 18714.917392 task-clock # 0.952 CPUs utilized ( +- 0.95% ) 23,142 context-switches # 0.001 M/sec ( +- 0.50% ) 1 CPU-migrations # 0.000 M/sec 10,558 page-faults # 0.001 M/sec ( +- 0.95% ) 53,957,727,252 cycles # 2.883 GHz ( +- 0.91% ) [83.33%] 24,440,599,852 stalled-cycles-frontend # 45.30% frontend cycles idle ( +- 1.20% ) [83.33%] 16,495,714,424 stalled-cycles-backend # 30.57% backend cycles idle ( +- 0.95% ) [66.66%] 76,267,572,582 instructions # 1.41 insns per cycle 12,692,186,323 branches # 678.186 M/sec ( +- 0.92% ) [83.35%] 263,486,879 branch-misses # 2.08% of all branches ( +- 0.73% ) [83.34%] 19.648474449 seconds time elapsed ( +- 0.82% ) After, w/ inline (this patch): 18471.376627 task-clock # 0.955 CPUs utilized ( +- 0.96% ) 23,048 context-switches # 0.001 M/sec ( +- 0.48% ) 1 CPU-migrations # 0.000 M/sec 10,708 page-faults # 0.001 M/sec ( +- 0.81% ) 53,208,990,796 cycles # 2.881 GHz ( +- 0.98% ) [83.34%] 23,941,071,673 stalled-cycles-frontend # 44.99% frontend cycles idle ( +- 0.95% ) [83.34%] 16,161,773,848 stalled-cycles-backend # 30.37% backend cycles idle ( +- 0.76% ) [66.67%] 75,786,269,766 instructions # 1.42 insns per cycle 12,573,617,143 branches # 680.708 M/sec ( +- 1.34% ) [83.33%] 260,235,550 branch-misses # 2.07% of all branches ( +- 0.66% ) [83.33%] 19.340502161 seconds time elapsed ( +- 0.56% ) After, w/o inline: 18791.253967 task-clock # 0.954 CPUs utilized ( +- 0.78% ) 23,230 context-switches # 0.001 M/sec ( +- 0.42% ) 1 CPU-migrations # 0.000 M/sec 10,563 page-faults # 0.001 M/sec ( +- 1.27% ) 54,168,674,622 cycles # 2.883 GHz ( +- 0.80% ) [83.34%] 24,244,712,629 stalled-cycles-frontend # 44.76% frontend cycles idle ( +- 1.37% ) [83.33%] 16,288,648,572 stalled-cycles-backend # 30.07% backend cycles idle ( +- 0.95% ) [66.66%] 77,659,755,503 instructions # 1.43 insns per cycle 12,922,780,045 branches # 687.702 M/sec ( +- 1.06% ) [83.34%] 261,962,386 branch-misses # 2.03% of all branches ( +- 0.71% ) [83.35%] 19.700174670 seconds time elapsed ( +- 0.56% ) Backports commit f6bb84d53110398f4899c19dab4e0fe9908ec060 from qemu	2018-03-05 02:42:46 -05:00
Emilio G. Cota	68ddc0cb08	exec-all: fix typos in TranslationBlock's documentation Backports commit eb5e2b9e3b141de0c435eedc31c26cbbdefbee1b from qemu	2018-03-05 02:10:28 -05:00
Richard Henderson	31b8b67cd3	tcg: Move USE_DIRECT_JUMP discriminator to tcg/cpu/tcg-target.h Replace the USE_DIRECT_JUMP ifdef with a TCG_TARGET_HAS_direct_jump boolean test. Replace the tb_set_jmp_target1 ifdef with an unconditional function tb_target_set_jmp_target. While we're touching all backends, add a parameter for tb->tc_ptr; we're going to need it shortly for some backends. Move tb_set_jmp_target and tb_add_jump from exec-all.h to cpu-exec.c. Backports commit a85833933628384d74ec412024d55cf012640287 from qemu	2018-03-04 21:52:35 -05:00
Lluís Vilanova	ed7225e685	tcg: Add generic translation framework Backports commit bb2e0039dc07177f928f9fe24758967da02d60a2 from qemu	2018-03-04 14:31:16 -05:00
Paolo Bonzini	6997a5a090	gen-icount: check cflags instead of use_icount global Backports commit cd42d5b23691ad73edfd6dbcfc935a960a9c5a65 from qemu	2018-03-04 14:26:26 -05:00
Lluís Vilanova	3a196c62ae	target: [tcg] Use a generic enum for DISAS_ values Used later. An enum makes expected values explicit and bounds the value space of switches. Backports commit 77fc6f5e28667634916f114ae04c6029cd7b9c45 from qemu	2018-03-04 14:08:43 -05:00
Richard Henderson	b8a16f841a	tcg: Add generic DISAS_NORETURN This will allow some amount of cleanup to happen before switching the backends over to enum DisasJumpType. Backports commit 5dc66895b0113034cd37fd5e65911d7959fc26a9 from qemu	2018-03-04 13:49:18 -05:00
Peter Maydell	26c8f31d9e	memory.h: Move MemTxResult type to memattrs.h Move the MemTxResult type to memattrs.h. We're going to want to use it in cpu/qom.h, which doesn't want to include all of memory.h. In practice MemTxResult and MemTxAttrs are pretty closely linked since both are used for the new-style read_with_attrs and write_with_attrs callbacks, so memattrs.h is a reasonable home for this rather than creating a whole new header file for it. Backports commit 3114d092b1740f9db9aa559aeb48ee387011e1da from qemu	2018-03-04 13:10:47 -05:00
Alexey Kardashevskiy	e723b8dd49	memory: Open code FlatView rendering We are going to share FlatView's between AddressSpace's and per-AS memory listeners won't suit the purpose anymore so open code the dispatch tree rendering. Since there is a good chance that dispatch_listener was the only listener, this avoids address_space_update_topology_pass() if there is no registered listeners; this should improve starting time. This should cause no behavioural change. Backports commit 1b04a1580917d9e41fd37ca62cbff9b4bf061e96 from qemu	2018-03-04 02:06:48 -05:00
Lluís Vilanova	32b3c3815d	tcg: Pass generic CPUState to gen_intermediate_code() Needed to implement a target-agnostic gen_intermediate_code() in the future. Backports commit 9c489ea6bed134fecfd556b439c68bba48fbe102 from qemu	2018-03-03 23:34:18 -05:00
Richard Henderson	fc52eea5e2	tcg: Expand glue macros before stringifying helper names Backports commit 44368ac62dc5ba014b68b2c1a8ec6fedc3242a5d from qemu	2018-03-03 23:07:21 -05:00
Alex Bennée	7d02489baf	include/exec/exec-all: document common exit conditions As a precursor to later patches attempt to come up with a more concrete wording for what each of the common exit cases would be. Backports commit df0311e634828fdc99ca59352aef68503d631aad from qemu	2018-03-03 22:31:28 -05:00
Peter Maydell	3bd5694a0a	memory: Rename memory_region_init_rom() and _rom_device() to _nomigrate() Rename memory_region_init_rom() to memory_region_init_rom_nomigrate() and memory_region_init_rom_device() to memory_region_init_rom_device_nomigrate(). Backports commit b59821a95bd1d7cb4697fd7748725c910582e0e7 from qemu	2018-03-03 22:29:01 -05:00
Peter Maydell	7b0027a828	memory: Rename memory_region_init_ram() to memory_region_init_ram_nomigrate() Rename memory_region_init_ram() to memory_region_init_ram_nomigrate(). This leaves the way clear for us to provide a memory_region_init_ram() which does handle migration. Backports commit 1cfe48c1ce219b60a9096312f7a61806fae64ab3 from qemu	2018-03-03 22:25:39 -05:00
Peter Maydell	152c56f6a9	memory: Document that the RAM MR initializers do not handle migration The various functions for initializing RAM MemoryRegions do not do anything to cause the data in the MemoryRegion to be migrated. Note in their documentation comments that this is the responsibility of the caller. (We will shortly add a new function that does do this for you.) Backports commit a5c0234bb2754f5248e67929a34c843dbe039da5 from qemu	2018-03-03 22:20:32 -05:00
Pranith Kumar	d0a70720a3	Revert "exec.c: Fix breakpoint invalidation race" Now that we have proper locking after MTTCG patches have landed, we can revert the commit. This reverts commit a9353fe897ca2687e5b3385ed39e3db3927a90e0. Backports commit 406bc339b0505fcfc2ffcbca1f05a3756e338a65 from qemu	2018-03-03 22:14:35 -05:00
Yang Zhong	1135db176f	tcg: add CONFIG_TCG guards in headers Add CONFIG_TCG around TLB-related functions and structure declarations. Some of these functions are defined in ./accel/tcg/cputlb.c, which will not be linked in if TCG is disabled, and have no stubs; therefore, their callers will also be compiled out for --disable-tcg. Backports commit b11ec7f2e44b285a3967d629b55d1a6970b06787 from qemu	2018-03-03 21:37:52 -05:00
Yang Zhong	d70c141675	tcg: move page_size_init() function translate-all.c will be disabled if tcg is disabled in the build, so page_size_init() function and related variables will be moved to exec.c file. Backports commit a0be0c585f5dcc4d50a37f6a20d3d625c5ef3a2c from qemu	2018-03-03 21:30:08 -05:00
Thomas Huth	cf5d583ef0	cpu: Introduce a wrapper for tlb_flush() that can be used in common code Commit 1f5c00cfdb8114c ("qom/cpu: move tlb_flush to cpu_common_reset") moved the call to tlb_flush() from the target-specific reset handlers into the common code qom/cpu.c file, and protected the call with "#ifdef CONFIG_SOFTMMU" to avoid that it is called for linux-user only targets. But since qom/cpu.c is common code, CONFIG_SOFTMMU is never defined here, so the tlb_flush() was simply never executed anymore. Fix it by introducing a wrapper for tlb_flush() in a file that is re-compiled for each target, i.e. in translate-all.c. Backports commit 2cd53943115be5118b5b2d4b80ee0a39c94c4f73 from qemu	2018-03-03 21:24:55 -05:00
Emilio G. Cota	1a4e5da043	gen-icount: use tcg_ctx.tcg_env instead of cpu_env We are relying on cpu_env being defined as a global, yet most targets (i.e. all but arm/a64) have it defined as a local variable. Luckily all of them use the same "cpu_env" name, but really compilation shouldn't break if the name of that local variable changed. Fix it by using tcg_ctx.tcg_env, which all targets set in their translate_init function. This change also helps paving the way for the upcoming "translation loop common to all targets" work. Backports commit 53f6672bcf57d82b794a2cc3a3469be7d35c8653 from qemu	2018-03-03 21:08:58 -05:00
Richard Henderson	68275ba6f3	tcg/arm: Use indirect branch for goto_tb Backports commit 3fb53fb4d12f2e7833bd1659e6013237b130ef20 from qemu	2018-03-03 17:11:18 -05:00
Emilio G. Cota	d3ada2feb5	tcg: allocate TB structs before the corresponding translated code Allocating an arbitrarily-sized array of tbs results in either (a) a lot of memory wasted or (b) unnecessary flushes of the code cache when we run out of TB structs in the array. An obvious solution would be to just malloc a TB struct when needed, and keep the TB array as an array of pointers (recall that tb_find_pc() needs the TB array to run in O(log n)). Perhaps a better solution, which is implemented in this patch, is to allocate TB's right before the translated code they describe. This results in some memory waste due to padding to have code and TBs in separate cache lines--for instance, I measured 4.7% of padding in the used portion of code_gen_buffer when booting aarch64 Linux on a host with 64-byte cache lines. However, it can allow for optimizations in some host architectures, since TCG backends could safely assume that the TB and the corresponding translated code are very close to each other in memory. See this message by rth for a detailed explanation: https://lists.gnu.org/archive/html/qemu-devel/2017-03/msg05172.html Subject: Re: GSoC 2017 Proposal: TCG performance enhancements Backports commit 6e3b2bfd6af488a896f7936e99ef160f8f37e6f2 from qemu	2018-03-03 17:05:49 -05:00
Emilio G. Cota	7d0440dec4	tb-hash: improve tb_jmp_cache hash function in user mode Optimizations to cross-page chaining and indirect branches make performance more sensitive to the hit rate of tb_jmp_cache. The constraint of reserving some bits for the page number lowers the achievable quality of the hashing function. However, user-mode does not have this requirement. Thus, with this change we use for user-mode a hashing function that is both faster and of better quality than the previous one. Measurements: Note: baseline (i.e. speedup == 1x) is QEMU v2.9.0. - SPECint06 (test set), x86_64-linux-user. Host: Intel i7-6700K @ 4.00GHz 2.2x +-+--------------------------------------------------------------------------------------------------------------+-+ \| \| \| jr \| 2x +jr+multhash +....................................................+++++...................................+-+ \| jr+hash \|$$$ \| \| \|$+$ \| \| ### $ \| 1.8x +-+......................................................................#\|#.$...................................+-+ \| ++#+# $ \| \| \|# # $ \| 1.6x +-+....................................................................**.#.$....................++$$$..........+-+ \| $$$ +* # $ \|$+$ \| \| ++$$$ ### $ * * # $ +++\|$ $ \| \| ++###+$ # # $ * * # $ ### **## $ \| 1.4x +-+...................+#.$.........*.#.$............................#.$...........#+#$$.++\|#.$..........+-+ \| +* # $ * * # $ * * # $ # # $ * +# $ \| \| * # $ +++++ * * # $ * * # $ *** # $ * * # $ ###$$ \| 1.2x +-+.....................#.$.**##$$...#.$............................#.$...........#.$....#.$.*+#+$..+-+ \| * # $ + # $ * * # $ +++ * * # $ ++###$$ * * # $ * * # $ * * # $ \| \| **##$$ * # $ * * # $ * * # $ **##$$ ++### * # $ *** #+$ * * # $ * * # $ * * # $ \| \| ++#+$ **##$$$ * # $ * * # $ * * # $ + # $ ++####$$ **+# * # $ * * # $ * * # $ * * # $ * * # $ \| 1x +-++-++#+$+++#-+$++-#+$+++#+$+++#+$+-+#+$+**++#+$+++#$$+++#+$+++#+$++-#+$++-+#+$+++#+$-++-+ \| * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ \| \| * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ \| 0.8x +-+--*##$$-##$$$-##$$-##$$-##$$-##$$-###$$-##$$-##$$-##$$-##$$-##$$-##$$--+-+ astar bzip2 gcc gobmk h264ref hmmlibquantum mcf omnetpperlbench sjengxalancbmk hmean png: http://imgur.com/4UXTrEc Here I also tried the hash function suggested by Paolo ("multhash"): return ((uint64_t) (pc 2654435761) >> 32) & (TB_JMP_CACHE_SIZE - 1); As you can see it is just as good as the other new function ("hash"), which is what I ended up going with. - SPECint06 (train set), x86_64-linux-user. Host: Intel i7-6700K @ 4.00GHz 2.6x +-+--------------------------------------------------------------------------------------------------------------+-+ \| \| \| jr ### \| 2.4x +jr+hash...........................................................................................#.#...........+-+ \| # # \| \| # # \| 2.2x +-+................................................................................................#.#...........+-+ \| # # \| \| # # \| 2x +-+................................................................................................#.#...........+-+ \| **** # \| \| * * # \| 1.8x +-+................................................................................................#...........+-+ \| +++ * * # \| \| #### #### * * # \| 1.6x +-+......................................####.............................#..#.***..#.............#...........+-+ \| +++ #++# *** # * * # #### * * # \| \| ### # # * * # * * # # # * * # \| 1.4x +-+...................**+#..........*..#..............................#.....#....#..#.....#...........+-+ \| ++* # * * # * * # * * # *** # * * # #### \| \| * * # #### * * # * * # * * # * * # * * # **** # \| 1.2x +-+......................#..***++#.....#..............................#.....#.....#.....#......#..+-+ \| **### * # * * # * * # * * # * * # * * # * * # * * # \| \| * * # **### * # * * # * * # ***## * # * * # * * # * * # * * # \| 1x +-+--**###--###--*##--###-###--###--###--##--###-###--###--*##--###--+-+ astar bzip2 gcc gobmk h264ref hmmlibquantum mcf omnetpperlbench sjengxalancbmk hmean png: http://imgur.com/ArCbHqo - NBench, x86_64-linux-user. Host: Intel i7-6700K @ 4.00GHz 1.12x +-+-------------------------------------------------------------------------------------------------------------+-+ \| \| \| jr +++ \| 1.1x +jr+hash...........................................................####.........................................+-+ \| +++#\| # \| \| \| #++# \| 1.08x +-+................................+++................+++.+++..**..#.........................................+-+ \| \| +++ \| \| \| * # \| \| \| \| \| \| +++ # \| 1.06x +-+................................***###.............\|...\|........#.........................+++.............+-+ \| \| * \|# ***### * # \| \| \| \| ++# \| \|# * * # #### \| 1.04x +-+................................++..#............\|..\|#.......#........................#.\|#.............+-+ \| * * # ++++# * * # +++#++# \| \| * * # * * # * * # \| # # +++#### \| 1.02x +-+....................................#......+++.......#.......#.....................**..#..**++#...+-+ \| +++ * # +++ \| * * # * * # +++ \| # +++ # \| \| +++ \| +++ +++ ++++++ * * # ****### * # * * # \| +++ ++++++ ++ # * * # \| 1x +-++-+++++####++***###++++-+####+-++++#-++++-+#++++++#+++-+++#+-+++####-+***###++++++#+++-+++#+-++-+ \| ***\| # ++* \|# ****\| # * # * ++# * # * * # **** \|# * * # * * # * * # \| \| * \| \| # ++# \| ++# * # * * # * * # * * # \| ++# * * # * * # * * # \| 0.98x +-+....\|.++#......#..+++..#......#.......#......#.......#..++..#.......#......#.......#...+-+ \| +++ # * * # * * # * * # * * # * * # * * # * * # * * # * * # * * # \| \| * * # * * # * * # * * # * * # * * # * * # * * # * * # * * # * * # \| 0.96x +-+---***###--###--*###--###--*###--###--*###--###--*###--###--*###---+-+ ASSIGNMENT BITFIELD FOURFP EMULATION HUFFMAN LU DECOMPOSITIONEURAL NNUMERIC SOSTRING SORT hmean png: http://imgur.com/ZXFX0hJ - NBench, arm-linux-user. Host: Intel i7-4790K @ 4.00GHz 1.3x +-+-------------------------------------------------------------------------------------------------------------+-+ \| #### \| \| jr # # +++ \| 1.25x +jr+hash.....................#..#...........................................####................................+-+ \| # # # # \| \| # # # # \| 1.2x +-+..........................#..#...........................................#..#................................+-+ \| # # # # \| \| # # # # \| 1.15x +-+..........................#..#...........................................#..#................................+-+ \| # # #### # # \| \| # # # # # # \| 1.1x +-+..........................#..#..................................#..#.....#..#................................+-+ \| # # # # # # +++ \| \| # # #### # # # # #### \| 1.05x +-+..........................#..#...............#..#.....####......#..#.....#..#.........................#..#...+-+ \| # # # # # # # # # # +++ # # \| \| +++ * # #### * # # # +++# # # ### # # \| 1x +-++-+*###++*++++++-+++#+-**++#-++++-+#+++++#++#++***++#+-++++#-+***-++++++++#++***++#+-++-+ \| * # * * \| * * # * * # * * # **** # * * # * * # * ### ++# * # \| \| * * # * ### * # * * # * * # * * # * * # * * # * * # * * # * * # \| 0.95x +-+........#.....\|#.......#......#.......#......#.......#......#.......#......#.......#...+-+ \| * * # * * \|# * * # * * # * * # * * # * * # * * # * * # * * # * * # \| \| * * # * * \|# * * # * * # * * # * * # * * # * * # * * # * * # * * # \| 0.9x +-+---***###--###--*###--###--*###--###--*###--###--*###--###--***###---+-+ ASSIGNMENT BITFIELD FOURFP EMULATION HUFFMAN LU DECOMPOSITIONEURAL NNUMERIC SOSTRING SORT hmean png: http://imgur.com/FfD27ey Backports commit 6f1653180f5701c6a8f1b35b89a80b1e3260928e from qemu	2018-03-03 14:11:29 -05:00
Emilio G. Cota	8f4f15e5f5	tcg: Introduce goto_ptr opcode and tcg_gen_lookup_and_goto_ptr Instead of exporting goto_ptr directly to TCG frontends, export tcg_gen_lookup_and_goto_ptr(), which calls goto_ptr with the pointer returned by the lookup_tb_ptr() helper. This is the only use case we have for goto_ptr and lookup_tb_ptr, so having this function is very convenient. Furthermore, it trivially allows us to avoid calling the lookup helper if goto_ptr is not implemented by the backend. Backports commit cedbcb01529cb6cf9a2289cdbebbc63f6149fc18 from qemu	2018-03-02 21:05:18 -05:00
Peter Xu	fce1b469e5	memory: tune last param of iommu_ops.translate() This patch converts the old "is_write" bool into IOMMUAccessFlags. The difference is that "is_write" can only express either read/write, but sometimes what we really want is "none" here (neither read nor write). Replay is an good example - during replay, we should not check any RW permission bits since thats not an actual IO at all. Backports commit bf55b7afce53718ef96f4e6616da62c0ccac37dd from qemu	2018-03-02 18:59:12 -05:00
Paolo Bonzini	c27870520a	exec: revert MemoryRegionCache MemoryRegionCache did not know about virtio support for IOMMUs (because the two features were developed at the same time). Revert MemoryRegionCache to "normal" address_space_* operations for 2.9, as it is simpler than undoing the virtio patches. Backports commit 90c4fe5fc517a045e7a7cf2f23472e114042ca29 from qemu	2018-03-02 14:30:41 -05:00
Dr. David Alan Gilbert	55d79cf4c0	RAMBlocks: qemu_ram_is_shared Provide a helper to say whether a RAMBlock was created as a shared mapping. Backports commit 463a4ac23bcf0f0b65c850fa66f5ae6e43edd243 from qemu	2018-03-02 13:05:35 -05:00
Dr. David Alan Gilbert	5dfbee8930	memory_region: Fix name comments The 'name' parameter to memory_region_init_* had been marked as debug only, however vmstate_region_ram uses it as a parameter to qemu_ram_set_idstr to set RAMBlock names and these form part of the migration stream. Backports commit e8f5fe2de125a0bfbefbaa6a69af81f4817cb7a0 from qemu	2018-03-02 13:01:23 -05:00
Yongji Xie	23f5b17a08	memory: Introduce DEVICE_HOST_ENDIAN for ram device At the moment ram device's memory regions are DEVICE_NATIVE_ENDIAN. It's incorrect. This memory region is backed by a MMIO area in host, so the uint64_t data that MemoryRegionOps read from/write to this area should be host-endian rather than target-endian. Hence, current code does not work when target and host endianness are different which is the most common case on PPC64. To fix it, this introduces DEVICE_HOST_ENDIAN for the ram device. This has been tested on PPC64 BE/LE host/guest in all possible combinations including TCG. Backports commit c99a29e702528698c0ce2590f06ca7ff239f7c39 from qemu	2018-03-02 11:24:32 -05:00
Alex Bennée	454932263c	cputlb and arm/sparc targets: convert mmuidx flushes from varg to bitmap While the vargs approach was flexible the original MTTCG ended up having munge the bits to a bitmap so the data could be used in deferred work helpers. Instead of hiding that in cputlb we push the change to the API to make it take a bitmap of MMU indexes instead. For ARM some the resulting flushes end up being quite long so to aid readability I've tended to move the index shifting to a new line so all the bits being or-ed together line up nicely, for example: tlb_flush_page_by_mmuidx(other_cs, pageaddr, (1 << ARMMMUIdx_S1SE1) \| (1 << ARMMMUIdx_S1SE0)); Backports commit 0336cbf8532935d8e23c2aabf3e2ce2c0697b6ac from qemu	2018-03-02 10:12:40 -05:00
Alex Bennée	e3e57ca08e	cputlb: drop flush_global flag from tlb_flush We have never has the concept of global TLB entries which would avoid the flush so we never actually use this flag. Drop it and make clear that tlb_flush is the sledge-hammer it has always been. Backports commit d10eb08f5d8389c814b554d01aa2882ac58221bf from qemu	2018-03-01 19:36:04 -05:00
Jason Wang	29932d0719	memory: handle alias in memory_region_is_iommu() Backports commit 12d37882f0c0def5dee1c21be5d8fea9c21baada from qemu	2018-03-01 13:06:18 -05:00

1 2 3 4 5

217 commits