unicorn

mirror of https://github.com/yuzu-emu/unicorn.git synced 2024-12-23 18:15:30 +00:00

Author	SHA1	Message	Date
Richard Henderson	2a4a7b9391	tcg: Use tlb_fill probe from tlb_vaddr_to_host Most of the existing users would continue around a loop which would fault the tlb entry in via a normal load/store. But for AArch64 SVE we have an existing emulation bug wherein we would mark the first element of a no-fault vector load as faulted (within the FFR, not via exception) just because we did not have its address in the TLB. Now we can properly only mark it as faulted if there really is no valid, readable translation, while still not raising an exception. (Note that beyond the first element of the vector, the hardware may report a fault for any reason whatsoever; with at least one element loaded, forward progress is guaranteed.) Backports commit 4811e9095c0491bc6f5450e5012c9c4796b9e59d from qemu	2019-05-16 18:27:03 -04:00
Laurent Vivier	8cdfed1032	linux-user: fix 32bit g2h()/h2g() sparc32plus has 64bit long type but only 32bit virtual address space. For instance, "apt-get upgrade" failed because of a mmap()/msync() sequence. mmap() returned 0xff252000 but msync() used g2h(0xffffffffff252000) to find the host address. The "(target_ulong)" in g2h() doesn't fix the address because it is 64bit long. This patch introduces an "abi_ptr" that is set to uint32_t if the virtual address space is addressed using 32bit in the linux-user case. It stays set to target_ulong with softmmu case. Backports commit 3e23de15237c81fe7af7c3ffa299a6ae5fec7d43 from qemu	2019-05-16 18:20:55 -04:00
Richard Henderson	dab0061a0d	tcg: Use CPUClass::tlb_fill in cputlb.c We can now use the CPUClass hook instead of a named function. Create a static tlb_fill function to avoid other changes within cputlb.c. This also isolates the asserts within. Remove the named tlb_fill function from all of the targets. Backports commit c319dc13579a92937bffe02ad2c9f1a550e73973 from qemu	2019-05-16 17:35:37 -04:00
Richard Henderson	8fdd009a9d	tcg: Remove CF_IGNORE_ICOUNT Now that we have curr_cflags, we can include CF_USE_ICOUNT early and then remove it as necessary. Backports commit 416986d3f97329655e30da7271a2d11c6d707b06 from qemu	2019-05-06 00:57:09 -04:00
Richard Henderson	12f9def3a2	tcg: Add CF_LAST_IO + CF_USE_ICOUNT to CF_HASH_MASK These flags are used by target/*/translate.c, and affect code generation. Backports commit 0cf8a44c2f56ba884c2f6db47d27fbb24975daa3 from qemu	2019-05-06 00:53:35 -04:00
Richard Henderson	4a858100f4	tcg: Include CF_COUNT_MASK in CF_HASH_MASK Backports commit cdfef1715c779eb528d633e8b76cbc8a10e71ac8 from qemu	2019-05-04 22:31:32 -04:00
Richard Henderson	ee1ddf4a92	tcg: define CF_PARALLEL and use it for TB hashing along with CF_COUNT_MASK This will enable us to decouple code translation from the value of parallel_cpus at any given time. It will also help us minimize TB flushes when generating code via EXCP_ATOMIC. Note that the declaration of parallel_cpus is brought to exec-all.h to be able to define there the "curr_cflags" inline. Backports commit 4e2ca83e71b51577b06b1468e836556912bd5b6e from qemu	2019-05-04 22:22:06 -04:00
Richard Henderson	bca82cde84	tcg: Hoist max_insns computation to tb_gen_code In order to handle TB's that translate to too much code, we need to place the control of the length of the translation in the hands of the code gen master loop. Backports commit 8b86d6d25807e13a63ab6ea879f976b9f18cc45a from qemu	2019-04-30 09:49:57 -04:00
Lioncash	5daabe55a4	cputlb: Synchronize with qemu Synchronizes the code with Qemu to reduce a few differences.	2019-04-26 15:48:45 -04:00
Lioncash	70836028eb	exec/helper-*: Synchronize with qemu	2019-04-22 08:22:49 -04:00
Lioncash	0379335677	cpu_ldst: Remove unused macros	2019-04-22 08:17:20 -04:00
Peter Maydell	ff9c67b8f0	cpu_ldst.h: Don't define helpers if MMU_MODE_SUFFIX not defined Not all targets define a full set of suffix strings for the NB_MMU_MODES that they have. In this situation, don't define any helper functions for that mode, rather than defining helper functions with no suffix at all. The MMU mode is still functional; it is merely not directly accessible via cpu_ld_MODE from target helper functions. Also add an "NB_MMU_MODES >= 2" check to the definition of the mode 1 helpers -- some targets only define one MMU mode. Backports commit de5ee4a888667ca0a198f0743d70075d70564117 from qemu	2019-04-22 07:44:32 -04:00
Lioncash	e75b32ca4b	cpu_ldst.h, cpu-all.h, bswap.h: Update documentation on ld/st accessors Add documentation of what the cpu__ accessors look like. Correct some minor errors in the existing documentation of the direct _p accessor family. Remove the near-duplicate comment on the _p accessors from cpu-all.h and replace it with a reference to the comment in bswap.h. Backports commit db5fd8d709fd57f4d4f11edfca9f421f657f4508 from qemu	2019-04-22 07:39:13 -04:00
Peter Maydell	84eafc0cf6	cpu_ldst_template.h: Drop unused cpu_ldfq/stfq/ldfl/stfl accessors The cpu_ldfq/stfq/ldfl/stfl accessors for loading and storing float32 and float64 are completely unused, so delete them. (The union they use for converting from the float32/float64 type to uint32_t or uint64_t is the wrong way to do it anyway: they should be using make_float* and float*_val.) Backports commit 82f11917c99e3c7fa3d6aa98572ecc98c7324c2f from qemu	2019-04-22 07:21:03 -04:00
Peter Maydell	32650e7816	cpu_ldst.h: Drop unused _raw macros, saddr() and laddr() The _raw macros and their helpers saddr() and laddr() are now totally unused -- delete them. Backports commit 800e2ecc896beb6b79e7333c762da163b6a9135a from qemu	2019-04-22 07:19:20 -04:00
Peter Maydell	f1a1f3c642	cpu_ldst_template.h: Use ld_p directly rather than via ld_raw macros The ld_raw and st_raw macros are now only used within the code produced by cpu_ldst_template.h, and only in three places. Expand these out to just call the ld_p and st_p functions directly. Note that in all the callsites the address argument is a uintptr_t, so we can drop that part of the double-cast used in the saddr() and laddr() macros. Backports commit 355392329e4a843580e53cb027ed85e0cbebb640 from qemu	2019-04-22 07:11:50 -04:00
Peter Maydell	1a880ef99b	cpu_ldst.h: Use inline functions for usermode cpu_ld/st accessors Use inline functions rather than macros for cpu_ld/st accessors for the -user configurations, as we already do for softmmu. This has a two advantages: we can actually typecheck our arguments * we don't need to leak the _raw macros everywhere Since the _kernel functions were only used by target-i386/seg_helper.c, put the definitions for them in that file too. (It already has the similar template include code to define them for the softmmu case, so it makes sense to have it deal with defining them for user-only.) Backports commit 9220fe54c679d145232a28df6255e166ebf91bab from qemu	2019-04-22 07:08:39 -04:00
Peter Maydell	4fe3b4f95c	cpu_ldst.h: Remove unused very short ld/st defines The very short ld/st defines are now not used anywhere; delete them. Backports commit 177ea79f65c90b3bc84d59565b7519e47ea02f63 from qemu	2019-04-22 06:57:28 -04:00
Peter Maydell	36cd9f0df0	cpu_ldst.h: Drop unused ld/st_kernel defines The ld_kernel and st*_kernel defines are not used anywhere; delete them. Backports commit 5a0826f7d2f9bea6e02157985b103d0a4c458aaa from qemu	2019-04-22 06:54:26 -04:00
Lioncash	830756a725	gen-icount: Use tcg_ctx where applicable in commented out code If this is ever used in the future, it'll already be able to be used.	2019-04-22 06:17:10 -04:00
Lioncash	d844d7cc9d	exec: Backport tb_cflags accessor	2019-04-22 06:12:59 -04:00
Lioncash	9f0e469142	gen-icount: Synchronize with qemu	2019-04-22 05:53:46 -04:00
David Hildenbrand	8583c8f1f6	include/exec/helper-head.h: support "const void " in helper calls Especially when dealing with out-of-line gvec helpers, it is often helpful to specify some vector pointers as constant. E.g. when we have two inputs and one output, marking the two inputs as consts pointers helps to avoid bugs. Const pointers can be specified via "cptr", however behave in TCG just like ordinary pointers. We can specify helpers like: DEF_HELPER_FLAGS_4(gvec_vbperm, TCG_CALL_NO_RWG, void, ptr, cptr, cptr, i32) void HELPER(gvec_vbperm)(void v1, const void v2, const void v3, uint32_t desc) And make sure that here, only v1 will be written (as long as const is not casted away, of course). Backports commit 8c6edfdd90522caa4fc429144d393aba5b99f584 from qemu	2019-02-22 19:12:09 -05:00
Emilio G. Cota	1b44fd94ac	exec-all: document that tlb_fill can trigger a TLB resize Backports commit ae56a2ff92ac73782279abf8857585c34b15f509 from qemu	2019-02-12 11:38:28 -05:00
Richard Henderson	9c2a5963d0	exec: Add target-specific tlb bits to MemTxAttrs These bits can be used to cache target-specific data in cputlb read from the page tables. Backports commit d3765835ed02f91f0c6cbb452874209a6af4a730 from qemu	2019-02-05 17:00:56 -05:00
Lioncash	29d84a9296	target: Resolve repeated typedef warnings	2019-01-22 20:27:35 -05:00
Richard Henderson	80b4bef1cc	tcg: Add TCG_CALL_NO_RETURN Remember which helpers have been marked noreturn. Backports commit 15d7409260498505e991e7b9d87118627165e613 from qemu	2019-01-05 06:35:21 -05:00
Emilio G. Cota	308f4c1e0c	include: move exec/tb-hash-xx.h to qemu/xxhash.h Backports commit fe656e3185fa10973d43492c867643e80fa433cd from qemu	2018-12-18 06:07:55 -05:00
Emilio G. Cota	63082a4d20	exec: introduce qemu_xxhash{2,4,5,6,7} Before moving them all to include/qemu/xxhash.h. Backports commit c971d8fa73ff92996d751fa87d90f220cf3c8194 from qemu	2018-12-18 06:04:57 -05:00
Peter Maydell	1301becdab	tcg: Support MMU protection regions smaller than TARGET_PAGE_SIZE Add support for MMU protection regions that are smaller than TARGET_PAGE_SIZE. We do this by marking the TLB entry for those pages with a flag TLB_RECHECK. This flag causes us to always take the slow-path for accesses. In the slow path we can then special case them to always call tlb_fill() again, so we have the correct information for the exact address being accessed. This change allows us to handle reading and writing from small regions; we cannot deal with execution from the small region. Backports commit 55df6fcf5476b44bc1b95554e686ab3e91d725c5 from qemu	2018-11-16 21:35:54 -05:00
Lioncash	3a0ab1a64a	Partial backport of: exec.c: Handle IOMMUs in address_space_translate_for_iotlb() We just want the parameter changes here. Partial backport of commit 1f871c5e6b0f30644a60a81a6a7aadb3afb030ac from qemu	2018-11-16 21:24:55 -05:00
Marc-André Lureau	fc354aa464	memory: learn about non-volatile memory region Add a new flag to mark memory region that are used as non-volatile, by NVDIMM for example. That bit is propagated down to the flat view, and reflected in HMP info mtree with a "nv-" prefix on the memory type. This way, guest_phys_blocks_region_add() can skip the NV memory regions for dumps and TCG memory clear in a following patch. Backports commit c26763f8ec70b1011098cab0da9178666d8256a5 from qemu	2018-11-11 08:50:39 -05:00
Li Qiang	b79f16c331	memory.h: fix typos in comments Backports commit 847b31f0d608bfcbc9ea11d5013ae62e956f32cd from qemu	2018-11-11 07:31:35 -05:00
Emilio G. Cota	1677898a09	cputlb: read CPUTLBEntry.addr_write atomically Updates can come from other threads, so readers that do not take tlb_lock must use atomic_read to avoid undefined behaviour (UB). This completes the conversion to tlb_lock. This conversion results on average in no performance loss, as the following experiments (run on an Intel i7-6700K CPU @ 4.00GHz) show. 1. aarch64 bootup+shutdown test: - Before: Performance counter stats for 'taskset -c 0 ../img/aarch64/die.sh' (10 runs): 7487.087786 task-clock (msec) # 0.998 CPUs utilized ( +- 0.12% ) 31,574,905,303 cycles # 4.217 GHz ( +- 0.12% ) 57,097,908,812 instructions # 1.81 insns per cycle ( +- 0.08% ) 10,255,415,367 branches # 1369.747 M/sec ( +- 0.08% ) 173,278,962 branch-misses # 1.69% of all branches ( +- 0.18% ) 7.504481349 seconds time elapsed ( +- 0.14% ) - After: Performance counter stats for 'taskset -c 0 ../img/aarch64/die.sh' (10 runs): 7462.441328 task-clock (msec) # 0.998 CPUs utilized ( +- 0.07% ) 31,478,476,520 cycles # 4.218 GHz ( +- 0.07% ) 57,017,330,084 instructions # 1.81 insns per cycle ( +- 0.05% ) 10,251,929,667 branches # 1373.804 M/sec ( +- 0.05% ) 173,023,787 branch-misses # 1.69% of all branches ( +- 0.11% ) 7.474970463 seconds time elapsed ( +- 0.07% ) 2. SPEC06int: SPEC06int (test set) [Y axis: Speedup over master] 1.15 +-+----+------+------+------+------+------+-------+------+------+------+------+------+------+----+-+ \| \| 1.1 +-+.................................+++.............................+ tlb-lock-v2 (m+++x) +-+ \| +++ \| +++ tlb-lock-v3 (spinl\|ck) \| \| +++ \| \| +++ +++ \| \| \| 1.05 +-+....+++...........####.........\|####.+++.\|......\|.....###....+++...........+++....###.........+-+ \| ### ++#\| # \|# \|# *### +++### +++#+# \| +++ \| #\|# ### \| 1 +-++++#++++####+++#++#++++++++++#++#++++#++++#+#+**+#++++###++++###++++###++++#+#++++#+#+++-+ \| +* # #++# * # #### * # * ++# **+# \| * # ***\|# \|# # #\|# #+# # # \| 0.95 +-+....#....#..#.\|..#...#..#.\|..#....#.\|..#.++.#.+++#.**.#....#+#....#.#..++#.#..+-+ \| * # # # \| # # # \| # * * # ++ # * * # * * # * \|* # ++# # # # *** # \| \| * * # ++# # + # # # \| # * * # * * # * * # * * # ++ # **** # ++# # * * # \| 0.9 +-+....#...\|#..#....#.++#..#.\|..#....#....#....#....#....#..\|.#...\|#.#....#..+-+ \| * * # *** # * * # \|# # + # * * # * * # * * # * * # * * # ++ # \|# # * * # \| 0.85 +-+....#..\|..#....#.**..#....#....#....#....#....#....#....#.**.#....#..+-+ \| * # + # * * # \| # * * # * * # * * # * * # * * # * * # * * # * \|* # * * # \| \| * * # * * # * * # + # * * # * * # * * # * * # * * # * * # * * # * \|* # * * # \| 0.8 +-+....#.....#....#....#....#....#....#....#....#....#....#.++.#....#..+-+ \| * * # * * # * * # * * # * * # * * # * * # * * # * * # * * # * * # * * # * * # \| 0.75 +-+--*##--###-###-###-###-###-*##-##-##-##-##-##--*##--+-+ 400.perlben401.bzip2403.gcc429.m445.gob456.hmme45462.libqua464.h26471.omnet473483.xalancbmkgeomean png: https://imgur.com/a/BHzpPTW Notes: - tlb-lock-v2 corresponds to an implementation with a mutex. - tlb-lock-v3 corresponds to the current implementation, i.e. a spinlock and a single lock acquisition in tlb_set_page_with_attrs. Backports commit 403f290c0603f35f2d09c982bf5549b6d0803ec1 from qemu	2018-10-23 15:37:43 -04:00
Richard Henderson	c911ea7128	tcg: Add tlb_index and tlb_entry helpers Isolate the computation of an index from an address into a helper before we change that function. Backports commit 383beda9cf32f795616c3b93f7d6154d70372d4b from qemu	2018-10-23 15:04:27 -04:00
Emilio G. Cota	dfb3954571	exec: introduce tlb_init Paves the way for the addition of a per-TLB lock. Backports commit 5005e2537d090bee87aca3b924dcd17920fd146a from qemu	2018-10-23 14:41:29 -04:00
Peter Maydell	01683fe97e	memory: Remove old_mmio accessors Now that all the users of old_mmio MemoryRegion accessors have been converted, we can remove the core code support. Backports commit 62a0db942dec6ebfec19aac2b604737d3c9a2d75 from qemu	2018-10-04 04:45:30 -04:00
Junyan He	6ead2c3d1f	memory, exec: Expose all memory block related flags. We need to use these flags in other files rather than just in exec.c, For example, RAM_SHARED should be used when create a ram block from file. We expose them the exec/memory.h Backports commit b0e5de93811077254a536c23b713b49e12efb742 from qemu	2018-08-22 13:00:05 -04:00
Peter Maydell	6543f9ea26	tcg: Define and use new tlb_hit() and tlb_hit_page() functions The condition to check whether an address has hit against a particular TLB entry is not completely trivial. We do this in various places, and in fact in one place (get_page_addr_code()) we have got the condition wrong. Abstract it out into new tlb_hit() and tlb_hit_page() inline functions (one for a known-page-aligned address and one for an arbitrary address), and use them in all the places where we had the condition correct. This is a no-behaviour-change patch; we leave fixing the buggy code in get_page_addr_code() to a subsequent patch Backports commit 334692bce7f0653a93b8d84ecde8c847b08dec38 from qemu	2018-07-03 19:21:36 -04:00
Peter Maydell	8295b228e3	bswap: Add new stn__p() and ldn__p() memory access functions There's a common pattern in QEMU where a function needs to perform a data load or store of an N byte integer in a particular endianness. At the moment this is handled by doing a switch() on the size and calling the appropriate ld_p or st_p function for each size. Provide a new family of functions ldn__p() and stn__p() which take the size as an argument and do the switch() themselves. Backports commit afa4f6653dca095f63f3fe7f2001e9334f5676c1 from qemu	2018-06-15 12:17:21 -04:00
Peter Maydell	61a7ac6948	cpu-defs.h: Document CPUIOTLBEntry 'addr' field The 'addr' field in the CPUIOTLBEntry struct has a rather non-obvious use; add a comment documenting it (reverse-engineered from what the code that sets it is doing). Backports commit ace4109011b4912b24e76f152e2cf010e78819c5 from qemu	2018-06-15 12:07:39 -04:00
Peter Maydell	7a6ae26346	cputlb: Pass cpu_transaction_failed() the correct physaddr The API for cpu_transaction_failed() says that it takes the physical address for the failed transaction. However we were actually passing it the offset within the target MemoryRegion. We don't currently have any target CPU implementations of this hook that require the physical address; fix this bug so we don't get confused if we ever do add one. Backports commit 2d54f19401bc54b3b56d1cc44c96e4087b604b97 from qemu	2018-06-15 12:03:23 -04:00
Richard Henderson	10e2b13650	tcg: Pass tb and index to tcg_gen_exit_tb separately Do the cast to uintptr_t within the helper, so that the compiler can type check the pointer argument. We can also do some more sanity checking of the index argument. Backports commit 07ea28b41830f946de3841b0ac61a3413679feb9 from qemu	2018-06-07 11:56:32 -04:00
Richard Henderson	533a3f6a6c	tcg: Fix helper function vs host abi for float16 Depending on the host abi, float16, aka uint16_t, values are passed and returned either zero-extended in the host register or with garbage at the top of the host register. The tcg code generator has so far been assuming garbage, as that matches the x86 abi, but this is incorrect for other host abis. Further, target/arm has so far been assuming zero-extended results, so that it may store the 16-bit value into a 32-bit slot with the high 16-bits already clear. Rectify both problems by mapping "f16" in the helper definition to uint32_t instead of (a typedef for) uint16_t. This forces the host compiler to assume garbage in the upper 16 bits on input and to zero-extend the result on output. Backports commit 6c2be133a7478e443c99757b833d0f265c48e0a6 from qemu	2018-06-02 10:10:12 -04:00
Richard Henderson	1730d3cff0	target/arm: Implement SVE Integer Multiply-Add Group Backports commit 96a36e4a44bbf296ac212ed68ebf4e48d3dfb1f0 from qemu	2018-05-20 04:35:36 -04:00
Emilio G. Cota	d26bf1d446	translator: merge max_insns into DisasContextBase While at it, use int for both num_insns and max_insns to make sure we have same-type comparisons. Backports commit b542683d77b4f56cef0221b267c341616d87bce9 from qemu	2018-05-11 13:59:17 -04:00
Pavel Dovgalyuk	b4bf3c776b	icount: fix cpu_restore_state_from_tb for non-tb-exit cases In icount mode, instructions that access io memory spaces in the middle of the translation block invoke TB recompilation. After recompilation, such instructions become last in the TB and are allowed to access io memory spaces. When the code includes instruction like i386 'xchg eax, 0xffffd080' which accesses APIC, QEMU goes into an infinite loop of the recompilation. This instruction includes two memory accesses - one read and one write. After the first access, APIC calls cpu_report_tpr_access, which restores the CPU state to get the current eip. But cpu_restore_state_from_tb resets the cpu->can_do_io flag which makes the second memory access invalid. Therefore the second memory access causes a recompilation of the block. Then these operations repeat again and again. This patch moves resetting cpu->can_do_io flag from cpu_restore_state_from_tb to cpu_loop_exit* functions. It also adds a parameter for cpu_restore_state which controls restoring icount. There is no need to restore icount when we only query CPU state without breaking the TB. Restoring it in such cases leads to the incorrect flow of the virtual time. In most cases new parameter is true (icount should be recalculated). But there are two cases in i386 and openrisc when the CPU state is only queried without the need to break the TB. This patch fixes both of these cases. Backports commit afd46fcad2dceffda35c0586f5723c127b6e09d8 from qemu	2018-04-11 20:05:40 -04:00
Alex Bennée	4074587775	accel/tcg/translate-all: expand cpu_restore_state addr check We are still seeing signals during translation time when we walk over a page protection boundary. This expands the check to ensure the host PC is inside the code generation buffer. The original suggestion was to check versus tcg_ctx.code_gen_ptr but as we now segment the translation buffer we have to settle for just a general check for being inside. I've also fixed up the declaration to make it clear it can deal with invalid addresses. A later patch will fix up the call sites. Backports commit d25f2a72272b9ffe0d06710d6217d1169bc2cc7d from qemu	2018-04-11 19:53:57 -04:00
Bharata B Rao	e373c001fa	cpu: Add Error argument to cpu_exec_init() Add an Error argument to cpu_exec_init() to let users collect the error. This is in preparation to change the CPU enumeration logic in cpu_exec_init(). With the new enumeration logic, cpu_exec_init() can fail if cpu_index values corresponding to max_cpus have already been handed out. Since all current callers of cpu_exec_init() are from instance_init, use error_abort Error argument to abort in case of an error. Backports commit 5a790cc4b942e651fec7edc597c19b637fad5a76 from qemu	2018-03-21 07:50:33 -04:00
Peter Crosthwaite	ce1831bfb4	target-*: Don't redefine cpu_exec() This function needs to be converted to QOM hook and virtualised for multi-arch. This rename interferes, as cpu-qom will not have access to the renaming causing name divergence. This rename doesn't really do anything anyway so just delete it. Backports commit 8642c1b81e0418df066a7960a7426d85a923a253 from qemu	2018-03-20 07:02:47 -04:00

1 2 3 4 5 ...

267 commits