unicorn

mirror of https://github.com/yuzu-emu/unicorn.git synced 2024-12-24 17:35:39 +00:00

Author	SHA1	Message	Date
Richard Henderson	f0781470b4	tcg/s390: Support deposit into zero Since we can no longer use matching constraints, this does mean we must handle that data movement by hand. Backports commit 752b1be94757de906b9c24ebc8f5e6aa54b96b23 from qemu	2018-03-01 13:47:20 -05:00
Richard Henderson	a7462cc7bf	tcg/s390: Implement field extraction opcodes Backports commit b0bf5fe82df93c180f69d439af59f1f546632f13 from qemu	2018-03-01 13:45:33 -05:00
Richard Henderson	ab8871ea82	tcg/s390: Implement field extraction opcodes Backports commit b0bf5fe82df93c180f69d439af59f1f546632f13 from qemu	2018-03-01 13:43:46 -05:00
Richard Henderson	348802286c	tcg/s390: Expose host facilities to tcg-target.h This lets us expose facilities to TCG_TARGET_HAS_* defines directly, rather than hiding behind function calls. Backports commit b2c98d9d392c87c9b9e975d30f79924719d9cbbe from qemu	2018-03-01 13:43:00 -05:00
Richard Henderson	db41c6f1d0	tcg/ppc: Implement field extraction opcodes Backports commit c05021c3c8d6c976e4677d3010b9ef01488a4434 from qemu	2018-03-01 13:38:42 -05:00
Richard Henderson	b10a4a9ee6	tcg/mips: Implement field extraction opcodes Backports commit befbb3ced5869003ee2e806c4f36e306918d2374 from qemu	2018-03-01 13:37:24 -05:00
Richard Henderson	7a7a5c640d	tcg/i386: Implement field extraction opcodes Backports commit 78fdbfb94616f0391834d2eccabd16ea29e37da5 from qemu	2018-03-01 13:35:41 -05:00
Richard Henderson	cabb6f71a0	tcg/arm: Implement field extraction opcodes Backports commit ec903af18418e0870af84f6036d7aca1e6a5dc0a from qemu	2018-03-01 13:33:55 -05:00
Richard Henderson	c4f56ec541	tcg/arm: Move isa detection to tcg-target.h This allows us to use this detection within the TCG_TARGET_HAS_* macros, instead of requiring a function call into tcg-target.inc.c. Backports commit 40b2ccb156534f5d5f1d110a6ce008d87ee10af1 from qemu	2018-03-01 13:32:39 -05:00
Richard Henderson	fbea4130fc	tcg/aarch64: Implement field extraction opcodes Backports commit e2179f94a17bf0933df29ce1b4f6bc93cbe7dbd3 from qemu	2018-03-01 13:30:55 -05:00
Richard Henderson	9f2fcaaf27	tcg: Add deposit_z expander While we don't require a new opcode, it is handy to have an expander that knows the first source is zero. Backports commit 07cc68d52852bf47dea7c402b46ddd28248d4212 from qemu	2018-03-01 13:29:24 -05:00
Richard Henderson	8e0585dcb1	tcg: Add field extraction primitives Adds tcg_gen_extract_* and tcg_gen_sextract_* for extraction of fixed position bitfields, much like we already have for deposit. Backports commit 7ec8bab3deae643b1ce579c2d65a244f30708330 from qemu	2018-03-01 13:21:30 -05:00
Jason Wang	29932d0719	memory: handle alias in memory_region_is_iommu() Backports commit 12d37882f0c0def5dee1c21be5d8fea9c21baada from qemu	2018-03-01 13:06:18 -05:00
Jason Wang	fdca6292a1	exec: introduce address_space_get_iotlb_entry() This patch introduces a helper to query the iotlb entry for a possible iova. This will be used by later device IOTLB API to enable the capability for a dataplane (e.g vhost) to query the IOTLB. Backports commit 052c8fa9983f553fdfa0d61034774070dd639c2b from qemu	2018-03-01 13:05:08 -05:00
Richard Henderson	efad2631d2	translate-all: Avoid -Werror=switch-bool gcc 5.3.0 diagnoses translate-all.c: In function ‘alloc_code_gen_buffer’: translate-all.c:756:17: error: switch condition has boolean value switch (buf2 != MAP_FAILED) { ^ Backports commit f68808c7494b38764e1895a9852b994638b86536 from qemu	2018-03-01 13:01:50 -05:00
Jin Guojie	4ed2a37f6d	tcg-mips: Adjust qemu_ld/st for mips64 Backports commit f0d703314ecb0415d51425727ed73ad2c6e3238a from qemu	2018-03-01 13:01:05 -05:00
Jin Guojie	25b4e11814	tcg-mips: Adjust calling conventions for mips64 Backports commit 999b941633cabf2487d9bc77ce382b3fde3cd66d from qemu	2018-03-01 12:53:42 -05:00
Jin Guojie	3de761976c	tcg-mips: Adjust prologue for mips64 Take stack frame parameters out from the function body. Backports commit 0973b1cff8b66f3561befb1f467b2ab4d1a7d55a from qemu	2018-03-01 12:51:36 -05:00
Jin Guojie	b55b7403a8	tcg-mips: Adjust load/store functions for mips64 tcg_out_ldst: using a generic ALIAS_PADD to avoid ifdefs tcg_out_ld: generates LD or LW tcg_out_st: generates SD or SW Backports commit 32b69707df3365aadaad1d058044a7704397ec62 from qemu	2018-03-01 12:50:12 -05:00
Jin Guojie	022ff3580e	tcg-mips: Adjust move functions for mips64 tcg_out_mov: using OPC_OR as most mips assemblers do; tcg_out_movi: extended to 64-bit immediate. Backports commit 2294d05dab503d11664e73712c7f250fd0bf9e3b from qemu	2018-03-01 12:49:19 -05:00
Jin Guojie	00ccf9cec7	tcg-mips: Add bswap32u and bswap64 Without the mips32r2 instructions to perform swapping, bswap is quite large, dominating the size of each reverse-endian qemu_ld/qemu_st operation. Create two subroutines in the prologue block. The subroutines require extra reserved registers (TCG_TMP[2, 3]). Using these within qemu_ld means that we need not place additional restrictions on the qemu_ld outputs. Backports commit 7f54eaa3b78d71cb57e45a719980f9b5ff06d21c from qemu	2018-03-01 12:47:45 -05:00
Jin Guojie	397db1b046	tcg-mips: Support 64-bit opcodes Bulk patch adding 64-bit opcodes into tcg_out_op. Note that mips64 is as yet neither complete nor enabled. Backports commit 0119b1927d531f3fac22b9b4da01dafc23644973 from qemu	2018-03-01 12:46:18 -05:00
Jin Guojie	286f3a9f70	tcg-mips: Add mips64 opcodes Since the mips manual tables are in octal, reorg all of the opcodes into that format for clarity. Note that the 64-bit opcodes are as yet unused. Backports commit 57a701fc2b34902310d4dbd1411088055616938a from qemu	2018-03-01 12:36:20 -05:00
Jin Guojie	d2aa49e9d3	tcg-mips: Move bswap code to a subroutine Without the mips32r2 instructions to perform swapping, bswap is quite large, dominating the size of each reverse-endian qemu_ld/qemu_st operation. Create a subroutine in the prologue block. The subroutine requires extra reserved registers (TCG_TMP[2, 3]). Using these within qemu_ld means that we need not place additional restrictions on the qemu_ld outputs. Backports commit bb08afe9f0aee1a3f5c23508e2511b882ca31e1b from qemu	2018-03-01 12:35:20 -05:00
Laurent Vivier	5d4a5e9ba7	target-m68k: free TCG variables that are not This is a cleanup patch. It adds call to tcg_temp_free() when it is missing. Backports commit 2b5e2170678af36df48ab4b05dff81fe40b41a65 from qemu	2018-03-01 12:27:43 -05:00
Laurent Vivier	74beed5a4d	target-m68k: add rol/ror/roxl/roxr instructions Backports commit 0194cf31cfc84516d10eca354146673150e10410 from qemu	2018-03-01 12:21:02 -05:00
Richard Henderson	cf9424d60d	target-m68k: Inline shifts Also manage word and byte operands and fix the computation of overflow in the case of M68000 arithmetic shifts. Backports commit 367790cce8e14131426f5190dfd7d1bdbf656e4d from qemu	2018-03-01 12:11:10 -05:00
Richard Henderson	0c00b036be	target-m68k: Do not cpu_abort on undefined insns Report this properly via exception and, importantly, allow the disassembler the chance to tell us what insn is not handled. Backports commit 72d2e4b6a437f11f97d3138f6b2ec177b78210c7 from qemu	2018-03-01 12:02:39 -05:00
Laurent Vivier	90b0b6d867	target-m68k: Implement 680x0 movem 680x0 movem can load/store words and long words and can use more addressing modes. Coldfire can only use long words with (Ax) and (d16,Ax) addressing modes. Backports commit 7b542eb96d7d5d9266a9c0425f05d49c8e6df2f9 from qemu	2018-03-01 12:01:46 -05:00
Laurent Vivier	2d318da080	target-m68k: add cas/cas2 ops Implement CAS using cmpxchg. Implement CAS2 using helper and either cmpxchg when the 32bit addresses are consecutive, or with parallel_cpus+cpu_loop_exit_atomic() otherwise. Backports commit 14f944063affbcc7bd6df42b060793dbfee8a822 from qemu	2018-03-01 11:56:26 -05:00
Laurent Vivier	f602f9d40c	target-m68k: add abcd/sbcd/nbcd Backports commit fb5543d820018a46b713911e7653594be727ca98 from qemu	2018-03-01 11:51:49 -05:00
Laurent Vivier	90b87b3580	target-m68k: add 680x0 divu/divs variants Update helper to set the throwing location in case of div-by-0. Cleanup divX.w and add quad word variants of divX.l. Backports commit 0ccb9c1d8128a020720d5c6abf99a470742a1b94 from qemu	2018-03-01 11:42:30 -05:00
Laurent Vivier	77b8b2f3b8	target-m68k: add 680x0 divu/divs variants Update helper to set the throwing location in case of div-by-0. Cleanup divX.w and add quad word variants of divX.l. Backports commit 0ccb9c1d8128a020720d5c6abf99a470742a1b94 from qemu	2018-03-01 11:38:53 -05:00
Laurent Vivier	f3990e8f87	target-m68k: add 64bit mull Backports commit 8be95defd6ab10d2c9f986879a0afa82417cb8e5 from qemu	2018-03-01 11:32:59 -05:00
Laurent Vivier	e1c7d37556	target-m68k: add cmpm Backports commit 817af1c72d227fd5759ef882bef61acee40679b1 from qemu	2018-03-01 11:29:35 -05:00
Richard Henderson	e6ca471dda	target-m68k: Split gen_lea and gen_ea Provide gen_lea_mode and gen_ea_mode, where the mode can be specified manually, rather than taken from the instruction. Backports commit f84aab269ddab8509b77408b886e9071bf5c48fb from qemu	2018-03-01 11:27:43 -05:00
Richard Henderson	5541553e8d	target-m68k: Delay autoinc writeback Backports commit 8a1e52b69d2cd1c633f3c473a213d575931bf46d from qemu	2018-03-01 11:18:36 -05:00
Cédric Le Goater	c7ab1e782b	target-arm: Add VBAR support to ARM1176 CPUs ARM1176 CPUs have TrustZone support and can use the Vector Base Address Register, but currently, qemu only adds VBAR support to ARMv7 CPUs. Fix this by adding a new feature ARM_FEATURE_VBAR which can used for ARMv7 and ARM1176 CPUs. The VBAR feature is always set for ARMv7 because some legacy boards require it even if this is not architecturally correct. Backports commit 91db4642f868cf2e591b62d31a19d35b02ea791e from qemu	2018-03-01 11:12:29 -05:00
Peter Maydell	554ad1f34e	target-arm: Log AArch64 exception returns We already log exception entry; add logging of the AArch64 exception return path as well. Backports commit c9b61d9aa1ad234b0961f8add023cdc999cda3da from qemu	2018-03-01 11:08:35 -05:00
Julian Brown	86670028f7	Correct value of ARM Cortex-A8 MVFR1 register. The value of the MVFR1 (Media and VFP Feature Register 1) register for the Cortex-A8 appears to be incorrect (according to the TRM, DDI0344K), with the "full denormal arithmetic" and "propagation of NaN" fields holding both 0 instead of both 1. I had a go tracing the history of the use of this value, and it seems it's always just been wrong in QEMU: maybe it was derived from early documentation, or guessed based on the use of a "VFP Lite" implementation in the Cortex-A8. Depending on the startup/early-boot code in use, this can manifest as failure to perform denormal arithmetic properly: in our case, selecting a Cortex-A8 CPU when using QEMU as an instruction-set simulator for bare-metal GCC testing caused tests using denormal arithmetic to fail. Problems might be masked (or not occur) when using a full OS kernel with suitable trap handlers (I'm not sure). Backports commit 0f1944735b6bac810b067e8a7a5154744536fd59 from qemu	2018-03-01 11:07:08 -05:00
Richard Henderson	fcc05dc1ce	tcg/s390: Remove 'R' constraint Since R0 is reserved, we don't need a special case constraint. Backports commit e45d4ef6e345831c8d67a5bffe0d057efc20f4ff from qemu	2018-03-01 11:05:57 -05:00
Richard Henderson	7852cc600d	tcg/s390: Fix setcond expansion We can't use LOAD AND TEST for unsigned data and then expect to extract the result with ADD LOGICAL WITH CARRY. Fall through to using COMPARE LOGICAL IMMEDIATE instead. Backports commit 65839b56b9a740e6b898b5d81afc160502bd2935 from qemu	2018-03-01 11:04:40 -05:00
Kirill A. Shutemov	eb489625b5	x86: implement la57 paging mode The new paging more is extension of IA32e mode with more additional page table level. It brings support of 57-bit vitrual address space (128PB) and 52-bit physical address space (4PB). The structure of new page table level is identical to pml4. The feature is enumerated with CPUID.(EAX=07H, ECX=0):ECX[bit 16]. CR4.LA57[bit 12] need to be set when pageing enables to activate 5-level paging mode. Backports commit 6c7c3c21f95dd9af8a0691c0dd29b07247984122 from qemu	2018-03-01 11:02:07 -05:00
Doug Evans	7c874b1b2b	target-i386: Fix eflags.TF/#DB handling of syscall/sysret insns The syscall and sysret instructions behave a bit differently: TF is checked after the instruction completes. This allows the o/s to disable #DB at a syscall by adding TF to FMASK. And then when the sysret is executed the #DB is taken "as if" the syscall insn just completed. Backports commit c52ab08aee6f7d4717fc6b517174043126bd302f from qemu	2018-03-01 10:56:22 -05:00
Yi Sun	f6e624d97b	target-i386: Add Intel SHA_NI instruction support. Add SHA_NI feature bit. Its spec can be found at: https://software.intel.com/sites/default/files/managed/39/c5/325462-sdm-vol-1-2abcd-3abcd.pdf Backports commit 638cbd452d3a92a2ab18caee73078483d90f64eb from qemu	2018-03-01 10:52:54 -05:00
Paolo Bonzini	81ad780e5e	exec: introduce MemoryRegionCache Device models often have to perform multiple access to a single memory region that is known in advance, but would to use "DMA-style" functions instead of address_space_map/unmap. This can happen for example when the data has to undergo endianness conversion. Introduce a new data structure to cache the result of address_space_translate without forcing usage of a host address like address_space_map does. Backports commit 1f4e496e1fc2eb6c8bf377a0f9695930c380bfd3 from qemu	2018-03-01 10:50:30 -05:00
Paolo Bonzini	03a9bbb3d3	exec: introduce address_space_extend_translation This extracts the common part of address_space_map and address_space_cache_init into a new function. Backports commit 715c31ec8e12107f47ac74b464c97e813c76f898 from qemu	2018-03-01 10:06:43 -05:00
Paolo Bonzini	88ad0f4f6e	exec: introduce memory_ldst.inc.c Templatize the address_space_* and *_phys functions, so that we can add similar functions in the next patch that work with a lightweight, cache-like version of address_space_map/unmap. Backports commit 0ce265ffef87f19f4dd1ff0663e09a63d66ae408 from qemu	2018-03-01 09:59:34 -05:00
Paolo Bonzini	f3cc489a72	exec: optimize remaining address_space_* cases Do them right before the next patch generalizes them into a multi-included file. Backports commit 2651efe7f5f9d6dc89c8e54d7d63952b7b22597d from qemu	2018-03-01 09:40:29 -05:00
Richard Henderson	0f94929fa7	target-arm: Fix aarch64 vec_reg_offset Since CPUARMState.vfp.regs is not 16 byte aligned, the ^ 8 fixup used for a big-endian host doesn't do what's intended. Fix this by adding in the vfp.regs offset after computing the inter-register offset. Backports commit d437262fa8edd0d9fbe038a515dda3dbf7c5bb54 from qemu	2018-03-01 09:36:03 -05:00
Richard Henderson	ce9dca9c5e	target-arm: Fix aarch64 disas_ldst_single_struct We add s->be_data within do_vec_ld/st. Adding it here means that we have the wrong bits set in SIZE for a big-endian host, leading to g_assert_not_reached in write_vec_element and read_vec_element. Backports commit 74b13f92c2428abae41a61c46a5cf47545da5fcb from qemu	2018-03-01 09:34:34 -05:00
Paolo Bonzini	560515941a	target-i386: correctly propagate retaddr into SVM helpers Commit 2afbdf8 ("target-i386: exception handling for memory helpers", 2015-09-15) changed tlb_fill's cpu_restore_state+raise_exception_err to raise_exception_err_ra. After this change, the cpu_restore_state and raise_exception_err's cpu_loop_exit are merged into raise_exception_err_ra's cpu_loop_exit_restore. This actually fixed some bugs, but when SVM is enabled there is a second path from raise_exception_err_ra to cpu_loop_exit. This is the VMEXIT path, and now cpu_vmexit is called without a cpu_restore_state before. The fix is to pass the retaddr to cpu_vmexit (via cpu_svm_check_intercept_param). All helpers can now use GETPC() to pass the correct retaddr, too. Backports commit 823fb688ebc52a7d79c1308acb28c92b56820167 from qemu	2018-03-01 09:31:16 -05:00
Richard Henderson	f6a72d4dca	target/sparc: Restore ldstub of odd asis Fixes the booting of ss20 roms. Backports commit f61f76cb3ff9fea6ecbcb9696ed82b3e2c5b7364 from qemu	2018-03-01 09:21:44 -05:00
Paolo Bonzini	9404dbf74e	cpu-exec: fix icount out-of-bounds access When icount is active, tb_add_jump is surprisingly called with an out of bounds basic block index. I have no idea how that can work, but it does not seem like a good idea. Clear *last_tb for all TB_EXIT_ICOUNT_EXPIRED cases, even when all you have to do is refill icount_extra. Backports commit d8dea6fbcbed177ca5d23ab77b3834a9437f0e88 from qemu	2018-03-01 09:17:26 -05:00
Richard Henderson	6820964e2f	tcg/aarch64: Fix tcg_out_movi There were some patterns, like 0x0000_ffff_ffff_00ff, for which we would select to begin a multi-insn sequence with MOVN, but would fail to set the 0x0000 lane back from 0xffff. Backports commit 50b468d42107a2c646b1c566ed17d9ec362c51c4 from qemu	2018-03-01 09:15:34 -05:00
Richard Henderson	a03666f2f2	tcg/aarch64: Fix addsub2 for 0+C When al == xzr, we cannot use addi/subi because that encodes xsp. Force a zero into the temp register for that (rare) case. Backports commit 028fbea47713f909d6ea761a457779a82b276247 from qemu	2018-03-01 09:13:54 -05:00
Roman Kapl	337f57dd2c	exec: Add missing rcu_read_unlock rcu_read_unlock was not called if the address_space_access_valid result is negative. This caused (at least) a problem when qemu on PPC/E500+TAP failed to terminate properly and instead got stuck in a deadlock. Backports commit 662a97d74f9b34cafe9aeb6d96620a97d768a1fa from qemu	2018-03-01 09:12:27 -05:00
Peter Maydell	488b6cc82b	exec.c: Fix breakpoint invalidation race A bug (1647683) was reported showing a crash when removing breakpoints. The reproducer was bisected to 3359baad when tb_flush was finally made thread safe. While in MTTCG the locking in breakpoint_invalidate would have prevented any problems, but currently tb_lock() is a NOP for system emulation. The race is between a tb_flush from the gdbstub and the tb_invalidate_phys_addr() in breakpoint_invalidate(). Ideally we'd have actual locking here; for the moment the simple fix is to do a full tb_flush() for a bp invalidate, since that is thread-safe even if no lock is taken. Backports commit a9353fe897ca2687e5b3385ed39e3db3927a90e0 from qemu	2018-03-01 09:10:59 -05:00
Alex Bennée	f4c65739f3	target-arm/translate-a64: fix gen_load_exclusive While testing rth's latest TCG patches with risu I found ldaxp was broken. Investigating further I found it was broken by 1dd089d0 when the cmpxchg atomic work was merged. As part of that change the code attempted to be clever by doing a single 64 bit load and then shuffle the data around to set the two 32 bit registers. As I couldn't quite follow the endian magic I've simply partially reverted the change to the original code gen_load_exclusive code. This doesn't affect the cmpxchg functionality as that is all done on in gen_store_exclusive part which is untouched. I've also restored the comment that was removed (with a slight tweak to mention cmpxchg). Backports commit 5460da501a57cd72eda6fec736d76539122e2f99 from qemu	2018-03-01 09:09:16 -05:00
Marc-André Lureau	b0e5d04813	qapi: add missing colon-ending for section name The documentation parser we are going to add expects a section name to end with ':', otherwise the comment is treated as free-form text body. Backports commit 5072f7b38b1b9b26b8fbe1a89086386a420aded8 from qemu	2018-03-01 09:07:10 -05:00
Yongbok Kim	8575514f4c	target-mips: fix bad shifts in {dextp\|dextpdp} Fixed issues in the MIPSDSP64 instructions dextp and dextpdp. Shifting can go out of 32 bit range. https://bugs.launchpad.net/qemu/+bug/1631625 Backports commit e6e2784cacd4cfec149a7690976b9ff15e541c4d from qemu	2018-03-01 09:04:41 -05:00
Heiher	b5468b4b22	target-mips: Fix Loongson multimedia instructions. Needed to emit FPU exception on Loongson multimedia instructions executing if Status:CU1 is clear. or FPR changes may be missed on Linux. Backports commit b5a587b613f6151c2ce164552579ae64f2ddfd1c from qemu	2018-03-01 09:03:49 -05:00
Heiher	12a8570cbe	target-mips: Fix Loongson multimedia 'or' instruction. Backports commit bb7cab5f3466540f5603b209c0df2e27a02fbb95 from qemu	2018-03-01 09:03:10 -05:00
Heiher	ba39cb4fcb	target-mips: Fix Loongson pandn instruction. pandn FD, FS, FT Operation: FD = ((NOT FS) AND FT) Backports commit 9099a36b4bb81f84004b77f08e58ac2c67eed0e7 from qemu	2018-03-01 09:02:34 -05:00
Laurent Vivier	7055c38183	target-m68k: fix muluw/mulsw "The multiplier and multiplicand are both word operands, and the result is a long-word operand." So compute flags on a long-word result, not on a word result. Backports commit 4a18cd44f3c905d443c26e26bb9b09932606d1a3 from qemu	2018-03-01 08:59:42 -05:00
Laurent Vivier	527c68f40e	target-m68k: Fix cmpa operand size "The size of the operation can be specified as word or long. Word length source operands are sign-extended to 32 bits for comparison." So comparison is always done using OS_LONG. Backports commit 5436c29d78957a6825a93f0eb79dfab388641017 from qemu	2018-03-01 08:58:59 -05:00
Laurent Vivier	69687e1824	target-m68k: fix EXG instruction opcodes of "EXG Ax,Ay" and "EXG Dx,Dy" have been swapped Backports commit c090c97d925ce751d8834d5c5a404952598f67c0 from qemu	2018-03-01 08:57:45 -05:00
Bobby Bingham	d46e52d9d0	cpu_ldst.h: use correct guest address parameter In the user emulation code path, tlb_vaddr_to_host erronesously passed vaddr as the guest address to be translated, instead of addr, the parameter which actually contained the guest address. This resulted in incorrect addresses being used when emulating block copy (mvc/mvpg) and block clear (xc) instructions for the s390x target. Backports commit c2a85316902e67530da9d6548139fcce73c0cac6 from qemu	2018-03-01 08:56:37 -05:00
Ed Maste	6deaf98a8b	Fix FreeBSD (10.x) build after 7dc9ae43 Include sys/user.h for declaration of 'struct kinfo_proc'. Add -lutil to qemu-ga link for kinfo_getproc. Backports commit a7764f1548ef9946af30a8f96be9cef10761f0c1 from qemu	2018-03-01 08:55:43 -05:00
Luwei Kang	57533d1adc	x86: add AVX512_4VNNIW and AVX512_4FMAPS features The spec can be found in Intel Software Developer Manual or in Instruction Set Extensions Programming Reference. Backports commit 95ea69fb46266aaa46d0c8b7f0ba8c4903dbe4e3 from qemu	2018-03-01 08:51:09 -05:00
Alex Bennée	1e4154af83	exec.c: ensure all AddressSpaceDispatch updates under RCU The memory_dispatch field is meant to be protected by RCU so we should use the correct primitives when accessing it. This race was flagged up by the ThreadSanitizer. Backports commit f35e44e7645edbb08e35b111c10c2fc57e2905c7 from qemu	2018-03-01 08:44:19 -05:00
Joseph Myers	7ff441826c	tcg: correct 32-bit tcg_gen_ld8s_i64 sign-extension The version of tcg_gen_ld8s_i64 for 32-bit systems does a load into the low part of the return value - then attempts a sign extension into the high part, but wrongly sets the high part to a sign extension of itself rather than of the low part. This results in TCG internal errors from the use of the uninitialized high part (in some GCC tests of AArch64 NEON shift intrinsics, in particular). This patch corrects the sign-extension logic, making it match other functions such as tcg_gen_ld16s_i64. Backports commit 3ff91d7e85176f8b4b131163d7fd801757a2c949 from qemu	2018-03-01 08:41:23 -05:00
Peter Maydell	f9c5c1a604	tcg/tcg.h: Improve documentation of TCGv_i32 etc types The typedefs we use for the TCGv_i32, TCGv_i64 and TCGv_ptr types are somewhat confusing, because we define them as pointers to structs, but the structs themselves are never defined. Explain in the comments a bit more clearly why this is OK and what is going on under the hood. Backports commit a40d4701bc9f6e6a3bbfb7b4fbe756a5b72b5df1 from qemu	2018-03-01 08:40:35 -05:00
Richard Henderson	f5a35908da	tcg: Add tcg_gen_mulsu2_{i32,i64,tl} This multiply has one signed input and one unsigned input, producing the full double-width result. Backports commit 5087abfb7dfd1d368ae6939420057036b4d8e509 from qemu	2018-03-01 08:39:37 -05:00
Richard Henderson	f9d91a81b5	target-sparc: Use tcg_gen_atomic_cmpxchg_tl Backports commit 5a7267b6a9e94c264ca77a7ca5a239e70dac81da from qemu	2018-03-01 08:34:35 -05:00
Richard Henderson	47313adedd	target-sparc: Use tcg_gen_atomic_xchg_tl Backports commit da1bcae65288bdd51e0a7203d1e6c9cde1be5b3d from qemu	2018-03-01 08:32:10 -05:00
Richard Henderson	6b09040e23	target-sparc: Remove MMU_MODE*_SUFFIX The functions that these generate are no longer used. Backports commit 47b2696b975b794c6fa7b9fa8ae4699e749d662c from qemu	2018-03-01 08:30:27 -05:00
Richard Henderson	00fc847229	target-sparc: Allow 4-byte alignment on fp mem ops The cpu is allowed to require stricter alignment on these 8- and 16-byte operations, and the OS is required to fix up the accesses as necessary, so the previous code was not wrong. However, we can easily handle this misalignment for all direct 8-byte operations and for direct 16-byte loads. We must retain 16-byte alignment for 16-byte stores, so that we don't have to probe for writability of a second page before performing the first of two 8-byte stores. We also retain 8-byte alignment for no-fault loads, since they are rare and it's not worth extending the helpers for this. Backports commit cb21b4da6cca1bb4e3f5fefb698fb9e4d00c8f66 from qemu	2018-03-01 08:29:11 -05:00
Richard Henderson	eec264526e	target-sparc: Implement ldqf and stqf inline At the same time, fix a problem with stqf_asi, when a write might access two pages. Backports commit f939ffe5a022a8798824e2720ed5a14186fca6b6 from qemu	2018-03-01 08:20:36 -05:00
Richard Henderson	3a25695841	target-sparc: Remove asi helper code handled inline Now that we never call out to helpers when direct accesses can handle an asi, remove the corresponding code in those helpers. For ldda, this removes the entire helper. Backports commit 918d9a2c9d36378a3cf6636018900a4731c83b9d from qemu	2018-03-01 08:14:31 -05:00
Richard Henderson	15c8bf0b42	target-sparc: Implement BCOPY/BFILL inline Backports commit 34810610acbde7a0745be3a88e99f2ef9282260f from qemu	2018-02-28 12:54:10 -05:00
Richard Henderson	3c48eb4aaf	target-sparc: Implement cas_asi/casx_asi inline Backports commit 7268adebfda6548b8ae6865dc8337f116a5d266d from qemu	2018-02-28 12:47:26 -05:00
Richard Henderson	b28b5cd3d3	target-sparc: Implement ldstub_asi inline Backports commit fbb4bbb62e5603c991b880e25dc4bb30d342b944 from qemu	2018-02-28 12:42:26 -05:00
Richard Henderson	adf9faf075	target-sparc: Implement swap_asi inline Backports commit 4fb554bc6c88eb45270a3ad3cf6e6e2ad476aede from qemu	2018-02-28 12:39:55 -05:00
Richard Henderson	ebc292c174	target-sparc: Handle more twinx asis As used by HelenOS, presumably for ultra 2 and 3, prior to the sun4v platform and the current twinx names. Backports commit 34a6e13da70b2c798630a8dbd03d09f201c0198f from qemu	2018-02-28 12:28:08 -05:00
Richard Henderson	ecbeea7c56	target-sparc: Use MMU_PHYS_IDX for bypass asis Backports commit 7f87c90527d7363e8cecf1c6b5ad3d4cc85d3d28 from qemu	2018-02-28 12:26:29 -05:00
Richard Henderson	15eea419e5	target-sparc: Add MMU_PHYS_IDX It's handy to have a mmu idx for physical addresses, so that mmu disabled and physical access asis can use the same path as normal accesses. Backports commit af7a06bac7d3abb2da48ef3277d2a415772d2ae8 from qemu	2018-02-28 12:24:17 -05:00
Richard Henderson	9e60a8e432	target-sparc: Introduce cpu_raise_exception_ra Several helpers call helper_raise_exception directly, which requires in turn that their callers have performed save_state. The new function allows a TCG return address to be passed in so that we can restore PC + NPC + flags data from that. This fixes a bug in the usage of helper_check_align, whose callers had not been calling save_state. It fixes another bug in which the divide helpers used GETPC at a level other than the direct callee from TCG. This allows the translator to avoid save_state prior to SAVE, RESTORE, and FLUSHW instructions. Backports commit 2f9d35fc4006122bad33f9ae3e2e51d2263e98ee from qemu	2018-02-28 12:15:06 -05:00
Richard Henderson	62ae2a5102	target-sparc: Use overalignment flags for twinx and block asis This allows us to enforce 16 and 64-byte alignment without any extra overhead. Backports commit 808832277af11dafee5a55da2b9e41d019b879ca from qemu	2018-02-28 12:01:50 -05:00
Alex Bennée	da124da4b1	tcg: move locking for tb_invalidate_phys_page_range up In the linux-user case all things that involve ''l1_map' and PageDesc tweaks are protected by the memory lock (mmpa_lock). For SoftMMU mode we previously relied on single threaded behaviour, with MTTCG we now use the tb_lock(). As a result we need to do a little re-factoring and push the taking of this lock up the call tree. This requires a slightly different entry for the SoftMMU and user-mode cases from tb_invalidate_phys_range. This also means user-mode breakpoint insertion needs to take two locks but it hadn't taken any previously so this is an improvement. Backpoirts commit ba051fb5e56d5ff5e4fa672d37954452e58543b2 from qemu	2018-02-28 10:35:41 -05:00
Paolo Bonzini	9d64a89acf	tcg: comment on which functions have to be called with tb_lock held softmmu requires more functions to be thread-safe, because translation blocks can be invalidated from e.g. notdirty callbacks. Probably the same holds for user-mode emulation, it's just that no one has ever tried to produce a coherent locking there. This patch will guide the introduction of more tb_lock and tb_unlock calls for system emulation. Note that after this patch some (most) of the mentioned functions are still called outside tb_lock/tb_unlock. The next one will rectify this. Backports commit 7d7500d99895f888f97397ef32bb536bb0df3b74 from qemu	2018-02-28 10:26:28 -05:00
Alex Bennée	7aab0bd9a6	translate-all: add DEBUG_LOCKING asserts This adds asserts to check the locking on the various translation engines structures. There are two sets of structures that are protected by locks. The first the l1map and PageDesc structures used to track which translation blocks are associated with which physical addresses. In user-mode this is covered by the mmap_lock. The second case are TB context related structures which are protected by tb_lock which is also user-mode only. Currently the asserts do nothing in SoftMMU mode but this will change for MTTCG. Backports commit 301e40ed8005306c009978be295ed9a4b725178b from qemu	2018-02-28 08:56:15 -05:00
Alex Bennée	075aaad106	translate_all: DEBUG_FLUSH -> DEBUG_TB_FLUSH Make the debug define consistent with the others. The flush operation is all about invalidating TranslationBlocks on flush events. Also fix up the commenting on the other DEBUG for the benefit of checkpatch. Backports commit 955939a2b51f72bea1c200b559ea39985df5a633 from qemu	2018-02-28 08:53:38 -05:00
Anand J	8278af45cd	clean-up: removed duplicate #includes Some files contain multiple #includes of the same header file. Removed most of those unnecessary duplicate entries using scripts/clean-includes. Backports commit 814bb12a561d36aeb5ae4440ad43d2b0761d76da from qemu	2018-02-28 08:51:56 -05:00
Wei Huang	bceed21d23	arm: Add an option to turn on/off vPMU support This patch adds a pmu=[on/off] option to enable/disable vPMU support in guest vCPU. It allows virt tools, such as libvirt, to determine the exsitence of vPMU and configure it. Note this option is only available for cortex-a57/cortex-53/ host CPUs, but unavailable on ARMv7 and other processors. Also even though "pmu=" option is available for TCG mode, setting it doesn't turn PMU on. Backports commit 929e754d5a621cd53f30e69b766ccf381b58d124 from qemu	2018-02-28 08:49:23 -05:00
Laurent Vivier	5daf91ea48	target-m68k: immediate ops manage word and byte operands Backports commit 92c62548f69cb4ba739d7d046e9caf9ea75753e4 from qemu	2018-02-28 08:42:22 -05:00
Laurent Vivier	f7c29f73b3	target-m68k: cmp manages word and bytes operands Backports commit ff99b952c8280853801fe14f7ae62d0f87464f7d from qemu	2018-02-28 08:37:46 -05:00
Laurent Vivier	fc28e8127f	target-m68k: add/sub manage word and byte operands Backports commit 8a370c6cb770b618f7eb66628116c25e84588df8 from qemu	2018-02-28 07:18:25 -05:00
Laurent Vivier	bc27695926	target-m68k: add addressing modes to neg Backports commit 227de713e0f4224a82c32991b4e4c4973381426b from qemu	2018-02-28 07:07:28 -05:00
Laurent Vivier	3558b93f11	target-m68k: introduce byte and word cc_ops Backports commit db3d7945ae7992c91cc5705dccf60fec79b24dc4 from qemu	2018-02-28 06:52:16 -05:00
Laurent Vivier	4e257ffda9	target-m68k: some bit ops cleanup Backports commit 3c980d2ef664e6d5a1a0c98aca4d11d33b17ca59 from qemu	2018-02-28 01:25:58 -05:00
Laurent Vivier	cfab571859	target-m68k: suba/adda can manage word operand Backports commit 415f4b62eb4629bd3702e6fb8aa51437a92983ff from qemu	2018-02-28 01:20:23 -05:00
Laurent Vivier	99c297efe3	target-m68k: and can manage word and byte operands Backports commit 52dc23c5956159a79a4e2d4193e44d2c4cf3883c from qemu	2018-02-28 01:19:02 -05:00
Laurent Vivier	41372b0cc9	target-m68k: or can manage word and byte operands Backports commit 020a4659208a6f9a985881504fd4d3b44ab589be from qemu	2018-02-28 01:15:27 -05:00
Laurent Vivier	e140aac281	target-m68k: eor can manage word and byte operands Backports commit eec37aec85af9f5fd59b534d20c86a775b8e7973 from qemu	2018-02-28 01:05:21 -05:00
Laurent Vivier	bc52777b00	target-m68k: add addressing modes to not Backports commit ea4f2a844132c81f1e6b51fed7019686ce4e3bc5 from qemu	2018-02-28 01:03:38 -05:00
Richard Henderson	549e31cc72	target-m68k: Inline addx, subx, negx And add opcodes for 680x0 Backports commit a665a820e5d46b1611f409fbc7a540fe1c6bf5c8 from qemu	2018-02-28 01:02:31 -05:00
Laurent Vivier	b796f934ff	target-m68k: add dbcc Backports commit beff27ab3a60d8abab4a166670ca79b3c0970005 from qemu	2018-02-28 00:45:39 -05:00
Laurent Vivier	977c3fe6c4	target-m68k: add addressing modes to scc Backports commit d5a3cf33f2f65069d2f79a6e349f0d8140f02bb4 from qemu	2018-02-28 00:43:30 -05:00
Laurent Vivier	77b1754376	target-m68k: add exg ops Backports commit 29cf437da4eeacb46cd7076014d06c85ca47c91d from qemu	2018-02-28 00:37:41 -05:00
Laurent Vivier	56882899be	target-m68k: add linkl Backports commit c630e436c0ed3adc3a858c328119daf6d1b3357f from qemu	2018-02-28 00:31:27 -05:00
Laurent Vivier	59d6a1a744	target-m68k: add bkpt instruction Backports commit 71600eda7cc48f03ea306bc69ed7e52ef1d9dd91 from qemu	2018-02-28 00:29:41 -05:00
Emilio G. Cota	22be035e60	target-arm: remove EXCP_STREX + cpu_exclusive_{test, info} The exception is not emitted anymore; remove it and the associated TCG variables. Backports commit 05188cc72f0399e99c92f608a8e7ca4c8e552c4b from qemu	2018-02-28 00:24:20 -05:00
Emilio G. Cota	cb92eea81a	target-arm: emulate aarch64's LL/SC using cmpxchg helpers Emulating LL/SC with cmpxchg is not correct, since it can suffer from the ABA problem. Portable parallel code, however, is written assuming only cmpxchg--and not LL/SC--is available. This means that in practice emulating LL/SC with cmpxchg is a viable alternative. The appended emulates LL/SC pairs in aarch64 with cmpxchg helpers. This works in both user and system mode. In usermode, it avoids pausing all other CPUs to perform the LL/SC pair. The subsequent performance and scalability improvement is significant, as the plots below show. They plot the throughput of atomic_add-bench compiled for ARM and executed on a 64-core x86 machine. Hi-res plots: http://imgur.com/a/JVc8Y atomic_add-bench: 1000000 ops/thread, [0,1] range 18 ++---------+----------+---------+----------+----------+----------+---++ +cmpxchg +-E--+ + + + + + \| 16 ++master +-H--+ ++ \|\| \| 14 ++ ++ \| \| \| 12 ++\| ++ \| \| \| 10 ++++ ++ 8 ++E ++ \|+++ \| 6 ++ \| ++ \| \| \| 4 ++ \| ++ \| \| \| 2 +H++E+--- ++ + \| +E++----+E+---+--+E+----++E+------+E+------+E++----+E+---+--+E\| 0 ++H-H----H-+-----H----+---------+----------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 1000000 ops/thread, [0,2] range 18 ++---------+----------+---------+----------+----------+----------+---++ +cmpxchg +-E--+ + + + + + \| 16 ++master +-H--+ ++ \| \| \| 14 ++E ++ \| \| \| 12 ++\| ++ \|+++ \| 10 ++ \| ++ 8 ++ \| ++ \| \| \| 6 ++ \| ++ \| \| \| 4 ++ \| ++ \| +E+--- \| 2 +H+ +E+-----+++ +++ +++ ---+E+-----+E+------+++ +++ + +E+---+--+E+----++E+------+E+--- ++++ +++ + +E\| 0 ++H-H----H-+-----H----+---------+----------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 1000000 ops/thread, [0,128] range 70 ++---------+----------+---------+----------+----------+----------+---++ +cmpxchg +-E--+ + + + + + \| 60 ++master +-H--+ +++ ---+E+-----+E+------+E+ \| +E+------E-------+E+--- \| \| --- +++ \| 50 ++ +++--- ++ \| -+E+ \| 40 ++ +++---- ++ \| E- \| \| --\| \| 30 ++ -- +++ ++ \| +E+ \| 20 ++E+ ++ \|E+ \| \| \| 10 ++ ++ + + + + + + + \| 0 +HH-H----H-+-----H----+---------+----------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 1000000 ops/thread, [0,1024] range 160 ++---------+---------+----------+---------+----------+----------+---++ +cmpxchg +-E--+ + + + + + \| 140 ++master +-H--+ +++ +++ \| -+E+-----+E+-------E\| 120 ++ +++ ---- +++ \| +++ ----E-- \| 100 ++ --E--- +++ ++ \| +++ ---- +++ \| 80 ++ --E-- ++ \| ---- +++ \| \| -+E+ \| 60 ++ ---- +++ ++ \| +E+- \| 40 ++ -- ++ \| +E+ \| 20 +EE+ ++ +++ + + + + + + \| 0 +HH-H---H--+-----H---+----------+---------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads Backports commit 1dd089d0eec060dcd8478735114d98421d414805 from qemu	2018-02-28 00:21:27 -05:00
Emilio G. Cota	3546558f66	target-arm: emulate SWP with atomic_xchg helper Backports commit cf12bce088f22b92bf62ffa0d7f6a3e951e355a9 from qemu	2018-02-28 00:11:23 -05:00
Emilio G. Cota	ec14a00925	target-arm: emulate LL/SC using cmpxchg helpers Emulating LL/SC with cmpxchg is not correct, since it can suffer from the ABA problem. Portable parallel code, however, is written assuming only cmpxchg--and not LL/SC--is available. This means that in practice emulating LL/SC with cmpxchg is a viable alternative. The appended emulates LL/SC pairs in ARM with cmpxchg helpers. This works in both user and system mode. In usermode, it avoids pausing all other CPUs to perform the LL/SC pair. The subsequent performance and scalability improvement is significant, as the plots below show. They plot the throughput of atomic_add-bench compiled for ARM and executed on a 64-core x86 machine. Hi-res plots: http://imgur.com/a/aNQpB atomic_add-bench: 1000000 ops/thread, [0,1] range 9 ++---------+----------+----------+----------+----------+----------+---++ +cmpxchg +-E--+ + + + + + \| 8 +Emaster +-H--+ ++ \| \| \| 7 ++E ++ \| \| \| 6 ++++ ++ \| \| \| 5 ++ \| ++ 4 ++ \| ++ \| \| \| 3 ++ \| ++ \| \| \| 2 ++ \| ++ \|H++E+--- +++ ---+E+------+E+------+E\| 1 +++ +E+-----+E+------+E+------+E+------+E+-- +++ +++ ++ ++H+ + +++ + +++ ++++ + + + \| 0 ++--H----H-+-----H----+----------+----------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 1000000 ops/thread, [0,2] range 16 ++---------+----------+---------+----------+----------+----------+---++ +cmpxchg +-E--+ + + + + + \| 14 ++master +-H--+ ++ \| \| \| 12 ++\| ++ \| E \| 10 ++\| ++ \| \| \| 8 ++++ ++ \|E+\| \| \| \| \| 6 ++ \| ++ \| \| \| 4 ++ \| ++ \| +E+--- +++ +++ +++ ---+E+------+E\| 2 +H+ +E+------E-------+E+-----+E+------+E+------+E+-- +++ + \| + +++ + ++++ + + + \| 0 ++H-H----H-+-----H----+---------+----------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 1000000 ops/thread, [0,128] range 70 ++---------+----------+---------+----------+----------+----------+---++ +cmpxchg +-E--+ + + + ++++ + \| 60 ++master +-H--+ ----E------+E+-------++ \| -+E+--- +++ +++ +E\| \| +++ ---- +++ ++\| 50 ++ +++ ---+E+- ++ \| -E--- \| 40 ++ ---+++ ++ \| +++--- \| \| -+E+ \| 30 ++ +++---- ++ \| +E+ \| 20 ++ +++-- ++ \| +E+ \| \|+E+ \| 10 +E+ ++ + + + + + + + \| 0 +HH-H----H-+-----H----+---------+----------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 1000000 ops/thread, [0,1024] range 120 ++---------+---------+----------+---------+----------+----------+---++ +cmpxchg +-E--+ + + + + + \| \| master +-H--+ ++\| 100 ++ ----E+ \| +++ ---+E+--- ++\| \| --E--- +++ \| 80 ++ ---- +++ ++ \| ---+E+- \| 60 ++ -+E+-- ++ \| +++ ---- +++ \| \| -+E+- \| 40 ++ +++---- ++ \| +++ ---+E+ \| \| -+E+--- \| 20 ++ +E+ ++ \|+E+++ \| +E+ + + + + + + \| 0 +HH-H---H--+-----H---+----------+---------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads Backports commit 354161b37c6465a32073eac5f16fa35939af2bb4 from qemu	2018-02-28 00:07:44 -05:00
Richard Henderson	fd9933fbd5	target-arm: Rearrange aa32 load and store functions Stop specializing on TARGET_LONG_BITS == 32; unconditionally allocate a temp and expand with tcg_gen_extu_i32_tl. Split out gen_aa32_addr, gen_aa32_frob64, gen_aa32_ld_i32 and gen_aa32_st_i32 as separate interfaces. Backports commit 7f5616f53896a4e08ad37de3ac50d3a4cc8eff7a from qemu	2018-02-27 23:59:16 -05:00
Emilio G. Cota	3dc16ebca3	target-i386: remove helper_lock() It's been superseded by the atomic helpers. The use of the atomic helpers provides a significant performance and scalability improvement. Below is the result of running the atomic_add-test microbenchmark with: $ x86_64-linux-user/qemu-x86_64 tests/atomic_add-bench -o 5000000 -r $r -n $n , where $n is the number of threads and $r is the allowed range for the additions. The scenarios measured are: - atomic: implements x86' ADDL with the atomic_add helper (i.e. this patchset) - cmpxchg: implement x86' ADDL with a TCG loop using the cmpxchg helper - master: before this patchset Results sorted in ascending range, i.e. descending degree of contention. Y axis is Throughput in Mops/s. Tests are run on an AMD machine with 64 Opteron 6376 cores. atomic_add-bench: 5000000 ops/thread, [0,1] range 25 ++---------+----------+---------+----------+----------+----------+---++ + atomic +-E--+ + + + + + \| \|cmpxchg +-H--+ \| 20 +Emaster +-N--+ ++ \|\| \| \|++ \| \|\| \| 15 +++ ++ \|N\| \| \|+\| \| 10 ++\| ++ \|+\|+ \| \| \| -+E+------ +++ ---+E+------+E+------+E+-----+E+------+E\| \|+E+E+- +++ +E+------+E+-- \| 5 ++\|+ ++ \|+N+H+--- +++ \| ++++N+--+H++----+++ + +++ --++H+------+H+------+H++----+H+---+--- \| 0 ++---------+-----H----+---H-----+----------+----------+----------+---H+ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 5000000 ops/thread, [0,2] range 25 ++---------+----------+---------+----------+----------+----------+---++ ++atomic +-E--+ + + + + + \| \|cmpxchg +-H--+ \| 20 ++master +-N--+ ++ \|E\| \| \|++ \| \|\|E \| 15 ++\| ++ \|N\|\| \| \|+\|\| ---+E+------+E+-----+E+------+E\| 10 ++\| \| ---+E+------+E+-----+E+--- +++ +++ \|\|H+E+--+E+-- \| \|+++++ \| \| \|\| \| 5 ++\|+H+-- +++ ++ \|+N+ - ---+H+------+H+------ \| + +N+--+H++----+H+---+--+H+----++H+--- + + +H+---+--+H\| 0 ++---------+----------+---------+----------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 5000000 ops/thread, [0,8] range 40 ++---------+----------+---------+----------+----------+----------+---++ ++atomic +-E--+ + + + + + \| 35 +cmpxchg +-H--+ ++ \| master +-N--+ ---+E+------+E+------+E+-----+E+------+E\| 30 ++\| ---+E+-- +++ ++ \| \| -+E+--- \| 25 ++E ---- +++ ++ \|+++++ -+E+ \| 20 +E+ E-- +++ ++ \|H\|+++ \| \|+\| +H+------- \| 15 ++H+ ---+++ +H+------ ++ \|N++H+-- +++--- +H+------++\| 10 ++ +++ - +++ ---+H+ +++ +H+ \| \| +H+-----+H+------+H+-- \| 5 ++\| +++ ++ ++N+N+--+N++ + + + + + \| 0 ++---------+----------+---------+----------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 5000000 ops/thread, [0,128] range 160 ++---------+---------+----------+---------+----------+----------+---++ + atomic +-E--+ + + + + + \| 140 +cmpxchg +-H--+ +++ +++ ++ \| master +-N--+ E--------E------+E+------++\| 120 ++ --\| \| +++ E+ \| -- +++ +++ ++\| 100 ++ - ++ \| +++- +++ ++\| 80 ++ -+E+ -+H+------+H+------H--------++ \| ---- ---- +++ H\| \| ---+E+-----+E+- ---+H+ ++\| 60 ++ +E+--- +++ ---+H+--- ++ \| --+++ ---+H+-- \| 40 ++ +E+-+H+--- ++ \| +H+ \| 20 +EE+ ++ +N+ + + + + + + \| 0 ++N-N---N--+---------+----------+---------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 5000000 ops/thread, [0,1024] range 350 ++---------+---------+----------+---------+----------+----------+---++ + atomic +-E--+ + + + + + \| 300 +cmpxchg +-H--+ +++ \| master +-N--+ +++ \|\| \| +++ \| ----E\| 250 ++ \| ----E---- ++ \| ----E--- \| ---+H\| 200 ++ -+E+--- +++ ---+H+--- ++ \| ---- -+H+-- \| \| +E+ +++ ---- +++ \| 150 ++ ---+++ ---+H+- ++ \| --- -+H+-- \| 100 ++ ---+E+ ---- +++ ++ \| +++ ---+E+-----+H+- \| \| -+E+------+H+-- \| 50 ++ +E+ ++ +EE+ + + + + + + \| 0 ++N-N---N--+---------+----------+---------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads hi-res: http://imgur.com/a/fMRmq For master I stopped measuring master after 8 threads, because there is little point in measuring the well-known performance collapse of a contended lock. Backports commit 37b995f6e7a1cb6fa378c5cd4217b9dd9e1fc98b from qemu	2018-02-27 23:43:22 -05:00
Emilio G. Cota	9d9b7dedac	target-i386: emulate XCHG using atomic helper Backports commit ea97ebe89f7a879ea9aba90140e40c29b5cbd653 from qemu	2018-02-27 23:40:20 -05:00
Emilio G. Cota	8f96b6beb9	target-i386: emulate LOCK'ed BTX ops using atomic helpers Backports commit cfe819d309d472f75fd129faf1d1064a2498326c from qemu	2018-02-27 23:39:21 -05:00
Emilio G. Cota	089965fa8d	target-i386: emulate LOCK'ed XADD using atomic helper Backports commit f53b01817f95781d2bcc8a82e057d1416601e13b from qemu	2018-02-27 23:06:28 -05:00
Emilio G. Cota	f9ed728f27	target-i386: emulate LOCK'ed NEG using cmpxchg helper Backports commit 8eb8c7385608b99bed6055a22d897ff727a6cb8e from qemu	2018-02-27 23:03:28 -05:00
Emilio G. Cota	fedeb0f93e	target-i386: emulate LOCK'ed NOT using atomic helper Backports commit 2a5fe8ae145ef7a3ab480922116d27efcc97b85d from qemu	2018-02-27 23:00:33 -05:00
Emilio G. Cota	05c94546d5	target-i386: emulate LOCK'ed INC using atomic helper Backports commit 60e573462fcdb83aa1a41e66a9f31dc8a4364399 from qemu	2018-02-27 22:56:05 -05:00
Emilio G. Cota	7c7b0fe746	target-i386: emulate LOCK'ed OP instructions using atomic helpers Backports commit a7cee522f3529c2fc85379237b391ea98823271e from qemu	2018-02-27 22:53:46 -05:00
Emilio G. Cota	a386368f82	target-i386: emulate LOCK'ed cmpxchg using cmpxchg helpers The diff here is uglier than necessary. All this does is to turn FOO into: if (s->prefix & PREFIX_LOCK) { BAR } else { FOO } where FOO is the original implementation of an unlocked cmpxchg. Backports commit ae03f8de45427042ecd10b0941a005f21ecc064c from qemu	2018-02-27 22:38:37 -05:00
Richard Henderson	b48508a6c1	tcg: Emit barriers with parallel_cpus Backports commit 91682118aa330aff7e8ef0cc685c32d101f49940 from qemu	2018-02-27 22:28:33 -05:00
Richard Henderson	064543a415	tcg: Add CONFIG_ATOMIC64 Allow qemu to build on 32-bit hosts without 64-bit atomic ops. Even if we only allow 32-bit hosts to multi-thread emulate 32-bit guests, we still need some way to handle the 32-bit guest using a 64-bit atomic operation. Do so by dropping back to single-step. Backports commit df79b996a7b21c6ea7847f7927a2e1a294b86c72 from qemu	2018-02-27 22:25:36 -05:00
Richard Henderson	da01e53757	tcg: Add atomic128 helpers Force the use of cmpxchg16b on x86_64. Wikipedia suggests that only very old AMD64 (circa 2004) did not have this instruction. Further, it's required by Windows 8 so no new cpus will ever omit it. If we truely care about these, then we could check this at startup time and then avoid executing paths that use it. Backports commit 7ebee43ee3e2fcd7b5063058b7ef74bc43216733 from qemu	2018-02-27 21:43:48 -05:00
Richard Henderson	5c0ce1b99c	tcg: Add atomic helpers Add all of cmpxchg, op_fetch, fetch_op, and xchg. Handle both endian-ness, and sizes up to 8. Handle expanding non-atomically, when emulating in serial. Backports commit c482cb117cc418115ca9c6d21a7a2315414c0a40 from qemu	2018-02-27 15:57:47 -05:00
Richard Henderson	0245f93c02	cputlb: Remove includes from softmmu_template.h We already include exec/address-spaces.h and exec/memory.h in cputlb.c; the include of qemu/timer.h appears to be a fossil. Backports commit 40978428853e2f7b4597ab2a9ffeb187333802dc from qemu	2018-02-27 12:40:43 -05:00
Richard Henderson	5c79851143	cputlb: Tidy some macros TGT_LE and TGT_BE are not size dependent and do not need to be redefined. The others are no longer used at all. Backports commit c86c6e4c80fee4d9423bedb10ba9e9c4aa68f861 from qemu	2018-02-27 12:36:25 -05:00
Richard Henderson	4da1cfb902	cputlb: Move most of iotlb code out of line Saves 2k code size off of a cold path. Backports commit 82a45b96a203a7403427183f1afd3d295222ff7d from qemu	2018-02-27 12:34:19 -05:00
Richard Henderson	5df7c9eec7	cputlb: Move probe_write out of softmmu_template.h Backports commit 3b08f0a92545ba06fbdeaae929a5172480300c33 from qemu	2018-02-27 12:25:24 -05:00
Yongbok Kim	79e4c001a9	softmmu: Add probe_write() Probe for whether the specified guest write access is permitted. If it is not permitted then an exception will be taken in the same way as if this were a real write access (and we will not return). Otherwise the function will return, and there will be a valid entry in the TLB for this access. Backports commit 3b4afc9e75ab1a95f33e41f462921093f8a109c4 from qemu	2018-02-27 12:20:50 -05:00
Richard Henderson	1c9c8d3f10	cputlb: Replace SHIFT with DATA_SIZE Backports commit dea2198201b3e0151d75b42774c51cf2ffe2ca4b from qemu	2018-02-27 12:00:33 -05:00
Richard Henderson	e35aacd5ae	tcg: Add EXCP_ATOMIC When we cannot emulate an atomic operation within a parallel context, this exception allows us to stop the world and try again in a serial context. Backports commit fdbc2b5722f6092e47181a947c90fd4bdcc1c121 from qemu Also backports parts of commit 02d57ea115b7669f588371c86484a2e8ebc369be	2018-02-27 11:57:58 -05:00
Richard Henderson	d5510a546f	int128: Add int128_make128 Allows Int128 to be used more generally, rather than having to begin with 64-bit inputs and accumulate. Backports commit 1edaeee0955fba7d834b7c8f4e372e7eae030745 from qemu	2018-02-27 11:06:33 -05:00
Richard Henderson	9084e5fe1b	int128: Use __int128 if available Backports commit 0846beb36641e8f0c3ee55a5bb84d468b653c852 from qemu	2018-02-27 11:03:06 -05:00
Richard Henderson	4fdbe94eea	exec: Avoid direct references to Int128 parts Backports commit 258dfaaad05a5fbe32a142b794e1df3e16501d0e from qemu	2018-02-27 11:01:43 -05:00
Richard Henderson	ba1c63572e	atomics: Add __nocheck atomic operations While the check against sizeof(void *) is appropriate for normal usage within qemu, there are places in which we want wider operaions and have checked for their existance. Backports commit 84bca3927b36fb1d9a2ca85cbbdf9023d2b84678 from qemu	2018-02-27 11:00:20 -05:00
Lioncash	a59eef391e	atomic: MSVC compatible equivalents to some functions	2018-02-27 10:56:04 -05:00
Emilio G. Cota	c837d76a86	atomics: add atomic_op_fetch variants This paves the way for upcoming work. Backports commit 83d0c719f837724d9e3963b078211b2242bdd2a5 from qemu	2018-02-27 10:28:27 -05:00
Emilio G. Cota	102a53aa50	atomics: add atomic_xor This paves the way for upcoming work. Backports commit 61696ddbdc74263ddb6869856772cfe355a5d3bd from qemu	2018-02-27 10:23:31 -05:00
Richard Henderson	3fe8d46a15	atomics: Add parameters to macros Making these functional rather than object macros will prevent later problems with complex macro expansion. Backports commit d1a9f2d12fcfc942924956fbe321aedf4226ccb7 from qemu	2018-02-27 10:21:35 -05:00
Richard Henderson	4168095fed	target-m68k: Optimize gen_flush_flags Backports commit 36f0399d46f2ccf4f6e7451ba46b1e8d0e9ab341 from qemu	2018-02-27 10:19:54 -05:00
Richard Henderson	7403e63f2f	target-m68k: Optimize some comparisons Backports commit 9d896621c1820fd8f437fac26fd7d2e0921091c3 from qemu	2018-02-27 10:15:20 -05:00
Richard Henderson	672a28173f	target-m68k: Use setcond for scc Backports commit b459e3eccfae7fe83e30187c391de00bccf4f51d from qemu	2018-02-27 10:11:35 -05:00
Richard Henderson	ed6feb9329	target-m68k: Introduce DisasCompare Backports commit 6a432295d73df91890dc70c4a94dcc4ba88ad1c3 from qemu	2018-02-27 10:08:32 -05:00
Richard Henderson	4e498cc54d	target-m68k: Reorg flags handling Separate all ccr bits. Continue to batch updates via cc_op. Backports commit 620c6cf66584bfbee90db84a7e87a6eabf230ca9 from qemu	2018-02-27 10:02:02 -05:00
Richard Henderson	121309a4d0	target-m68k: Reorg flags handling Separate all ccr bits. Continue to batch updates via cc_op. Signed-off-by: Richard Henderson <rth@twiddle.net> Fix gen_logic_cc() to really extend the size of the result. Fix gen_get_ccr(): update cc_op as it is used by the helper. Factorize flags computing and src/ccr cleanup Backports commit 620c6cf66584bfbee90db84a7e87a6eabf230ca9 from qemu	2018-02-27 09:30:32 -05:00
Richard Henderson	61ab9a42cd	target-m68k: Remove incorrect clearing of cc_x The CF docs certainly doesnt suggest this is true. Backports commit 18dd87f26bed46f22bb1b9536329c02de500f407 from qemu	2018-02-27 09:21:26 -05:00
Richard Henderson	187c2a9807	target-m68k: Some fixes to SR and flags management Backports commit 99c514485b1d7922c4ca1ed767fd45525de4701f from qemu	2018-02-27 09:19:21 -05:00
Richard Henderson	9493b29399	target-m68k: Print flags properly Backports commit 8e394ccabdb1e439aab092de6b9d2f26432e962f from qemu	2018-02-27 09:17:44 -05:00
Laurent Vivier	57ea90a91f	target-m68k: update CPU flags management Copied from target-i386 Backports commit 9fdb533fb129b19610941bd1e5dd93e7471a18f5 from qemu	2018-02-27 09:15:29 -05:00
Laurent Vivier	125675e334	target-m68k: don't update cc_dest in helpers Backports commit 91f90d7191f862ab27528dbdf76cee55c77f79cf from qemu	2018-02-27 09:04:51 -05:00
Laurent Vivier	12f9ba3fe4	target-m68k: update move to/from ccr/sr Backports commit 7c0eb318bdcc3667a861e7b0f140df0b6d9895e2 from qemu	2018-02-27 08:57:05 -05:00
Laurent Vivier	b8366d5b31	target-m68k: remove m68k_cpu_exec_enter() and m68k_cpu_exec_exit() Update cc_op directly from tcg_gen_insn_start() and restore_state_to_opc() Copied from target-i386 Backports commit 20a8856eba0980fbe9d2b8ed2b33ecdb9c9fe5ad from qemu	2018-02-27 08:53:02 -05:00
Laurent Vivier	a521f4f41d	target-m68k: Replace helper_xflag_lt with setcond Backports commit f9083519034aaa5ad5cd2c5727bd61c29bf60bc5 from qemu	2018-02-27 08:50:44 -05:00
Laurent Vivier	b079255576	target-m68k: allow to update flags with operation on words and bytes Backports commit 5dbb6784b7e2b833c036b4df58aa07067e35f476 from qemu	2018-02-27 08:47:12 -05:00
Laurent Vivier	f069762b61	target-m68k: REG() macro cleanup Backports commit bcc098b0c23b4dd902ff56987d769bd839677331 from qemu	2018-02-27 08:37:25 -05:00
Laurent Vivier	3d59fe56b3	target-m68k: set PAGE_BITS to 12 for m68k Backports commit 2b04e85a3401e13cb19b1de197e6c211eaadca4c from qemu	2018-02-27 08:36:09 -05:00
Laurent Vivier	292fc83c86	target-m68k: define operand sizes Backports commit 7ef25cdd6cee4fa468d6cb913fa064a6689faf7d from qemu	2018-02-27 08:35:13 -05:00
Laurent Vivier	2653165c63	target-m68k: introduce read_imXX() functions Read a 8, 16 or 32bit immediat constant. An immediate constant is stored in the instruction opcode and can be in one or two extension words. Backports commit 28b68cd79ef01e8b1f5bd26718cd8c09a12c625f from qemu	2018-02-27 08:32:04 -05:00
Laurent Vivier	d29cbb70b3	target-m68k: manage scaled index Scaled index is not supported by 68000, 68008, and 68010. EA = (bd + PC) + Xn.SIZE*SCALE + od Ignore it: M68000 FAMILY PROGRAMMER’S REFERENCE MANUAL 2.4 BRIEF EXTENSION WORD FORMAT COMPATIBILITY "If the MC68000 were to execute an instruction that encoded a scaling factor, the scaling factor would be ignored and would not access the desired memory address. The earlier microprocessors do not recognize the brief extension word formats implemented by newer processors. Although they can detect illegal instructions, they do not decode invalid encodings of the brief extension word formats as exceptions." Backports commit d8633620a112296fcf6a6ae9a1cbba614c0ca502 from qemu	2018-02-27 08:27:20 -05:00
Laurent Vivier	fa4a71a1bf	target-m68k: define m680x0 CPUs and features This patch defines height new features: - M68K_FEATURE_SCALED_INDEX, scaled address index register - M68K_FEATURE_LONG_MULDIV, 32bit multiply/divide - M68K_FEATURE_QUAD_MULDIV, 64bit multiply/divide - M68K_FEATURE_BCCL, long conditional branches - M68K_FEATURE_BITFIELD, bit field instructions - M68K_FEATURE_FPU, FPU instructions - M68K_FEATURE_CAS, cas instruction - M68K_FEATURE_BKPT, bkpt instruction Backports commit f076803bbf6ad1618f493f543faff97f3dd0c970 from qemu	2018-02-27 08:26:06 -05:00
John Paul Adrian Glaubitz	2fd7779aa5	target-m68k: Build the opcode table only once to avoid multithreading issues Backports commit b208525797b031c1be4121553e21746686318a38 from qemu	2018-02-27 08:14:35 -05:00
Laurent Vivier	fd84549b3e	target-m68k: fix DEBUG_DISPATCH Backports commit a1ff19302007986fa081738e88905a715bd68e2e from qemu	2018-02-27 08:07:21 -05:00
Daniel P. Berrange	83a5bf2d25	qapi: rename QmpOutputVisitor to QObjectOutputVisitor The QmpOutputVisitor has no direct dependency on QMP. It is valid to use it anywhere that one wants a QObject. Rename it to better reflect its functionality as a generic QAPI to QObject converter. The commit before previous renamed the files, this one renames C identifiers. Backports commit 7d5e199ade76c53ec316ab6779800581bb47c50a from qemu	2018-02-27 08:05:33 -05:00
Daniel P. Berrange	2949a90977	qapi: rename QmpInputVisitor to QObjectInputVisitor The QmpInputVisitor has no direct dependency on QMP. It is valid to use it anywhere that one has a QObject. Rename it to better reflect its functionality as a generic QObject to QAPI converter. The previous commit renamed the files, this one renames C identifiers. Backports commit 09e68369a88d7de0f988972bf28eec1b80cc47f9 from qemu	2018-02-26 15:54:15 -05:00
Daniel P. Berrange	228f122248	qapi: rename qmp--visitor* to qobject--visitor* The QMP visitors have no direct dependency on QMP. It is valid to use them anywhere that one has a QObject. Rename them to better reflect their functionality as a generic QObject to QAPI converter. This is the first of three parts: rename the files. The next two parts will rename C identifiers. The split is necessary to make git rename detection work. Backports commit b3db211f3c80bb996a704d665fe275619f728bd4 from qemu	2018-02-26 15:42:37 -05:00
Peter Maydell	1a850bcb19	target-arm: Implement new HLT trap for semihosting Version 2.0 of the semihosting specification introduces new trap instructions for AArch32: HLT 0xF000 for A32 and HLT 0x3C for T32. Implement these (in the same way we implement the existing HLT semihosting trap for A64). The old traps via SVC and BKPT are unaffected. Backports commit 19a6e31c9d2701ef648b70ddcfc3bf64cec8c37e from qemu	2018-02-26 15:28:45 -05:00
Peter Maydell	db8b0a82b1	cpu: Support a target CPU having a variable page size Support target CPUs having a page size which isn't knownn at compile time. To use this, the CPU implementation should: * define TARGET_PAGE_BITS_VARY * not define TARGET_PAGE_BITS * define TARGET_PAGE_BITS_MIN to the smallest value it might possibly want for TARGET_PAGE_BITS * call set_preferred_target_page_bits() in its realize function to indicate the actual preferred target page size for the CPU (and report any error from it) In CONFIG_USER_ONLY, the CPU implementation should continue to define TARGET_PAGE_BITS appropriately for the guest OS page size. Machines which want to take advantage of having the page size something larger than TARGET_PAGE_BITS_MIN must set the MachineClass minimum_page_bits field to a value which they guarantee will be no greater than the preferred page size for any CPU they create. Note that changing the target page size by setting minimum_page_bits is a migration compatibility break for that machine. For debugging purposes, attempts to use TARGET_PAGE_SIZE before it has been finally confirmed will assert. Backports commit 20bccb82ff3ea09bcb7c4ee226d3160cab15f7da from qemu	2018-02-26 12:29:08 -05:00
Vijaya Kumar K	a7229cc08a	translate-all.c: Compute L1 page table properties at runtime Remove L1 page mapping table properties computing statically using macros which is dependent on TARGET_PAGE_BITS. Drop macros V_L1_SIZE, V_L1_SHIFT, V_L1_BITS macros and replace with variables which are computed at early stage of VM boot. Removing dependency can help to make TARGET_PAGE_BITS dynamic. Backports commit 66ec9f49399f0a9fa13ee77c472caba0de2773fc from qemu	2018-02-26 11:46:58 -05:00
Vijaya Kumar K	3082b4e4ec	exec.c: Remove static allocation of sub_section of sub_page Allocate sub_section dynamically. Remove dependency on TARGET_PAGE_SIZE to make run-time page size detection for arm platforms. Backports commit 2615fabd42ea0078dd9e659bdb21a5b7a1f87a9a from qemu	2018-02-26 10:50:04 -05:00
Paolo Bonzini	eb75004013	memory: add a per-AddressSpace list of listeners This speeds up MEMORY_LISTENER_CALL noticeably. Right now, with many PCI devices you have N regions added to M AddressSpaces (M = # PCI devices with bus-master enabled) and each call looks up the whole listener list, with at least M listeners in it. Because most of the regions in N are BARs, which are also roughly proportional to M, the whole thing is O(M^3). This changes it to O(M^2), which is the best we can do without rewriting the whole thing. Backports commit 9a54635dcb51a3fcf7507af630168f514a8cd4e7 from qemu	2018-02-26 10:46:50 -05:00
Paolo Bonzini	4b06e8bbb7	memory: eliminate global MemoryListeners There is none, so just drop the code. Backports commit d45fa784cd0c111131696808d1168259d66b7519 from qemu	2018-02-26 10:19:28 -05:00
Paolo Bonzini	8734e13a73	tcg: try sti when moving a constant into a dead memory temp This comes from free from unifying tcg_reg_alloc_mov and tcg_reg_alloc_movi's handling of TEMP_VAL_CONST. It triggers often on moves to cc_dst, such as the following translation of "sub $0x3c,%esp": before: after: subl $0x3c,%ebp subl $0x3c,%ebp movl %ebp,0x10(%r14) movl %ebp,0x10(%r14) movl $0x3c,%ebx movl $0x3c,0x2c(%r14) movl %ebx,0x2c(%r14) Backports commit 0fe4fca4e1a5e06a270127dd80bb753d4dda61c6 from qemu	2018-02-26 10:08:47 -05:00
Paolo Bonzini	be00a3e100	target-i386: fix 32-bit addresses in LEA This was found with test-i386. The issue is that instructions such as addr32 lea (%eax), %rax did not perform a 32-bit extension, because the LEA translation skipped the gen_lea_v_seg step. That step does not just add segments, it also takes care of extending from address size to pointer size. Backports commit 620abfb004543404bef1953e25da2ad77352941a from qemu	2018-02-26 10:06:08 -05:00
Paolo Bonzini	8b239bd48b	atomic: base mb_read/mb_set on load-acquire and store-release This introduces load-acquire and store-release operations in QEMU. For now, just use them as an implementation detail of atomic_mb_read and atomic_mb_set. Since docs/atomics.txt documents that atomic_mb_read only synchronizes with an atomic_mb_set of the same variable, we can use the new implementation everywhere instead of seq-cst loads and stores. Backports commit 803cf26a9e019b5d2256a8edeb22e3538c4f3261 from qemu	2018-02-26 10:02:46 -05:00
Paolo Bonzini	fd7ef4c184	atomic: introduce smp_mb_acquire and smp_mb_release	2018-02-26 09:58:22 -05:00
Eduardo Habkost	b41bb81737	target-i386: Don't use cpu->migratable when filtering features When explicitly enabling unmigratable flags using "-cpu host" (e.g. "-cpu host,+invtsc"), the requested feature won't be enabled because cpu->migratable is true by default. This is inconsistent with all other CPU models, which don't have the "migratable" option, making "+invtsc" work without the need for extra options. This happens because x86_cpu_filter_features() uses cpu->migratable as an argument for x86_cpu_get_supported_feature_word(). This is not useful because: 2) on "-cpu host" it only makes QEMU disable features that were explicitly enabled in the command-line; 1) on all the other CPU models, cpu->migratable is already false. The fix is to just use 'false' as an argument to x86_cpu_get_supported_feature_word() in x86_cpu_filter_features(). Note that: * This won't change anything for people using using "-cpu host" or "-cpu host,migratable=<on\|off>" (with no extra features) because the x86_cpu_get_supported_feature_word() call on the cpu->host_features check uses cpu->migratable as argument. * This won't change anything for any CPU model except "host" because they all have cpu->migratable == false (and only "host" has the "migratable" property that allows it to be changed). * This will only change things for people using "-cpu host,+<feature>", where <feature> is a non-migratable feature. The only existing named non-migratable feature is "invtsc". In other words, this change will only affect people using "-cpu host,+invtsc" (that will now get what they asked for: the invtsc flag will be enabled). All other use cases are unaffected. Backports commit 46c032f3afcc05a0123914609f1003906ba63fda from qemu	2018-02-26 09:51:14 -05:00
Eduardo Habkost	4096ce0184	target-i386: x86_cpu_load_features() function When probing for CPU model information, we need to reuse the code that initializes CPUID fields, but not the remaining side-effects of x86_cpu_realizefn(). Move that code to a separate function that can be reused later. Backports commit 41f3d4d69a423dadb8431fda65d8d7c68c0de0fc from qemu	2018-02-26 09:49:34 -05:00
Eduardo Habkost	aa98c8a93f	target-i386: Move warning code outside x86_cpu_filter_features() x86_cpu_filter_features() will be reused by code that shouldn't print any warning. Move the warning code to a new x86_cpu_report_filtered_features() function, and call it from x86_cpu_realizefn(). Backports commit 8ca30e8673aff9bfcf8f969f8db4266b5f62e49c from qemu	2018-02-26 09:40:11 -05:00
Eduardo Habkost	08bfa41e1b	target-i386: xsave: Add FP and SSE bits to x86_ext_save_areas Instead of treating the FP and SSE bits as special cases, add them to the x86_ext_save_areas array. This will simplify the code that calculates the supported xsave components and the size of the xsave area. Backports commit e3c9022b4e2b6a4deb6518361d2bbf33522b9198 from qemu	2018-02-26 09:37:48 -05:00
Eduardo Habkost	54bd827472	target-i386: Register properties for feature aliases manually Instead of keeping the aliases inside the feature name arrays and require parsing the strings, just register alias properties manually. This simplifies the code for property registration and lookup. Backports commit 16d2fcaa509b1ca56eb2fcd8fe877279cf65cccc from qemu	2018-02-26 09:34:52 -05:00
Eduardo Habkost	b508b9e02a	target-i386: Remove underscores from feat_names arrays Instead of translating the feature name entries when adding property names, store the actual property names in the feature name array. For reference, here is the full list of functions that use FeatureWordInfo::feat_names: * x86_cpu_get_migratable_flags(): not affected, as it just check for non-NULL values. * report_unavailable_features(): informative only. It will start printing feature names with hyphens. * x86_cpu_list(): informative only. It will start printing feature names with hyphens * x86_cpu_register_feature_bit_props(): not affected, as it was already calling feat2prop(). Now we can remove the feat2prop() calls safely. So, the only user-visible effect of this patch are the new names being used in help and error messages for users. Backports commit fc7dfd205f3287893c436d932a167bffa30579c8 from qemu	2018-02-26 09:33:15 -05:00
Eduardo Habkost	6d1a7bccb5	target-i386: Disable VME by default with TCG VME is already disabled automatically when using TCG. So, instead of pretending it is there when reporting CPU model data on query-cpu-* QMP commands (making every CPU model to be reported as not runnable), we can disable it by default on all CPU models when using TCG. Do that by adding a tcg_default_props array that will work like kvm_default_props. Backports commit 04d99c3c61f4bdc0450dbeb6512b6dd743baca65 from qemu	2018-02-26 08:23:44 -05:00
Eduardo Habkost	594cbeaa06	target-i386: List CPU models using subclass list Instead of using the builtin_x86_defs array, use the QOM subclass list to list CPU models on "-cpu ?" and "query-cpu-definitions". Backports commit ee465a3ef77c2b2975ffa71c72208c05b3f3970d from qemu	2018-02-26 08:17:04 -05:00
Peter Maydell	200771d0ba	target-arm: Add trace events for the generic timers Backports commit 194cbc492bcc8f3f1868ec97a35146bc99c3c71c from qemu	2018-02-26 08:15:42 -05:00
Peter Maydell	158bfc109a	target-arm: Implement dummy MDCCINT_EL1 MDCCINT_EL1 is part of the DCC debugger communication channel between the CPU and an attached external debugger. QEMU doesn't implement this, but since Linux may try to access this register we need to provide at least a dummy implementation. Backports commit 5dbdc4342f479d799a1970dd5fd22e64c9dcd50d from qemu	2018-02-26 08:11:54 -05:00
Peter Maydell	f2dcb81b27	Fix masking of PC lower bits when doing exception returns In commit 9b6a3ea7a699594 store_reg() was changed to mask both bits 0 and 1 of the new PC value when in ARM mode. Unfortunately this broke the exception return code paths when doing a return from ARM mode to Thumb mode: in some of these we write a new CPSR including new Thumb mode bit via gen_helper_cpsr_write_eret(), and then use store_reg() to write the new PC. In this case if the new CPSR specified Thumb mode then masking bit 1 of the PC is incorrect (these code paths correspond to the v8 ARM ARM pseudocode function AArch32.ExceptionReturn(), which always aligns the new PC appropriately for the new instruction set state). Instead of using store_reg() in exception-return code paths, call a new store_pc_exc_ret() which stores the raw new PC value to env->regs[15], and then mask it appropriately in the subsequent helper_cpsr_write_eret() where the new env->thumb state is available. This fixes a bug introduced by 9b6a3ea7a699594 which caused crashes/hangs or otherwise bad behaviour for Linux when userspace was using Thumb. Backports commit fb0e8e79a9d77ee240dbca036fa8698ce654e5d1 from qemu	2018-02-26 08:09:28 -05:00
Thomas Hanson	c69ae10ca7	target-arm: Comments added to identify cases in a switch 3 cases in a switch in disas_exc() require reference to the ARM ARM spec in order to determine what case they're handling. Backports commit 957956b3013c8122a749dfe61a41aef8b4100e31 from qemu	2018-02-26 08:05:49 -05:00
Thomas Hanson	00d1803436	target-arm: Code changes to implement overwrite of tag field on PC load For BR, BLR and RET instructions, if tagged addresses are enabled, the tag field in the address must be cleared out prior to loading the address into the PC. Depending on the current EL, it will be set to either all 0's or all 1's. Backports commit 6feecb8b941f2d21e5645d0b6e0cdb776998121b from qemu	2018-02-26 08:04:00 -05:00
Thomas Hanson	2af4ca54e9	target-arm: Infrastucture changes to enable handling of tagged address loading into PC When capturing the current CPU state for the TB, extract the TBI0 and TBI1 values from the correct TCR for the current EL and then add them to the TB flags field. Then, at the start of code generation for the block, copy the TBI fields into the DisasContext structure. Backports commit 86fb3fa4ed5873b021a362ea26a021f4aeab1bb4 from qemu	2018-02-26 07:58:17 -05:00
Marc-André Lureau	be6e25bcc7	qapi: return a 'missing parameter' error The 'old' dispatch code returned a QERR_MISSING_PARAMETER for missing parameters, but the qapi qmp_dispatch() code uses QERR_INVALID_PARAMETER_TYPE. Improve qapi code to return QERR_MISSING_PARAMETER where appropriate. Fix expected error message in iotests. Backports commit 1382d4abdf9619985e4078e37e49e487cea9935e from qemu	2018-02-26 05:19:53 -05:00
Marc-André Lureau	ddc25c8aaf	qapi: assert list entry has a value This helps to figure out the expectations. Backports commit eac8e79ff749fc15e1dca4caccf1f38664ab4915 from qemu	2018-02-26 05:15:32 -05:00
Marc-André Lureau	bd469af15f	qapi: add assert about root value qiv->root should not be null, make that clearer with some assert. Backports commit 5d0cbbcfeb59e1e3f5ee7d26b8a215382f6d9abd from qemu	2018-02-26 05:15:01 -05:00
Marc-André Lureau	1a138915a5	qapi: Fix crash when 'any' or 'null' parameter is missing Unlike the other visit methods, visit_type_any() and visit_type_null() neglect to check whether qmp_input_get_object() succeeded. They crash when it fails. Reproducer: { "execute": "qom-set", "arguments": { "path": "/machine", "property": "rtc-time" } } Will crash with: qapi/qapi-visit-core.c:277: visit_type_any: Assertion `!err != !*obj' failed Broken in commit 5c678ee. Fix by adding the missing error checks. Backports commit c489780203f9b22aca5539ec7589b7140bdc951f from qemu	2018-02-26 05:13:54 -05:00
Alex Bennée	fbf6fb1e25	atomic.h: fix __SANITIZE_THREAD__ build Only very modern GCC's actually set this define when building with the ThreadSanitizer so this little typo slipped though. Backports commit 23ea7f57949f2f5934f4d5bbc29fe321b3a7067b from qemu	2018-02-26 05:12:17 -05:00
Alex Bennée	d4cb954102	cpu: atomically modify cpu->exit_request ThreadSanitizer picks up potential races although we already use barriers to ensure things are in the correct order when processing exit requests. For true C11 defined behaviour across threads we need to use relaxed atomic_set/atomic_read semantics to reassure tsan. Backports commit 027d9a7d2911e993cdcbd21c7c35d1dd058f05bb from qemu	2018-02-26 05:11:18 -05:00
Alex Bennée	e1cf9ca84a	qom/cpu: atomically clear the tb_jmp_cache The ThreadSanitizer rightly complains that something initialised with a normal access is later updated and read atomically. Backports commit ce7cf6a973f4b614162b9518954d441fa5e32fc6 from qemu	2018-02-26 05:09:05 -05:00
Alex Bennée	12d7e946a1	qom/object: update class cache atomically The idiom CPU_GET_CLASS(cpu) is fairly extensively used in various threads and trips of ThreadSanitizer due to the fact it updates obj->class->object_cast_cache behind the scenes. As this is just a fast-path cache there is no need to lock updates. However to ensure defined C11 behaviour across threads we need to use the plain atomic_read/set primitives and keep the sanitizer happy. Backports commit b6b3ccfda015dcd5ab50f70c189ee5cc6c622e91 from qemu	2018-02-26 05:06:40 -05:00
Alex Bennée	bf72733576	tcg/optimize: move default return out of if statement This is to appease sanitizer builds which complain that: "error: control reaches end of non-void function" Backports commit 550276ae0a88851edda2cb7fcdd64256dbb8e314 from qemu	2018-02-26 05:05:21 -05:00
Alex Bennée	4046235e92	atomic.h: comment on use of atomic_read/set Add some notes on the use of the relaxed atomic access helpers and their importance for defined behaviour in C11's multi-threaded memory model. Backports commit e653bc6b0ff645c25b8a2eb607c18a5c98b59db6 from qemu	2018-02-26 05:03:59 -05:00
Peter Maydell	f48d1fe391	target-arm: Correctly handle 'sub pc, pc, 1' for ARMv6 In the ARM v6 architecture, 'sub pc, pc, 1' is not an interworking branch, so the computed new value is written to r15 as a normal value. The architecture says that in this case, bits [1:0] of the value written must be ignored if we are in ARM mode (or bit [0] ignored if in Thumb mode); this is a change from the ARMv4/v5 specification that behaviour is UNPREDICTABLE. Use the correct mask on the PC value when doing a non-interworking store to PC. A popular library used on RaspberryPi uses this instruction as part of a trick to determine whether it is running on ARMv6 or ARMv7, and we were mishandling the sequence. Fixes bug: https://bugs.launchpad.net/bugs/1625295 Backports commit 9b6a3ea7a699594162ed3d11e4e04b98568dc5c0 from qemu	2018-02-26 05:02:32 -05:00
Edgar E. Iglesias	dedab81d68	target-arm: A64: Fix decoding of iss_sf in disas_ld_lit Fix the decoding of iss_sf in disas_ld_lit. The SF (Sixty-Four) field in the ISS (Instruction Specific Syndrome) is a bit that specifies the width of the register that the instruction loads to. If cleared it specifies 32 bits. If set it specifies 64 bits. Backports commit 173ff58580b383a7841b18fddb293038c9d40d1c from qemu	2018-02-26 05:01:33 -05:00
Evgeny Yakovlev	fa9d708fbd	target-i386: Correct family/model/stepping for Opteron_G3 Current CPU definition for AMD Opteron third generation includes features like SSE4a and LAHF_LM support in emulated CPUID. These features are present in K8 rev.E or K10 CPUs and later. However, current G3 family and model describe 2nd generation K8 cores instead. This is incorrect but was considered harmless until our tests found a problem with linux kernels >= 3.10 (and maybe earlier) which specifically check for Opteron K8 model when parsing CPUID leaf 0x80000001: http://lxr.free-electrons.com/source/arch/x86/kernel/cpu/amd.c?v=3.16#L552 This code will disable LAHF_LM feature in /proc/cpuinfo if model number is inconsistent. This change sets Opteron_G3 family/model/stepping to 16/2/3 which is a proper Opteron 3rd generation 2350 CPU. Backports commit 339892d758efb2d0954160d41736a0eac9875d67 from qemu	2018-02-26 04:59:18 -05:00
Eduardo Habkost	b7f434373b	target-i386: Report known CPUID[EAX=0xD,ECX=0]:EAX bits as migratable A regression was introduced by commit 96193c22a "target-i386: Move xsave component mask to features array": all CPUID[EAX=0xD,ECX=0]:EAX bits were being reported as unmigratable because they don't have feature names defined. This broke "-cpu host" because it enables only migratable features by default. This adds a new field to FeatureWordInfo: migratable_flags, which will make those features be reported as migratable even if they don't have a property name defined. Backports commit 6fb2fff75dceed1716e757882a6dfbadd9042407 from qemu	2018-02-26 04:58:05 -05:00
Alex Bennée	33589eb75f	cpus: pass CPUState to run_on_cpu helpers CPUState is a fairly common pointer to pass to these helpers. This means if you need other arguments for the async_run_on_cpu case you end up having to do a g_malloc to stuff additional data into the routine. For the current users this isn't a massive deal but for MTTCG this gets cumbersome when the only other parameter is often an address. This adds the typedef run_on_cpu_func for helper functions which has an explicit CPUState * passed as the first parameter. All the users of run_on_cpu and async_run_on_cpu have had their helpers updated to use CPUState where available. Backports commit e0eeb4a21a3ca4b296220ce4449d8acef9de9049 from qemu	2018-02-26 04:54:55 -05:00
Felipe Franciosi	0ed8880525	compiler: Swap 'public domain' header for license As discussed on the list [1], having a comment stating that this file is "public domain" is arguably wrong and not legally binding. This patch replaces that comment with a clear GPLv2+ license as proposed in [2]. [1] http://lists.nongnu.org/archive/html/qemu-devel/2016-09/msg06151.html [2] http://lists.nongnu.org/archive/html/qemu-devel/2016-09/msg06217.html Worth noting, compiler.h was originally created on 5c026320 by splitting qemu-common.h. At the time, qemu-common.h was already GPLv2+. Backports commit cc9d8a3b2c41c22fb09f90f3085e6036c199c3ca from qemu	2018-02-26 04:49:45 -05:00
Eduardo Habkost	49c04d7104	target-i386: Clear KVM CPUID features if KVM is disabled This will ensure all checks for features[FEAT_KVM] in the code will be correct in case the KVM CPUID leaf is completely disabled. Backports commit aec661de86894e914d2d82431d9cefa9a9a40213 from qemu	2018-02-26 04:47:05 -05:00
Eduardo Habkost	f29384c810	target-i386: Move xsave component mask to features array This will reuse the existing check/enforce logic in x86_cpu_filter_features() to check the xsave component bits against GET_SUPPORTED_CPUID. Backports commit 96193c22ab39ea24f81e386ad7883260ff24f5fd from qemu	2018-02-26 04:45:35 -05:00
Eduardo Habkost	3fb3e6672b	target-i386: xsave: Calculate set of xsave components on realize Instead of doing complex calculations and calling kvm_arch_get_supported_cpuid() inside cpu_x86_cpuid(), calculate the set of required XSAVE components earlier, at realize time. Backports commit 2ca8a8becc2eeb5262e478ce502f5daa53f3d0bc from qemu	2018-02-26 04:40:41 -05:00
Eduardo Habkost	28f002cbaf	target-i386: xsave: Helper function to calculate xsave area size Move the xsave area size calculation from cpu_x86_cpuid() inside its own function. While doing it, change it to use the XSAVE area struct sizes for the initial size, instead of the magic 0x240 number. Backports commit 1fda6198e4126af9988754c8824cfc9928649890 from qemu	2018-02-26 04:36:27 -05:00
Eduardo Habkost	c35e9eb9af	target-i386: xsave: Simplify CPUID[0xD,0].{EAX,EDX} calculation Instead of assigning individual bits in a loop, just copy the values from ena_mask. Backports commit 8057c621b1b17cbcb35fe67d1a09ada9055873a9 from qemu	2018-02-26 04:35:14 -05:00
Eduardo Habkost	c7195afd32	target-i386: xsave: Calculate enabled components only once Instead of checking both env->features and ena_mask at two different places in the CPUID code, initialize ena_mask based on the features that are enabled for the CPU, and then clear unsupported bits based on kvm_arch_get_supported_cpuid(). The results should be exactly the same, but it will make it easier to move the mask calculation elsewhare, and reuse x86_cpu_filter_features() for the kvm_arch_get_supported_cpuid() check. Backports commit 4928cd6de6b4211a79f98c8dc39115be1e815c2b from qemu	2018-02-26 04:33:18 -05:00
Eduardo Habkost	c3a0cba5b1	target-i386: Don't try to enable PT State xsave component The code that calculates the set of supported XSAVE components on CPUID looks at ext_save_areas to find out which components should be enabled. However, if there are zeroed entries in the ext_save_areas array, the ((env->features[esa->feature] & esa->bits) == esa->bits) check will always succeed and QEMU will unconditionally try to enable the component. Luckily this never caused any problems because the only missing entry in ext_save_areas is the PT State component (bit 8), and KVM currently doesn't support it (so it was cleared on ena_mask). But the code was still incorrect and would break if KVM starts returning CPUID[EAX=0xD,ECX=0].EAX[bit 8] as supported on GET_SUPPORTED_CPUID. Fix the problem by changing the code to not enable a XSAVE component if ExtSaveArea::bits is zero. Backports commit 9646f4927faf68e8690588c2fd6dc9834c440b58 from qemu	2018-02-26 04:30:35 -05:00
Eduardo Habkost	6188c6d6e4	target-i386: Move feature name arrays inside FeatureWordInfo It makes it easier to guarantee the arrays are the right size, and to find information when looking at the code. Backports commit 2d5312da566e4424a807d078da05f92ee7be3eec from qemu	2018-02-26 04:29:47 -05:00
Eduardo Habkost	74ae087743	target-i386: Enable CPUID[0x8000000A] if SVM is enabled SVM needs CPUID[0x8000000A] to be available. So if SVM is enabled in a CPU model or explicitly in the command-line, adjust CPUID xlevel to expose the CPUID[0x8000000A] leaf. Backports commit 0c3d7c0051576d220e6da0a8ac08f2d8482e2f0b from qemu	2018-02-26 04:05:47 -05:00
Eduardo Habkost	37406874ea	target-i386: Automatically set level/xlevel/xlevel2 when needed Instead of requiring users and management software to be aware of required CPUID level/xlevel/xlevel2 values for each feature, automatically increase those values when features need them. This was already done for CPUID[7].EBX, and is now made generic for all CPUID feature flags. Unit test included, to make sure we don't break ABI on older machine-types and don't mess with the CPUID level values if they are explicitly set by the user. Backports commit c39c0edf9bb3b968ba95484465a50c7b19f4aa3a from qemu	2018-02-26 04:03:09 -05:00
Eduardo Habkost	6861fe80cf	target-i386: Add a marker to end of the region zeroed on reset Instead of using cpuid_level, use an empty struct as a marker (like we already did with {start,end}_init_save). This will avoid accidentaly resetting the wrong fields if we change the field ordering on CPUX86State. Backports commit 5e992a8e337e710ea2d02f35668ac55a80e15f99 from qemu	2018-02-26 03:59:03 -05:00
Eduardo Habkost	c78d24b93c	target-i386: Remove unused X86CPUDefinition::xlevel2 field No CPU model in builtin_x86_defs has xlevel2 set, so it is always zero. Delete the field. Note that this is not an user-visible change. It doesn't remove the ability to set xlevel2 on the command-line, it just removes an unused field in builtin_x86_defs. Backports commit 0456441b5eb6694a561ad5bb8dad52483e6a08d0 from qemu	2018-02-26 03:57:02 -05:00
Leon Alrae	f60eca6930	target-mips: generate fences Make use of memory barrier TCG opcode in MIPS front end. Backports commit d208ac0c2e4cb43b74153bd584fc63c7b8a93ed6 from qemu	2018-02-26 03:52:35 -05:00
André Draszik	f14ece4aa1	target-mips: add 24KEc CPU definition Define a new CPU definition supporting 24KEc cores, similar to the existing 24Kc, but with added support for DSP instructions and MIPS16e (and without FPU). Backports commit e9deaad8a58c899dc32e9fdeff9e533070e79dca from qemu	2018-02-26 03:50:22 -05:00
Andrey Yurovsky	e24890a580	arm: add Cortex A7 CPU parameters Add the "cortex-a7" CPU with features and registers matching the Cortex-A7 MPCore Technical Reference Manual and the Cortex-A7 Floating-Point Unit Technical Reference Manual. The A7 is very similar to the A15. Backports commit dcf578ed8cec89543158b103940e854ebd21a8cf from qemu	2018-02-26 03:44:24 -05:00
Richard Henderson	552ef4b3e6	target-i386: Use struct X86XSaveArea in fpu_helper.c This avoids a double hand-full of magic numbers in the xsave and xrstor helper functions. Backports commit 3f32bd21df655e62eb271182a5c63280d631c7b3 from qemu	2018-02-26 03:38:53 -05:00
Richard Henderson	2ab4b8fa4d	tcg/i386: Extend TARGET_PAGE_MASK to the proper type TARGET_PAGE_MASK, as defined, has type "int". We need to extend that to the proper target width before oring in an "unsigned". Backports commit ebb90a005da67147245cd38fb04a965a87a961b7 from qemu	2018-02-26 03:32:38 -05:00
Pranith Kumar	16d71f0f10	tcg: Optimize fence instructions This commit optimizes fence instructions. Two optimizations are currently implemented: (1) unnecessary duplicate fence instructions, and (2) merging weaker fences into a stronger fence. [rth: Merge tcg_optimize_mb back into tcg_optimize, so that we only loop over the opcode stream once. Merge "unrelated" weaker barriers into one stronger barrier.] Backports commit 34f939218ce78163171addd63750e1e0300376ab from qemu	2018-02-26 03:29:59 -05:00
Pranith Kumar	533e083495	target-i386: Generate fences for x86 Backports commit cc19e497a047193db5083425957d7292c8dd3226 from qemu	2018-02-26 03:28:31 -05:00
Pranith Kumar	32b7cee81e	target-aarch64: Generate fences for aarch64 Backports commit ce1bd93f94e8d4b7117744e49652d2f907bed99f from qemu	2018-02-26 03:26:35 -05:00
Pranith Kumar	7849f8d72a	target-arm: Generate fences in ARMv7 frontend Backports commit 61e4c432ab26526bab0f3ef746c1861415b6da29 from qemu	2018-02-26 03:22:53 -05:00
Pranith Kumar	65a73763e3	tcg/sparc: Add support for fence Backports commit f8f03b3707b49898052fb8cd75ee31d19c8161fc from qemu	2018-02-26 03:20:39 -05:00
Pranith Kumar	a6fdc24e28	tcg/s390: Add support for fence Backports commit c9314d610e0e5da4d2cd5a36f3563d102b3294e0 from qemu	2018-02-26 03:19:41 -05:00
Pranith Kumar	bdd9cad15c	tcg/ppc: Add support for fence Backports commit 7b4af5ee8a1336bc39714b6de47924ee71fba761 from qemu	2018-02-26 03:18:43 -05:00
Pranith Kumar	5f10101245	tcg/mips: Add support for fence Backports commit 6f0b99104a396905870edc3049310ece29b6b8d6 from qemu	2018-02-26 03:17:34 -05:00
Pranith Kumar	e29cbe9640	tcg/arm: Add support for fence Backports commit 40f191ab8226fdada185efa49c44b60d8f494890 from qemu	2018-02-26 03:13:17 -05:00
Pranith Kumar	907060b865	tcg/aarch64: Add support for fence Backports commit c7a59c2a92592e556b9361437c9c4229917bd1e3 from qemu	2018-02-26 03:11:03 -05:00
Pranith Kumar	d49bd55f52	tcg/i386: Add support for fence Generate a 'lock orl $0,0(%esp)' instruction for ordering instead of mfence which has similar ordering semantics. Backports commit a7d00d4effb58889ac6df64f98ac50c9d1594149 from qemu	2018-02-26 03:10:58 -05:00
Pranith Kumar	5e44ce9be8	Introduce TCGOpcode for memory barrier This commit introduces the TCGOpcode for memory barrier instruction. This opcode takes an argument which is the type of memory barrier which should be generated. Backports commit f65e19bc2c9e8358e634d309606144ac2a3c2936 from qemu	2018-02-26 03:02:41 -05:00
Richard Henderson	66d79ac959	tcg: Merge GETPC and GETRA The return address argument to the softmmu template helpers was confused. In the legacy case, we wanted to indicate that there is no return address, and so passed in NULL. However, we then immediately subtracted GETPC_ADJ from NULL, resulting in a non-zero value, indicating the presence of an (invalid) return address. Push the GETPC_ADJ subtraction down to the only point it's required: immediately before use within cpu_restore_state_from_tb, after all NULL pointer checks have been completed. This makes GETPC and GETRA identical. Remove GETRA as the lesser used macro, replacing all uses with GETPC. Backports commit 01ecaf438b1eb46abe23392c8ce5b7628b0c8cf5 from qemu	2018-02-26 02:54:44 -05:00
Richard Henderson	91f5cf0417	tcg: Support arbitrary size + alignment Previously we allowed fully unaligned operations, but not operations that are aligned but with less alignment than the operation size. In addition, arm32, ia64, mips, and sparc had been omitted from the previous overalignment patch, which would have led to that alignment being enforced. Backports commit 85aa80813dd9f5c1f581c743e45678a3bee220f8 from qemu	2018-02-26 02:47:26 -05:00
Stanislav Shmarov	5f9552657e	target-i386: Fixed syscall posssible segfault In user-mode emulation env->idt.base memory is allocated in linux-user/main.c with size 8512 = 4096 (for 64-bit). When fake interrupt EXCP_SYSCALL is thrown do_interrupt_user checks destination privilege level for this fake exception, and tries to read 4 bytes at address base + (256 2^4)=4096, that causes segfault. Privlege level was checked only for int's, so lets read dpl from memory only for this case. Backports commit 885b7c44e4f8b7a012a92770a0dba8b238662caa from qemu	2018-02-26 02:36:09 -05:00
Paolo Bonzini	d8d0d08262	target-i386: fix ordering of fields in CPUX86State Make sure reset zeroes TSC_AUX, XCR0, PKRU. Move XSTATE_BV from the "vmstate only" section to the "KVM only" section. Backports commit 7616f1c2da1c0f336a474a56ad6d32e15ccd666e from qemu	2018-02-26 02:34:22 -05:00
Ladi Prosek	7acc14da16	Remove unused function declarations Unused function declarations were found using a simple gcc plugin and manually verified by grepping the sources. Backports commit d4b84d564ee3eb7a58e4585d671fb3c220b6c3b9 from qemu	2018-02-26 02:31:46 -05:00
Thomas Huth	b581d4033f	tcg: Remove duplicate header includes host-utils.h and timer.h are included twice in tcg.c. One time should be enough. Backports commit 347519eb9d68303a6c23a7663c0fa6c20a225191 from qemu	2018-02-26 02:29:38 -05:00
Lioncash	1ff9724b46	cutils: Remove unused vector ifdef block	2018-02-26 02:28:50 -05:00
Andrew Dutcher	26b36e5ff8	fpu: add mechanism to check for invalid long double formats All operations that take a floatx80 as an operand need to have their inputs checked for malformed encodings. In all of these cases, use the function floatx80_invalid_encoding to perform the check. If an invalid operand is found, raise an invalid operation exception, and then return either NaN (for fp-typed results) or the integer indefinite value (the minimum representable signed integer value, for int-typed results). For the non-quiet comparison operations, this touches adjacent code in order to pass style checks. Backports cast correction portion of commit d1eb8f2acba579830cf3798c3c15ce51be852c56m from qemu	2018-02-26 02:27:40 -05:00
Pranith Kumar	9e6fec8741	atomics: Use __atomic__n() variant primitives Use the __atomic__n() primitives which take the value as argument. It is not necessary to store the value locally before calling the primitive, hence saving us a stack store and load. Backports commit 89943de17c4e276f2c47f05b4604e8816a6a636c from qemu	2018-02-26 02:16:48 -05:00
Fam Zheng	1a2c30abbf	rules.mak: Don't extract libs from .mo-libs in link command For module build, .mo objects are passed to LINK and consumed in process-archive-undefs. The reason behind that is documented in the comment above process-archive-undefs. Similarly, extract-libs should be called with .mo filtered out too. Otherwise, the .mo-libs are added to the link command incorrectly, spoiling the purpose of modularization. Currently we don't have any .mo-libs usage, but it will be used soon when we modularize more multi-source objects, like sdl and gtk. Backports commit 5b1b6dbd94e2e2e98920f886cb32fcf4a1520b50 from qemu	2018-02-26 02:08:03 -05:00
Sergey Fedorov	58ff618708	tcg: rename tb_find_physical() In fact, this function does not exactly perform a lookup by physical address as it is descibed for comment on get_page_addr_code(). Thus it may be a bit confusing to have "physical" in it's name. So rename it to tb_htable_lookup() to better reflect its actual functionality. Backports commit b34de45fc40d01c14b31d3a682e284180a2ed8c5 from qemu	2018-02-26 02:07:06 -05:00
Sergey Fedorov	ab0c87bc6f	tcg: Merge tb_find_slow() and tb_find_fast() These functions are not too big and can be merged together. This makes locking scheme more clear and easier to follow. Backports commit bd2710d5da06ad7706d4864f65b3f0c9f7cb4d7f from qemu	2018-02-26 02:05:19 -05:00
Sergey Fedorov	9b6f287488	tcg: Avoid bouncing tb_lock between tb_gen_code() and tb_add_jump() Backports commit 74d356dd48b64eaa2a6104ac1493ca64cb31fa16 from qemu	2018-02-26 02:01:40 -05:00
Alex Bennée	09c3ef656e	tcg: cpu-exec: remove tb_lock from the hot-path Lock contention in the hot path of moving between existing patched TranslationBlocks is the main drag in multithreaded performance. This patch pushes the tb_lock() usage down to the two places that really need it: - code generation (tb_gen_code) - jump patching (tb_add_jump) The rest of the code doesn't really need to hold a lock as it is either using per-CPU structures, atomically updated or designed to be used in concurrent read situations (qht_lookup). To keep things simple I removed the #ifdef CONFIG_USER_ONLY stuff as the locks become NOPs anyway until the MTTCG work is completed. Backports commit 518615c6503ad78d3bb67ddf1cd848c4a41de02e from qemu	2018-02-26 01:58:33 -05:00
Alex Bennée	62aa0abd02	tcg: set up tb->page_addr before insertion This ensures that if we find the TB on the slow path that tb->page_addr is correctly set before being tested. Backports commit 2e1ae44a4f4a6149fbb9dc812243522f07284700 from qemu	2018-02-26 01:50:04 -05:00
Paolo Bonzini	30845ae475	tcg: Prepare TB invalidation for lockless TB lookup When invalidating a translation block, set an invalid flag into the TranslationBlock structure first. It is also necessary to check whether the target TB is still valid after acquiring 'tb_lock' but before calling tb_add_jump() since TB lookup is to be performed out of 'tb_lock' in future. Note that we don't have to check 'last_tb'; an already invalidated TB will not be executed anyway and it is thus safe to patch it. Backports commit 6d21e4208f382dd8ca1f7995a6dd9ea7ca281163 from qemu	2018-02-26 01:48:13 -05:00
Sergey Fedorov	c0dda5fbe9	tcg: Prepare safe access to tb_flushed out of tb_lock Ensure atomicity and ordering of CPU's 'tb_flushed' access for future translation block lookup out of 'tb_lock'. This field can only be touched from another thread by tb_flush() in user mode emulation. So the only access to be sequential atomic is: * a single write in tb_flush(); * reads/writes out of 'tb_lock'. In future, before enabling MTTCG in system mode, tb_flush() must be safe and this field becomes unnecessary. Backports commit 118b07308a8cedc16ef63d7ab243a95f1701db40 from qemu	2018-02-25 23:33:58 -05:00
Sergey Fedorov	9eb02a540d	tcg: Prepare safe tb_jmp_cache lookup out of tb_lock Ensure atomicity of CPU's 'tb_jmp_cache' access for future translation block lookup out of 'tb_lock'. Note that this patch does not make CPU's TLB invalidation safe if it is done from some other thread while the CPU is in its execution loop. Backports commit 89a16b1e4294e3664667a151c2f70c84dfac6fd9 from qemu	2018-02-25 23:29:18 -05:00
Sergey Fedorov	371101a184	tcg: Pass last_tb by value to tb_find_fast() This is a small clean up. tb_find_fast() is a final consumer of this variable so no need to pass it by reference. 'last_tb' is always updated by subsequent cpu_loop_exec_tb() in cpu_exec(). This change also simplifies calling cpu_exec_nocache() in cpu_handle_exception(). Backports commit 4b7e69509df2fcbfdab8c62c294dbfcfdab8a6e1 from qemu	2018-02-25 23:23:22 -05:00
Cao jin	cc45b82472	timer/cpus: fix some typos and update some comments Backports commit 3224e8786fcbe531746f1530c37210c425625213 from qemu	2018-02-25 23:21:57 -05:00
Paolo Bonzini	57fff7a94b	target-m68k: fix get_mac_extf helper val is assigned twice; the second one should be combined with "\|". Reported by Coverity. Backports commit 5ce747cfac697f61668ab4fa4a71c1dba15cc272 from qemu	2018-02-25 23:21:05 -05:00
Thomas Huth	aed5df31b7	sparc: Use g_memdup() instead of g_new0() + memcpy() There is no need to make sure that the memory is zeroed after the allocation if we also immediatly fill the whole buffer afterwards with memcpy(). Thus g_new0 should be g_new instead. But since we are also doing a memcpy() here, we can also simply replace both with g_memdup() instead. Backports commit a337f295defad7eb977da4d6317cf70f7f2fa4b4 from qemu	2018-02-25 23:19:44 -05:00
Peter Maydell	eb77f61bea	configure: Always compile with -fwrapv QEMU's code relies on left shifts of signed integers always being defined behaviour with the obvious 2s-complement semantics. The only way to tell the compiler (and any associated undefined-behaviour sanitizer) that we require a C dialect with these semantics is to use the -fwrapv option. This is a bit of a heavy hammer for the job as it also gives us guaranteed semantics on integer arithmetic overflow which in theory we don't require. In an ideal world this would allow us to drop the warning flag -Wno-shift-negative-value, but we must retain this to avoid spurious warnings on clang versions predating the fix to https://llvm.org/bugs/show_bug.cgi?id=25552. Backports commit 2d31515bc0880a1cea86ce638d2a109f4f4e6f7d from qemu	2018-02-25 23:17:41 -05:00
Longpeng(Mike)	8b5400d675	target-i386: present virtual L3 cache info for vcpus Some software algorithms are based on the hardware's cache info, for example, for x86 linux kernel, when cpu1 want to wakeup a task on cpu2, cpu1 will trigger a resched IPI and told cpu2 to do the wakeup if they don't share low level cache. Oppositely, cpu1 will access cpu2's runqueue directly if they share llc. The relevant linux-kernel code as bellow: static void ttwu_queue(struct task_struct p, int cpu) { struct rq rq = cpu_rq(cpu); ...... if (... && !cpus_share_cache(smp_processor_id(), cpu)) { ...... ttwu_queue_remote(p, cpu); /* will trigger RES IPI / return; } ...... ttwu_do_activate(rq, p, 0); / access target's rq directly / ...... } In real hardware, the cpus on the same socket share L3 cache, so one won't trigger a resched IPIs when wakeup a task on others. But QEMU doesn't present a virtual L3 cache info for VM, then the linux guest will trigger lots of RES IPIs under some workloads even if the virtual cpus belongs to the same virtual socket. For KVM, there will be lots of vmexit due to guest send IPIs. The workload is a SAP HANA's testsuite, we run it one round(about 40 minuates) and observe the (Suse11sp3)Guest's amounts of RES IPIs which triggering during the period: No-L3 With-L3(applied this patch) cpu0: 363890 44582 cpu1: 373405 43109 cpu2: 340783 43797 cpu3: 333854 43409 cpu4: 327170 40038 cpu5: 325491 39922 cpu6: 319129 42391 cpu7: 306480 41035 cpu8: 161139 32188 cpu9: 164649 31024 cpu10: 149823 30398 cpu11: 149823 32455 cpu12: 164830 35143 cpu13: 172269 35805 cpu14: 179979 33898 cpu15: 194505 32754 avg: 268963.6 40129.8 The VM's topology is "1socket 8cores 2threads". After present virtual L3 cache info for VM, the amounts of RES IPIs in guest reduce 85%. For KVM, vcpus send IPIs will cause vmexit which is expensive, so it can cause severe performance degradation. We had tested the overall system performance if vcpus actually run on sparate physical socket. With L3 cache, the performance improves 7.2%~33.1%(avg:15.7%). Backports commit 14c985cffa6cb177fc01a163d8bcf227c104718c from qemu	2018-02-25 23:16:14 -05:00
Lioncash	2d87095858	glib_compat: Amend header guard	2018-02-25 23:12:20 -05:00
Sergey Sorokin	a882118050	target-arm: Fix lpae bit in FSR on an alignment fault If an alignment fault occurred and target EL is using AArch32, then DFSR/IFSR bit LPAE[9] must be set correctly. Backports commit e0fe723c24562c8f909bb40f131bfdbe75650677 from qemu	2018-02-25 23:10:29 -05:00
Luwei Kang	af7b3995dd	target-i386: Add more Intel AVX-512 instructions support Add more AVX512 feature bits, include AVX512DQ, AVX512IFMA, AVX512BW, AVX512VL, AVX512VBMI. Its spec can be found at: https://software.intel.com/sites/default/files/managed/b4/3a/319433-024.pdf Backports commit cc728d1493eee3e20c1547191862e43d3f55e714 from qemu	2018-02-25 23:09:18 -05:00
Alex Williamson	fe66c2e088	memory: Don't use memcpy for ram_device regions With a vfio assigned device we lay down a base MemoryRegion registered as an IO region, giving us read & write accessors. If the region supports mmap, we lay down a higher priority sub-region MemoryRegion on top of the base layer initialized as a RAM device pointer to the mmap. Finally, if we have any quirks for the device (ie. address ranges that need additional virtualization support), we put another IO sub-region on top of the mmap MemoryRegion. When this is flattened, we now potentially have sub-page mmap MemoryRegions exposed which cannot be directly mapped through KVM. This is as expected, but a subtle detail of this is that we end up with two different access mechanisms through QEMU. If we disable the mmap MemoryRegion, we make use of the IO MemoryRegion and service accesses using pread and pwrite to the vfio device file descriptor. If the mmap MemoryRegion is enabled and results in one of these sub-page gaps, QEMU handles the access as RAM, using memcpy to the mmap. Using either pread/pwrite or the mmap directly should be correct, but using memcpy causes us problems. I expect that not only does memcpy not necessarily honor the original width and alignment in performing a copy, but it potentially also uses processor instructions not intended for MMIO spaces. It turns out that this has been a problem for Realtek NIC assignment, which has such a quirk that creates a sub-page mmap MemoryRegion access. To resolve this, we disable memory_access_is_direct() for ram_device regions since QEMU assumes that it can use memcpy for those regions. Instead we access through MemoryRegionOps, which replaces the memcpy with simple de-references of standard sizes to the host memory. With this patch we attempt to provide unrestricted access to the RAM device, allowing byte through qword access as well as unaligned access. The assumption here is that accesses initiated by the VM are driven by a device specific driver, which knows the device capabilities. If unaligned accesses are not supported by the device, we don't want them to work in a VM by performing multiple aligned accesses to compose the unaligned access. A down-side of this philosophy is that the xp command from the monitor attempts to use the largest available access weidth, unaware of the underlying device. Using memcpy had this same restriction, but at least now an operator can dump individual registers, even if blocks of device memory may result in access widths beyond the capabilities of a given device (RTL NICs only support up to dword). Backports commit 1b16ded6a512809f99c133a97f19026fe612b2de from qemu	2018-02-25 23:06:36 -05:00
Alex Williamson	5db45219c9	memory: Replace skip_dump flag with ram_device Setting skip_dump on a MemoryRegion allows us to modify one specific code path, but the restriction we're trying to address encompasses more than that. If we have a RAM MemoryRegion backed by a physical device, it not only restricts our ability to dump that region, but also affects how we should manipulate it. Here we recognize that MemoryRegions do not change to sometimes allow dumps and other times not, so we replace setting the skip_dump flag with a new initializer so that we know exactly the type of region to which we're applying this behavior. Backports commit ca83f87a66d19fdaabf23d4f5ebb49396fe232c1 from qemu	2018-02-25 23:00:45 -05:00
Pranith Kumar	1b19fe260a	softfloat: Fix warn about implicit conversion from int to int8_t Change the flag type to 'uint8_t' to fix the implicit conversion error. Backports commit dfd607671037ff46d5b16ade10e10efdf0d260be from qemu	2018-02-25 22:54:39 -05:00
Pranith Kumar	4c880fba9d	target-arm: Fix warn about implicit conversion Clang warns about an implicit conversion as follows: /mnt/devops/code/qemu/target-arm/neon_helper.c:1075:1: warning: implicit conversion from 'int' to 'int8_t' (aka 'signed char') changes value from 128 to -128 [-Wconstant-conversion] NEON_VOP_ENV(qrshl_s8, neon_s8, 4) ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /mnt/devops/code/qemu/target-arm/neon_helper.c:116:83: note: expanded from macro 'NEON_VOP_ENV' uint32_t HELPER(glue(neon_,name))(CPUARMState env, uint32_t arg1, uint32_t arg2) \ ^ /mnt/devops/code/qemu/target-arm/neon_helper.c:106:5: note: expanded from macro '\ NEON_VOP_BODY' NEON_DO##n; \ ^~~~~~~~~~ <scratch space>:21:1: note: expanded from here NEON_DO4 ^~~~~~~~ /mnt/devops/code/qemu/target-arm/neon_helper.c:93:5: note: expanded from macro 'NEON_DO4' NEON_FN(vdest.v1, vsrc1.v1, vsrc2.v1); \ ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /mnt/devops/code/qemu/target-arm/neon_helper.c:1054:23: note: expanded from macro 'NEON_FN' dest = (1 << (sizeof(src1) 8 - 1)); \ ~ ~~^~~~~~~~~~~~~~~~~~~~~~~~~ Fix it by casting to appropriate type. Backports commit 6bbbb0ac136102098a70b97ab0c07bc7bf53131c from qemu	2018-02-25 22:44:43 -05:00
Richard Henderson	ede1cae3dc	tcg: Lower indirect registers in a separate pass Rather than rely on recursion during the middle of register allocation, lower indirect registers to loads and stores off the indirect base into plain temps. For an x86_64 host, with sufficient registers, this results in identical code, modulo the actual register assignments. For an i686 host, with insufficient registers, this means that temps can be (temporarily) spilled to the stack in order to satisfy an allocation. This as opposed to the possibility of not being able to spill, to allocate a register for the indirect base, in order to perform a spill. Backports commit 5a18407f55ade924aa6397c9a043a9ffd59645fe from qemu	2018-02-25 22:32:28 -05:00
Richard Henderson	8a012ff6d3	tcg: Require liveness analysis Backports commit c0ef05b5e62ab0c291a94022f14104e61e306f03 from qemu	2018-02-25 22:20:42 -05:00
Lioncash	541601edc4	util: Move qemu-log to utils	2018-02-25 22:17:44 -05:00
Richard Henderson	2aa46dd9a1	tcg: Include liveness info in the dumps Backports commit bdfb460ef77500f7b186759b585f06ff2120929d from qemu	2018-02-25 22:13:08 -05:00
Richard Henderson	e973e89a57	tcg: Compress dead_temps and mem_temps into a single array We only need two bits per temporary. Fold the two bytes into one, and reduce the memory and cachelines required during compilation. Backports commit c70fbf0a9938baf3b4f843355a77c17a7e945b98 from qemu	2018-02-25 22:07:08 -05:00
Richard Henderson	690985a582	tcg: Fold life data into TCGOp Reduce the size of other bitfields to make room. This reduces the cache footprint of compilation. Backports commit bee158cb4dde35c41632a3a129c869f14a32f8f0 from qemu	2018-02-25 21:49:42 -05:00
Lioncash	b5e765d562	target-mips: Silence unused function warning	2018-02-25 21:47:22 -05:00
Richard Henderson	1547048a22	tcg: Reorg TCGOp chaining Instead of using -1 as end of chain, use 0, and link through the 0 entry as a fully circular double-linked list. Backports commit dcb8e75870e2de199db853697f8839cb603beefe from qemu	2018-02-25 21:44:50 -05:00
Richard Henderson	b2e6e351c2	tcg: Compress liveness data to 16 bits This reduces both memory usage and per-insn cacheline usage during code generation. Backports commit a1b3c48d2b23d6eaeb4529d3e1183d2648731bf8 from qemu	2018-02-25 21:27:24 -05:00
Eric Blake	30cbcafc05	osdep: Document differences in rounding macros Make it obvious which macros are safe in which situations. Useful since QEMU_ALIGN_UP and ROUND_UP both purport to do the same thing, but differ on whether the alignment must be a power of 2.	2018-02-25 21:05:21 -05:00
Leon Alrae	bc434da124	target-mips: fix EntryHi.EHINV being cleared on TLB exception While implementing TLB invalidation feature we forgot to modify part of code responsible for updating EntryHi during TLB exception. Consequently EntryHi.EHINV is unexpectedly cleared on the exception. Backports commit 701074a6fc7470d0ed54e4a4bcd4d491ad8da22e from qemu	2018-02-25 21:02:31 -05:00
Igor Mammedov	943b9fc261	qdev: Fix object reference leak in case device.realize() fails If device doesn't have parent assined before its realize is called, device_set_realized() will implicitly set parent to '/machine/unattached'. However device_set_realized() may fail after that point at several other points leaving not realized object dangling in '/machine/unattached' and as result caller of obj = object_new() obj->ref == 1 object_property_set_bool(obj,..., true, "realized",...) obj->ref == 2 if (fail) object_unref(obj); obj->ref == 1 will get object leak instead of expected object destruction. Fix it by making device_set_realized() to cleanup after itself in case of failure. Backports commit 69382d8b3e8600b349c191394d761dcb480502cf from qemu	2018-02-25 21:00:26 -05:00
Igor Mammedov	62c89b9cd4	exec: Reduce CONFIG_USER_ONLY ifdeffenery Backports commit 1bc7e522d9cf1b58f2de9c8f1737be0bb5129c35 from qemu	2018-02-25 20:57:48 -05:00
Igor Mammedov	d30410dc9a	target-i386: Add x86_cpu_unrealizefn() First remove VCPU from exec loop and only then remove lapic. Backports commit c884776e9dc947105827bd6c22192863f97267d2 from qemu	2018-02-25 20:54:13 -05:00
Igor Mammedov	298b0e6529	target-i386: Fix apic object leak when CPU is deleted Backports commit 67e55caa6dcb91c80428cee6fe463f8dd8a755ab from qemu	2018-02-25 20:48:40 -05:00
Igor Mammedov	e15fb246ab	target-i386: cpu: Do not ignore error and fix apic parent object_property_add_child() silently fails with error that it can't create duplicate propery 'apic' as we already have 'apic' property registered for 'apic' feature. As result generic device_realize puts apic into unattached container. As it's programming error, abort if name collision happens in future and fix property name for apic_state to 'lapic', this way apic is a child of cpu instance. Backports commit 6816b1b3811e839540df22855d975b6d76ae438b from qemu	2018-02-25 20:47:46 -05:00
Paolo Bonzini	403021183d	target-i386: Add support for UMIP and RDPID CPUID bits These are both stored in CPUID[EAX=7,EBX=0].ECX. KVM is going to be able to emulate both (albeit with a performance loss in the case of RDPID, which therefore will be in KVM_GET_EMULATED_CPUID rather than KVM_GET_SUPPORTED_CPUID). It's also possible to implement both in TCG, but this is for 2.8. Backports commit c2f193b538032accb9db504998bf2ea7c0ef65af from qemu	2018-02-25 20:46:40 -05:00
Igor Mammedov	6714284211	target-i386: Add socket/core/thread properties to X86CPU These properties will be used by as address where to plug CPU with help -device/device_add commands. Backports commit d89c2b8b98e097b9cad5104b0f178bde1cfa011b from qemu	2018-02-25 20:45:35 -05:00
Igor Mammedov	2ac9df3633	target-i386: Replace custom apic-id setter/getter with static property Custom apic-id setter/getter doesn't do any property specific checks anymore, so clean it up and use more compact static property DEFINE_PROP_UINT32 instead. Backports commit 2da00e3176abac34ca7a6aab1f5bbb94a0d03fc5 from qemu	2018-02-25 20:44:18 -05:00
Igor Mammedov	0525a9c9fa	pc: cpu: Consolidate apic-id validity checks in pc_cpu_pre_plug() Machine code knows about all possible APIC IDs so use that instead of hack which does O(n^2) complexity duplicate checks, interating over global CPUs list. As result duplicate check is done only once with O(log n) complexity. Backports commit 4ec60c76d5ab513e375f17b043d2b9cb849adf6c from qemu	2018-02-25 20:38:43 -05:00
Dr. David Alan Gilbert	9ee1a82185	target-i386: Set physical address bits based on host Add the host-phys-bits boolean property, if true, take phys-bits from the hosts physical bits value, overriding either the default or the user specified value. We can also use the value we read from the host to check the users explicitly set value and warn them if it doesn't match. Note: a) We only read the hosts value in KVM mode (because on non-x86 we get an abort if we try) b) We don't warn about trying to use host-phys-bits in TCG mode, we just fall back to the TCG default. This allows the machine type to set the host-phys-bits flag if it wants and then to work in both TCG and KVM. Backports commit 11f6fee576680a2d482123535da920f8ceb33eb5 from qemu	2018-02-25 20:36:12 -05:00
Igor Mammedov	95cced34fb	pc: Add x86_topo_ids_from_apicid() It's reverse of apicid_from_topo_ids() and will be used in follow up patches to fill in data structures for query-hotpluggable-cpus and for user friendly error reporting. Backports commit 9f3aab58539b4cc716e42e772be8116dc2e7d159 from qemu	2018-02-25 20:31:36 -05:00
Igor Mammedov	bc8dbd862d	target-i386: Use uint32_t for X86CPU.apic_id Redo 9886e834 (target-i386: Require APIC ID to be explicitly set before CPU realize) in another way that doesn't use int64_t to detect if apic-id property has been set. Use the fact that 0xFFFFFFFF is the broadcast value that a CPU can't have and set default uint32_t apic_id to it instead of using int64_t. Later uint32_t apic_id will be used to drop custom property setter/getter in favor of static property. Backports commit d9c84f196970f78d4b55ab87e03cbcad7c65f86f from qemu	2018-02-25 20:30:31 -05:00
Dr. David Alan Gilbert	54851f7d74	target-i386: Fill high bits of mtrr mask Fill the bits between 51..number-of-physical-address-bits in the MTRR_PHYSMASKn variable range mtrr masks so that they're consistent in the migration stream irrespective of the physical address space of the source VM in a migration. Backports commit fcc35e7ccaed771790940524f3b0eef7aebfc9b1 from qemu	2018-02-25 20:29:20 -05:00
Dr. David Alan Gilbert	78254267ff	target-i386: Allow physical address bits to be set Currently QEMU sets the x86 number of physical address bits to the magic number 40. This is only correct on some small AMD systems; Intel systems tend to have 36, 39, 46 bits, and large AMD systems tend to have 48. Having the value different from your actual hardware is detectable by the guest and in principal can cause problems; The current limit of 40 stops TB VMs being created by those lucky enough to have that much. This patch lets you set the physical bits by a cpu property but defaults to the same 40bits which matches TCGs setup. I've removed the ancient warning about the 42 bit limit in exec.c; I can't find that limit in there and no one else seems to know where it is. We use a magic value of 0 as the property default so that we can later distinguish between the default and a user set value. Backports commit af45907a132857cfd47acc998bf5f7c26cd13071 from qemu	2018-02-25 20:28:38 -05:00
Dr. David Alan Gilbert	7cb359cc19	target-i386: Provide TCG_PHYS_ADDR_BITS Provide a constant for the number of address bits supported under TCG. Backports commit 709787ee997f0a0ccab78e0edaf10d48929151ee from qemu	2018-02-25 20:23:25 -05:00
Eric Blake	23ab6d81f9	qapi: Implement boxed types for commands/events Turn on the ability to pass command and event arguments in a single boxed parameter, which must name a non-empty type (although the type can be a struct with all optional members). For structs, it makes it possible to pass a single qapi type instead of a breakout of all struct members (useful if the arguments are already in a struct or if the number of members is large); for other complex types, it is now possible to use a union or alternate as the data for a command or event. The empty type may be technically feasible if needed down the road, but it's easier to forbid it now and relax things to allow it later, than it is to allow it now and have to special case how the generated 'q_empty' type is handled (see commit 7ce106a9 for reasons why nothing is generated for the empty type). An alternate type is never considered empty, but now that a boxed type can be either an object or an alternate, we have to provide a trivial QAPISchemaAlternateType.is_empty(). The new call to arg_type.is_empty() during QAPISchemaCommand.check() requires that we first check the type in question; but there is no chance of introducing a cycle since objects do not refer back to commands. We still have a split in syntax checking between ad-hoc parsing up front (merely validates that 'boxed' has a sane value) and during .check() methods (if 'boxed' is set, then 'data' must name a non-empty user-defined type). Generated code is unchanged, as long as no client uses the new feature. Backports commit c818408e449ea55371253bd4def1c1dc87b7bb03 from qemu	2018-02-25 20:22:03 -05:00
Eric Blake	c65f056fbe	qapi: Plumb in 'boxed' to qapi generator lower levels The next patch will add support for passing a qapi union type as the 'data' of a command. But to do that, the user function for implementing the command, as called by the generated marshal command, must take the corresponding C struct as a single boxed pointer, rather than a breakdown into one parameter per member. Even without a union, being able to use a C struct rather than a list of parameters can make it much easier to handle coding with QAPI. This patch adds the internal plumbing of a 'boxed' flag associated with each command and event. In several cases, this means adding indentation, with one new dead branch and the remaining branch being the original code more deeply nested; this was done so that the new implementation in the next patch is easier to review without also being mixed with indentation changes. For this patch, no behavior or generated output changes, other than the testsuite outputting the value of the new flag (always False for now). Backports commit 48825ca419fd9c8140d4fecb24e982d68ebca74f from qemu	2018-02-25 20:17:01 -05:00
Eric Blake	6ff318b839	qapi-event: Simplify visit of non-implicit data Commit 7ce106a9 documented why we don't generated a visit_type_FOO() for implicit types; and therefore events with an anonymous type for 'data' have to open-code a visit. Note that the open-coded visit in qapi-event.c is slightly different from what is done in qapi-visit.c for normal types, in part because we don't have to check for obj being NULL or free things on error. But where the type is not implicit, it is nicer to reuse the normal visit instead of open-coding a duplicate. At the moment, the only event with a non-implicit 'data' is in the testsuite, where test-qapi-event.c changes as follows: \|@@ -155,6 +155,7 @@ void qapi_event_send___org_qemu_x_event( \| __org_qemu_x_Struct param = { \| __org_qemu_x_member1, (char )__org_qemu_x_member2, has_q_wchar_t, q_wchar_t \| }; \|+ __org_qemu_x_Struct *arg = &param; \| \| emit = qmp_event_get_func_emit(); \| if (!emit) { \|@@ -164,16 +165,7 @@ void qapi_event_send___org_qemu_x_event( \| qmp = qmp_event_build_dict("__ORG.QEMU_X-EVENT"); \| \| v = qmp_output_visitor_new(&obj); \|- \|- visit_start_struct(v, "__ORG.QEMU_X-EVENT", NULL, 0, &err); \|- if (err) { \|- goto out; \|- } \|- visit_type___org_qemu_x_Struct_members(v, &param, &err); \|- if (!err) { \|- if (!err) { \|- visit_check_struct(v, &err); \|- } \|- visit_end_struct(v, NULL); \|+ visit_type___org_qemu_x_Struct(v, "__ORG.QEMU_X-EVENT", &arg, &err); \| if (err) { \| goto out; \| } Backports commit 4d0b268fdb17a1fed10fe980e77fd388e5427bfd from qemu	2018-02-25 20:12:34 -05:00
Eric Blake	b5220a6867	qapi: Drop useless gen_err_check() Ever since commit 12f254f removed the last parameterization of gen_err_check(), it no longer makes sense to hide the three lines of generated C code behind a macro call. Just inline it into the remaining users. No change to generated code. Backports commit fa274ed6fb788866ed3a2cfd54a2ddf78f04f2c0 from qemu	2018-02-25 20:10:45 -05:00
Eric Blake	d7014c66df	qapi: Add type.is_empty() helper In the near future, we want to lift our artificial restriction of no variants at the top level of an event, at which point the currently open-coded check for empty members will become insufficient. Factor it out into a new helper method is_empty() now, and future-proof it by checking variants, too, along with an assert that it is not used prior to the completion of .check(). Update places that were checking for (non-)empty .members to use the new helper. All of the current callers assert that there are no variants (either directly, or by qapi.py asserting that base types have no variants), so this is not a semantic change. No change to generated code. Backports commit b6167706829c6e0d3572daa2b6769594ced276f7 from qemu	2018-02-25 20:07:43 -05:00
Eric Blake	4b39eaae33	qapi: Hide tag_name data member of variants Clean up the only remaining external use of the tag_name field of QAPISchemaObjectTypeVariants, by explicitly listing the generated 'type' tag for all variants in the testsuite (you can still tell simple unions by the -wrapper types). Then we can mark the tag_name field as private by adding a leading underscore to prevent any further use. Backports commit da9cb19385fc66b2cb2584bbbbcbf50246d057e2 from qemu	2018-02-25 20:06:15 -05:00
Eric Blake	febeea5f4b	qapi: Special case c_name() for empty type Commit 7ce106a rendered QAPISchemaObjectType.c_name() redundant, since it now does nothing more than delegate to its superclass. However, rather than deleting it, we can restore part of the assertion that was removed in that commit, to prove that we never emit the empty type directly in generated code, but rather special-case it as a built-in that makes other aspects of code generation easier to reason about. Backports commit cd50a2564560986e865ff64fa73b59d2564076f0 from qemu	2018-02-25 20:05:16 -05:00
Eric Blake	8ccfff95fe	qapi: Require all branches of flat union enum to be covered We were previously enforcing that all flat union branches were found in the corresponding enum, but not that all enum values were covered by branches. The resulting generated code would abort() if the user passes the uncovered enum value. We don't automatically treat non-present branches in a flat union as empty types, for symmetry with simple unions (there, the enum type is generated from the list of all branches, so there is no way to omit a branch but still have it be part of the union). A later patch will add shorthand so that branches that are empty in flat unions can be declared as 'branch':{} instead of 'branch':'Empty', to avoid the need for an otherwise useless explicit empty type. [Such shorthand for simple unions is a bit harder to justify, since we would still have to generate a wrapper type that parses 'data':{}, rather than truly being an empty branch with no additional siblings to the 'type' member.] Backports commit d0b182392d0281ef780e3effcb82677a004f1f97 from qemu	2018-02-25 20:04:18 -05:00
Paolo Bonzini	674805745b	qapi: change QmpInputVisitor to QSLIST This saves a lot of memory compared to a statically-sized array, or at least 24kb could be considered a lot on an Atari ST. It also makes the code more similar to QmpOutputVisitor. This removes the limit on the depth of a QObject that can be processed into a QAPI tree. This is not a problem because QObjects can be considered trusted; the text received on the QMP wire is untrusted input, but the JSON parser already takes pains to limit the QObject tree it creates. We don't need the QMP input visitor to limit it again. Backports commit 3d344c2aabb7bc9b414321e3c52872901edebdda from qemu	2018-02-25 20:02:09 -05:00
Paolo Bonzini	b14f1d7a80	qapi: change QmpOutputVisitor to QSLIST This saves a little memory compared to the doubly-linked QTAILQ. Backports commit fc76ae8b38783e82c109834573ba5d6f080440b5 from qemu	2018-02-25 19:59:16 -05:00
Sergey Fedorov	e39b9d0391	target-i386: Remove redundant HF_SOFTMMU_MASK 'HF_SOFTMMU_MASK' is only set when 'CONFIG_SOFTMMU' is defined. So there's no need in this flag: test 'CONFIG_SOFTMMU' instead. Backports commit da6d48e3348bbc266896cf8adf0c33f1eaf5b31f from qemu	2018-02-25 19:59:15 -05:00
Sergey Sorokin	4a904baaf5	target-arm: Add missed AArch32 TLBI sytem registers Some PL2 related TLBI system registers are missed in AArch32 implementation. The patch fixes it. Backports commit 541ef8c2e73fb99d173b125bef7c262fdd2fe33c from qemu	2018-02-25 19:59:15 -05:00
Peter Lieven	799bf1c3a5	exec: avoid realloc in phys_map_node_reserve this is the first step in reducing the brk heap fragmentation created by the map->nodes memory allocation. Since the introduction of RCU the freeing of the PhysPageMaps is delayed so that sometimes several hundred are allocated at the same time. Even worse the memory for map->nodes is allocated and shortly afterwards reallocated. Since the number of nodes it grows to in the end is the same for all PhysPageMaps remember this value and at least avoid the reallocation. The large number of simultaneous allocations (about 450 x 70kB in my configuration) has to be addressed later. Backports commit 101420b886eec36990419bc9ed5b503622af8a0d from qemu	2018-02-25 19:32:40 -05:00
Paolo Bonzini	a47c68164d	compiler: never omit assertions if using a static analysis tool Assertions help both Coverity and the clang static analyzer avoid false positives, but on the other hand both are confused when the condition is compiled as (void)(x != FOO). Always expand assertion macros when using Coverity or clang, through a new QEMU_STATIC_ANALYSIS preprocessor symbol. This fixes a couple false positives in TCG. Backports commit 8bff06a0bbf257a2083223534c1607bf87d913e6 from qemu	2018-02-25 19:19:28 -05:00
Vijay	5608b53b6f	target-arm: Use Neon for zero checking Use Neon instructions to perform zero checking of buffer. This is helps in reducing total migration time. Use case: Idle VM live migration with 4 VCPUS and 8GB ram running CentOS 7. Without Neon, the Total migration time is 3.5 Sec Migration status: completed total time: 3560 milliseconds downtime: 33 milliseconds setup: 5 milliseconds transferred ram: 297907 kbytes throughput: 685.76 mbps remaining ram: 0 kbytes total ram: 8519872 kbytes duplicate: 2062760 pages skipped: 0 pages normal: 69808 pages normal bytes: 279232 kbytes dirty sync count: 3 With Neon, the total migration time is 2.9 Sec Migration status: completed total time: 2960 milliseconds downtime: 65 milliseconds setup: 4 milliseconds transferred ram: 299869 kbytes throughput: 830.19 mbps remaining ram: 0 kbytes total ram: 8519872 kbytes duplicate: 2064313 pages skipped: 0 pages normal: 70294 pages normal bytes: 281176 kbytes dirty sync count: 3 Backports commit 7069532e3b944c25707d4f69998e68a739eabff9 from qemu	2018-02-25 19:17:38 -05:00
Richard Henderson	d17dc29d2e	target-sparc: Elide duplicate updates to fprs Backports commit f9c816c00cf4242542472ae6b2a579b11b7e86f1 from qemu	2018-02-25 19:14:59 -05:00
Richard Henderson	2215ef7e21	target-sparc: Use cpu_loop_exit_restore from helper_check_ieee_exceptions This avoids needing to save state before every FP operation. Backports commit 02c79d78853f07d519b3272d06e43041eb4a4105 from qemu	2018-02-25 19:12:36 -05:00
Richard Henderson	524e4af5ca	target-sparc: Use cpu_fsr in stfsr Backports commit ba2397d1ca6546e8cf5bd9e2939923546ac3091a from qemu	2018-02-25 19:10:27 -05:00
Lioncash	17c54e2702	header_gen: alphabetize general symbols	2018-02-25 19:07:20 -05:00
Lioncash	4b8cae3f61	header_gen: alphabetize ARM symbols	2018-02-25 19:00:31 -05:00
Lioncash	fa10382007	header_gen: alphabetize aarch64 symbols	2018-02-25 19:00:01 -05:00
Lioncash	3f8802fcf5	header_gen: alphabetize MIPS symbols	2018-02-25 18:59:49 -05:00
Richard Henderson	12eecc4939	target-sparc: Use explicit writes to cpu_fsr By arranging for explicit writes to cpu_fsr after floating point operations, we are able to mark the helpers as not writing to tcg globals, which means that we don't need to invalidate the integer register set across said calls. Backports commit 7385aed20db5d83979f683b9d0048674411e963c from qemu	2018-02-25 18:55:07 -05:00
Richard Henderson	2e24c09db3	target-sparc: Remove helper_ldf_asi, helper_stf_asi We've now implemented all fp asis inline, except for the no-fault memory reads. The latter can be passed directly to helper_ld_asi. Backports commit f2fe396f0fae6b389169f65abf294df9ae6cfee5 from qemu	2018-02-25 18:32:35 -05:00
Richard Henderson	a921273a6c	target-sparc: Directly implement block and short ldf/stf asis Backports commit ca5ce5723fb1ee3445f690004f63c209c15fb813 from qemu	2018-02-25 18:27:52 -05:00
Richard Henderson	333d88c9e6	target-sparc: Directly implement easy ldf/stf asis Backports commit 7705091ca4a20c8c2d20e2af5d0a1bcb17296657 from qemu	2018-02-25 18:23:45 -05:00
Richard Henderson	9d47cda44c	target-sparc: Pass TCGMemOp constants to helper_ld/st_asi Reduces the argument count for helper_ld_asi; do helper_st_asi for consistency. Backports commit 6850811e7c56403b0d225a1bffd096abf2ff06f9 from qemu	2018-02-25 18:19:42 -05:00
Richard Henderson	950aa89c7a	target-sparc: Fix obvious error in ASI_M_BFILL Backports commit c095b83f9836cef80f64b32603fea240762a824b from qemu	2018-02-25 18:08:40 -05:00
Richard Henderson	eb285aa281	target-sparc: Directly implement easy ldd/std asis Backports commit e4dc0052a40d3e7b00ca0b008f345e2ed644aa20 from qemu	2018-02-25 18:07:51 -05:00
Richard Henderson	1ed7df7720	target-sparc: Introduce gen_check_align Backports commit 35e94905ce4b39b358a673995f9bee11f46ec8be from qemu	2018-02-25 17:59:47 -05:00
Richard Henderson	cef4ae5ca8	target-sparc: Use QT0 to return results from ldda Also implement a few more twinx asis. Backports commit 3f4288ebf6fca7b266fa42a74d9d99b961ba6844 from qemu	2018-02-25 17:56:08 -05:00
Richard Henderson	9e402493a9	target-sparc: Directly implement easy ld/st asis Backports commit f0913be04be13cfb4f9341ae79e035fc8479fd28 from qemu	2018-02-25 17:49:16 -05:00
Richard Henderson	e2d0ee1286	target-sparc: Use defines from asi.h Backports commit 0cc1f4bf76a20c7fee0bab5c9bba9ad7302198b5 from qemu	2018-02-25 17:44:36 -05:00
Richard Henderson	bd3b7a2537	target-sparc: Add UA2005 defines to asi.h Backports commit 1d854963ea340855efe3f8a5b99c95a75bd717ae from qemu	2018-02-25 17:32:46 -05:00
Richard Henderson	b9a65e0e79	target-sparc: Import linux/arch/sparc/include/uapi/asm/asi.h Copied from tag v4.2, 64291f7db5bd8150a74ad2036f1037e6a0428df2. Backports commit 68a03b8c8853c66724c6f200af3f821ae0d7e934 from qemu	2018-02-25 17:29:51 -05:00
Richard Henderson	c509a5562d	target-sparc: Pass TCGMemOp to gen_ld/st_asi Backports commit 1d65b0f5bb8f32500bbce09d922d226bb7cf4c68 from qemu	2018-02-25 17:26:34 -05:00
Richard Henderson	4bc53f223c	target-sparc: Introduce get_asi Replace gen_get_asi, and use it for both 32-bit and 64-bit. For v8, do supervisor and immediate checks here. Also, move save_state and TB ending into the respective subroutines, out of disas_sparc_insn. Backports commit 7ec1e5ea4bd0700fa48da86bffa2fcc6146c410a from qemu	2018-02-25 17:23:20 -05:00
Richard Henderson	1dcd14d434	target-sparc: Store %asi in TB flags Knowing the value of %asi at translation time means that we can handle the common settings without a function call. The steady state appears to be %asi == ASI_P, so that sparcv9 code can use offset forms of lda/sta. The %asi register gets pushed and popped on entry to certain functions, but it rarely takes on values other than ASI_P or ASI_AIUP. Therefore we're unlikely to be expanding the set of TBs created. Backports commit a6d567e523ed7e928861f3caa5d49368af3f330d from qemu	2018-02-25 05:17:21 -05:00
Richard Henderson	080281bc9c	target-sparc: Unify asi handling between 32 and 64-bit We now have a single copy of gen_ld_asi, gen_st_asi, gen_swap_asi, and everything uses gen_get_asi. Backports commit 22e700607aeaff5f5e139d0fdc3d861e5502040c from qemu	2018-02-25 05:11:52 -05:00
Richard Henderson	847d65258b	target-sparc: Create gen_exception This unifies quite a few duplicate code fragments. Backports commit 4fbe00679000f9fd0c509c2d548d957b08ec6057 from qemu	2018-02-25 04:55:16 -05:00
Richard Henderson	39d1657fc3	target-sparc: Store mmu index in TB flags Doing this instead of saving the raw PS_PRIV and TL. This means that all nucleus mode TBs (TL > 0) can be shared. This fixes a bug in that we didn't include HS_PRIV in the TB flags, and so could produce incorrect TB matches for hypervisor state. The LSU and DMMU states were unused by the translator. Including them in TB flags meant unnecessary mismatches from tb_find_fast. Backports commit 99a230638a3674e921224dbe628159c867d734b1 from qemu	2018-02-25 04:51:50 -05:00
Richard Henderson	395e00cdc5	target-sparc: Remove softint as a TCG global The global is only ever read for one insn; we can just as well use a load from env instead and generate the same code. This also allows us to indicate the the associated helpers do not touch TCG globals. Backports commit e86ceb0d652baa5738e05a59ee0e7989dafbeaa1 from qemu	2018-02-25 04:49:27 -05:00
Richard Henderson	dcd1d6f8ce	target-sparc: Mark more flags for helpers Quite a few helpers do not modify tcg globals but did not so indicate. Backports commit be72f9fcca742c5e9a949f5eac901ed6cc26a2a0 from qemu	2018-02-25 04:28:54 -05:00
Markus Armbruster	c2ffbc575d	Clean up decorations and whitespace around header guards Cleaned up with scripts/clean-header-guards.pl. Backports commit 175de52487ce0b0c78daa4cdf41a5a465a168a25 from qemu	2018-02-25 04:26:02 -05:00
Markus Armbruster	1275b9b459	Clean up ill-advised or unusual header guards Cleaned up with scripts/clean-header-guards.pl. Backports commit 2a6a4076e117113ebec97b1821071afccfdfbc96 from qemu	2018-02-25 04:22:46 -05:00
Markus Armbruster	9ae2fc4d9e	Clean up header guards that don't match their file name Header guard symbols should match their file name to make guard collisions less likely. Offenders found with scripts/clean-header-guards.pl -vn. Cleaned up with scripts/clean-header-guards.pl, followed by some renaming of new guard symbols picked by the script to better ones. Backports commit 121d07125bb6d7079c7ebafdd3efe8c3a01cc440 from qemu	2018-02-25 04:18:42 -05:00
Markus Armbruster	25ec9ab016	tcg: Clean up tcg-target.h header guards These use guard symbols like TCG_TARGET_$target. scripts/clean-header-guards.pl doesn't like them because they don't match their file name (they should, to make guard collisions less likely). Clean them up: use guard symbol $target_TCG_TARGET_H for tcg/$target/tcg-target.h. Backports commit 14e54f8ecfe9c5e17348f456781344737ed10b3b from qemu	2018-02-25 04:15:08 -05:00
Markus Armbruster	2b65f98538	target-*: Clean up cpu.h header guards Most of them use guard symbols like CPU_$target_H, but we also have __MIPS_CPU_H__ and __TRICORE_CPU_H__. They all upset scripts/clean-header-guards.pl. The script dislikes CPU_$target_H because they don't match their file name (they should, to make guard collisions less likely). The others are reserved identifiers. Clean them all up: use guard symbol $target_CPU_H for target-$target/cpu.h. Backports commit 07f5a258750b3b9a6e10fd5ec3e29c9a943b650e from qemu	2018-02-25 04:12:46 -05:00
Markus Armbruster	60e8836b74	Use #include "..." for our own headers, <...> for others Tracked down with an ugly, brittle and probably buggy Perl script. Also move includes converted to <...> up so they get included before ours where that's obviously okay. Backports commit a9c94277f07d19d3eb14f199c3e93491aa3eae0e from qemu	2018-02-25 04:10:33 -05:00
Peter Maydell	f6f843b4d4	bswap.h: Document cpu_to_* and *_to_cpu conversion functions Add a documentation comment describing the functions for converting between the cpu and little or bigendian formats. Backports commit 7d820b766a2049f33ca7e078aa51018f2335f8c5 from qemu	2018-02-25 04:06:28 -05:00
Peter Maydell	1d7f813942	bswap.h: Remove unused cpu_to_w() and _to_cpup() Now that all uses of cpu_to_w() and _to_cpup() have been replaced with either ld_p()/st_p() or by doing direct dereferences and using the cpu_to_()/_to_cpu() byteswap functions, we can remove the unused implementations. Backports commit f76bde702916d0230bf359d478bcac8d7f3b30ae from qemu	2018-02-25 04:04:46 -05:00
Sergey Sorokin	d1e4ac0451	Fix confusing argument names in some common functions There are functions tlb_fill(), cpu_unaligned_access() and do_unaligned_access() that are called with access type and mmu index arguments. But these arguments are named 'is_write' and 'is_user' in their declarations. The patches fix the arguments to avoid a confusion. Backports commit b35399bb4e9968296a12303b00f9f2066470e987 from qemu	2018-02-25 03:58:27 -05:00
Leon Alrae	a465707a47	target-mips: enable 10-bit ASIDs in I6400 CPU Backports commit cdc46fab07a122dfcc8a1054510a68d936ae3440 from qemu	2018-02-25 03:50:58 -05:00
Paul Burton	002b392a15	target-mips: support CP0.Config4.AE bit The read-only Config4.AE bit set denotes extended 10 bits ASID. Backports commit a0c8060841f2d56fb3504292c18522b957972e4c from qemu	2018-02-25 03:49:36 -05:00
Paul Burton	ba4dcc8c2f	target-mips: change ASID type to hold more than 8 bits ASID currently has uint8_t type which is too small since some processors support more than 8 bits ASID. Therefore change its type to uint16_t. Backports commit 2d72e7b047d800c9f99262466f65a98684ecca14 from qemu	2018-02-25 03:48:10 -05:00
Paul Burton	ac27c881ff	target-mips: add ASID mask field and replace magic values Backports commit 6ec98bd7b64ad75870c8e9d87a90fcd1a64b4942 from qemu	2018-02-25 03:44:26 -05:00
Leon Alrae	7e589c117b	target-mips: replace MIPS64R6-generic with the real I6400 CPU model MIPS64R6-generic gradually gets closer to I6400 CPU, feature-wise. Rename it to make it clear which MIPS processor it is supposed to emulate. Backports commit 8f95ad1c79b4166350b982a6defe0e21faa04dac from qemu	2018-02-25 03:35:55 -05:00
Leon Alrae	c0b3938b88	target-mips: add exception base to MIPS CPU Replace hardcoded 0xbfc00000 with exception_base which is initialized with this default address so there is no functional change here. However, it is now exposed and consequently it will be possible to modify it from outside of the CPU. Backports commit 89777fd10fc3dd573c3b4d1b2efdd10af823c001 from qemu	2018-02-25 03:22:10 -05:00
Stanislav Shmarov	6f20d35cd1	translate-all: Fix user-mode self-modifying code in 2 page long TB In user-mode emulation Translation Block can consist of 2 guest pages. In that case QEMU also mprotects 2 host pages that are dedicated for guest memory, containing instructions. QEMU detects self-modifying code with SEGFAULT signal processing. In case if instruction in 1st page is modifying memory of 2nd page (or vice versa) QEMU will mark 2nd page with PAGE_WRITE, invalidate TB, generate new TB contatining 1 guest instruction and exit to CPU loop. QEMU won't call mprotect, and new TB will cause same SEGFAULT. Page will have both PAGE_WRITE_ORG and PAGE_WRITE flags, so QEMU will handle the signal as guest binary problem, and exit with guest SEGFAULT. Solution is to do following: In case if current TB was invalidated continue to invalidate TBs from remaining guest pages and mark pages as PAGE_WRITE. After that disable host page protection with mprotect. If current tb was invalidated longjmp to main loop. That is more efficient, since we won't get SEGFAULT when executing new TB. Backports commit 7399a337e4126f7c8c8af3336726f001378c4798 from qemu	2018-02-25 03:14:22 -05:00
Samuel Damashek	670d81367b	cputlb: Fix for self-modifying writes across page boundaries As it currently stands, QEMU does not properly handle self-modifying code when the write is unaligned and crosses a page boundary. The procedure for handling a write to the current translation block is to write-protect the current translation block, catch the write, split up the translation block into the current instruction (which remains write-protected so that the current instruction is not modified) and the remaining instructions in the translation block, and then restore the CPU state to before the write occurred so the write will be retried and successfully executed. However, since unaligned writes across pages are split into one-byte writes for simplicity, writes to the second page (which is not the current TB) may succeed before a write to the current TB is attempted, and since these writes are not invalidated before resuming state after splitting the TB, these writes will be performed a second time, thus corrupting the second page. Credit goes to Patrick Hulin for discovering this. In recent 64-bit versions of Windows running in emulated mode, this results in either being very unstable (a BSOD after a couple minutes of uptime), or being entirely unable to boot. Windows performs one or more 8-byte unaligned self-modifying writes (xors) which intersect the end of the current TB and the beginning of the next TB, which runs into the aforementioned issue. This commit fixes that issue by making the unaligned write loop perform the writes in forwards order, instead of reverse order. This way, QEMU immediately tries to write to the current TB, and splits the TB before any write to the second page is executed. The write then proceeds as intended. With this patch applied, I am able to boot and use Windows 7 64-bit and Windows 10 64-bit in QEMU without KVM. Per Richard Henderson's input, this patch also ensures the second page is in the TLB before executing the write loop, to ensure the second page is mapped. The original discussion of the issue is located at http://lists.nongnu.org/archive/html/qemu-devel/2014-08/msg02161.html. Backports commit 81daabaf7a572f138a8b88ba6eea556bdb0cce46 from qemu	2018-02-25 03:12:11 -05:00
Samuel Damashek	04c423b081	cputlb: Add address parameter to VICTIM_TLB_HIT Backports commit a390284b80d2b6581143cdb40666674e60e635ae from qemu	2018-02-25 03:03:36 -05:00
Richard Henderson	9e2422032a	cputlb: Move VICTIM_TLB_HIT out of line There are currently 22 invocations of this function, and we're about to increase that number. Backports commit 7e9a7c50d9a400ef51242d661a261123c2cc9485 from qemu	2018-02-25 02:58:47 -05:00
Haozhong Zhang	2893a1c381	target-i386: Publish advised value of MSR_IA32_FEATURE_CONTROL via fw_cfg It's a prerequisite that certain bits of MSR_IA32_FEATURE_CONTROL should be set before some features (e.g. VMX and LMCE) can be used, which is usually done by the firmware. This patch adds a fw_cfg file "etc/msr_feature_control" which contains the advised value of MSR_IA32_FEATURE_CONTROL and can be used by guest firmware (e.g. SeaBIOS). Backports commit 217f1b4a72153cf8d556e9d45919e9222c38d25e from qemu	2018-02-25 02:49:42 -05:00
Ashok Raj	b58f1fccce	target-i386: kvm: Add basic Intel LMCE support This patch adds the support to inject SRAR and SRAO as LMCE, i.e. they are injected to only one VCPU rather than broadcast to all VCPUs. As KVM reports LMCE support on Intel platforms, this features is only available on Intel platforms. LMCE is disabled by default and can be enabled/disabled by cpu option 'lmce=on/off'. Backports commit 87f8b626041ceaea9adcfdbd549359f0ca7b871d from qemu	2018-02-25 02:48:22 -05:00
Evgeny Yakovlev	49fdd75329	target-i386: Report hyperv feature words through qom This change adds hyperv feature words report through qom rpc. When VM is configured with hyperv features enabled libvirt will check that required feature words are set in cpuid leaf 40000003 through qom request. Currently qemu does not report hyperv feature words which prevents windows guests from starting with libvirt. To avoid conflicting with current hyperv properties all added feature words cannot be set directly with -cpu +feature yet. Backports commit c35bd19a5c9140bce8b913cc5cefe6f071135bdb from qemu	2018-02-25 02:45:20 -05:00
Paolo Bonzini	f39cc9e3b9	target-i386: Avoid using locals outside their scope x86_cpu_parse_featurestr has a "val = num;" assignment just before num goes out of scope. Push num up to fix the issue. Backports commit cf2887c9738451eb989c6c102af070dee2dc172a from qemu	2018-02-25 02:30:06 -05:00
Paolo Bonzini	1be92ac243	target-i386: TCG can support CPUID.07H:EBX.erms ERMS just says "rep movsb" and "rep stosb" are fast. It does not imply any new instruction, so we can support it easily. Backports commit 7eb24386dbfb0b66464c7f856c1074c606efccda from qemu	2018-02-25 02:29:00 -05:00
Igor Mammedov	78f9128dbb	target-sparc: Use sparc_cpu_parse_features() directly Make SPARC target use sparc_cpu_parse_features() directly so it won't get in the way of switching other propertified targets to handling features as global properties. Backports commit fb02d56e96d553088c5b4267a3c954a3e952a50a from qemu	2018-02-25 02:27:23 -05:00
Sergey Sorokin	e4d123caa9	tcg: Improve the alignment check infrastructure Some architectures (e.g. ARMv8) need the address which is aligned to a size more than the size of the memory access. To support such check it's enough the current costless alignment check implementation in QEMU, but we need to support an alignment size specifying. Backports commit 1f00b27f17518a1bcb4cedca49eaec96a4d560bd from qemu	2018-02-25 02:23:28 -05:00
Richard Henderson	23586e2674	tcg: Optimize spills of constants While we can store constants via constrants on INDEX_op_st_i32 et al, we weren't able to spill constants to backing store. Add a new backend interface, tcg_out_sti, which may store the constant (and is allowed to fail). Rearrange the temp_* helpers so that we only attempt to directly store a constant when the temp is becoming dead/free. Backports commit 59d7c14eeff8d2ad7f61aed86ce5a176113bc153 from qemu	2018-02-25 01:45:29 -05:00
Richard Henderson	64fda683b1	tcg: Fix name for high-half register	2018-02-25 01:36:35 -05:00
Lioncash	532f840dc3	qapi: Add new clone visitor We have a couple places in the code base that want to deep-clone one QAPI object into another, and they were resorting to serializing the struct out to QObject then reparsing it. A much more efficient version can be done by adding a new clone visitor. Since cloning is still relatively uncommon, expose the use of the new visitor via a QAPI_CLONE() macro that takes care of type-punning the underlying function pointer, rather than generating lots of unused functions for types that won't be cloned. And yes, we're relying on the compiler treating all pointers equally, even though a strict C program cannot portably do so - but we're not the first one in the qemu code base to expect it to work (hello, glib!). The choice of adding a fourth visitor type deserves some explanation. On the surface, the clone visitor is mostly an input visitor (it takes arbitrary input - in this case, another QAPI object - and creates a new QAPI object during the course of the visit). But ever since commit da72ab0 consolidated enum visits based on the visitor type, using VISITOR_INPUT would cause us to run visit_type_str(), even though for cloning there is nothing to do (we just copy the enum value across, without regards to its mapping to strings). Also, since our input happens to be a QAPI object, we can also satisfy the internal checks for VISITOR_OUTPUT. So in the end, I settled with a new VISITOR_CLONE, and chose its value such that many internal checks can use 'v->type & mask', sticking to 'v->type == value' where the difference matters. Note that we can only clone objects (including alternates) and lists, not built-ins or enums. The visitor core hides integer width from the actual visitor (since commit 04e070d), and as long as that's the case, we can't clone top-level integers. Then again, those can always be cloned by direct copy, since they are not objects with deep pointers, so it's no real loss. And restricting cloning to just objects and lists is cleaner than restricting it to non-integers. As such, I documented that the clone visitor is for direct use only by code internal to QAPI, and should not be used on incomplete objects (other than a hack to work around the fact that we allow NULL in place of "" in visit_type_str() in other output visitors). Note that as written, the clone visitor will never fail on a complete object. Scalars (including enums) not at the root of the clone copy just fine with no additional effort while visiting the scalar, by virtue of a g_memdup() each time we push another struct onto the stack. Cloning a string requires deduplication of a pointer, which means it can also provide the guarantee of an input visitor of never producing NULL even when still accepting NULL in place of "" the way the QMP output visitor does. Cloning an 'any' type could be possible by incrementing the QObject refcnt, but it's not obvious whether that is better than implementing a QObject deep clone. So for now, we document it as unsupported, and intentionally omit the .type_any() callback to let a developer know their usage needs implementation. Add testsuite coverage for several different clone situations, to ensure that the code is working. I also tested that valgrind was happy with the test. Backports commit a15fcc3cf69ee3d408f60d6cc316488d2b0249b4 from qemu	2018-02-25 01:34:12 -05:00
Eric Blake	85af4b2030	qapi: Add new visit_complete() function Making each output visitor provide its own output collection function was the only remaining reason for exposing visitor sub-types to the rest of the code base. Add a polymorphic visit_complete() function which is a no-op for input visitors, and which populates an opaque pointer for output visitors. For maximum type-safety, also add a parameter to the output visitor constructors with a type-correct version of the output pointer, and assert that the two uses match. This approach was considered superior to either passing the output parameter only during construction (action at a distance during visit_free() feels awkward) or only during visit_complete() (defeating type safety makes it easier to use incorrectly). Most callers were function-local, and therefore a mechanical conversion; the testsuite was a bit trickier, but the previous cleanup patch minimized the churn here. The visit_complete() function may be called at most once; doing so lets us use transfer semantics rather than duplication or ref-count semantics to get the just-built output back to the caller, even though it means our behavior is not idempotent. Generated code is simplified as follows for events: \|@@ -26,7 +26,7 @@ void qapi_event_send_acpi_device_ost(ACP \| QDict qmp; \| Error err = NULL; \| QMPEventFuncEmit emit; \|- QmpOutputVisitor qov; \|+ QObject obj; \| Visitor v; \| q_obj_ACPI_DEVICE_OST_arg param = { \| info \|@@ -39,8 +39,7 @@ void qapi_event_send_acpi_device_ost(ACP \| \| qmp = qmp_event_build_dict("ACPI_DEVICE_OST"); \| \|- qov = qmp_output_visitor_new(); \|- v = qmp_output_get_visitor(qov); \|+ v = qmp_output_visitor_new(&obj); \| \| visit_start_struct(v, "ACPI_DEVICE_OST", NULL, 0, &err); \| if (err) { \|@@ -55,7 +54,8 @@ void qapi_event_send_acpi_device_ost(ACP \| goto out; \| } \| \|- qdict_put_obj(qmp, "data", qmp_output_get_qobject(qov)); \|+ visit_complete(v, &obj); \|+ qdict_put_obj(qmp, "data", obj); \| emit(QAPI_EVENT_ACPI_DEVICE_OST, qmp, &err); and for commands: \| { \| Error err = NULL; \|- QmpOutputVisitor qov = qmp_output_visitor_new(); \| Visitor v; \| \|- v = qmp_output_get_visitor(qov); \|+ v = qmp_output_visitor_new(ret_out); \| visit_type_AddfdInfo(v, "unused", &ret_in, &err); \|- if (err) { \|- goto out; \|+ if (!err) { \|+ visit_complete(v, ret_out); \| } \|- *ret_out = qmp_output_get_qobject(qov); \|- \|-out: \| error_propagate(errp, err); Backports commit 3b098d56979d2f7fd707c5be85555d114353a28d from qemu	2018-02-25 01:20:03 -05:00
Eric Blake	ec53301cda	qmp-output-visitor: Favor new visit_free() function Now that we have a polymorphic visit_free(), we no longer need qmp_output_visitor_cleanup(); however, we still need to expose the subtype for qmp_output_get_qobject(). Backports commit 1830f22a6777cedaccd67a08f675d30f7a85ebfd from qemu	2018-02-25 01:12:27 -05:00
Eric Blake	f008d93ac0	qmp-input-visitor: Favor new visit_free() function Now that we have a polymorphic visit_free(), we no longer need qmp_input_visitor_cleanup(); which in turn means we no longer need to return a subtype from qmp_input_visitor_new() nor a public upcast function. Generated code changes to qmp-marshal.c look like: \|@@ -52,11 +52,10 @@ void qmp_marshal_add_fd(QDict args, QOb \| { \| Error err = NULL; \| AddfdInfo retval; \|- QmpInputVisitor qiv = qmp_input_visitor_new(QOBJECT(args), true); \| Visitor *v; \| q_obj_add_fd_arg arg = {0}; \| \|- v = qmp_input_get_visitor(qiv); \|+ v = qmp_input_visitor_new(QOBJECT(args), true); \| visit_start_struct(v, NULL, NULL, 0, &err); \| if (err) { \| goto out; Backports commit b70ce1018a251c0c33498d9c927a07cade655a5e from qemu	2018-02-25 01:10:53 -05:00
Eric Blake	e88a7e260b	string-input-visitor: Favor new visit_free() function Now that we have a polymorphic visit_free(), we no longer need string_input_visitor_cleanup(); which in turn means we no longer need to return a subtype from string_input_visitor_new() nor a public upcast function. Backports commit 7a0525c7be6b38d32d586e3fd12e7377ded21faa from qemu	2018-02-25 01:08:04 -05:00
Eric Blake	7f741a6c9b	qapi: Add new visit_free() function Making each visitor provide its own (awkwardly-named) FOO_cleanup() is unusual, when we can instead have a polymorphic visit_free() interface. Over the next few patches, we can use the polymorphic functions to eliminate the need for a FOO_get_visitor() function for accessing specific visitor functionality, once everything can be accessed directly through the Visitor* interfaces. The dealloc visitor is the first one converted to completely use the new entry point, since qapi_dealloc_visitor_cleanup() was the only reason that qapi_dealloc_get_visitor() existed, and only generated and testsuite code was even using it. With the new visit_free() entry point in place, we no longer need to expose the QapiDeallocVisitor subtype through qapi_dealloc_visitor_new(), and can get by with less generated code, with diffs that look like: \| void qapi_free_ACPIOSTInfo(ACPIOSTInfo obj) \| { \|- QapiDeallocVisitor qdv; \| Visitor *v; \| \| if (!obj) { \| return; \| } \| \|- qdv = qapi_dealloc_visitor_new(); \|- v = qapi_dealloc_get_visitor(qdv); \|+ v = qapi_dealloc_visitor_new(); \| visit_type_ACPIOSTInfo(v, NULL, &obj, NULL); \|- qapi_dealloc_visitor_cleanup(qdv); \|+ visit_free(v); \|} Backports commit 2c0ef9f411ae6081efa9eca5b3eab2dbeee45a6c from qemu	2018-02-25 01:05:41 -05:00
Eric Blake	37ae4dfdfd	qapi: Add parameter to visit_end_* Rather than making the dealloc visitor track of stack of pointers remembered during visit_start_* in order to free them during visit_end_, it's a lot easier to just make all callers pass the same pointer to visit_end_. The generated code has access to the same pointer, while all other users are doing virtual walks and can pass NULL. The dealloc visitor is then greatly simplified. All three visit_end_() functions intentionally take a void, even though the visit_start_() functions differ between void, GenericList, and GenericAlternate*. This is done for several reasons: when doing a virtual walk, passing NULL doesn't care what the type is, but when doing a generated walk, we already have to cast the caller's specific FOO to call visit_start, while using void** lets us use visit_end without a cast. Also, an upcoming patch will add a clone visitor that wants to use the same implementation for all three visit_end callbacks, which is made easier if all three share the same signature. For visitors with already track per-object state (the QMP visitors via a stack, and the string visitors which do not allow nesting), add an assertion that the caller is indeed passing the same pointer to paired calls. Backports commit 1158bb2a058fcdd0c8fc3e60dc77f7a57ddbb271 from qemu	2018-02-25 00:57:54 -05:00
Changlong Xie	2ca07642f1	qom: Fix comment typo It's qom_unref, not qdef_unref. Backports commit ada03a0e8423ef8950e30d216f56a9661a4070e2 from qemu	2018-02-25 00:46:15 -05:00
Markus Armbruster	eeef227560	range: Replace internal representation of Range Range represents a range as follows. Member @start is the inclusive lower bound, member @end is the exclusive upper bound. Zero @end is special: if @start is also zero, the range is empty, else @end is to be interpreted as 2^64. No other empty ranges may occur. The range [0,2^64-1] cannot be represented. If you try to create it with range_set_bounds1(), you get the empty range instead. If you try to create it with range_set_bounds() or range_extend(), assertions fail. Before range_set_bounds() existed, the open-coded creation usually got you the empty range instead. Open deathtrap. Moreover, the code dealing with the janus-faced @end is too clever by half. Dumb this down to a more pedestrian representation: members @lob and @upb are inclusive lower and upper bounds. The empty range is encoded as @lob = 1, @upb = 0. Backports commit 6dd726a2bf1b800289d90a84d5fcb5ce7b78a8e1 from qemu	2018-02-25 00:44:36 -05:00
Markus Armbruster	8b2a0c4ece	range: Eliminate direct Range member access Users of struct Range mess liberally with its members, which makes refactoring hard. Create a set of methods, and convert all users to call them instead of accessing members. The methods have carefully worded contracts, and use assertions to check them. Backports commit a0efbf16604770b9d805bcf210ec29942321134f from qemu	2018-02-25 00:39:43 -05:00
Alistair Francis	fbb0645fb3	bitops: Add MAKE_64BIT_MASK macro Add a macro that creates a 64bit value which has length number of ones shifted across by the value of shift. Backports commit ae2923b5c20a21c6457680330506a9c13873485c from qemu	2018-02-25 00:30:39 -05:00
Peter Maydell	efc6cc2b83	memory: Assert that memory_region_init_rom_device() ops aren't NULL It doesn't make sense to pass a NULL ops argument to memory_region_init_rom_device(), because the effect will be that if the guest tries to write to the memory region then QEMU will segfault. Catch the bug earlier by sanity checking the arguments to this function, and remove the misleading documentation that suggests that passing NULL might be sensible. Backports commit 39e0b03dec518254fabd2acff29548d3f1d2b754 from qemu	2018-02-25 00:29:52 -05:00
Peter Maydell	334e951ec1	memory: Provide memory_region_init_rom() Provide a new helper function memory_region_init_rom() for memory regions which are read-only (and unlike those created by memory_region_init_rom_device() don't have special behaviour for writes). This has the same behaviour as calling memory_region_init_ram() and then memory_region_set_readonly() (which is what we do today in boards with pure ROMs) but is a more easily discoverable API for the purpose. Backports commit a1777f7f6462c66e1ee6e98f0d5c431bfe988aa5 from qemu	2018-02-25 00:28:17 -05:00
Alexey Kardashevskiy	7187d77cfa	memory: Add MemoryRegionIOMMUOps.notify_started/stopped callbacks The IOMMU driver may change behavior depending on whether a notifier client is present. In the case of POWER, this represents a change in the visibility of the IOTLB, for other drivers such as intel-iommu and future AMD-Vi emulation, notifier support is not yet enabled and this provides the opportunity to flag that incompatibility. Backports commit d22d8956b185c002b50a4d0883aff61f857347ef from qemu	2018-02-25 00:23:00 -05:00
Eric Blake	c14d8226ab	qapi: Fix memleak in string visitors on int lists Commit 7f8f9ef1 introduced the ability to store a list of integers as a sorted list of ranges, but when merging ranges, it leaks one or more ranges. It was also using range_get_last() incorrectly within range_compare() (a range is a start/end pair, but range_get_last() is for start/len pairs), and will also mishandle a range ending in UINT64_MAX (remember, we document that no range covers 2*64 bytes, but that ranges that end on UINT64_MAX have end < begin). The whole merge algorithm was rather complex, and included unnecessary passes over data within glib functions, and enough indirection to make it hard to easily plug the data leaks. Since we are already hard-coding things to a list of ranges, just rewrite the thing to open-code the traversal and comparisons, by making the range_compare() helper function give us an answer that is easier to use, at which point we avoid the need to pass any callbacks to g_list_(). Then by reusing range_extend() instead of duplicating effort with range_merge(), we cover the corner cases correctly. Drop the now-unused range_merge() and ranges_can_merge(). Doing this lets test-string-{input,output}-visitor pass under valgrind without leaks. Backports commit db486cc334aafd3dbdaf107388e37fc3d6d3e171 from qemu	2018-02-25 00:20:34 -05:00
Eric Blake	ef357d06bc	qapi: Simplify use of range.h Calling our function g_list_insert_sorted_merged is a misnomer, since we are NOT writing a glib function. Furthermore, we are making every caller pass the same comparator function of range_merge(): any caller that would try otherwise would break in weird ways since our internal call to ranges_can_merge() is hard-coded to operate only on ranges, rather than paying attention to the caller's comparator. Better is to fix things so that callers don't have to care about our internal comparator, by picking a function name and updating the parameter type away from a gratuitous use of void*, to make it obvious that we are operating specifically on a list of ranges and not a generic list. Plus, refactoring the code here will make it easier to plug a memory leak in the next patch. range_compare() is now internal only, and moves to the .c file. Backports commit 7c47959d0cb05db43014141a156ada0b6d53a750 from qemu	2018-02-25 00:02:42 -05:00
Eric Blake	5e22c7e180	range: Create range.c for code that should not be inline g_list_insert_sorted_merged() is rather large to be an inline function; move it to its own file. range_merge() and ranges_can_merge() can likewise move, as they are only used internally. Also, it becomes obvious that the condition within range_merge() is already satisfied by its caller, and that the return value is not used. The diffstat is misleading, because of the copyright boilerplate. Backports commit fec0fc0a13ac7f1a1130433a6740cd850c3db34a from qemu	2018-02-24 23:59:13 -05:00
Eric Blake	ebeb0e46f8	qapi: Fix crash on missing alternate member of QAPI struct If a QAPI struct has a mandatory alternate member which is not present on input, the input visitor reports an error for the missing alternate without setting the discriminator, but the cleanup code for the struct still tries to use the dealloc visitor to clean up the alternate. Commit dbf11922 changed visit_start_alternate to set obj to NULL when an error occurs, where it was previously left untouched. Thus, before the patch, the dealloc visitor is blindly trying to cleanup whatever branch corresponds to (obj)->type == 0 (that is, QTYPE_NONE, because obj still pointed to zeroed memory), which selects the default branch of the switch and sets an error, but this second error is ignored by the way the dealloc visitor is used; but after the patch, the attempt to switch dereferences NULL. When cleaning up after a partial object parse, we specifically check for !obj after visit_start_struct() (see gen_visit_object()); doing the same for alternates fixes the crash. Enhance the testsuite to give coverage for both missing struct and missing alternate members. Also add an abort - we expect visit_start_alternate() to either set an error or to set (obj)->type to a valid QType that corresponds to actual user input, and QTYPE_NONE should never be reachable from valid input. Had the abort() been in place earlier, we might have noticed the dealloc visitor dereferencing bogus zeroed memory prior to when commit dbf11922 forced our hand by setting obj to NULL and causing a fault. Test case: {'execute':'blockdev-add', 'arguments':{'options':{'driver':'raw'}}} The choice of 'driver':'raw' selects a BlockdevOptionsGenericFormat struct, which has a mandatory 'file':'BlockdevRef' in QAPI. Since 'file' is missing as a sibling of 'driver', this should report a graceful error rather than fault. After this patch, we are back to: {"error": {"class": "GenericError", "desc": "Parameter 'file' is missing"}} Generated code in qapi-visit.c changes as: \|@@ -2444,6 +2444,9 @@ void visit_type_BlockdevRef(Visitor v, \| if (err) { \| goto out; \| } \|+ if (!obj) { \|+ goto out_obj; \|+ } \| switch ((obj)->type) { \| case QTYPE_QDICT: \| visit_start_struct(v, name, NULL, 0, &err); \|@@ -2459,10 +2462,13 @@ void visit_type_BlockdevRef(Visitor v, \| case QTYPE_QSTRING: \| visit_type_str(v, name, &(*obj)->u.reference, &err); \| break; \|+ case QTYPE_NONE: \|+ abort(); \| default: \| error_setg(&err, QERR_INVALID_PARAMETER_TYPE, name ? name : "null", \| "BlockdevRef"); \| } \|+out_obj: \| visit_end_alternate(v); Backports commit 9b4e38fe6a35890bb1d995316d7be08de0b30ee5 from qemu	2018-02-24 23:53:29 -05:00
Aleksandar Markovic	f95e0e9e98	target-mips: Add FCR31's FS bit definition Add preprocessor definition of FCR31's FS bit, and update related code for setting this bit. Backports commit 77be419980114d75605811e1681115d0919cfa1a from qemu	2018-02-24 21:32:10 -05:00
Aleksandar Markovic	4a540f88de	target-mips: Implement FCR31's R/W bitmask and related functionalities This patch implements read and write access rules for Mips floating point control and status register (FCR31). The change can be divided into following parts: - Add fields that will keep FCR31's R/W bitmask in procesor definitions and processor float_status structure. - Add appropriate value for FCR31's R/W bitmask for each supported processor. - Add function for setting snan_bit_is_one, and integrate it in appropriate places. - Modify handling of CTC1 (case 31) instruction to use FCR31's R/W bitmask. - Modify handling user mode executables for Mips, in relation to the bit EF_MIPS_NAN2008 from ELF header, that is in turn related to reading and writing to FCR31. - Modify gdb behavior in relation to FCR31. Backports commit 599bc5e89c46f95f86ccad0d747d041c89a28806 from qemu	2018-02-24 21:30:24 -05:00
Aleksandar Markovic	84b516d9db	target-mips: Add nan2008 flavor of <CEIL\|CVT\|FLOOR\|ROUND\|TRUNC>.<L\|W>.<S\|D> New set of helpers for handling nan2008-syle versions of instructions <CEIL\|CVT\|FLOOR\|ROUND\|TRUNC>.<L\|W>.<S\|D>, for Mips R6. All involved instructions have float operand and integer result. Their core functionality is implemented via invocations of appropriate SoftFloat functions. The problematic cases are when the operand is a NaN, and also when the operand (float) is out of the range of the result. Here one can distinguish three cases: CASE MIPS-A: (FCR31.NAN2008 == 1) 1. Operand is a NaN, result should be 0; 2. Operand is larger than INT_MAX, result should be INT_MAX; 3. Operand is smaller than INT_MIN, result should be INT_MIN. CASE MIPS-B: (FCR31.NAN2008 == 0) 1. Operand is a NaN, result should be INT_MAX; 2. Operand is larger than INT_MAX, result should be INT_MAX; 3. Operand is smaller than INT_MIN, result should be INT_MAX. CASE SoftFloat: 1. Operand is a NaN, result is INT_MAX; 2. Operand is larger than INT_MAX, result is INT_MAX; 3. Operand is smaller than INT_MIN, result is INT_MIN. Current implementation of <CEIL\|CVT\|FLOOR\|ROUND\|TRUNC>.<L\|W>.<S\|D> implements case MIPS-B. This patch relates to case MIPS-A. For case MIPS-A, only return value for NaN-operands should be corrected after appropriate SoftFloat library function is called. Related MSA instructions FTRUNC_S and FTINT_S already handle well all cases, in the fashion similar to the code from this patch. Backports commit 87552089b62fa229d2ff86906e4e779177fb5835 from qemu	2018-02-24 21:14:04 -05:00
Aleksandar Markovic	a411a12170	target-mips: Add abs2008 flavor of <ABS\|NEG>.<S\|D> Updated handling of instructions <ABS\|NEG>.<S\|D>. Note that legacy (pre-abs2008) ABS and NEG instructions are arithmetic (and, therefore, any NaN operand causes signaling invalid operation), while abs2008 ones are non-arithmetic, always and only changing the sign bit, even for NaN-like operands. Details on these instructions are documented in [1] p. 35 and 359. Implementation-wise, abs2008 versions are implemented without helpers, for simplicity and performance sake. [1] "MIPS Architecture For Programmers Volume II-A: The MIPS64 Instruction Set Reference Manual", Imagination Technologies LTD, Revision 6.04, November 13, 2015 Backports commit 6be77480052b1a71557081896e7080363a8a2f95 from qemu	2018-02-24 20:45:06 -05:00
Aleksandar Markovic	ef9f33a345	target-mips: Activate IEEE 754-2008 signaling NaN bit meaning for MSA Function msa_reset() is updated so that flag snan_bit_is_one is properly set to 0. By applying this patch, a number of incorrect MSA behaviors that require IEEE 754-2008 compliance will be fixed. Those are behaviors that (up to the moment of applying this patch) did not get the desired functionality from SoftFloat library with respect to distinguishing between quiet and signaling NaN, getting default NaN values (both quiet and signaling), establishing if a floating point number is NaN or not, etc. Two examples: * FMAX, FMIN will now correctly detect and propagate NaNs. * FCLASS.D ans FCLASS.S will now correcty detect NaN flavors Backports commit 40bd6dd456e61a36e454fb9dd2cc739b67c224cf from qemu	2018-02-24 20:41:48 -05:00
Aleksandar Markovic	3e9325f1e9	softfloat: Handle snan_bit_is_one == 0 in MIPS pickNaNMulAdd() Only for Mips platform, and only for cases when snan_bit_is_one is 0, correct the order of argument comparisons in pickNaNMulAdd(). For more info, see [1], page 53, section "3.5.3 NaN Propagation". [1] "MIPS Architecture for Programmers Volume IV-j: The MIPS32 SIMD Architecture Module", Imagination Technologies LTD, Revision 1.12, February 3, 2016 Backports commit c27644f0e9659471e1c9355da5b667960d311937 from qemu	2018-02-24 20:40:11 -05:00
Aleksandar Markovic	33833b6605	softfloat: For Mips only, correct default NaN values Only for Mips platform, and only for cases when snan_bit_is_one is 0, correct default NaN values (in their 16-, 32-, and 64-bit flavors). For more info, see [1], page 84, Table 6.3 "Value Supplied When a New Quiet NaN Is Created", and [2], page 52, Table 3.7 "Default NaN Encodings". [1] "MIPS Architecture For Programmers Volume II-A: The MIPS64 Instruction Set Reference Manual", Imagination Technologies LTD, Revision 6.04, November 13, 2015 [2] "MIPS Architecture for Programmers Volume IV-j: The MIPS32 SIMD Architecture Module", Imagination Technologies LTD, Revision 1.12, February 3, 2016 Backports commit a7c04d545a97126c9df9d96623747d8613aaf7db from qemu	2018-02-24 20:38:23 -05:00
Aleksandar Markovic	33ee9429b2	softfloat: Clean code format in fpu/softfloat-specialize.h fpu/softfloat-specialize.h is the most critical file in SoftFloat library, since it handles numerous differences between platforms in relation to floating point arithmetics. This patch makes the code in this file more consistent format-wise, and hopefully easier to debug and maintain. Backports commit a59eaea64686c8966b7653303660f8c26f285c77 from qemu	2018-02-24 20:35:05 -05:00
Aleksandar Markovic	6eb4fa54f6	softfloat: Implement run-time-configurable meaning of signaling NaN bit This patch modifies SoftFloat library so that it can be configured in run-time in relation to the meaning of signaling NaN bit, while, at the same time, strictly preserving its behavior on all existing platforms. Background: In floating-point calculations, there is a need for denoting undefined or unrepresentable values. This is achieved by defining certain floating-point numerical values to be NaNs (which stands for "not a number"). For additional reasons, virtually all modern floating-point unit implementations use two kinds of NaNs: quiet and signaling. The binary representations of these two kinds of NaNs, as a rule, differ only in one bit (that bit is, traditionally, the first bit of mantissa). Up to 2008, standards for floating-point did not specify all details about binary representation of NaNs. More specifically, the meaning of the bit that is used for distinguishing between signaling and quiet NaNs was not strictly prescribed. (IEEE 754-2008 was the first floating-point standard that defined that meaning clearly, see [1], p. 35) As a result, different platforms took different approaches, and that presented considerable challenge for multi-platform emulators like QEMU. Mips platform represents the most complex case among QEMU-supported platforms regarding signaling NaN bit. Up to the Release 6 of Mips architecture, "1" in signaling NaN bit denoted signaling NaN, which is opposite to IEEE 754-2008 standard. From Release 6 on, Mips architecture adopted IEEE standard prescription, and "0" denotes signaling NaN. On top of that, Mips architecture for SIMD (also known as MSA, or vector instructions) also specifies signaling bit in accordance to IEEE standard. MSA unit can be implemented with both pre-Release 6 and Release 6 main processor units. QEMU uses SoftFloat library to implement various floating-point-related instructions on all platforms. The current QEMU implementation allows for defining meaning of signaling NaN bit during build time, and is implemented via preprocessor macro called SNAN_BIT_IS_ONE. On the other hand, the change in this patch enables SoftFloat library to be configured in run-time. This configuration is meant to occur during CPU initialization, at the moment when it is definitely known what desired behavior for particular CPU (or any additional FPUs) is. The change is implemented so that it is consistent with existing implementation of similar cases. This means that structure float_status is used for passing the information about desired signaling NaN bit on each invocation of SoftFloat functions. The additional field in float_status is called snan_bit_is_one, which supersedes macro SNAN_BIT_IS_ONE. IMPORTANT: This change is not meant to create any change in emulator behavior or functionality on any platform. It just provides the means for SoftFloat library to be used in a more flexible way - in other words, it will just prepare SoftFloat library for usage related to Mips platform and its specifics regarding signaling bit meaning, which is done in some of subsequent patches from this series. Further break down of changes: 1) Added field snan_bit_is_one to the structure float_status, and correspondent setter function set_snan_bit_is_one(). 2) Constants <float16\|float32\|float64\|floatx80\|float128>_default_nan (used both internally and externally) converted to functions <float16\|float32\|float64\|floatx80\|float128>_default_nan(float_status). This is necessary since they are dependent on signaling bit meaning. At the same time, for the sake of code cleanup and simplicity, constants <floatx80\|float128>_default_nan_<low\|high> (used only internally within SoftFloat library) are removed, as not needed. 3) Added a float_status argument to SoftFloat library functions XXX_is_quiet_nan(XXX a_), XXX_is_signaling_nan(XXX a_), XXX_maybe_silence_nan(XXX a_). This argument must be present in order to enable correct invocation of new version of functions XXX_default_nan(). (XXX is <float16\|float32\|float64\|floatx80\|float128> here) 4) Updated code for all platforms to reflect changes in SoftFloat library. This change is twofolds: it includes modifications of SoftFloat library functions invocations, and an addition of invocation of function set_snan_bit_is_one() during CPU initialization, with arguments that are appropriate for each particular platform. It was established that all platforms zero their main CPU data structures, so snan_bit_is_one(0) in appropriate places is not added, as it is not needed. [1] "IEEE Standard for Floating-Point Arithmetic", IEEE Computer Society, August 29, 2008. Backports commit af39bc8c49224771ec0d38f1b693ea78e221d7bc from qemu	2018-02-24 20:27:12 -05:00
Alexey Kardashevskiy	096ca207af	memory: Add reporting of supported page sizes Every IOMMU has some granularity which MemoryRegionIOMMUOps::translate uses when translating, however this information is not available outside the translate context for various checks. This adds a get_min_page_size callback to MemoryRegionIOMMUOps and a wrapper for it so IOMMU users (such as VFIO) can know the minimum actual page size supported by an IOMMU. As IOMMU MR represents a guest IOMMU, this uses TARGET_PAGE_SIZE as fallback. This removes vfio_container_granularity() and uses new helper in memory_region_iommu_replay() when replaying IOMMU mappings on added IOMMU memory region. Backports the relevant parts of commit f682e9c244af7166225f4a50cc18ff296bb9d43e from qemu	2018-02-24 19:23:28 -05:00
Lluís Vilanova	2297527755	exec: [tcg] Track which vCPU is performing translation and execution Information is tracked inside the TCGContext structure, and later used by tracing events with the 'tcg' and 'vcpu' properties. The 'cpu' field is used to check tracing of translation-time events ("_trans"). The 'tcg_env' field is used to pass it to execution-time events ("_exec"). Backports commit 7c2550432abe62f53e6df878ceba6ceaf71f0e7e from qemu	2018-02-24 19:21:39 -05:00
Eduardo Habkost	0f6513ef62	error: Remove unnecessary local_err variables This patch simplifies code that uses a local_err variable just to immediately use it for an error_propagate() call. Coccinelle patch used to perform the changes added to scripts/coccinelle/remove_local_err.cocci. Backports commit 6b62d961373e0327f2af8fb77d6d5d6308864180 from qemu	2018-02-24 19:12:25 -05:00
Peter Maydell	5ae787f895	target-arm: Provide hook to tell GICv3 about changes of security state The GICv3 CPU interface needs to know when the CPU it is attached to makes an exception level or mode transition that changes the security state, because whether it is asserting IRQ or FIQ can change depending on these things. Provide a mechanism for letting the GICv3 device register a hook to be called on such changes. Backports commit bd7d00fc50c9960876dd194ebf0c88889b53e765 from qemu	2018-02-24 19:09:22 -05:00
Peter Maydell	eec3a5f843	target-arm: Define new arm_is_el3_or_mon() function The GICv3 system registers need to know if the CPU is AArch64 in EL3 or AArch32 in Monitor mode. This happens to be the first part of the check for arm_is_secure(), so factor it out into a new arm_is_el3_or_mon() function that the GIC can also use. Backports commit 712058764da29b2908f6fbf56760ca4f15980709 from qemu	2018-02-24 19:04:27 -05:00

... 6 7 8 9 10 ...

2509 commits