unicorn

mirror of https://github.com/yuzu-emu/unicorn.git synced 2025-01-21 13:31:00 +00:00

Author	SHA1	Message	Date
Peter Maydell	9056a93c9a	target/arm: Don't store M profile PRIMASK and FAULTMASK in daif We currently store the M profile CPU register state PRIMASK and FAULTMASK in the daif field of the CPU state in its I and F bits. This is a legacy from the original implementation, which tried to share the cpu_exec_interrupt code between A profile and M profile. We've since separated out the two cases because they are significantly different, so now there is no common code between M and A profile which looks at env->daif: all the uses are either in A-only or M-only code paths. Sharing the state fields now is just confusing, and will make things awkward when we implement v8M, where the PRIMASK and FAULTMASK registers are banked between security states. Switch M profile over to using v7m.faultmask and v7m.primask fields for these registers. Backports commit e6ae5981ea4b0f6feb223009a5108582e7644f8f from qemu	2018-03-04 12:56:29 -05:00
Peter Maydell	5d6b031550	target/arm: Define and use XPSR bit masks The M profile XPSR is almost the same format as the A profile CPSR, but not quite. Define some XPSR_* macros and use them where we definitely dealing with an XPSR rather than reusing the CPSR ones. Backports commit 987ab45e108953c1c98126c338c2119c243c372b from qemu	2018-03-04 12:54:41 -05:00
Peter Maydell	64c6727e4a	target/arm: Fix outdated comment about exception exit When we switched our handling of exception exit to detect the magic addresses at translate time rather than via a do_unassigned_access hook, we forgot to update a comment; correct the omission. Backports commit 9d17da4b68a05fc78daa47f0f3d914eea5d802ea from qemu	2018-03-04 12:52:34 -05:00
Peter Maydell	219b3e8a08	target/arm: Remove incorrect comment about MPU_CTRL Remove the comment that claims that some MPU_CTRL bits are stored in sctlr_el[1]. This has never been true since MPU_CTRL was added in commit 29c483a50607 -- the comment is a leftover from Michael Davidsaver's original implementation, which I modified not to use sctlr_el[1]; I forgot to delete the comment then. Backports commit 59e4972c3fc63d981e8b613ebb3bb01a05848075 from qemu	2018-03-04 12:52:02 -05:00
Peter Maydell	108cff5e61	target/arm: Tighten up Thumb decode where new v8M insns will be Tighten up the T32 decoder in the places where new v8M instructions will be: * TT/TTT/TTA/TTAT are in what was nominally LDREX/STREX r15, ... which is UNPREDICTABLE: make the UNPREDICTABLE behaviour be to UNDEF * BXNS/BLXNS are distinguished from BX/BLX via the low 3 bits, which in previous architectural versions are SBZ: enforce the SBZ via UNDEF rather than ignoring it, and move the "ARCH(5)" UNDEF case up so we don't leak a TCG temporary * SG is in the encoding which would be LDRD/STRD with rn = r15; this is UNPREDICTABLE and we currently UNDEF: move this check further up the code so that we don't leak TCG temporaries in the UNDEF case and have a better place to put the SG decode. This means that if a v8M binary is accidentally run on v7M or if a test case hits something that we haven't implemented yet the behaviour will be obvious (UNDEF) rather than obscure (plough on treating it as a different instruction). In the process, add some comments about the instruction patterns at these points in the decode. Our Thumb and ARM decoders are very difficult to understand currently, but gradually adding comments like this should help to clarify what exactly has been decoded when. Backports commit ebfe27c593e5b222aa2a1fc545b447be3d995faa from qemu	2018-03-04 12:51:08 -05:00
Peter Maydell	6f4afe1a13	target/arm: Consolidate PMSA handling in get_phys_addr() Currently get_phys_addr() has PMSAv7 handling before the "is translation disabled?" check, and then PMSAv5 after it. Tidy this up by making the PMSAv5 code handle the "MPU disabled" case itself, so that we have all the PMSA code in one place. This will make adding the PMSAv8 code slightly cleaner, and also means that pre-v7 PMSA cores benefit from the MPU lookup logging that the PMSAv7 codepath had. Backports commit 3279adb95e34dd3d67c66d729458f7784747cf8d from qemu	2018-03-04 12:48:22 -05:00
Peter Maydell	f85f301316	target/arm: Don't trap WFI/WFE for M profile M profile cores can never trap on WFI or WFE instructions. Check for M profile in check_wfx_trap() to ensure this. The existing code will do the right thing for v7M cores because the hcr_el2 and scr_el3 registers will be all-zeroes and so we won't attempt to trap, but when we start setting ARM_FEATURE_V8 for v8M cores the v8A handling of SCTLR.nTWE and .nTWI will not give the right results. Backports commit 0e2845689ebdb4ea7174f96f6797e2d8942bd114 from qemu	2018-03-04 12:46:37 -05:00
Peter Maydell	2c9a196efe	target/arm: Use MMUAccessType enum rather than int In the ARM get_phys_addr() code, switch to using the MMUAccessType enum and its MMU_* values rather than int and literal 0/1/2. Backports commit 03ae85f858fc46495258a5dd4551fff2c34bd495 from qemu	2018-03-04 12:45:56 -05:00
Brijesh Singh	b9c18f22cd	target-i386/cpu: Add new EPYC CPU model Add a new base CPU model called 'EPYC' to model processors from AMD EPYC family (which includes EPYC 76xx,75xx,74xx, 73xx and 72xx). The following features bits have been added/removed compare to Opteron_G5 Added: monitor, movbe, rdrand, mmxext, ffxsr, rdtscp, cr8legacy, osvw, fsgsbase, bmi1, avx2, smep, bmi2, rdseed, adx, smap, clfshopt, sha xsaveopt, xsavec, xgetbv1, arat Removed: xop, fma4, tbm Backports commit 2e2efc7dbe2b0adc1200b5aa286cdbed729f6751 from qemu	2018-03-04 12:22:27 -05:00
Eduardo Habkost	382022929e	cpu: cpu_by_arch_id() helper The helper can be used for CPU object lookup using the CPU's arch-specific ID (the one returned by CPUClass::get_arch_id()). Backports commit 5ce46cb34eecec0bc94a4b1394763f9a1bbe20c3 from qemu	2018-03-04 12:16:39 -05:00
Alexey Kardashevskiy	75afdffa45	memory: Move FlatView allocation to a helper This moves a FlatView allocation and initialization to a helper. While we are nere, replace g_new with g_new0 to not to bother if we add new fields in the future. This should cause no behavioural change. Backports commit de7e6815b84c797cbda56dc96fcacaf5f37d3a20 from qemu	2018-03-04 02:08:37 -05:00
Alexey Kardashevskiy	e723b8dd49	memory: Open code FlatView rendering We are going to share FlatView's between AddressSpace's and per-AS memory listeners won't suit the purpose anymore so open code the dispatch tree rendering. Since there is a good chance that dispatch_listener was the only listener, this avoids address_space_update_topology_pass() if there is no registered listeners; this should improve starting time. This should cause no behavioural change. Backports commit 1b04a1580917d9e41fd37ca62cbff9b4bf061e96 from qemu	2018-03-04 02:06:48 -05:00
Alexey Kardashevskiy	f74fcb194f	exec: Explicitly export target AS from address_space_translate_internal This adds an AS** parameter to address_space_do_translate() to make it easier for the next patch to share FlatViews. This should cause no behavioural change. Backports commit 6424975ce912061ac9e4a375237b0c89d83d93e3 from qemu	2018-03-04 01:56:13 -05:00
Eric Blake	be742759b0	osdep: Fix ROUND_UP(64-bit, 32-bit) When using bit-wise operations that exploit the power-of-two nature of the second argument of ROUND_UP(), we still need to ensure that the mask is as wide as the first argument (done by using a ternary to force proper arithmetic promotion). Unpatched, ROUND_UP(2ULL1024102410241024, 512U) produces 0, instead of the intended 2TiB, because negation of an unsigned 32-bit quantity followed by widening to 64-bits does not sign-extend the mask. Broken since its introduction in commit 292c8e50 (v1.5.0). Callers that passed the same width type to both macro parameters, or that had other code to ensure the first parameter's maximum runtime value did not exceed the second parameter's width, are unaffected, but I did not audit to see which (if any) existing clients of the macro could trigger incorrect behavior (I found the bug while adding a new use of the macro). While preparing the patch, checkpatch complained about poor spacing, so I also fixed that here and in the nearby DIV_ROUND_UP. Backports commit 33a599667a9e70588483a31286dfff8cfc27d513 from qemu	2018-03-04 01:54:09 -05:00
Alistair Francis	5d742aad0b	target/arm: Require alignment for load exclusive According to the ARM ARM exclusive loads require the same alignment as exclusive stores. Let's update the memops used for the load to match that of the store. This adds the alignment requirement to the memops. Backports commit 4a2fdb78e794c1ad93aa9e160235d6a61a2125de from qemu	2018-03-04 01:53:04 -05:00
Richard Henderson	4a8f556c29	target/arm: Correct load exclusive pair atomicity We are not providing the required single-copy atomic semantics for the 64-bit operation that is the 32-bit paired load. At the same time, leave the entire 64-bit value in cpu_exclusive_val and stop writing to cpu_exclusive_high. This means that we do not have to re-assemble the 64-bit quantity when it comes time to store. At the same time, drop a redundant temporary and perform all loads directly into the cpu_exclusive_* globals. Backports commit 19514cde3b92938df750acaecf2caaa85e1d36a6 from qemu	2018-03-04 01:49:35 -05:00
Alistair Francis	009a52dd13	target/arm: Correct exclusive store cmpxchg memop mask When we perform the atomic_cmpxchg operation we want to perform the operation on a pair of 32-bit registers. Previously we were just passing the register size in which was set to MO_32. This would result in the high register to be ignored. To fix this issue we hardcode the size to be 64-bits long when operating on 32-bit pairs. Backports commit 955fd0ad5d610f62ba2f4ce46a872bf50434dcf8 from qemu	2018-03-04 01:43:55 -05:00
Michael S. Tsirkin	fd472c53c6	Revert "cpu: add APIs to allocate/free CPU environment" This reverts commit e2a7f28693aea7e194ec1435697ec4feb24f8a6f. This was not supposed to go upstream yet. Reverting. Backports commit cde0a63ad721dbb538419a00f9405587680be436 from qemu	2018-03-04 01:42:49 -05:00
Joseph Myers	e5b84c6d59	target/i386: set rip_offset for some SSE4.1 instructions When emulating various SSE4.1 instructions such as pinsrd, the address of a memory operand is computed without allowing for the 8-bit immediate operand located after the memory operand, meaning that the memory operand uses the wrong address in the case where it is rip-relative. This patch adds the required rip_offset setting for those instructions, so fixing some GCC test failures (13 in the gcc testsuite in my GCC 6-based testing) when testing with a default CPU setting enabling those instructions. Backports commit ab6ab3e9972a49a359f59895a88bed311472ca97 from qemu	2018-03-04 01:41:43 -05:00
Michael S. Tsirkin	71bf994214	cpu: add APIs to allocate/free CPU environment These will be implemented and then used by follow-up patches. Backports commit e2a7f28693aea7e194ec1435697ec4feb24f8a6f from qemu	2018-03-04 01:39:09 -05:00
Richard Henderson	b33f2b40e8	tcg: Increase minimum alignment from tcg_malloc to 8 For a 64-bit ILP32 host, aligning to sizeof(long) is not enough. Guess the minimum for any host is 8, as that covers uint64_t. Qemu doesn't use a host long double or host vectors, except in extremely limited circumstances. Fixes a bus error for a sparc v8plus host. Backports commit 13aaef678ed377b12b76dc7fb9e615b2f2f9047b from qemu	2018-03-04 01:36:59 -05:00
Richard Henderson	29ea0681d0	tcg/arm: Fix runtime overalignment test Patch 85aa80813dd changed the IF emitting the TST instruction, but failed to change the ?: converting CMP to CMPEQ, so the result of the TST is ignored. Backports commit ca671de8af96798e0f493378240034620a3a04ee from qemu	2018-03-04 01:36:20 -05:00
James Hogan	4cc63bac09	target/mips: Fix RDHWR CC with icount RDHWR CC reads the CPU timer like MFC0 CP0_Count, so with icount enabled it must set can_do_io while it calls the helper to avoid the "Bad icount read" error. It should also break out of the translation loop to ensure that timer interrupts are immediately handled. Backports commit d673a68db6963e86536b125af464bb6ed03eba33 from qemu	2018-03-04 01:35:25 -05:00
James Hogan	cb20fdce64	target/mips: Drop redundant gen_io_start/stop() DMTC0 CP0_Cause does a redundant gen_io_start() and gen_io_end() pair, even though this is done for all DMTC0 operations outside of the switch statement. Remove these redundant calls. Backports commit 51ca717b079dccae5b6cc9f45153f5044abd34f0 from qemu	2018-03-04 01:33:54 -05:00
James Hogan	0afa0c8ddc	target/mips: Use BS_EXCP where interrupts are expected Commit e350d8ca3ac7 ("target/mips: optimize indirect branches") made indirect branches able to directly find the next TB and jump straight to it without breaking out of translated code and going around the main execution loop. This breaks the assumption in target/mips/translate.c that BS_STOP is sufficient to cause pending interrupts to be handled, since interrupts are only checked in the main loop. Fix a few of these assumptions by using gen_save_pc to update the saved PC and using BS_EXCP instead of BS_STOP: - [D]MFC0 CP0_Count may trigger a timer interrupt which should be immediately handled. - [D]MTC0 CP0_Cause may trigger an interrupt (but in fact translation was only even being stopped in the DMTC0 case). - [D]MTC0 CP0_<any> when icount is used is assumed could potentially cause interrupts. - EI may trigger an interrupt which was pending. I specifically hit this case when running KVM nested in mipsel-softmmu. A timer interrupt while the 2nd guest was executing is caught by KVM which switches back to the normal Linux exception base and re-enables interrupts with EI. Since the above commit QEMU doesn't leave translated code until the nested KVM has already restored the KVM exception base and returned to the 2nd guest, at which point it is too late to check for pending interrupts and it gets stuck in an infinite loop of unhandled interrupts. Something similar was needed for ARM in commit b29fd33db578 ("target/arm: use DISAS_EXIT for eret handling"). Backports commit b74cddcbf6063f684725e3f8bca49a68e30cba71 from qemu	2018-03-04 01:32:24 -05:00
Leon Alrae	4a1ec3bb80	target-mips: apply CP0.PageMask before writing into TLB entry PFN0 and PFN1 have to be masked out with PageMask_Mask. Backports commit 2d1847ec1ca47fe82f1d8122409cedffdd3925d5 from qemu	2018-03-04 01:27:51 -05:00
James Hogan	7cf1a4276e	mips: Improve segment defs for KVM T&E guests Improve the segment definitions used by get_physical_address() to yield target_ulong types, e.g. 0xffffffff80000000 instead of 0x80000000. This is in preparation for enabling emulation of MIPS KVM T&E segments in TCG MIPS targets, which unlike KVM could potentially have 64-bit target_ulong. In such a case the offset guest KSEG0 address ends up at e.g. 0x000000008xxxxxxx instead of 0xffffffff8xxxxxxx. This also allows the casts to int32_t that force sign extension to be removed, which removes any confusion due to relational comparison of unsigned (target_ulong) and signed (int32_t) types. Backports commit 6743334568933199927af4992a04bfb3c30610f5 from qemu	2018-03-04 01:26:42 -05:00
James Hogan	987401c4d4	target-mips: Don't stop on [d]mtc0 DESAVE/KScratch Writing to the MIPS DESAVE register (and now the KScratch registers) will stop translation, supposedly due to risk of execution mode switches. However these registers are basically RW scratch registers with no side effects so there is no risk of them triggering execution mode changes. Drop the bstate = BS_STOP for these registers for both mtc0 and dmtc0. Backports commit cb539fd241900f51de7d21244f7a55422ad0d40a from qemu	2018-03-04 01:25:27 -05:00
Anthony PERARD	567bc68803	exec: Add lock parameter to qemu_ram_ptr_length Commit 04bf2526ce87f21b32c9acba1c5518708c243ad0 (exec: use qemu_ram_ptr_length to access guest ram) start using qemu_ram_ptr_length instead of qemu_map_ram_ptr, but when used with Xen, the behavior of both function is different. They both call xen_map_cache, but one with "lock", meaning the mapping of guest memory is never released implicitly, and the second one without, which means, mapping can be release later, when needed. In the context of address_space_{read,write}_continue, the ptr to those mapping should not be locked because it is used immediatly and never used again. The lock parameter make it explicit in which context qemu_ram_ptr_length is called. Backports commit f5aa69bdc3418773f26747ca282c291519626ece from qemu	2018-03-04 01:23:14 -05:00
Peter Maydell	d72175d671	target/arm: Move PMSAv7 reset into arm_cpu_reset() so M profile MPUs get reset When the PMSAv7 implementation was originally added it was for R profile CPUs only, and reset was handled using the cpreg .resetfn hooks. Unfortunately for M profile cores this doesn't work, because they do not register any cpregs. Move the reset handling into arm_cpu_reset(), where it will work for both R profile and M profile cores. Backports commit 69ceea64bf565559a2b865ffb2a097d2caab805b from qemu	2018-03-04 01:20:57 -05:00
Peter Maydell	6add2f0f65	target/arm: Rename cp15.c6_rgnr to pmsav7.rnr Almost all of the PMSAv7 state is in the pmsav7 substruct of the ARM CPU state structure. The exception is the region number register, which is in cp15.c6_rgnr. This exception is a bit odd for M profile, which otherwise generally does not store state in the cp15 substruct. Rename cp15.c6_rgnr to pmsav7.rnr accordingly. Backports commit 8531eb4f614a60e6582d4832b15eee09f7d27874 from qemu	2018-03-04 01:18:53 -05:00
Peter Maydell	266885f50f	target/arm: Don't allow guest to make System space executable for M profile For an M profile v7PMSA, the system space (0xe0000000 - 0xffffffff) can never be executable, even if the guest tries to set the MPU registers up that way. Enforce this restriction. Backports commit bf446a11dfb17ae7d8ed2b61a2444804eb458075 from qemu	2018-03-04 01:17:01 -05:00
Peter Maydell	34b9740081	target/arm: Don't do MPU lookups for addresses in M profile PPB region The M profile PMSAv7 specification says that if the address being looked up is in the PPB region (0xe0000000 - 0xe00fffff) then we do not use the MPU regions but always use the default memory map. Implement this (we were previously behaving like an R profile PMSAv7, which does not special case this). Backports commit 38aaa60ca464b48e6feef346709e97335d01b289 from qemu	2018-03-04 01:14:22 -05:00
Peter Maydell	4dc69f4b26	target/arm: Correct MPU trace handling of write vs execute Correct off-by-one bug in the PSMAv7 MPU tracing where it would print a write access as "reading", an insn fetch as "writing", and a read access as "execute". Since we have an MMUAccessType enum now, we can make the code clearer in the process by using that rather than the raw 0/1/2 values. Backports commit 709e4407add7acacc593cb6cdac026558c9a8fb6 from qemu	2018-03-04 01:13:19 -05:00
James Hogan	b35fb57c84	target/mips: Enable CP0_EBase.WG on MIPS64 CPUs Enable the CP0_EBase.WG (write gate) on the I6400 and MIPS64R2-generic CPUs. This allows 64-bit guests to run KVM itself, which uses CP0_EBase.WG to point CP0_EBase at XKPhys. Backports commit bad63a8008a0aaefcd00542c89bee01623d7c9de from qemu	2018-03-04 01:09:47 -05:00
James Hogan	16d97568e2	target/mips: Add EVA support to P5600 Add the Enhanced Virtual Addressing (EVA) feature to the P5600 core configuration, along with the related Segmentation Control (SC) feature and writable CP0_EBase.WG bit. This allows it to run Malta EVA kernels. Backports commit 574da58e4678b3c09048f268821295422d8cde6d from qemu	2018-03-04 01:08:19 -05:00
James Hogan	1ef8c8bd48	target/mips: Implement segmentation control Implement the optional segmentation control feature in the virtual to physical address translation code. The fixed legacy segment and xkphys handling is replaced with a dynamic layout based on the segmentation control registers (which should be set up even when the feature is not exposed to the guest). Backports commit 480e79aedd322fcfac17052caff21626ea7c78e2 from qemu	2018-03-04 01:06:13 -05:00
James Hogan	ddbea9422c	target/mips: Add segmentation control registers The optional segmentation control registers CP0_SegCtl0, CP0_SegCtl1 & CP0_SegCtl2 control the behaviour and required privilege of the legacy virtual memory segments. Add them to the CP0 interface so they can be read and written when CP0_Config3.SC=1, and initialise them to describe the standard legacy layout so they can be used in future patches regardless of whether they are exposed to the guest. Backports commit cec56a733dd2c3fa81dbedbecf03922258747f7d from qemu	2018-03-04 01:00:42 -05:00
James Hogan	7e9b84ca1a	target/mips: Add an MMU mode for ERL The segmentation control feature allows a legacy memory segment to become unmapped uncached at error level (according to CP0_Status.ERL), and in fact the user segment is already treated in this way by QEMU. Add a new MMU mode for this state so that QEMU's mappings don't persist between ERL=0 and ERL=1. Backports commit 42c86612d507c2a8789f2b8d920a244693c4ef7b from qemu	2018-03-04 00:47:19 -05:00
James Hogan	f285157856	target/mips: Abstract mmu_idx from hflags The MIPS mmu_idx is sometimes calculated from hflags without an env pointer available as cpu_mmu_index() requires. Create a common hflags_mmu_index() for the purpose of this calculation which can operate on any hflags, not just with an env pointer, and update cpu_mmu_index() itself and gen_intermediate_code() to use it. Also update debug_post_eret() and helper_mtc0_status() to log the MMU mode with the status change (SM, UM, or nothing for kernel mode) based on cpu_mmu_index() rather than directly testing hflags. This will also allow the logic to be more easily updated when a new MMU mode is added. Backports commit b0fc6003224543d2bdb172eca752656a6223e4a1 from qemu	2018-03-04 00:45:00 -05:00
James Hogan	8595d11eb4	target/mips: Check memory permissions with mem_idx When performing virtual to physical address translation, check the required privilege level based on the mem_idx rather than the mode in the hflags. This will allow EVA loads & stores to operate safely only on user memory from kernel mode. For the cases where the mmu_idx doesn't need to be overridden (mips_cpu_get_phys_page_debug() and cpu_mips_translate_address()), we calculate the required mmu_idx using cpu_mmu_index(). Note that this only tests the MIPS_HFLAG_KSU bits rather than MIPS_HFLAG_MODE, so we don't test the debug mode hflag MIPS_HFLAG_DM any longer. This should be fine as get_physical_address() only compares against MIPS_HFLAG_UM and MIPS_HFLAG_SM, neither of which should get set by compute_hflags() when MIPS_HFLAG_DM is set. Backports commit 9fbf4a58c90183b30bb2c8ad971ccce7e6716a16 from qemu	2018-03-04 00:40:22 -05:00
James Hogan	54b349aee5	target/mips: Decode microMIPS EVA load & store instructions Implement decoding of microMIPS EVA load and store instruction groups in the POOL31C pool. These use the same gen_ld(), gen_st(), gen_st_cond() helpers as the MIPS32 decoding, passing the equivalent MIPS32 opcodes as opc. Backports commit 8fffc64696783b1ff1d17262d098976479895660 from qemu	2018-03-04 00:37:39 -05:00
Leon Alrae	8fadc55db3	target-mips: make ITC Configuration Tags accessible to the CPU Add CP0.ErrCtl register with WST, SPR and ITC bits. In 34K and interAptiv processors these bits are used to enable CACHE instruction access to different arrays. When WST=0, SPR=0 and ITC=1 the CACHE instruction will access ITC tag values. Generally we do not model caches and we have been treating the CACHE instruction as NOP. But since CACHE can operate on ITC Tags new MIPS_HFLAG_ITC_CACHE hflag is introduced to generate the helper only when CACHE is in the ITC Access mode. Backports commit 0d74a222c27e26fc40f4f6120c61c3f9ceaa3776 from qemu	2018-03-04 00:34:30 -05:00
Leon Alrae	a338e9c855	target-mips: enable CM GCR in MIPS64R6-generic CPU	2018-03-04 00:24:09 -05:00
James Hogan	22ca920e40	target/mips: Decode MIPS32 EVA load & store instructions Implement decoding of MIPS32 EVA loads and stores. These access the user address space from kernel mode when implemented, so for each instruction we need to check that EVA is available from Config5.EVA & check for sufficient COP0 privilege (with the new check_eva()), and then override the mem_idx used for the operation. Unfortunately some Loongson 2E instructions use overlapping encodings, so we must be careful not to prevent those from being decoded when EVA is absent. Backports commit 7696414729b2d0f870c80ad1dd637d854bc78847 from qemu	2018-03-04 00:20:09 -05:00
James Hogan	42a5534ade	target/mips: Prepare loads/stores for EVA EVA load and store instructions access the user mode address map, so they need to use mem_idx of MIPS_HFLAG_UM. Update the various utility functions to allow mem_idx to be more easily overridden from the decoding logic. Specifically we add a mem_idx argument to the op_ld/st_* helpers used for atomics, and a mem_idx local variable to gen_ld(), gen_st(), and gen_st_cond(). Backports commit dd4096cd2ccc19384770f336c930259da7a54980 from qemu	2018-03-04 00:14:09 -05:00
James Hogan	152323fe35	target/mips: Add CP0_Ebase.WG (write gate) support Add support for the CP0_EBase.WG bit, which allows upper bits to be written (bits 31:30 on MIPS32, or bits 63:30 on MIPS64), along with the CP0_Config5.CV bit to control whether the exception vector for Cache Error exceptions is forced into KSeg1. This is necessary on MIPS32 to support Segmentation Control and Enhanced Virtual Addressing (EVA) extensions (where KSeg1 addresses may not represent an unmapped uncached segment). It is also useful on MIPS64 to allow the exception base to reside in XKPhys, and possibly out of range of KSEG0 and KSEG1. Backports commit 74dbf824a1313b6064bbebb981a7440951d70896 from qemu	2018-03-03 23:55:09 -05:00
James Hogan	72677eadd0	target/mips: Weaken TLB flush on UX,SX,KX,ASID changes There is no need to invalidate any shadow TLB entries when the ASID changes or when access to one of the 64-bit segments has been disabled, since doing so doesn't reveal to software whether any TLB entries have been evicted into the shadow half of the TLB. Therefore weaken the tlb flushes in these cases to only flush the QEMU TLB. Backports commit 9658e4c342e6ae0d775101f8f6bb6efb16789af1 from qemu	2018-03-03 23:40:37 -05:00
James Hogan	310e3f0a1d	target/mips: Fix TLBWI shadow flush for EHINV,XI,RI Writing specific TLB entries with TLBWI flushes shadow TLB entries unless an existing entry is having its access permissions upgraded. This is necessary as software would from then on expect the previous mapping in that entry to no longer be in effect (even if QEMU has quietly evicted it to the shadow TLB on a TLBWR). However it won't do this if only EHINV, XI, or RI bits have been set, even if that results in a reduction of permissions, so add the necessary checks to invoke the flush when these bits are set. Backports commit eff6ff9431aa9776062a5f4a08d1f6503ca9995a from qemu	2018-03-03 23:39:18 -05:00
James Hogan	fe0de45a26	target/mips: Fix MIPS64 MFC0 UserLocal on BE host Using MFC0 to read CP0_UserLocal uses tcg_gen_ld32s_tl, however CP0_UserLocal is a target_ulong. On a big endian host with a MIPS64 target this reads and sign extends the more significant half of the 64-bit register. Fix this by using ld_tl to load the whole target_ulong and ext32s_tl to sign extend it, as done for various other target_ulong COP0 registers. Backports commit e40df9a80bb7cdb0a4ca650985fa9fe572097fa7 from qemu	2018-03-03 23:37:41 -05:00
Lluís Vilanova	32b3c3815d	tcg: Pass generic CPUState to gen_intermediate_code() Needed to implement a target-agnostic gen_intermediate_code() in the future. Backports commit 9c489ea6bed134fecfd556b439c68bba48fbe102 from qemu	2018-03-03 23:34:18 -05:00
Philippe Mathieu-Daudé	382dcb2deb	target/sparc: optimize gen_op_mulscc() using deposit op Backports commit 08d64e0db02e826b063d2b0d8b84f1cb1f7306c9 from qemu	2018-03-03 23:21:28 -05:00
Philippe Mathieu-Daudé	3827b167e2	target/sparc: optimize various functions using extract op Done with the Coccinelle semantic patch scripts/coccinelle/tcg_gen_extract.cocci. Backports commit 0b1183e315cce99102898bda54f69b685157a507 from qemu	2018-03-03 23:11:29 -05:00
Philippe Mathieu-Daudé	e5486b636b	target/m68k: optimize bcd_flags() using extract op Done with the Coccinelle semantic patch scripts/coccinelle/tcg_gen_extract.cocci. Backports commit 0d9acef24062844b96c671b4379d9fb03c3ea606 from qemu	2018-03-03 23:09:13 -05:00
Richard Henderson	fc52eea5e2	tcg: Expand glue macros before stringifying helper names Backports commit 44368ac62dc5ba014b68b2c1a8ec6fedc3242a5d from qemu	2018-03-03 23:07:21 -05:00
Philippe Mathieu-Daudé	b7ab3c861d	util/cacheinfo: Add missing include for ppc linux This include was forgotten when splitting cacheinfo.c out of tcg/ppc/tcg-target.inc.c (see commit b255b2c8). For a Centos7 host, the include path <signal.h> <bits/sigcontext.h> <asm/sigcontext.h> <asm/elf.h> <asm/auxvec.h> implicitly pulls in the desired AT_* defines. Not so for Debian Jessie. Backports commit 810d5cad4087236236e00fd3046a16adf26e9060 from qemu	2018-03-03 23:05:44 -05:00
Jiang Biao	f1211b1c88	tcg/mips: reserve a register for the guest_base. Reserve a register for the guest_base using ppc code for reference. By doing so, we do not have to recompute it for every memory load. Backports commit 4df9cac57f5220c17d856292e90fce455f708421 from qemu	2018-03-03 23:04:55 -05:00
Boqun Feng (Intel)	53242e647d	i386: add Skylake-Server cpu model Introduce Skylake-Server cpu mode which inherits the features from Skylake-Client and supports some additional features that are: AVX512, CLWB and PGPE1GB. Backports commit 53f9a6f45fb214540cb40af45efc11ac40ac454c from qemu	2018-03-03 23:02:30 -05:00
Eduardo Habkost	8f04fd8b8a	i386: Update comment about XSAVES on Skylake-Client Backports commit cf70879f14d83287d0d6af3b0d7ba7a322ea9ece from qemu	2018-03-03 22:57:07 -05:00
Daniel P. Berrange	abf3c71af2	i386: expose TCGTCGTCGTCG in the 0x40000000 CPUID leaf Currently when running KVM, we expose "KVMKVMKVM\0\0\0" in the 0x40000000 CPUID leaf. Other hypervisors (VMWare, HyperV, Xen, BHyve) all do the same thing, which leaves TCG as the odd one out. The CPUID signature is used by software to detect which virtual environment they are running in and (potentially) change behaviour in certain ways. For example, systemd supports a ConditionVirtualization= setting in unit files. The virt-what command can also report the virt type it is running on Currently both these apps have to resort to custom hacks like looking for 'fw-cfg' entry in the /proc/device-tree file to identify TCG. This change thus proposes a signature "TCGTCGTCGTCG" to be reported when running under TCG. To hide this, the -cpu option tcg-cpuid=off can be used. Backports commits 4ed3d478c63dc65a02eba774c35116618ea5ff10 and 1ce36bfe6424243082d3d7c2330e1a0a4ff72a43 from qemu	2018-03-03 22:56:32 -05:00
Eduardo Habkost	570c064065	qom: Fix ambiguous path detection when ambiguous=NULL object_resolve_path*() ambiguous path detection breaks when ambiguous==NULL and the object tree have 3 objects of the same type and only 2 of them are under the same parent. e.g.: /container/obj1 (TYPE_FOO) /container/obj2 (TYPE_FOO) /obj2 (TYPE_FOO) With the above tree, object_resolve_path_type("", TYPE_FOO, NULL) will incorrectly return /obj2, because the search inside "/container" will return NULL, and the match at "/obj2" won't be detected as ambiguous. Fix that by always calling object_resolve_partial_path() with a non-NULL ambiguous parameter. Backports commit ebcc479eee740937e70a94a468effcf2126a572b from qemu	2018-03-03 22:49:21 -05:00
Aurelien Jarno	1c0169842d	target/mips: optimize WSBH, DSBH and DSHD Use the same mask to avoid having to load two different constants. Backports commit 06a57e5cc7ee5292a4915117ebf951e310a28264 from qemu	2018-03-03 22:47:39 -05:00
Pavel Dovgalyuk	342fa7135d	mips: set CP0 Debug DExcCode for SDBBP instruction This patch fixes setting DExcCode field of CP0 Debug register when SDBBP instruction is executed. According to EJTAG specification, this field must be set to the value 9 (Bp). Backports commit c6c2c0fc32362ba234ae3bdad1a55c2d6aefaa12 from qemu	2018-03-03 22:45:08 -05:00
Alex Bennée	0bd8dc4e0a	target/arm: use DISAS_EXIT for eret handling Previously DISAS_JUMP did ensure this but with the optimisation of 8a6b28c7 (optimize indirect branches) we might not leave the loop. This means if any pending interrupts are cleared by changing IRQ flags we might never get around to servicing them. You usually notice this by seeing the lookup_tb_ptr() helper gainfully chaining TBs together while cpu->interrupt_request remains high and the exit_request has not been set. This breaks amongst other things the OPTEE test suite which executes an eret from the secure world after a non-secure world IRQ has gone pending which then never gets serviced. Instead of using the previously implied semantics of DISAS_JUMP we use DISAS_EXIT which will always exit the run-loop. Backports commit b29fd33db578decacd14f34933b29aece3e7c25e from qemu	2018-03-03 22:43:16 -05:00
Alex Bennée	65356210a8	target/arm: use gen_goto_tb for ISB handling While an ISB will ensure any raised IRQs happen on the next instruction it doesn't cause any to get raised by itself. We can therefore use a simple tb exit for ISB instructions and rely on the exit_request check at the top of each TB to deal with exiting if needed. Backports commit 0b609cc128ba5ef16cc841bcade898d1898f1dc3 from qemu	2018-03-03 22:42:33 -05:00
Alex Bennée	0f8d216d67	target/arm/translate: ensure gen_goto_tb sets exit flags As the gen_goto_tb function can do both static and dynamic jumps it should also set the is_jmp field. This matches the behaviour of the a64 code. Backports commit 4cae8f56fbab2798586576a56cc669f0127d04fb from qemu	2018-03-03 22:38:12 -05:00
Alex Bennée	bffa25cc07	target/arm/translate.h: expand comment on DISAS_EXIT We already have an exit condition, DISAS_UPDATE which will exit the run-loop. Expand on the difference with DISAS_EXIT in the comments Backports commit abd1fb0ee2c58b99f4b2d15718f1825fe4984e12 from qemu	2018-03-03 22:38:11 -05:00
Alex Bennée	63d40e1a55	target/arm/translate: make DISAS_UPDATE match declared semantics DISAS_UPDATE should be used when the wider CPU state other than just the PC has been updated and we should therefore exit the TCG runtime and return to the main execution loop rather assuming DISAS_JUMP would do that. Backports commit e8d5230221851e8933811f1579fd13371f576955 from qemu	2018-03-03 22:38:07 -05:00
Alex Bennée	7d02489baf	include/exec/exec-all: document common exit conditions As a precursor to later patches attempt to come up with a more concrete wording for what each of the common exit cases would be. Backports commit df0311e634828fdc99ca59352aef68503d631aad from qemu	2018-03-03 22:31:28 -05:00
Peter Maydell	e31653de84	target/arm: Make Cortex-M3 and M4 default to 8 PMSA regions The Cortex-M3 and M4 CPUs always have 8 PMSA MPU regions (this isn't a configurable option for the hardware). Make the default value of the pmsav7-dregion property be set per-cpu, so we don't need to have every user of these CPUs set it manually. (The existing default of 16 is correct for the other PMSAv7 core, the Cortex-R5.) This fixes a bug where we were creating the M3 and M4 with too many regions; most guest software would not notice or care, though, since it would just not use the registers associated with the unexpected extra regions. Backports commit 8d92e26b452f8961ec90df3f93cf5f3b7a9d158f from qemu	2018-03-03 22:30:32 -05:00
Peter Maydell	3bd5694a0a	memory: Rename memory_region_init_rom() and _rom_device() to _nomigrate() Rename memory_region_init_rom() to memory_region_init_rom_nomigrate() and memory_region_init_rom_device() to memory_region_init_rom_device_nomigrate(). Backports commit b59821a95bd1d7cb4697fd7748725c910582e0e7 from qemu	2018-03-03 22:29:01 -05:00
Peter Maydell	7b0027a828	memory: Rename memory_region_init_ram() to memory_region_init_ram_nomigrate() Rename memory_region_init_ram() to memory_region_init_ram_nomigrate(). This leaves the way clear for us to provide a memory_region_init_ram() which does handle migration. Backports commit 1cfe48c1ce219b60a9096312f7a61806fae64ab3 from qemu	2018-03-03 22:25:39 -05:00
Peter Maydell	152c56f6a9	memory: Document that the RAM MR initializers do not handle migration The various functions for initializing RAM MemoryRegions do not do anything to cause the data in the MemoryRegion to be migrated. Note in their documentation comments that this is the responsibility of the caller. (We will shortly add a new function that does do this for you.) Backports commit a5c0234bb2754f5248e67929a34c843dbe039da5 from qemu	2018-03-03 22:20:32 -05:00
Peter Maydell	3c2d3d8363	include/hw/boards.h: Document memory_region_allocate_system_memory() Add a documentation comment for memory_region_allocate_system_memory(). In particular, the reason for this function's existence and the requirement on board code to call it exactly once are non-obvious. Backports commit 09ad643823dcda0a86eddce1291c28d0ccb09a3b from qemu	2018-03-03 22:18:49 -05:00
Igor Mammedov	fe4152c6a5	qom: enforce readonly nature of link's check callback link's check callback is supposed to verify/permit setting it, however currently nothing restricts it from misusing it and modifying target object from within. Make sure that readonly semantics are checked by compiler to prevent callback's misuse. Backports commit 8f5d58ef2c92d7b82d9a6eeefd7c8854a183ba4a from qemu	2018-03-03 22:17:20 -05:00
Pranith Kumar	d0a70720a3	Revert "exec.c: Fix breakpoint invalidation race" Now that we have proper locking after MTTCG patches have landed, we can revert the commit. This reverts commit a9353fe897ca2687e5b3385ed39e3db3927a90e0. Backports commit 406bc339b0505fcfc2ffcbca1f05a3756e338a65 from qemu	2018-03-03 22:14:35 -05:00
Paolo Bonzini	7b337b9c07	build: add -Wexpansion-to-defined This warning is included in -Wall by clang, but not by GCC (which only enables it for -Wextra). Include it in the list of warnings we enable to minimize the differences between the compilers: Backports commit b98fcfd8840f290c406c32301340e96f00238a93 from qemu	2018-03-03 22:12:31 -05:00
Marc-André Lureau	9926281c05	scripts: use build_ prefix for string not piped through cgen() The gen_ prefix is awkward. Generated C should go through cgen() exactly once (see commit 1f9a7a1). The common way to get this wrong is passing a foo=gen_foo() keyword argument to mcgen(). I'd like us to adopt a naming convention where gen_ means "something that's been piped through cgen(), and thus must not be passed to cgen() or mcgen()". Requires renaming gen_params(), gen_marshal_proto() and gen_event_send_proto(). Backports commit 086ee7a6200fa5ad795b12110b5b3d5a93dcac3e from qemu	2018-03-03 22:11:28 -05:00
Miodrag Dinic	8daabd339e	target/mips: fix msa copy_[s\|u]_df rd = 0 corner case This patch fixes the msa copy_[s\|u]_df instruction emulation when the destination register rd is zero. Without this patch the zero register would get clobbered, which should never happen because it is supposed to be hardwired to 0. Fix this corner case by explicitly checking rd = 0 and effectively making these instructions emulation no-op in that case. Backports commit cab4888136a92250fdd401402622824994f7ce0b from qemu	2018-03-03 22:08:12 -05:00
Jiang Biao	60703a4f57	tcg/mips: Bugfix for crash when running program with qemu-i386. When running a helloworld program with qemu-i386 in linux-user mode on Loongson 3A3000, it will crash. This patch fix the bug. Backports commit 8b8d768f19037a825a0bc81654492caa7c8fab8b from qemu	2018-03-03 22:06:26 -05:00
Pranith Kumar	2141c777f1	util/cacheinfo: Fix warning generated by clang Clang generates the following warning on aarch64 host: CC util/cacheinfo.o /home/pranith/qemu/util/cacheinfo.c:121:48: warning: value size does not match register size specified by the constraint and modifier [-Wasm-operand-widths] asm volatile("mrs\t%0, ctr_el0" : "=r"(ctr)); ^ /home/pranith/qemu/util/cacheinfo.c:121:28: note: use constraint modifier "w" asm volatile("mrs\t%0, ctr_el0" : "=r"(ctr)); ^~ %w0 Constraint modifier 'w' is not (yet?) accepted by gcc. Fix this by increasing the ctr size. Backports commit 2ae96c157ab3155baf6595c08cf5d3fe3c023a60 from qemu	2018-03-03 22:04:12 -05:00
Pranith Kumar	57f8eec080	tcg/aarch64: Enable indirect jump path using LDR (literal) This patch enables the indirect jump path using an LDR (literal) instruction. It will be interesting to test and see which performs better among the two paths. Backports commit 2acee8b2b5e6bba2935bb6ce5be92d0f0f9799cb from qemu	2018-03-03 22:03:39 -05:00
Pranith Kumar	5e9e39cafd	tcg/aarch64: Use ADRP+ADD to compute target address We use ADRP+ADD to compute the target address for goto_tb. This patch introduces the NOP instruction which is used to align the above instruction pair so that we can use one atomic instruction to patch the destination offsets. Backports commit b68686bd4bfeb70040b4099df993dfa0b4f37b03 from qemu	2018-03-03 22:01:38 -05:00
Pranith Kumar	0998ba8259	tcg/aarch64: Introduce and use long branch to register We can use a branch to register instruction for exit_tb for offsets greater than 128MB. Backports commit 23b7aa1d2af04ba57cc94f74d9f0ab25dce72fa0 from qemu	2018-03-03 21:59:58 -05:00
Yang Zhong	1e0745b31a	target/i386: add the CONFIG_TCG into Makefiles Add the CONFIG_TCG for frontend and backend's files in the related Makefiles. Backports commit 44eff673411381062b826d048ba9d6630d2b2bdb from qemu	2018-03-03 21:57:22 -05:00
Yang Zhong	a16bcbdac0	target/i386: add the tcg_enabled() in target/i386/ Add the tcg_enabled() where the x86 target needs to disable TCG-specific code. Backports commit 79c664f62d75cfba89a5bbe998622c8d5fdf833b from qemu	2018-03-03 21:56:31 -05:00
Yang Zhong	0c739344d3	target/i386: split cpu_set_mxcsr() and make cpu_set_fpuc() inline Split the cpu_set_mxcsr() and make cpu_set_fpuc() inline with specific tcg code. Backports commit 1d8ad165b688759bbf00e40431ee9fde8817d190 from qemu	2018-03-03 21:52:29 -05:00
Yang Zhong	24225cb6fa	target/i386: make cpu_get_fp80()/cpu_set_fp80() static Move cpu_get_fp80()/cpu_set_fp80() from fpu_helper.c to machine.c because fpu_helper.c will be disabled if tcg is disabled in the build. Backports commit db573d2cf7ae6b5a4fc324be6f55e078fc218464 from qemu. In unicorn's case, they can be moved into unicorn.c	2018-03-03 21:44:09 -05:00
Yang Zhong	35e0595d1c	target/i386: move cpu_sync_bndcs_hflags() function Move cpu_sync_bndcs_hflags() function from mpx_helper.c to helper.c because mpx_helper.c need be disabled when tcg is disabled. Backports commit ab0a19d4f08d924e052eb369420d264240872f8a from qemu	2018-03-03 21:41:26 -05:00
Yang Zhong	7e32537efa	tcg: add the CONFIG_TCG into Makefiles Add the CONFIG_TCG for frontend and backend's files in the related Makefiles. Backports commit e4b4b6428ca45cb1374dab98ab1d23a213a5db9a from qemu	2018-03-03 21:39:30 -05:00
Yang Zhong	1135db176f	tcg: add CONFIG_TCG guards in headers Add CONFIG_TCG around TLB-related functions and structure declarations. Some of these functions are defined in ./accel/tcg/cputlb.c, which will not be linked in if TCG is disabled, and have no stubs; therefore, their callers will also be compiled out for --disable-tcg. Backports commit b11ec7f2e44b285a3967d629b55d1a6970b06787 from qemu	2018-03-03 21:37:52 -05:00
Lioncash	0f4ebf07d8	qom/cpu: Silence an unused variable warning	2018-03-03 21:37:04 -05:00
Paolo Bonzini	4964bdcc29	configure: add --disable-tcg configure option This lets you build without TCG (hardware accelerationor qtest only). When this flag is passed to configure, it will automatically filter out the target list to only those that support KVM or Xen or HAX. Backports commit b3f6ea7e55e8228d6f84d5cee7cb11cae917ba95 from qemu	2018-03-03 21:35:30 -05:00
Yang Zhong	d70c141675	tcg: move page_size_init() function translate-all.c will be disabled if tcg is disabled in the build, so page_size_init() function and related variables will be moved to exec.c file. Backports commit a0be0c585f5dcc4d50a37f6a20d3d625c5ef3a2c from qemu	2018-03-03 21:30:08 -05:00
Thomas Huth	cf5d583ef0	cpu: Introduce a wrapper for tlb_flush() that can be used in common code Commit 1f5c00cfdb8114c ("qom/cpu: move tlb_flush to cpu_common_reset") moved the call to tlb_flush() from the target-specific reset handlers into the common code qom/cpu.c file, and protected the call with "#ifdef CONFIG_SOFTMMU" to avoid that it is called for linux-user only targets. But since qom/cpu.c is common code, CONFIG_SOFTMMU is never defined here, so the tlb_flush() was simply never executed anymore. Fix it by introducing a wrapper for tlb_flush() in a file that is re-compiled for each target, i.e. in translate-all.c. Backports commit 2cd53943115be5118b5b2d4b80ee0a39c94c4f73 from qemu	2018-03-03 21:24:55 -05:00
Paolo Bonzini	f944cf4255	target/i386: simplify handling of conforming code segments on interrupt Move the handling of conforming code segments before the handling of stack switch. Because dpl == cpl after the new "if", it's now unnecessary to check the C bit when testing dpl < cpl. Furthermore, dpl > cpl is checked slightly above the modified code, so the final "else" is unreachable and we can remove it. Backports commit 1110bfe6f5600017258fa6578f9c17ec25b32277 from qemu	2018-03-03 21:19:48 -05:00
Wu Xiang	a8de2d4748	target/i386: fix interrupt CPL error when using ist in x86-64 In do_interrupt64(), when interrupt stack table(ist) is enabled and the the target code segment is conforming(e2 & DESC_C_MASK), the old implementation always set new CPL to 0, and SS.RPL to 0. This is incorrect for when CPL3 code access a CPL0 conforming code segment, the CPL should remain unchanged. Otherwise higher privileged code can be compromised. The patch fix this for always set dpl = cpl when the target code segment is conforming, and modify the last parameter `flags`, which contains correct new CPL, in cpu_x86_load_seg_cache(). Backports commit e95e9b88ba5f4a6c17f4d0c3a3a6bf3f648bb328 from qemu	2018-03-03 21:18:22 -05:00
Lioncash	0ef338aa71	Fix building for multi-arch targets	2018-03-03 21:14:08 -05:00
Emilio G. Cota	f66e74d65b	tcg: consistently access cpu->tb_jmp_cache atomically Some code paths can lead to atomic accesses racing with memset() on cpu->tb_jmp_cache, which can result in torn reads/writes and is undefined behaviour in C11. These torn accesses are unlikely to show up as bugs, but from code inspection they seem possible. For example, tb_phys_invalidate does: /* remove the TB from the hash list */ h = tb_jmp_cache_hash_func(tb->pc); CPU_FOREACH(cpu) { if (atomic_read(&cpu->tb_jmp_cache[h]) == tb) { atomic_set(&cpu->tb_jmp_cache[h], NULL); } } Here atomic_set might race with a concurrent memset (such as the ones scheduled via "unsafe" async work, e.g. tlb_flush_page) and therefore we might end up with a torn pointer (or who knows what, because we are under undefined behaviour). This patch converts parallel accesses to cpu->tb_jmp_cache to use atomic primitives, thereby bringing these accesses back to defined behaviour. The price to pay is to potentially execute more instructions when clearing cpu->tb_jmp_cache, but given how infrequently they happen and the small size of the cache, the performance impact I have measured is within noise range when booting debian-arm. Note that under "safe async" work (e.g. do_tb_flush) we could use memset because no other vcpus are running. However I'm keeping these accesses atomic as well to keep things simple and to avoid confusing analysis tools such as ThreadSanitizer. Backports commit f3ced3c59287dabc253f83f0c70aa4934470c15e from qemu	2018-03-03 21:12:36 -05:00
Emilio G. Cota	1a4e5da043	gen-icount: use tcg_ctx.tcg_env instead of cpu_env We are relying on cpu_env being defined as a global, yet most targets (i.e. all but arm/a64) have it defined as a local variable. Luckily all of them use the same "cpu_env" name, but really compilation shouldn't break if the name of that local variable changed. Fix it by using tcg_ctx.tcg_env, which all targets set in their translate_init function. This change also helps paving the way for the upcoming "translation loop common to all targets" work. Backports commit 53f6672bcf57d82b794a2cc3a3469be7d35c8653 from qemu	2018-03-03 21:08:58 -05:00
Laurent Vivier	8a7f7242cc	target/m68k: add fmovem Backports commit a1e58ddcb3eed7ec4a158512b9dae46f90492c1b from qemu	2018-03-03 21:05:56 -05:00
Laurent Vivier	50b639098c	target/m68k: add explicit single and double precision operations (part 2) Add fsabs, fdabs, fsneg, fdneg, fsmove and fdmove. The value is converted using the new floatx80_round() function. Backports commit 77bdb2292492fafc4bc0fbb4d8c44fdd0ef1fa8e from qemu	2018-03-03 21:02:52 -05:00
Laurent Vivier	1d5e30f30c	target/m68k: add fsglmul and fsgldiv fsglmul and fsgldiv truncate data to single precision before computing results. Backports commit 2f77995cebc8027851b8ea8f02c097fb8cdf668a from qemu	2018-03-03 20:59:20 -05:00
Laurent Vivier	4e8e8572c3	softfloat: define floatx80_round() Add a function to round a floatx80 to the defined precision (floatx80_rounding_precision) Backports commit 0f72129281765ed64d26353284059f2bdcde7a23 from qemu	2018-03-03 20:57:27 -05:00
Laurent Vivier	20b610390d	target/m68k: add explicit single and double precision operations Add fssqrt, fdsqrt, fsadd, fdadd, fssub, fdsub, fsmul, fdmul, fsdiv, fddiv. The precision is managed using set_floatx80_rounding_precision(). Backports commit a51b6bc38bb9b73a40e9486b52be12c810c6f2d9 from qemu	2018-03-03 20:55:41 -05:00
Laurent Vivier	0b62df7f30	target/m68k: add fmovecr fmovecr moves a floating point constant from the FPU ROM to a floating point register. Backports commit 9d403660d91229922c2786e81c23cc9dd8e644f1 from qemu	2018-03-03 20:51:21 -05:00
Laurent Vivier	ed3e8ab460	target/m68k: add fscc. use DisasCompare with FPU conditions in fscc and fbcc. Backports commit dd337bf86214e2436833d9442c995df95b136190 from qemu	2018-03-03 20:43:08 -05:00
Greg Kurz	a125b35f1f	qapi: add explicit null to string input and output visitors This may be used for deprecated object properties that are kept for backwards compatibility. Backports commit a733371214b68881d84725a3c71f60e2faf3b8e2 from qemu	2018-03-03 20:32:50 -05:00
KONRAD Frederic	18020c2c79	cputlb: cleanup get_page_addr_code to use VICTIM_TLB_HIT This replaces env1 and page_index variables by env and index so we can use VICTIM_TLB_HIT macro later. Backports commit 3416343255cbe01fbe12e5e36cd4bb5042425b27 from qemu	2018-03-03 19:54:13 -05:00
Laurent Vivier	f7ef6b49a8	target-m68k: add FPCR and FPSR Backports commit ba62494483ab51ee31c70952b6ce5171a31860b1 from qemu	2018-03-03 19:51:31 -05:00
Laurent Vivier	1c6b1e2b9f	target-m68k: use floatx80 internally Coldfire uses float64, but 680x0 use floatx80. This patch introduces the use of floatx80 internally and enables 680x0 80bits FPU. Backports commit f83311e4764f1f25a8abdec2b32c64483be1759b from qemu	2018-03-03 19:35:17 -05:00
Laurent Vivier	92555a1134	target-m68k: initialize FPU registers on reset, set FP registers to NaN and control registers to 0 Backports commit f4a6ce5155aab2a7ed7b9032a72187b37b3bfffe from qemu	2018-03-03 18:51:37 -05:00
Laurent Vivier	d92621522a	target-m68k: move fmove CR to a function Move code of fmove to/from control register to a function Backports commit 860b9ac779615fe9315cd58165652052ac165a92 from qemu	2018-03-03 18:49:49 -05:00
Marc-André Lureau	ca25248ecd	object: add uint property setter/getter Backports commit 3152779cd63ba41331ef41659406f65b03e7911a from qemu	2018-03-03 18:43:17 -05:00
Marc-André Lureau	fef464c4cb	qapi: update the qobject visitor to use QNUM_U64 Switch to use QNum/uint where appropriate to remove i64 limitation. The input visitor will cast i64 input to u64 for compatibility reasons (existing json QMP client already use negative i64 for large u64, and expect an implicit cast in qemu). Note: before the patch, uint64_t values above INT64_MAX are sent over json QMP as negative values, e.g. UINT64_MAX is sent as -1. After the patch, they are sent unmodified. Clearly a bug fix, but we have to consider compatibility issues anyway. libvirt should cope fine, because its parsing of unsigned integers accepts negative values modulo 2^64. There's hope that other clients will, too. Backports commit 5923f85fb82df7c8c60a89458a5ae856045e5ab1 from qemu	2018-03-03 18:40:51 -05:00
Marc-André Lureau	6ca6050206	qnum: add uint type In order to store integer values between INT64_MAX and UINT64_MAX, add a uint64_t internal representation. Backports commit 61a8f418b26a2d974e38e4ae55020aca8d402d88 from qemu	2018-03-03 18:37:56 -05:00
Marc-André Lureau	a57d8a5b50	qapi: Remove visit_start_alternate() parameter promote_int Before the previous commit, parameter promote_int = true made visit_start_alternate() with an input visitor avoid QTYPE_QINT variants and create QTYPE_QFLOAT variants instead. This was used where QTYPE_QINT variants were invalid. The previous commit fused QTYPE_QINT with QTYPE_QFLOAT, rendering promote_int useless and unused. Backports commit 60390d2dc85ffade8981ca41e02335cb07353a6d from qemu	2018-03-03 18:34:35 -05:00
Lioncash	a6623ce754	qapi: Update scripts to commit 01b2ffcedd94ad7b42bc870e4c6936c87ad03429	2018-03-03 18:32:12 -05:00
Marc-André Lureau	dd77730d49	qapi: merge QInt and QFloat in QNum We would like to use a same QObject type to represent numbers, whether they are int, uint, or floats. Getters will allow some compatibility between the various types if the number fits other representations. Add a few more tests while at it. Backports commit 01b2ffcedd94ad7b42bc870e4c6936c87ad03429 from qemu	2018-03-03 18:16:28 -05:00
Marc-André Lureau	f1dbfe6be6	qapi: Clean up qobject_input_type_number() control flow Use the more common pattern to error out. Backports commit 58634047b7deeab36e4b07c4744e44d698975561 from qemu	2018-03-03 17:40:45 -05:00
Markus Armbruster	d70f3bfc6b	qobject-input-visitor: Document full_name_nth() Backports commit 6c02258e143700314ebf268dae47eb23db17d1cf from qemu	2018-03-03 17:39:09 -05:00
Markus Armbruster	0d433af617	qobject-input-visitor: Catch misuse of end_struct vs. end_list Backports commit 8b2e41d733850ec6a67a85743138e023cbb8921b from qemu	2018-03-03 17:38:16 -05:00
Markus Armbruster	e9174563be	qapi: Document intended use of @name within alternate visits Backports commit ed0ba0f47e8cb6d924db0a54090bbb7b095fe9ea from qemu	2018-03-03 17:37:12 -05:00
Markus Armbruster	5ab0d5af81	qapi: New QAPI_CLONE_MEMBERS() QAPI_CLONE() returns a newly allocated QAPI object. Inconvenient when we want to clone into an existing object. QAPI_CLONE_MEMBERS() does exactly that. Backports commit 4626a19c86c30d96cedbac2bd44ef8103303cb37 from qemu	2018-03-03 17:36:02 -05:00
Eric Blake	734778da93	qobject: Add helper macros for common scalar insertions Rather than making lots of callers wrap a scalar in a QInt, QString, or QBool, provide helper macros that do the wrapping automatically. Update the Coccinelle script to make mass conversions easy, although the conversion itself will be done as a separate patches to ease review and backport efforts. Backports commit a92c21591b5bb9543996538f14854ca6b528318b from qemu	2018-03-03 17:33:30 -05:00
Markus Armbruster	09efe97bfd	qapi: Fix string input visitor regression for empty lists Visiting a list when input is the empty string should result in an empty list, not an error. Noticed when commit 3d089ce belatedly added tests, but simply accepted as weird then. It's actually a regression: broken in commit 74f24cb, v2.7.0. Fix it, and throw in another test case for empty string. Backports commit d2788227c6185c72d88ef3127e9fed41686f8e39 from qemu	2018-03-03 17:30:42 -05:00
Markus Armbruster	247a511c4a	qapi: Factor out common part of qobject input visitor creation Backports commit abe81bc21a6996c62e66ed2d051373c0df24f870 from qemu	2018-03-03 17:26:27 -05:00
Marc-André Lureau	c4e0911f95	object: fix potential leak in getters If the property is not of the requested type, the getters will leak a QObject. Backports commit 560f19f162529d691619ac69ed032321c7f5f1fb from qemu	2018-03-03 17:22:32 -05:00
Richard Henderson	42bb73fa96	target/arm: Exit after clearing aarch64 interrupt mask Exit to cpu loop so we reevaluate cpu_arm_hw_interrupts. Backports commit 8da54b2507c1cabf60c2de904cf0383b23239231 from qemu	2018-03-03 17:19:40 -05:00
Richard Henderson	dd1473f582	tcg: Increase hit rate of lookup_tb_ptr We can call tb_htable_lookup even when the tb_jmp_cache is completely empty. Therefore, un-nest most of the code dependent on tb != NULL from the read from the cache. This improves the hit rate of lookup_tb_ptr; for instance, when booting and immediately shutting down debian-arm, the hit rate improves from 93.2% to 99.4%. Backports commit b97a879de980e99452063851597edb98e7e8039c from qemu	2018-03-03 17:16:23 -05:00
Richard Henderson	9ec975448b	tcg/arm: Use ldr (literal) for goto_tb The new placement of the TB means that we can use one insn to load the goto_tb destination directly from the TB. Backports commit 308714e6bc945389c64faf1b9213e2c0d3f03391 from qemu	2018-03-03 17:14:27 -05:00
Richard Henderson	c99edca63b	tcg/arm: Try pc-relative addresses for movi Backports commit 9c39b94f1448770e7e573e9516d2483816785d1b from qemu	2018-03-03 17:13:31 -05:00
Richard Henderson	a5133ccaa1	tcg/arm: Remove limit on code buffer size Since we're no longer using a direct branch, we have no limit on the branch distance. Backports commit acb0b292b6d0f49972dc98f742e79ed53973e438 from qemu	2018-03-03 17:11:47 -05:00
Richard Henderson	68275ba6f3	tcg/arm: Use indirect branch for goto_tb Backports commit 3fb53fb4d12f2e7833bd1659e6013237b130ef20 from qemu	2018-03-03 17:11:18 -05:00
Richard Henderson	9a85cb0a26	tcg/aarch64: Use ADR in tcg_out_movi The new placement of the TB means that we can use one insn to load the return value for exit_tb returning the TB pointer. Backports commit cc74d332ff9a78684374847375ef63fc4bd10436 from qemu	2018-03-03 17:09:42 -05:00
Emilio G. Cota	f50e6cfa11	translate-all: consolidate tb init in tb_gen_code We are partially initializing tb in tb_alloc. Instead, fully initialize it in tb_gen_code, which is tb_alloc's only caller. This saves an unnecessary write to tb->cflags. Backports commit 2b48e10f888059a98043b4816769fa2a326a1d2c from qemu	2018-03-03 17:08:21 -05:00
Emilio G. Cota	d3ada2feb5	tcg: allocate TB structs before the corresponding translated code Allocating an arbitrarily-sized array of tbs results in either (a) a lot of memory wasted or (b) unnecessary flushes of the code cache when we run out of TB structs in the array. An obvious solution would be to just malloc a TB struct when needed, and keep the TB array as an array of pointers (recall that tb_find_pc() needs the TB array to run in O(log n)). Perhaps a better solution, which is implemented in this patch, is to allocate TB's right before the translated code they describe. This results in some memory waste due to padding to have code and TBs in separate cache lines--for instance, I measured 4.7% of padding in the used portion of code_gen_buffer when booting aarch64 Linux on a host with 64-byte cache lines. However, it can allow for optimizations in some host architectures, since TCG backends could safely assume that the TB and the corresponding translated code are very close to each other in memory. See this message by rth for a detailed explanation: https://lists.gnu.org/archive/html/qemu-devel/2017-03/msg05172.html Subject: Re: GSoC 2017 Proposal: TCG performance enhancements Backports commit 6e3b2bfd6af488a896f7936e99ef160f8f37e6f2 from qemu	2018-03-03 17:05:49 -05:00
Emilio G. Cota	8e58c67968	util: add cacheinfo Add helpers to gather cache info from the host at init-time. For now, only export the host's I/D cache line sizes, which we will use to improve cache locality to avoid false sharing. Backports commit b255b2c8a5484742606e8760870ba3e14d0c9605 from qemu	2018-03-03 16:58:28 -05:00
Laurent Vivier	da4d407317	target-m68k: define ext_opsize Backports commit 69e698220f68a17ce9584b068f68ed09e527a6ad from qemu	2018-03-03 15:05:55 -05:00
Laurent Vivier	409369a7ce	target-m68k: move FPU helpers to fpu_helper.c Backports commit c88f8107b14456d514b00571b0675cb532e82cad from qemu	2018-03-03 15:04:05 -05:00
Laurent Vivier	199c62ea01	softfloat: define 680x0 specific values Backports commit e5b0cbe8e8744b57faf0c62d023525cd466f5ab8 from qemu	2018-03-03 15:01:16 -05:00
Laurent Vivier	68c9ab9b77	target/m68k: fix V flag for CC_OP_SUBx V flag for subtraction is: v = (res ^ src1) & (src1 ^ src2) (see COMPUTE_CCR() in target/m68k/helper.c) But gen_flush_flags() uses: v = (res ^ src2) & (src1 ^ src2) The problem has been found with the following program: .global _start _start: move.l #-2147483648,%d0 subq.l #1,%d0 jvc 1f move.l #1,%d1 move.l #1,%d0 trap #0 1: move.l #0,%d1 move.l #1,%d0 trap #0 It works fine (exit(1)) on real hardware, and with "-singlestep". "-singlestep" uses gen_helper_flush_flags(), whereas without "-singlestep", V flag is computed directly in gen_flush_flags(). This patch updates gen_flush_flags() to have the same result as with gen_helper_flush_flags(). Backports commit 043b936ef6fe53396b3c6b8f5562ea3e238a071d from qemu	2018-03-03 14:59:20 -05:00
Mihail Abakumov	e1c2fac129	i386: fix read/write cr with icount option Running Windows with icount causes a crash in instruction of write cr. This patch fixes it. Reading and writing cr cause an icount read because there are called cpu_get_apic_tpr and cpu_set_apic_tpr functions. So, there is need gen_io_start()/gen_io_end() calls. Backports commit 5b003a40bb1ab14d0398e91f03393d3c6b9577cd from qemu	2018-03-03 14:56:18 -05:00
Paolo Bonzini	741ff79e23	target/i386: use multiple CPU AddressSpaces This speeds up SMM switches. Later on it may remove the need to take the BQL, and it may also allow to reuse code between TCG and KVM. Backports commit f8c45c6550b9ff1e1f0b92709ff3213a79870879 from qemu	2018-03-03 14:53:47 -05:00
Paolo Bonzini	710f393c13	target/i386: enable A20 automatically in system management mode Ignore env->a20_mask when running in system management mode. Backports commit c8bc83a4dd29a9a33f5be81686bfe6e2e628097b from qemu	2018-03-03 14:33:09 -05:00
Peter Xu	fb8d3e2f6a	exec: simplify phys_page_find() params It really only plays with the dispatchers, so the parameter list does not need that complexity. This helps for readability at least. Backports commit 003a0cf2cd1828a1141a874428571267b117f765 from qemu	2018-03-03 14:28:25 -05:00
Laurent Vivier	ce25609ed3	target/m68k: implement rtd Add "Return and Deallocate" (rtd) instruction. RTD #d (SP) -> PC SP + 4 + d -> SP Backports commit 18059c9e1648bf4fc5c7c1bae6f54690742b05ba from qemu	2018-03-03 14:27:01 -05:00
Aurelien Jarno	2c49a6b2f6	target/mips: optimize indirect branches Backports commit e350d8ca3ac7e31c6af71a4ab74d2442dfefc697 from qemu	2018-03-03 14:23:58 -05:00
Aurelien Jarno	8ce8d4fe20	target/mips: optimize cross-page direct jumps in softmmu Backports commit d9a9acde64b862107933f9e9a01435e51bf8f91b from qemu	2018-03-03 14:23:25 -05:00
Emilio G. Cota	baa0983ae3	target/aarch64: optimize indirect branches Measurements: [Baseline performance is that before applying this and the previous commit] - NBench, aarch64-softmmu. Host: Intel i7-4790K @ 4.00GHz 1.7x +-+--------------------------------------------------------------------------------------------------------------+-+ \| \| \| cross \| 1.6x +cross+jr.................................................####...................................................+-+ \| #++# \| \| # # \| 1.5x +-+...................................................****..#...................................................+-+ \| +++* # \| \| * * # \| 1.4x +-+........................................................#...................................................+-+ \| * * # \| \| ##### * * # \| 1.3x +-+................................***+++#................#...................................................+-+ \| ++* # * * # \| \| * * # * * # \| 1.2x +-+.....................................#................#...................................................+-+ \| * * # * * # \| \| #### * * # * * # \| 1.1x +-+.......................+++#..#.......#................#...................................................+-+ \| **** # * * # * * # ***#### \| \| * # * * # * * # **### +++#### *### * # \| 1x +-++-++++++-++++***###++-++++#+++++-+#++**++++++++++#++++-+#++**++#++*###-++++-+#+++-+++#+-++-+ \| ***### * # * * # * * # ++### * * # * * # * * # * ++# * # * * # \| \| * ++# * # * * # * * # * * # * * # * * # * * # * * # * * # * * # \| 0.9x +-+---***###--###---###--####--###--*###--###--*###--###---###--####---+-+ ASSIGNMENT BITFIELD FOURFP EMULATION HUFFMAN LU DECOMPOSITIONNEURAL NUMERIC SORSTRING SORT hmean png: http://imgur.com/qO9ubtk NB. cross here represents the previous commit. - SPECint06 (test set), aarch64-linux-user. Host: Intel i7-4790K @ 4.00GHz 1.5x +-+--------------------------------------------------------------------------------------------------------------+-+ \| *** \| \| +++ jr \| \| * * \| 1.4x +-+.............................................................................................+++............+-+ \| * * \| \| \| ***** * * \| \| \| * * * * ***** \| 1.3x +-+...........................................................................................\|............+-+ \| +++ * * * * * \| * \| \| ***** * * * * +++ \| \| * * * * * * * * \| 1.2x +-+...............................................................................****..................+-+ \| **** * * * * * * * * * * +++ \| \| * * * * * * * * * * * * ***** \| \| * * * * ***** * * * * * * * * * * \| 1.1x +-+....................................................................+++.......................+-+ \| * * * * * * * * * * ***** * * * * * * \| \| * * * * * * * * ***** * * * * * * * * * * \| \| * * ***** * * * * * * * * ****** * * * * * * * * * * \| 1x +-++-++++-++++++++++-++++-+++++-++++++++++-++++-++****+++++-+++++-++++-++++++++++-++++-++-+ \| * * * * * * * * * * * * * +++ * * * * * * * * * * \| \| * * * * * * * * * * * * * * * * * * * * * * * * * * \| \| * * * * * * * * * * * * * * * * * * * * * * * * * * \| 0.9x +-+---***---*----*---*---*---*---**---*---*---*---*----*---*---+-+ astar bzip2 gcc gobmk h264ref hmmlibquantum mcf omnetpperlbench sjengxalancbmk hmean png: http://imgur.com/3Dp4vvq - SPECint06 (train set), aarch64-linux-user. Host: Intel i7-4790K @ 4.00GHz 1.7x +-+--------------------------------------------------------------------------------------------------------------+-+ \| \| \| jr \| 1.6x +-+...............................................................................................+++............+-+ \| *** \| \| +++ \| \| * * \| 1.5x +-+............................................................................................................+-+ \| +++ * * \| \| ***** * * \| 1.4x +-+.....................................................................+++..................................+-+ \| * * * * \| \| ***** * * * * \| \| * * * * ***** * * \| 1.3x +-+......................................................................................................+-+ \| +++ * * * * * * * * \| \| ***** * * * * * * ***** * * \| 1.2x +-+.............................................................................+++..........****...+-+ \| * * * * * * * * * * * +++ \| \| ***** * * ***** * * * * * * * * * * * * \| \| * * * * +++ * * * * * * * * * * * * \| 1.1x +-+............................................................................................+-+ \| * * ***** * * * * * * ***** * * * * * * * * * * \| \| * * * * * * * * * * +++ ****** +++ * * * * * * * * * * \| 1x +-+---***---*----*---*---*---*---**---*---*---*---*----*---***---+-+ astar bzip2 gcc gobmk h264ref hmmlibquantum mcf omnetpperlbench sjengxalancbmk hmean png: http://imgur.com/vRrdc9j Backports commit e75449a346bf558296966a44277bfd93412c6da6 from qemu	2018-03-03 14:22:12 -05:00
Emilio G. Cota	83ea5b72f2	target/aarch64: optimize cross-page direct jumps in softmmu Perf numbers in next commit's log. Backports commit e78722368c721f3c5b8109ed525adac1653ae97b from qemu	2018-03-03 14:20:55 -05:00
Aurelien Jarno	0e9d3d1943	tcg/mips: implement goto_ptr Backports commit 5786e0683c4f8170dd05a550814b8809d8ae6d86 from qemu	2018-03-03 14:19:46 -05:00
Richard Henderson	1d6c4f1a42	tcg/arm: Implement goto_ptr Backports commit 085c648bef7301eabe7d4a3301c8d012ae4423b8 from qemu	2018-03-03 14:18:41 -05:00
Richard Henderson	3b02642372	tcg/arm: Clarify tcg_out_bx for arm4 host In theory this would re-enable usage of QEMU on an armv4 host. Whether this is worthwhile is debatable -- we've been unconditionally issuing the armv5t BX instruction in the prologue since 2011 without complaint. Possibly we should simply require an armv6 host. Backports commit 702a947484eb3e615183dafc93de590ab0679f60 from qemu	2018-03-03 14:17:13 -05:00
Richard Henderson	d496bb6150	tcg/s390: Implement goto_ptr Backports commit 46644483cae978c734460131bb1d9071f813b287 from qemu	2018-03-03 14:16:03 -05:00
Richard Henderson	f0420c3427	tcg/sparc: Implement goto_ptr Backports commit 38f81dc5938fb7025531c5ed602afd41fef799a7 from qemu	2018-03-03 14:14:32 -05:00
Richard Henderson	81f1aae572	tcg/aarch64: Implement goto_ptr Measurements: SPECint06 (test set), x86_64-linux-user. Host: APM 64-bit ARMv8 (Atlas/A57) @ 2.4 GHz 1.45x +-+-------------------------------------------------------------------------------------------------------------+-+ \| ***** \| \| +++ * * +goto-ptr \| 1.4x +-+...****...................................................................................................+-+ \| +++* * * +++ \| 1.35x +-+................................................................****....................................+-+ \| * * * +++ \| \| * * * * * * \| 1.3x +-+.......................................................................................................+-+ \| * * * * * * \| \| * * * * * * ***** \| 1.25x +-+.................****.........................................................***.................+-+ \| * * * * * * * +++ * * \| 1.2x +-+.................................................................................................+-+ \| * * * * * * * * * * * * \| \| * * * * * * * * * * * * ***** \| 1.15x +-+...............................................................................................+-+ \| * * * * * * * * +++ * * * * * * \| \| * * * * * * * * ***** * * * * * * \| 1.1x +-+........................****.........***..................................................+-+ \| * * * * * * * * * * * * * * * * * * * \| 1.05x +-+.........................................................................................+-+ \| * * ***** * * * * * * * * * * * * * * * * * * \| \| * * * * * * * * * * * * *** *** * * * * * * * * * * \| 1x +-+---***---*---*----*---*---*---*---*---*---*----*---*---***---+-+ astar bzip2 gcc gobmk h264ref hmmlibquantum mcf omnetpperlbench sjenxalancbmk hmean png: http://imgur.com/en9HE8L Backports commit b19f0c2e7d344d4d62daf554951acdb6c94a34b0 from qemu	2018-03-03 14:13:09 -05:00
Emilio G. Cota	7d0440dec4	tb-hash: improve tb_jmp_cache hash function in user mode Optimizations to cross-page chaining and indirect branches make performance more sensitive to the hit rate of tb_jmp_cache. The constraint of reserving some bits for the page number lowers the achievable quality of the hashing function. However, user-mode does not have this requirement. Thus, with this change we use for user-mode a hashing function that is both faster and of better quality than the previous one. Measurements: Note: baseline (i.e. speedup == 1x) is QEMU v2.9.0. - SPECint06 (test set), x86_64-linux-user. Host: Intel i7-6700K @ 4.00GHz 2.2x +-+--------------------------------------------------------------------------------------------------------------+-+ \| \| \| jr \| 2x +jr+multhash +....................................................+++++...................................+-+ \| jr+hash \|$$$ \| \| \|$+$ \| \| ### $ \| 1.8x +-+......................................................................#\|#.$...................................+-+ \| ++#+# $ \| \| \|# # $ \| 1.6x +-+....................................................................**.#.$....................++$$$..........+-+ \| $$$ +* # $ \|$+$ \| \| ++$$$ ### $ * * # $ +++\|$ $ \| \| ++###+$ # # $ * * # $ ### **## $ \| 1.4x +-+...................+#.$.........*.#.$............................#.$...........#+#$$.++\|#.$..........+-+ \| +* # $ * * # $ * * # $ # # $ * +# $ \| \| * # $ +++++ * * # $ * * # $ *** # $ * * # $ ###$$ \| 1.2x +-+.....................#.$.**##$$...#.$............................#.$...........#.$....#.$.*+#+$..+-+ \| * # $ + # $ * * # $ +++ * * # $ ++###$$ * * # $ * * # $ * * # $ \| \| **##$$ * # $ * * # $ * * # $ **##$$ ++### * # $ *** #+$ * * # $ * * # $ * * # $ \| \| ++#+$ **##$$$ * # $ * * # $ * * # $ + # $ ++####$$ **+# * # $ * * # $ * * # $ * * # $ * * # $ \| 1x +-++-++#+$+++#-+$++-#+$+++#+$+++#+$+-+#+$+**++#+$+++#$$+++#+$+++#+$++-#+$++-+#+$+++#+$-++-+ \| * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ \| \| * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ \| 0.8x +-+--*##$$-##$$$-##$$-##$$-##$$-##$$-###$$-##$$-##$$-##$$-##$$-##$$-##$$--+-+ astar bzip2 gcc gobmk h264ref hmmlibquantum mcf omnetpperlbench sjengxalancbmk hmean png: http://imgur.com/4UXTrEc Here I also tried the hash function suggested by Paolo ("multhash"): return ((uint64_t) (pc 2654435761) >> 32) & (TB_JMP_CACHE_SIZE - 1); As you can see it is just as good as the other new function ("hash"), which is what I ended up going with. - SPECint06 (train set), x86_64-linux-user. Host: Intel i7-6700K @ 4.00GHz 2.6x +-+--------------------------------------------------------------------------------------------------------------+-+ \| \| \| jr ### \| 2.4x +jr+hash...........................................................................................#.#...........+-+ \| # # \| \| # # \| 2.2x +-+................................................................................................#.#...........+-+ \| # # \| \| # # \| 2x +-+................................................................................................#.#...........+-+ \| **** # \| \| * * # \| 1.8x +-+................................................................................................#...........+-+ \| +++ * * # \| \| #### #### * * # \| 1.6x +-+......................................####.............................#..#.***..#.............#...........+-+ \| +++ #++# *** # * * # #### * * # \| \| ### # # * * # * * # # # * * # \| 1.4x +-+...................**+#..........*..#..............................#.....#....#..#.....#...........+-+ \| ++* # * * # * * # * * # *** # * * # #### \| \| * * # #### * * # * * # * * # * * # * * # **** # \| 1.2x +-+......................#..***++#.....#..............................#.....#.....#.....#......#..+-+ \| **### * # * * # * * # * * # * * # * * # * * # * * # \| \| * * # **### * # * * # * * # ***## * # * * # * * # * * # * * # \| 1x +-+--**###--###--*##--###-###--###--###--##--###-###--###--*##--###--+-+ astar bzip2 gcc gobmk h264ref hmmlibquantum mcf omnetpperlbench sjengxalancbmk hmean png: http://imgur.com/ArCbHqo - NBench, x86_64-linux-user. Host: Intel i7-6700K @ 4.00GHz 1.12x +-+-------------------------------------------------------------------------------------------------------------+-+ \| \| \| jr +++ \| 1.1x +jr+hash...........................................................####.........................................+-+ \| +++#\| # \| \| \| #++# \| 1.08x +-+................................+++................+++.+++..**..#.........................................+-+ \| \| +++ \| \| \| * # \| \| \| \| \| \| +++ # \| 1.06x +-+................................***###.............\|...\|........#.........................+++.............+-+ \| \| * \|# ***### * # \| \| \| \| ++# \| \|# * * # #### \| 1.04x +-+................................++..#............\|..\|#.......#........................#.\|#.............+-+ \| * * # ++++# * * # +++#++# \| \| * * # * * # * * # \| # # +++#### \| 1.02x +-+....................................#......+++.......#.......#.....................**..#..**++#...+-+ \| +++ * # +++ \| * * # * * # +++ \| # +++ # \| \| +++ \| +++ +++ ++++++ * * # ****### * # * * # \| +++ ++++++ ++ # * * # \| 1x +-++-+++++####++***###++++-+####+-++++#-++++-+#++++++#+++-+++#+-+++####-+***###++++++#+++-+++#+-++-+ \| ***\| # ++* \|# ****\| # * # * ++# * # * * # **** \|# * * # * * # * * # \| \| * \| \| # ++# \| ++# * # * * # * * # * * # \| ++# * * # * * # * * # \| 0.98x +-+....\|.++#......#..+++..#......#.......#......#.......#..++..#.......#......#.......#...+-+ \| +++ # * * # * * # * * # * * # * * # * * # * * # * * # * * # * * # \| \| * * # * * # * * # * * # * * # * * # * * # * * # * * # * * # * * # \| 0.96x +-+---***###--###--*###--###--*###--###--*###--###--*###--###--*###---+-+ ASSIGNMENT BITFIELD FOURFP EMULATION HUFFMAN LU DECOMPOSITIONEURAL NNUMERIC SOSTRING SORT hmean png: http://imgur.com/ZXFX0hJ - NBench, arm-linux-user. Host: Intel i7-4790K @ 4.00GHz 1.3x +-+-------------------------------------------------------------------------------------------------------------+-+ \| #### \| \| jr # # +++ \| 1.25x +jr+hash.....................#..#...........................................####................................+-+ \| # # # # \| \| # # # # \| 1.2x +-+..........................#..#...........................................#..#................................+-+ \| # # # # \| \| # # # # \| 1.15x +-+..........................#..#...........................................#..#................................+-+ \| # # #### # # \| \| # # # # # # \| 1.1x +-+..........................#..#..................................#..#.....#..#................................+-+ \| # # # # # # +++ \| \| # # #### # # # # #### \| 1.05x +-+..........................#..#...............#..#.....####......#..#.....#..#.........................#..#...+-+ \| # # # # # # # # # # +++ # # \| \| +++ * # #### * # # # +++# # # ### # # \| 1x +-++-+*###++*++++++-+++#+-**++#-++++-+#+++++#++#++***++#+-++++#-+***-++++++++#++***++#+-++-+ \| * # * * \| * * # * * # * * # **** # * * # * * # * ### ++# * # \| \| * * # * ### * # * * # * * # * * # * * # * * # * * # * * # * * # \| 0.95x +-+........#.....\|#.......#......#.......#......#.......#......#.......#......#.......#...+-+ \| * * # * * \|# * * # * * # * * # * * # * * # * * # * * # * * # * * # \| \| * * # * * \|# * * # * * # * * # * * # * * # * * # * * # * * # * * # \| 0.9x +-+---***###--###--*###--###--*###--###--*###--###--*###--###--***###---+-+ ASSIGNMENT BITFIELD FOURFP EMULATION HUFFMAN LU DECOMPOSITIONEURAL NNUMERIC SOSTRING SORT hmean png: http://imgur.com/FfD27ey Backports commit 6f1653180f5701c6a8f1b35b89a80b1e3260928e from qemu	2018-03-03 14:11:29 -05:00
Emilio G. Cota	2d16da435e	target/i386: optimize indirect branches Speed up indirect branches by jumping to the target if it is valid. Softmmu measurements (see later commit for user-mode numbers): Note: baseline (i.e. speedup == 1x) is QEMU v2.9.0. - SPECint06 (test set), x86_64-softmmu (Ubuntu 16.04 guest). Host: Intel i7-4790K @ 4.00GHz 2.4x +-+--------------------------------------------------------------------------------------------------------------+-+ \| \| \| cross \| 2.2x +cross+jr..........................................................................+++...........................+-+ \| \| \| \| +++ \| \| 2x +-+..............................................................................\|..\|............................+-+ \| \| \| \| \| \| \| \| 1.8x +-+..............................................................................\|####...........................+-+ \| \|# \|# \| \| **** \|# \| 1.6x +-+.............................................................................\|.\|#...........................+-+ \| * \|* \|# \| \| * \|* \|# \| 1.4x +-+.......................................................................+++...\|.\|#...........................+-+ \| ++++++ #### * \|++# +++ \| \| +++ \| \| #++# ++* # +++ \| \| 1.2x +-+......................###.....####....+++............\|..\|...........***..#.....#....####...\|.###.....####..+-+ \| +++ * # # #### ### ++* # * * # #++# **\|# +++#++# \| \| *### +++ ++* # ++ # ++# # #### \| \|# +++ * * # * * # *** # \| \|# **** # \| 1x +-++-++++#++**###+++++#+++-++#+*++#++++#+-+++#-+**##++++-+#+++-+#+++++#++-++#++++++#-++-+ \| * # * * # * * # * * # * * # * * # \| \|# ++ # * * # * * # * * # * * # * * # \| \| * * # * * # * * # * * # * * # * * # +++# * * # * * # * * # * * # * * # * * # \| 0.8x +-+--**###--###--*##--###-###--###--###--##--###-###--###--*##--**###--+-+ astar bzip2 gcc gobmk h264ref hmmlibquantum mcf omnetpperlbench sjengxalancbmk hmean png: http://imgur.com/DU36YFU NB. 'cross' represents the previous commit. Backports commit b4aa297781ceddef79deb0e99da7817551fa89f8 from qemu	2018-03-03 14:10:14 -05:00
Emilio G. Cota	3895eea3b4	target/i386: optimize cross-page direct jumps in softmmu Instead of unconditionally exiting to the exec loop, use the gen_jr helper to jump to the target if it is valid. Perf impact: see next commit's log. Backports commit fe62089563ffc6a42f16ff28a6b6be34d2697766 from qemu	2018-03-03 14:08:27 -05:00
Emilio G. Cota	baa017d29b	target/i386: introduce gen_jr helper to generate lookup_and_goto_ptr This helper will be used by subsequent changes. Backports commit 1ebb1af1b8068fca36f48f738eb7146ecdf03625 from qemu	2018-03-03 14:06:05 -05:00
Emilio G. Cota	9aaad9ed27	target/arm: optimize indirect branches Speed up indirect branches by jumping to the target if it is valid. Softmmu measurements (see later commit for user-mode results): Note: baseline (i.e. speedup == 1x) is QEMU v2.9.0. - Impact on Boot time \| setup \| ARM debian jessie boot+shutdown time \| stddev \| \|--------+--------------------------------------+--------\| \| v2.9.0 \| 8.84 \| 0.07 \| \| +cross \| 8.85 \| 0.03 \| \| +jr \| 8.83 \| 0.06 \| - NBench, arm-softmmu (debian jessie guest). Host: Intel i7-4790K @ 4.00GHz 1.3x +-+-------------------------------------------------------------------------------------------------------------+-+ \| \| \| cross #### \| 1.25x +cross+jr..........................................................#++#.........................................+-+ \| #### # # \| \| +++# # # # \| \| +++ **** # # # \| 1.2x +-+...................................####................#......#..#.........................................+-+ \| **** # * * # # # #### \| \| * * # * * # # # # # \| 1.15x +-+....................................#................#......#..#.....#..#................................+-+ \| * * # * * # # # # # \| \| * * # #### * * # # # # # \| \| * * # # # * * # # # # # #### \| 1.1x +-+....................................#......#..#......#......#..#.....#..#.........................#..#...+-+ \| * * # # # * * # # # # # # # \| \| * * # # # * * # # # # # # # \| 1.05x +-+..........................####......#......#..#......#......#..#.....#..#......+++............***..#...+-+ \| *** # * * # # # * * # *** # # # +++ \| *### * # \| \| +++ # * * # # # * * # +++ # ** # **### * # * * # \| \| ****### +++#### * # * * # ***** # * * # * * # * * # * \| ++# * # * * # \| 1x +-++-++++-+#++***++#+++-+++#+-++++#-++++-+#++++++#+++-+++#+-++++#-++++-+#++++++#+++-+++#+-++-+ \| * # * * # * * # * * # * * # * * # * * # * * # * * # * * # * * # \| \| * * # * * # * * # * * # * * # * * # * * # * * # * * # * * # * * # \| 0.95x +-+---***###--###--*###--###--*###--###--*###--###--*###--###--***###---+-+ ASSIGNMENT BITFIELD FOURFP EMULATION HUFFMAN LU DECOMPOSITIONEURAL NNUMERIC SOSTRING SORT hmean png: http://imgur.com/eOLmZNR NB. 'cross' represents the previous commit. Backports commit 8a6b28c7b5104263344508df0f4bce97f22cfcaf from qemu	2018-03-02 21:18:15 -05:00
Emilio G. Cota	5a42602b92	target/arm: optimize cross-page direct jumps in softmmu Instead of unconditionally exiting to the exec loop, use the lookup_and_goto_ptr helper to jump to the target if it is valid. Perf impact: see next commit's log. Backports commit 7ad55b4ffd982c80f26f7f3658138d94cdc678e8 from qemu	2018-03-02 21:09:44 -05:00
Emilio G. Cota	e4dfb7f807	tcg/i386: implement goto_ptr Backports commit 5cb4ef80f65252dd85b86fa7f3c985015423d670 from qemu	2018-03-02 21:08:38 -05:00
Emilio G. Cota	8f4f15e5f5	tcg: Introduce goto_ptr opcode and tcg_gen_lookup_and_goto_ptr Instead of exporting goto_ptr directly to TCG frontends, export tcg_gen_lookup_and_goto_ptr(), which calls goto_ptr with the pointer returned by the lookup_tb_ptr() helper. This is the only use case we have for goto_ptr and lookup_tb_ptr, so having this function is very convenient. Furthermore, it trivially allows us to avoid calling the lookup helper if goto_ptr is not implemented by the backend. Backports commit cedbcb01529cb6cf9a2289cdbebbc63f6149fc18 from qemu	2018-03-02 21:05:18 -05:00
Richard Henderson	23d8f5fba2	qemu/atomic: Loosen restrictions for 64-bit ILP32 hosts We need to coordinate with the TCG_OVERSIZED_GUEST test in cputlb.c, and allow 64-bit atomics even though sizeof(void *) == 4. Backports commit 374aae653499f4d405caf32b7fff0c8639113fe4 from qemu	2018-03-02 20:06:39 -05:00
Luc MICHEL	393019de26	target/arm: add data cache invalidation cp15 instruction to cortex-r5 The cp15, CRn=15, opc1=0, CRm=5, opc2=0 instruction invalidates all the data cache on the cortex-r5. Implementing it as a NOP. Backports commit 95e9a242e2a393c7d4e5cc04340e39c3a9420f03 from qemu	2018-03-02 20:04:20 -05:00
Peter Maydell	565626ca63	armv7m: Raise correct kind of UsageFault for attempts to execute ARM code M profile doesn't implement ARM, and the architecturally required behaviour for attempts to execute with the Thumb bit clear is to generate a UsageFault with the CFSR INVSTATE bit set. We were incorrectly implementing this as generating an UNDEFINSTR UsageFault; fix this. Backports commit e13886e3a790b52f0b2e93cb5e84fdc2ada5471a from qemu	2018-03-02 20:00:58 -05:00
Peter Maydell	fbfeca93b3	armv7m: Check exception return consistency Implement the exception return consistency checks described in the v7M pseudocode ExceptionReturn(). Inspired by a patch from Michael Davidsaver's series, but this is a reimplementation from scratch based on the ARM ARM pseudocode. Backports commit aa488fe3bb5460c6675800ccd80f6dccbbd70159 from qemu	2018-03-02 19:59:18 -05:00
Peter Maydell	0736054d6d	armv7m: Extract "exception taken" code into functions Extract the code from the tail end of arm_v7m_do_interrupt() which enters the exception handler into a pair of utility functions v7m_exception_taken() and v7m_push_stack(), which correspond roughly to the pseudocode PushStack() and ExceptionTaken(). This also requires us to move the arm_v7m_load_vector() utility routine up so we can call it. Handling illegal exception returns has some cases where we want to take a UsageFault either on an existing stack frame or with a new stack frame but with a specific LR value, so we want to be able to call these without having to go via arm_v7m_cpu_do_interrupt(). Backports commit 39ae2474e337247e5930e8be783b689adc9f6215 from qemu	2018-03-02 19:54:46 -05:00
Michael Davidsaver	5b9f53bd27	armv7m: Simpler and faster exception start All the places in armv7m_cpu_do_interrupt() which pend an exception in the NVIC are doing so for synchronous exceptions. We know that we will always take some exception in this case, so we can just acknowledge it immediately, rather than returning and then immediately being called again because the NVIC has raised its outbound IRQ line. Backports commit a25dc805e2e63a55029e787a52335e12dabf07dc from qemu	2018-03-02 19:52:01 -05:00
Peter Maydell	43ba76cb28	armv7m: Fix condition check for taking exceptions The M profile condition for when we can take a pending exception or interrupt is not the same as that for A/R profile. The code originally copied from the A/R profile version of the cpu_exec_interrupt function only worked by chance for the very simple case of exceptions being masked by PRIMASK. Replace it with a call to a function in the NVIC code that correctly compares the priority of the pending exception against the current execution priority of the CPU. Backports commit 7ecdaa4a9635f1ded0dfa9218c25273b6d4dcd44 from qemu	2018-03-02 19:50:05 -05:00
Peter Maydell	5470bd1763	armv7m: Remove unused armv7m_nvic_acknowledge_irq() return value Having armv7m_nvic_acknowledge_irq() return the new value of env->v7m.exception and its one caller assign the return value back to env->v7m.exception is pointless. Just make the return type void instead. Backports commit a5d8235545e98c1ce02560d5f4f57552d937efe9 from qemu	2018-03-02 19:36:07 -05:00
Peter Maydell	50c956db7e	arm: Implement HFNMIENA support for M profile MPU Implement HFNMIENA support for the M profile MPU. This bit controls whether the MPU is treated as enabled when executing at execution priorities of less than zero (in NMI, HardFault or with the FAULTMASK bit set). Doing this requires us to use a different MMU index for "running at execution priority < 0", because we will have different access permissions for that case versus the normal case. Backports commit 3bef7012560a7f0ea27b265105de5090ba117514 from qemu	2018-03-02 19:33:24 -05:00
Michael Davidsaver	611a711f7b	arm: add MPU support to M profile CPUs The M series MPU is almost the same as the already implemented R profile MPU (v7 PMSA). So all we need to implement here is the MPU register interface in the system register space. This implementation has the same restriction as the R profile MPU that it doesn't permit regions to be sized down smaller than 1K. We also do not yet implement support for MPU_CTRL.HFNMIENA; this bit should if zero disable use of the MPU when running HardFault, NMI or with FAULTMASK set to 1 (ie at an execution priority of less than zero) -- if the MPU is enabled we don't treat these cases any differently. Backports commit 29c483a506070e8f554c77d22686f405e30b9114 from qemu	2018-03-02 19:30:20 -05:00
Michael Davidsaver	09d69209a0	armv7m: Classify faults as MemManage or BusFault General logic is that operations stopped by the MPU are MemManage, and those which go through the MPU and are caught by the unassigned handle are BusFault. Distinguish these by looking at the exception.fsr values, and set the CFSR bits and (if appropriate) fill in the BFAR or MMFAR with the exception address. Backports commit 5dd0641d234e355597be62e5279d8a519c831625 from qemu	2018-03-02 19:28:21 -05:00
Peter Maydell	9bc3050c51	arm: All M profile cores are PMSA All M profile CPUs are PMSA, so set the feature bit. (We haven't actually implemented the M profile MPU register interface yet, but setting this feature bit gives us closer to correct behaviour for the MPU-disabled case.) Backports commit 790a11503cfb5e1dcd031ea2212bbebae4ca3cec from qemu	2018-03-02 19:26:41 -05:00
Michael Davidsaver	4d8ae4a2b2	armv7m: Implement M profile default memory map Add support for the M profile default memory map which is used if the MPU is not present or disabled. The main differences in behaviour from implementing this correctly are that we set the PAGE_EXEC attribute on the right regions of memory, such that device regions are not executable. Backports commit 3a00d560bcfca7ad04327062c1986a016c104b1f from qemu	2018-03-02 19:25:02 -05:00
Michael Davidsaver	7c845dabe8	armv7m: Improve "-d mmu" tracing for PMSAv7 MPU Improve the "-d mmu" tracing for the PMSAv7 MPU translation process as an aid in debugging guest MPU configurations: * fix a missing newline for a guest-error log * report the region number with guest-error or unimp logs of bad region register values * add a log message for the overall result of the lookup * print "0x" prefix for hex values Backports commit c9f9f1246d630960bce45881e9c0d27b55be71e2 from qemu	2018-03-02 19:17:05 -05:00
Peter Maydell	bfe99e9a0b	arm: Remove unnecessary check on cpu->pmsav7_dregion Now that we enforce both: * pmsav7_dregion == 0 implies has_mpu == false * PMSA with has_mpu == false means SCTLR.M cannot be set we can remove a check on pmsav7_dregion from get_phys_addr_pmsav7(), because we can only reach this code path if the MPU is enabled (and so region_translation_disabled() returned false). Backports commit e9235c6983b261e04e897e8ff900b2b7a391e644 from qemu	2018-03-02 19:14:50 -05:00
Peter Maydell	349227bb05	arm: Don't let no-MPU PMSA cores write to SCTLR.M If the CPU is a PMSA config with no MPU implemented, then the SCTLR.M bit should be RAZ/WI, so that the guest can never turn on the non-existent MPU. Backports commit 06312febfb2d35367006ef23608ddd6a131214d4 from qemu	2018-03-02 19:13:37 -05:00
Peter Maydell	e564ed6311	arm: Don't clear ARM_FEATURE_PMSA for no-mpu configs Fix the handling of QOM properties for PMSA CPUs with no MPU: Allow no-MPU to be specified by either: * has-mpu = false * pmsav7_dregion = 0 and make setting one imply the other. Don't clear the PMSA feature bit in this situation. Backports commit f50cd31413d8bc9d1eef8edd1f878324543bf65d from qemu	2018-03-02 19:12:20 -05:00
Peter Maydell	6614ba9615	arm: Clean up handling of no-MPU PMSA CPUs ARM CPUs come in two flavours: * proper MMU ("VMSA") * only an MPU ("PMSA") For PMSA, the MPU may be implemented, or not (in which case there is default "always acts the same" behaviour, but it isn't guest programmable). QEMU is a bit confused about how we indicate this: we have an ARM_FEATURE_MPU, but it's not clear whether this indicates "PMSA, not VMSA" or "PMSA and MPU present" , and sometimes we use it for one purpose and sometimes the other. Currently trying to implement a PMSA-without-MPU core won't work correctly because we turn off the ARM_FEATURE_MPU bit and then a lot of things which should still exist get turned off too. As the first step in cleaning this up, rename the feature bit to ARM_FEATURE_PMSA, which indicates a PMSA CPU (with or without MPU). Backports commit 452a095526a0537f16c271516a2200877a272ea8 from qemu	2018-03-02 19:05:31 -05:00
Peter Maydell	b50d2da03c	arm: Use different ARMMMUIdx values for M profile Make M profile use completely separate ARMMMUIdx values from those that A profile CPUs use. This is a prelude to adding support for the MPU and for v8M, which together will require 6 MMU indexes which don't map cleanly onto the A profile uses: non secure User non secure Privileged non secure Privileged, execution priority < 0 secure User secure Privileged secure Privileged, execution priority < 0 Backports commit e7b921c2d9efc249f99b9feb0e7dca82c96aa5c4 from qemu	2018-03-02 19:01:42 -05:00
Michael Davidsaver	f532e80749	armv7m: Escalate exceptions to HardFault if necessary The v7M exception architecture requires that if a synchronous exception cannot be taken immediately (because it is disabled or at too low a priority) then it should be escalated to HardFault (and the HardFault exception is then taken). Implement this escalation logic. Backports commit a73c98e159d18155445d29b6044be6ad49fd802f from qemu	2018-03-02 18:59:13 -05:00
Peter Maydell	b7bf752d3c	arm: Add support for M profile CPUs having different MMU index semantics The M profile CPU's MPU has an awkward corner case which we would like to implement with a different MMU index. We can avoid having to bump the number of MMU modes ARM uses, because some of our existing MMU indexes are only used by non-M-profile CPUs, so we can borrow one. To avoid that getting too confusing, clean up the code to try to keep the two meanings of the index separate. Instead of ARMMMUIdx enum values being identical to core QEMU MMU index values, they are now the core index values with some high bits set. Any particular CPU always uses the same high bits (so eventually A profile cores and M profile cores will use different bits). New functions arm_to_core_mmu_idx() and core_to_arm_mmu_idx() convert between the two. In general core index values are stored in 'int' types, and ARM values are stored in ARMMMUIdx types. Backports commit 8bd5c82030b2cb09d3eef6b444f1620911cc9fc5 from qemu	2018-03-02 18:59:13 -05:00
Wei Huang	19335c32c9	target/arm: clear PMUVER field of AA64DFR0 when vPMU=off The PMUv3 driver of linux kernel (in arch/arm64/kernel/perf_event.c) relies on the PMUVER field of id_aa64dfr0_el1 to decide if PMU support is present or not. This patch clears the PMUVER field under TCG mode when vPMU=off. Without it, PMUv3 will init insider guest VMs even with vPMU=off. This patch also removes a redundant line inside the if-statement. Backports commit 2b3ffa929249b15a75d8bde3e8e57a744f52aff0 from qemu	2018-03-02 18:59:12 -05:00
Peter Maydell	4789e49c4d	arm: Use the mmu_idx we're passed in arm_cpu_do_unaligned_access() When identifying the DFSR format for an alignment fault, use the mmu index that we are passed, rather than calling cpu_mmu_index() to get the mmu index for the current CPU state. This doesn't actually make any difference since the only cases where the current MMU index differs from the index used for the load are the "unprivileged load/store" instructions, and in that case the mmu index may differ but the translation regime is the same (apart from the "use from Hyp mode" case which is UNPREDICTABLE). However it's the more logical thing to do. Backports commit e517d95b63427fae9f03958dbc005c36b4ebf2cf from qemu	2018-03-02 18:59:12 -05:00
Peter Xu	fce1b469e5	memory: tune last param of iommu_ops.translate() This patch converts the old "is_write" bool into IOMMUAccessFlags. The difference is that "is_write" can only express either read/write, but sometimes what we really want is "none" here (neither read nor write). Replay is an good example - during replay, we should not check any RW permission bits since thats not an actual IO at all. Backports commit bf55b7afce53718ef96f4e6616da62c0ccac37dd from qemu	2018-03-02 18:59:12 -05:00
Peter Xu	5621c7e09f	exec: abstract address_space_do_translate() This function is an abstraction helper for address_space_translate() and address_space_get_iotlb_entry(). It does the lookup of address into memory region section, then does proper IOMMU translation if necessary. Refactor the two existing functions to use it. This fixes vhost when IOMMU is disabled by guest. Backports commit a764040cc831cfe5b8bf1c80e8341b9bf2de3ce8 from qemu	2018-03-02 18:59:12 -05:00
Nikunj A Dadhania	d907423bac	cputlb: handle first atomic write to the page In case where the conditional write is the first write to the page, TLB_NOTDIRTY will be set and stop_the_world is triggered. Handle this as a special case and set the dirty bit. After that fall through to the actual atomic instruction below. Backports commit 7f9af1abdcc69fd1d3d8d2be68464329600616d6 from qemu	2018-03-02 18:59:12 -05:00
Aurelien Jarno	00ebbae128	tcg/mips: fix field extraction opcode The "msb" argument should correspond to (len - 1). Backports commit 2f5a5f5774d95baacf86c03aa8a77a2d0390f2b2 from qemu	2018-03-02 18:59:12 -05:00
Richard Henderson	69116abafc	tcg: Initialize return value after exit_atomic Users of tcg_gen_atomic_cmpxchg and do_atomic_op rightfully utilize the output. Even though this code is dead, it gets translated, and without the initialization we encounter a tcg_error. Backports commit 79b1af906245558c30e0a5faf26cb52b63f83cce from qemu	2018-03-02 18:59:11 -05:00
Gerd Hoffmann	108354cc4a	bitmap: add bitmap_copy_and_clear_atomic Backports commit d6eb1413920affb7be3df9982682dd183a805dd7 from qemu	2018-03-02 18:59:11 -05:00
Peter Maydell	b8b70dfcd2	Drop QEMU_GNUC_PREREQ() checks for gcc older than 4.1 We already require gcc 4.1 or newer (for the atomic support), so the fallback codepaths for older gcc versions than that are now dead code and we can just delete them. NB: clang reports itself as gcc 4.2 (regardless of clang version), so clang won't be using the fallbacks either. Backports commit fa54abb8c298f892639ffc4bc2f61448ac3be4a1 from qemu	2018-03-02 18:59:05 -05:00
Peter Maydell	2935a9af7a	arm: Remove workarounds for old M-profile exception return implementation Now that we've rewritten M-profile exception return so that the magic PC values are not visible to other parts of QEMU, we can delete the special casing of them elsewhere. Backports commit f4e8e4edda875cab9df91dc4ae9767f7cb1f50aa from qemu	2018-03-02 15:02:14 -05:00
Peter Maydell	44bf8985e5	arm: Implement M profile exception return properly On M profile, return from exceptions happen when code in Handler mode executes one of the following function call return instructions: * POP or LDM which loads the PC * LDR to PC * BX register and the new PC value is 0xFFxxxxxx. QEMU tries to implement this by not treating the instruction specially but then catching the attempt to execute from the magic address value. This is not ideal, because: * there are guest visible differences from the architecturally specified behaviour (for instance jumping to 0xFFxxxxxx via a different instruction should not cause an exception return but it will in the QEMU implementation) * we have to account for it in various places (like refusing to take an interrupt if the PC is at a magic value, and making sure that the MPU doesn't deny execution at the magic value addresses) Drop these hacks, and instead implement exception return the way the architecture specifies -- by having the relevant instructions check for the magic value and raise the 'do an exception return' QEMU internal exception immediately. The effect on the generated code is minor: bx lr, old code (and new code for Thread mode): TCG: mov_i32 tmp5,r14 movi_i32 tmp6,$0xfffffffffffffffe and_i32 pc,tmp5,tmp6 movi_i32 tmp6,$0x1 and_i32 tmp5,tmp5,tmp6 st_i32 tmp5,env,$0x218 exit_tb $0x0 set_label $L0 exit_tb $0x7f2aabd61993 x86_64 generated code: 0x7f2aabe87019: mov %ebx,%ebp 0x7f2aabe8701b: and $0xfffffffffffffffe,%ebp 0x7f2aabe8701e: mov %ebp,0x3c(%r14) 0x7f2aabe87022: and $0x1,%ebx 0x7f2aabe87025: mov %ebx,0x218(%r14) 0x7f2aabe8702c: xor %eax,%eax 0x7f2aabe8702e: jmpq 0x7f2aabe7c016 bx lr, new code when in Handler mode: TCG: mov_i32 tmp5,r14 movi_i32 tmp6,$0xfffffffffffffffe and_i32 pc,tmp5,tmp6 movi_i32 tmp6,$0x1 and_i32 tmp5,tmp5,tmp6 st_i32 tmp5,env,$0x218 movi_i32 tmp5,$0xffffffffff000000 brcond_i32 pc,tmp5,geu,$L1 exit_tb $0x0 set_label $L1 movi_i32 tmp5,$0x8 call exception_internal,$0x0,$0,env,tmp5 x86_64 generated code: 0x7fe8fa1264e3: mov %ebp,%ebx 0x7fe8fa1264e5: and $0xfffffffffffffffe,%ebx 0x7fe8fa1264e8: mov %ebx,0x3c(%r14) 0x7fe8fa1264ec: and $0x1,%ebp 0x7fe8fa1264ef: mov %ebp,0x218(%r14) 0x7fe8fa1264f6: cmp $0xff000000,%ebx 0x7fe8fa1264fc: jae 0x7fe8fa126509 0x7fe8fa126502: xor %eax,%eax 0x7fe8fa126504: jmpq 0x7fe8fa122016 0x7fe8fa126509: mov %r14,%rdi 0x7fe8fa12650c: mov $0x8,%esi 0x7fe8fa126511: mov $0x56095dbeccf5,%r10 0x7fe8fa12651b: callq *%r10 which is a difference of one cmp/branch-not-taken. This will be lost in the noise of having to exit generated code and look up the next TB anyway. Backports commit 3bb8a96f5348913ee130169504f3642f501b113e from qemu	2018-03-02 14:58:14 -05:00
Peter Maydell	cfc1611d6f	arm: Track M profile handler mode state in TB flags For M profile exception-return handling we'd like to generate different code for some instructions depending on whether we are in Handler mode or Thread mode. This isn't the same as "are we privileged or user", so we need an extra bit in the TB flags to distinguish. Backports commit 064c379c99b835bdcc478d21a3849507ea07d53a from qemu	2018-03-02 14:54:16 -05:00
Peter Maydell	8233756382	arm: Move condition-failed codepath generation out of if() Move the code to generate the "condition failed" instruction codepath out of the if (singlestepping) {} else {}. This will allow adding support for handling a new is_jmp type which can't be neatly split into "singlestepping case" versus "not singlestepping case". Backports commit f021b2c4627890d82fbcc300db3bd782b37b7f8a from qemu arm: Abstract out "are we singlestepping" test to utility function We now test for "are we singlestepping" in several places and it's not a trivial check because we need to care about both architectural singlestep and QEMU gdbstub singlestep. We're also about to add another place that needs to make this check, so pull the condition out into a function. Backports commit b636649f5a2e108413dd171edaf320f781f57942 from qemu	2018-03-02 14:52:30 -05:00
Peter Maydell	43d6e73fea	arm: Move gen_set_condexec() and gen_set_pc_im() up in the file Move the utility routines gen_set_condexec() and gen_set_pc_im() up in the file, as we will want to use them from a function placed earlier in the file than their current location. Backports commit 4d5e8c969a74c86124fc2284ea603cc6dd3c5dfa from qemu	2018-03-02 14:48:36 -05:00
Peter Maydell	23141d7620	arm: Factor out "generate right kind of step exception" We currently have two places that do: if (dc->ss_active) { gen_step_complete_exception(dc); } else { gen_exception_internal(EXCP_DEBUG); } Factor this out into its own function, as we're about to add a third place that needs the same logic. Backports commit 5425415ebba5fa20558e1ef25e1997a6f5ea4c7c from qemu	2018-03-02 14:45:30 -05:00
Peter Maydell	ddfe550411	arm: Thumb shift operations should not permit interworking branches In Thumb mode, the only instructions which can cause an interworking branch by writing the PC are BLX, BX, BXJ, LDR, POP and LDM. Unlike ARM mode, data processing instructions which target the PC do not cause interworking branches. When we added support for doing interworking branches on writes to PC from data processing instructions in commit 21aeb3430ce7ba, we accidentally changed a Thumb instruction to have interworking branch behaviour for writes to PC. (MOV, MOVS register-shifted register, encoding T2; this is the standard encoding for LSL/LSR/ASR/ROR (register).) For this encoding, behaviour with Rd == R15 is specified as UNPREDICTABLE, so allowing an interworking branch is within spec, but it's confusing and differs from our handling of this class of UNPREDICTABLE for other Thumb ALU operations. Make it perform a simple (non-interworking) branch like the others. Backports commit bedb8a6b09c1754c3b9f155750c62dc087706698 from qemu	2018-03-02 14:42:40 -05:00
Peter Maydell	9f938da9e1	arm: Don't implement BXJ on M-profile CPUs For M-profile CPUs, the BXJ instruction does not exist at all, and the encoding should always UNDEF. We were accidentally implementing it to behave like A-profile BXJ; correct the error. Backports commit 9d7c59c84d4530d05e8702b1c3a31e6da00a397e from qemu	2018-03-02 14:42:04 -05:00
Peter Maydell	e9d507a193	target/arm: Add assertion about FSC format for syndrome registers In tlb_fill() we construct a syndrome register value from a fault status register value which is filled in by arm_tlb_fill(). arm_tlb_fill() returns FSR values which might be in the format used with short-format page descriptors, or the format used with long-format (LPAE) descriptors. The syndrome register always uses LPAE-format FSR status codes. It isn't actually possible to end up delivering a syndrome register value to the guest for a fault which is reported with a short-format FSR (that kind of stage 1 fault will only happen for an AArch32 translation regime which doesn't have a syndrome register, and can never be redirected to an AArch64 or Hyp exception level). Add an assertion which checks this, and adjust the code so that we construct a syndrome with an invalid status code, rather than allowing set bits in the FSR input to randomly corrupt other fields in the syndrome. Backports commit 65ed2ed90d9d81fd4b639029be850ea5651f919f from qemu	2018-03-02 14:41:07 -05:00
Peter Maydell	1cf80d7536	arm: Move excnames[] array into arm_log_exceptions() The excnames[] array is defined in internals.h because we used to use it from two different source files for handling logging of AArch32 and AArch64 exception entry. Refactoring means that it's now used only in arm_log_exception() in helper.c, so move the array into that function. Backports commit 2c4a7cc5afb1bfc1728a39abd951ddd7714c476e from qemu	2018-03-02 14:39:37 -05:00
Peter Maydell	1af1944903	target/arm: Add missing entries to excnames[] for log strings Recent changes have added new EXCP_ values to ARM but forgot to update the excnames[] array which is used to provide human-readable strings when printing information about the exception for debug logging. Add the missing entries, and add a comment to the list of #defines to help avoid the mistake being repeated in future. Backports commit 32b81e620ea562d56ab2733421b5da1082b237a2 from qemu	2018-03-02 14:38:23 -05:00
Richard Henderson	13242af398	target/arm: Fix aa64 ldp register writeback For "ldp x0, x1, [x0]", if the second load is on a second page and the second page is unmapped, the exception would be raised with x0 already modified. This means the instruction couldn't be restarted. Backports commit 2d1bbf51c2cb948da4b6fd5f91cf3ecc80b28156 from qemu	2018-03-02 14:35:46 -05:00
Paolo Bonzini	c27870520a	exec: revert MemoryRegionCache MemoryRegionCache did not know about virtio support for IOMMUs (because the two features were developed at the same time). Revert MemoryRegionCache to "normal" address_space_* operations for 2.9, as it is simpler than undoing the virtio patches. Backports commit 90c4fe5fc517a045e7a7cf2f23472e114042ca29 from qemu	2018-03-02 14:30:41 -05:00
Peter Maydell	008a235b5e	tcg/sparc: Zero extend address argument to ld/st helpers The C store helper functions take the address argument as a target_ulong type; if this is 32 bit but the host is 64 bit then the SPARC calling convention requires that the caller must zero extend the value. We weren't doing this, which meant we could pass values to the caller with high bits set and QEMU would crash if it was compiled with optimizations. In particular, the i386 BIOS would not start. Backports commit 5c32be5baf41aec4f4675d2bf24f9948756abf3c from qemu	2018-03-02 14:25:17 -05:00
Peter Maydell	40718df109	tcg/sparc: Zero extend data argument to store helpers The C store helper functions take the data argument as a uint8_t, uint16_t, etc depending on the store size. The SPARC calling convention requires that data types smaller than the register size must be extended by the caller. We weren't doing this, which meant that if QEMU was compiled with optimizations enabled we could end up storing incorrect values to guest memory. (In particular the i386 guest BIOS would crash on startup.) Add code to the trampolines that call the store helpers to do the zero extension as required. Backports commit 709a340d679d95a0c6cbb9b5f654498f04345b50 from qemu	2018-03-02 14:24:24 -05:00
Eduardo Habkost	e71c7b7819	i386: Don't override -cpu options on -cpu host/max The existing code for "host" and "max" CPU models overrides every single feature in the CPU object at realize time, even the ones that were explicitly enabled or disabled by the user using "feat=on" or "feat=off", while features set using +feat/-feat are kept. This means "-cpu host,+invtsc" works as expected, while "-cpu host,invtsc=on" doesn't. This was a known bug, already documented in a comment inside x86_cpu_expand_features(). What makes this bug worse now is that libvirt 3.0.0 and newer now use "feat=on\|off" instead of +feat/-feat when it detects a QEMU version that supports it (see libvirt commit d47db7b16dd5422c7e487c8c8ee5b181a2f9cd66). Change the feature property getter/setter to set a env->user_features field, to keep track of features that were explicitly changed using QOM properties. Then make the max_features code not override user features when handling "-cpu host" and "-cpu max". This will also allow us to remove the plus_features/minus_features hack in the future, but I plan to do that after 2.9.0 is released. Backports commit d4a606b38b5d4b3689b86cc1575908e82179ecfb from qemu	2018-03-02 14:22:45 -05:00
Pranith Kumar	31b977ab3e	tcg/i386: Check the size of instruction being translated This fixes the bug: 'user-to-root privesc inside VM via bad translation caching' reported by Jann Horn here: https://bugs.chromium.org/p/project-zero/issues/detail?id=1122 Backports commit 30663fd26c0307e414622c7a8607fbc04f92ec14 from qemu	2018-03-02 14:19:35 -05:00
Paolo Bonzini	b750ec2363	configure: remove Cygwin The Cygwin target is really compiling for native Win32 with -mno-cygwin. Except, GCC 4.7.0 has finally removed the long deprecated -mno-cygwin option, and that happened about five years ago. Let it rest in peace. Backports commit c8645752ce31cc044ecc5f969a986fdcb6aab590 from qemu	2018-03-02 14:17:41 -05:00
Yongbok Kim	ce3aecf263	target/mips: fix delay slot detection in gen_msa_branch() It is unnecessary to test R6 from delay/forbidden slot check in gen_msa_branch(). https://bugs.launchpad.net/qemu/+bug/1663287 Backports commit 075a1fe788d36b271ec25507466c30b9a90b5d54 from qemu	2018-03-02 14:15:50 -05:00
Philippe Mathieu-Daudé	d17c07b548	target-mips: replace break by goto cp0_unimplemented this fixes many warnings like: target/mips/translate.c:6253:13: warning: Value stored to 'rn' is never read rn = "invalid sel"; ^ ~~~~~~~~~~~~~ Backports commit 3570d7f6672836140f0a1ec9bf95dd5ea50a2aaa from qemu	2018-03-02 14:14:57 -05:00
Philippe Mathieu-Daudé	5f78f3cd80	target-mips: log bad coprocessor0 register accesses with LOG_UNIMP Backports commit 965447eecb6b98d6dfc4dbd97f836093c7e398a0 from qemu	2018-03-02 14:12:29 -05:00
Philippe Mathieu-Daudé	e7176e6c85	target-mips: remove old & unuseful comments Backports commit 989f2aa9af7f05c323761b66c0e299059a19b7b1 from qemu	2018-03-02 14:11:20 -05:00
Philippe Mathieu-Daudé	65c69e6ccb	target-mips: fix compiler warnings (clang 5) static code analyzer complain: target/mips/helper.c:453:5: warning: Function call argument is an uninitialized value qemu_log_mask(CPU_LOG_MMU, ^~~~~~~~~~~~~~~~~~~~~~~~~~ 'physical' and 'prot' are uninitialized if 'ret' is not TLBRET_MATCH. Backports commit def74c0cf05722b2e502d4b4f1219966c5b0cbd3 from qemu	2018-03-02 14:09:55 -05:00
Peter Maydell	78303d4c1b	arm: Fix APSR writes via M profile MSR Our implementation of writes to the APSR for M-profile via the MSR instruction was badly broken. First and worst, we had the sense wrong on the test of bit 2 of the SYSm field -- this is supposed to request an APSR write if bit 2 is 0 but we were doing it if bit 2 was 1. This bug was introduced in commit 58117c9bb429cd, so hasn't been in a QEMU release. Secondly, the choice of exactly which parts of APSR should be written is defined by bits in the 'mask' field. We were not passing these through from instruction decode, making it impossible to check them in the helper. Pass the mask bits through from the instruction decode to the helper function and process them appropriately; fix the wrong sense of the SYSm bit 2 check. Invalid mask values and invalid combinations of mask and register number are UNPREDICTABLE; we choose to treat them as if the mask values were valid. Backports commit b28b3377d7e9ba35611d454d5a63ef50cab1f8c5 from qemu	2018-03-02 14:08:13 -05:00
Peter Maydell	bb5819cbbc	armv7m: R14 should reset to 0xffffffff For M profile (unlike A profile) the reset value of R14 is specified as 0xffffffff. (The rationale is that this is an illegal exception return value, so if guest code tries to return to it it will result in a helpful exception.) Registers r0 to r12 and the flags are architecturally UNKNOWN on reset, so we leave those at zero. Backports commit 056f43df9168413f304500b69c33158d66efb7cf from qemu	2018-03-02 13:56:36 -05:00
Michael Davidsaver	f42f22ec02	armv7m: FAULTMASK should be 0 on reset For M profile CPUs, FAULTMASK should be 0 on reset, like PRIMASK. QEMU stores FAULTMASK in the PSTATE F bit, so (as with PRIMASK in the I bit) we have to clear these to undo the A profile default of 1. Update the comment accordingly and move it so that it's closer to the code it's referring to. Backports commit dc7abe4d65ad39390b2db120f5ad18f8f6576f8b from qemu	2018-03-02 13:55:59 -05:00
Peter Maydell	8a6d746aef	armv7m: Report no-coprocessor faults correctly For v7M attempts to access a nonexistent coprocessor are reported differently from plain undefined instructions (as UsageFaults of type NOCP rather than type UNDEFINSTR). Split them out into a new EXCP_NOCP so we can report the FSR value correctly. Backports commit 7517748e3f71a3099e57915fba95c4c308e6d842 from qemu	2018-03-02 13:54:36 -05:00
Peter Maydell	fce8138187	armv7m: Report no-coprocessor faults correctly For v7M attempts to access a nonexistent coprocessor are reported differently from plain undefined instructions (as UsageFaults of type NOCP rather than type UNDEFINSTR). Split them out into a new EXCP_NOCP so we can report the FSR value correctly. Backports commit 7517748e3f71a3099e57915fba95c4c308e6d842 from qemu	2018-03-02 13:47:14 -05:00
Michael Davidsaver	eaa080e232	armv7m: set CFSR.UNDEFINSTR on undefined instructions When we take an exception for an undefined instruction, set the appropriate CFSR bit. Backports commit 81dd9648c69bb89afdd6f4bb3ed6f3efdac96524 from qemu	2018-03-02 13:45:56 -05:00
Michael Davidsaver	2297b8134b	armv7m: honour CCR.STACKALIGN on exception entry The CCR.STACKALIGN bit controls whether the CPU is supposed to force 8-alignment of the stack pointer on entry to the exception handler. Backports commit dc858c6633a9af8b80c1509cf6f825e4390d3ad1 from qemu	2018-03-02 13:45:06 -05:00
Peter Maydell	7870bcfcb0	armv7m: add state for v7M CCR, CFSR, HFSR, DFSR, MMFAR, BFAR Add the structure fields, VMState fields, reset code and macros for the v7M system control registers CCR, CFSR, HFSR, DFSR, MMFAR and BFAR. Backports commit 2c4da50d9477fb830d778bb5d6a11215aa359b44 from qemu	2018-03-02 13:43:55 -05:00
Michael Davidsaver	7f044bf8cc	armv7m: Clear FAULTMASK on return from non-NMI exceptions FAULTMASK must be cleared on return from all exceptions other than NMI. Backports commit a20ee6005564590d33eabec11ed4dc7c432db36b from qemu	2018-03-02 13:41:16 -05:00
Michael Davidsaver	cc9458cf59	armv7m: Explicit error for bad vector table Give an explicit error and abort when a load from the vector table fails. Architecturally this should HardFault (which will then immediately fail to load the HardFault vector and go into Lockup). Since we don't model Lockup, just report this guest error via cpu_abort(). This is more helpful than the previous behaviour of reading a zero, which is the address of the reset stack pointer and not a sensible location to jump to. Backports commit 1b9ea408fca1ce8caae67b792355b023c69c5ac5 from qemu	2018-03-02 13:38:08 -05:00
Michael Davidsaver	703489071f	armv7m: Replace armv7m.hack with unassigned_access handler For v7m we need to catch attempts to execute from special addresses at 0xfffffff0 and above. Previously we did this with the aid of a hacky special purpose lump of memory in the address space and a check in translate.c for whether we were translating code at those addresses. We can implement this more cleanly using a CPU unassigned access handler which throws the exception if the unassigned access is for one of the special addresses. Backports commit 542b3478a00cb7ef51c259255b3ab1e2a7daada2 from qemu	2018-03-02 13:33:31 -05:00
Michael Davidsaver	8828b4e595	armv7m: MRS/MSR: handle unprivileged access The MRS and MSR instruction handling has a number of flaws: * unprivileged accesses should only be able to read CONTROL and the xPSR subfields, and only write APSR (others RAZ/WI) * privileged access should not be able to write xPSR subfields other than APSR * accesses to unimplemented registers should log as guest errors, not abort QEMU Backports commit 58117c9bb429cd9552d998687aa99088eb1d8528 from qemu	2018-03-02 13:29:59 -05:00
Michael Davidsaver	2769c6ada0	armv7m: Fix reads of CONTROL register bit 1 The v7m CONTROL register bit 1 is SPSEL, which indicates the stack being used. We were storing this information not in v7m.control but in the separate v7m.other_sp structure field. Unfortunately, the code handling reads of the CONTROL register didn't take account of this, and so if SPSEL was updated by an exception entry or exit then a subsequent guest read of CONTROL would get the wrong value. Using a separate structure field doesn't really gain us anything in efficiency, so drop this unnecessary complexity in favour of simply storing all the bits in v7m.control. This is a migration compatibility break for M profile CPUs only. Backports commit abc24d86cc0364f402e438fae3acb14289b40734 from qemu	2018-03-02 13:26:38 -05:00
Peter Maydell	d8eb259032	arm: Enforce should-be-1 bits in MRS decoding The MRS instruction requires that bits [19..16] are all 1s, and for A/R profile also that bits [7..0] are all 0s. At this point in the decode tree we have checked all of the rest of the instruction but were allowing these to be any value. If these bits are not set then the result is architecturally UNPREDICTABLE, but choosing to UNDEF is more helpful to the user and avoids unexpected odd behaviour if the encodings are used for some purpose in future architecture versions. Backports commit 3d54026fb06d1aea7ebb4e9825970b06bebcacac from qemu	2018-03-02 13:09:17 -05:00
Peter Maydell	dc44eded51	arm: Don't decode MRS(banked) or MSR(banked) for M profile M profile doesn't have the MSR(banked) and MRS(banked) instructions and uses the encodings for different kinds of M-profile MRS/MSR. Guard the relevant bits of the decode logic to make sure we don't accidentally fall into them by accident on M-profile. (The bit being checked for this (bit 5) is part of the SYSm field on M-profile, but since no currently allocated system registers have encodings with bit 5 of SYSm set, this hasn't been a problem in practice.) Backports commit 43ac65742319ef5ac4461daf43316b189cd21e89 from qemu	2018-03-02 13:08:20 -05:00
Peter Maydell	cc2a6a2728	arm: HVC and SMC encodings don't exist for M profile M profile doesn't have the HVC or SMC encodings, so make them always UNDEF rather than generating calls to helper functions that assume A/R profile. Backports commit 001b3cab51ebfcb13e8dd03ea25bfa3bd0c517a3 from qemu	2018-03-02 13:07:31 -05:00
Dr. David Alan Gilbert	55d79cf4c0	RAMBlocks: qemu_ram_is_shared Provide a helper to say whether a RAMBlock was created as a shared mapping. Backports commit 463a4ac23bcf0f0b65c850fa66f5ae6e43edd243 from qemu	2018-03-02 13:05:35 -05:00
Dr. David Alan Gilbert	5dfbee8930	memory_region: Fix name comments The 'name' parameter to memory_region_init_* had been marked as debug only, however vmstate_region_ram uses it as a parameter to qemu_ram_set_idstr to set RAMBlock names and these form part of the migration stream. Backports commit e8f5fe2de125a0bfbefbaa6a69af81f4817cb7a0 from qemu	2018-03-02 13:01:23 -05:00
Andrew Jones	0139cbc2cd	target/arm/arm-powerctl: Fix psci info return values The power state spec section 5.1.5 AFFINITY_INFO defines the affinity info return values as 0 ON 1 OFF 2 ON_PENDING I grepped QEMU for power_state to ensure that no assumptions of OFF=0 were being made. Backports commit d5affb0d8677e1a8a8fe03fa25005b669e7cdc02 from qemu	2018-03-02 12:59:49 -05:00
Andrew Baumann	76cd64dd7e	target/arm: implement armv8 PMUSERENR (user-mode enable bits) In armv8, this register implements more than a single bit, with fine-grained enables for read access to event counters, cycles counters, and write access to the software increment. This change implements those checks using custom access functions for the relevant registers. Backports commit 6ecd0b6ba0591ef280ed984103924d4bdca5ac32 from qemu	2018-03-02 12:55:46 -05:00
Eduardo Habkost	9ddce7c01d	i386: Change stepping of Haswell to non-blacklisted value glibc blacklists TSX on Haswell CPUs with model==60 and stepping < 4. To make the Haswell CPU model more useful, make those guests actually use TSX by changing CPU stepping to 4. References: * glibc commit 2702856bf45c82cf8e69f2064f5aa15c0ceb6359 https://sourceware.org/git/?p=glibc.git;a=commit;h=2702856bf45c82cf8e69f2064f5aa15c0ceb6359 Backports commit ec56a4a7b07e2943f49da273a31e3195083b1f2e from qemu	2018-03-02 12:53:11 -05:00
Eduardo Habkost	f865b17639	i386: host_vendor_fms() helper function Helper function for code that needs to check the host CPU vendor/family/model/stepping values. Backports commit 20271d484069f154fb262507e63adc3a37e885d2 from qemu	2018-03-02 12:51:44 -05:00
Alex Bennée	b8caaac110	target/arm/helper: make it clear the EC field is also in hex ..just like the rest of the displayed ESR register. Otherwise people might scratch their heads if a not obviously hex number is displayed for the EC field. Backports commit 6568da459b611845ef55526cd23afc9fa9f4647f from qemu	2018-03-02 12:50:33 -05:00
Paolo Bonzini	bc7a9ccfbd	target-i386: defer VMEXIT to do_interrupt Paths through the softmmu code during code generation now need to be audited to check for double locking of tb_lock. In particular, VMEXIT can take tb_lock through cpu_vmexit -> cpu_x86_update_cr4 -> tlb_flush. To avoid this, split VMEXIT delivery in two parts, similar to what is done with exceptions. cpu_vmexit only records the VMEXIT exit code and information, and cc->do_interrupt can then deliver it when it is safe to take the lock. Backports commit 10cde894b63146139f981857e4eedf756fa53dcb from qemu	2018-03-02 12:49:18 -05:00
Alex Bennée	ad548f8110	translate-all: exit cpu_restore_state early if translating The translation code uses cpu_ld*_code which can trigger a tlb_fill which if it fails will erroneously attempts a fault resolution. This never works during translation as the TB being generated hasn't been added yet. The target should have checked retaddr before calling cpu_restore_state but for those that have yet to be fixed we do it here to avoid a recursive tb_lock() under MTTCG's new locking regime Backports commit d8b2239bcd8872a5c5f7534d1658fc2365caab2d from qemu	2018-03-02 12:46:16 -05:00
Alex Bennée	a01496e6d9	target/i386/cpu.h: declare TCG_GUEST_DEFAULT_MO This suppresses the incorrect warning when forcing MTTCG for x86 guests on x86 hosts. A future patch will still warn when TARGET_SUPPORT_MTTCG hasn't been defined for the guest (which is still pending for x86). Backports commit 72c1701f62e8d44eb24a0583a958edc280105455 from qemu	2018-03-02 12:43:37 -05:00
Markus Armbruster	8a8dc93945	qapi: Improve qobject visitor documentation Backports commit aa3a982e674b09ae32502940f93ba98b3a8ad50e from qemu	2018-03-02 12:24:21 -05:00
Markus Armbruster	67cb4b0900	qapi: Fix object input visit beyond end of list Backports commit 1f41a645b65530859bf5984aa08e103bb452b473 from qemu	2018-03-02 12:22:50 -05:00
Markus Armbruster	ac1a61af47	qapi: Make input visitors detect unvisited list tails Fix the design flaw demonstrated in the previous commit: new method check_list() lets input visitors report that unvisited input remains for a list, exactly like check_struct() lets them report that unvisited input remains for a struct or union. Implement the method for the qobject input visitor (straightforward), and the string input visitor (less so, due to the magic list syntax there). The opts visitor's list magic is even more impenetrable, and all I can do there today is a stub with a FIXME comment. No worse than before. Backports commit a4a1c70dc759e5b81627e96564f344ab43ea86eb from qemu	2018-03-02 12:21:04 -05:00
Markus Armbruster	e0ee098c4a	qapi: Drop unused non-strict qobject input visitor The split between tests/test-qobject-input-visitor.c and tests/test-qobject-input-strict.c now makes less sense than ever. The next commit will take care of that. Backports commit 048abb7b20c9f822ad9d4b730bade73b3311a47a from qemu	2018-03-02 12:14:52 -05:00
Markus Armbruster	3e8b0c66a3	qom: Make object_property_set_qobject()'s input visitor strict Commit 240f64b made all qobject input visitors created outside tests strict, except for the one in object_property_set_qobject(). That one was left behind only because Eric couldn't spare the time to figure out whether making it strict would break anything, with a TODO comment. Time to resolve it. Strict makes a difference only for otherwise successful visits of QAPI structs or unions. Let's examine what the callers of object_property_set_qobject() visit: * object_property_set_str(), object_property_set_bool(), object_property_set_int() visit a QString, QBool, QInt, respectively. Strictness can't matter. * qmp_qom_set visits its @value argument. Comes straight from QMP and can be anything ('any' in the QAPI schema). Strictness matters when the property's set() method visits a struct or union QAPI type. No such methods exist, thus switching to strict can't break anything. If we acquire such methods in the future, we'll want the visitor to be strict, so that unexpected members get rejected as they should be. Switch to strict. Backports commit 05601ed2de60df0e344d6b783a6bc0c1ff2b5d1f from qemu	2018-03-02 12:10:50 -05:00
Markus Armbruster	2b7daee13b	qapi: Make string input and opts visitor require non-null input The string input visitor tries to cope with null input. Null input isn't used anywhere, and isn't covered by tests. Unsurprisingly, it doesn't fully work: start_list() crashes because it passes the input via parse_str() to strtoll() unchecked. Make string_input_visitor_new() assert its argument isn't null, and drop the code trying to deal with null input. The opts visitor crashes when you try to actually visit something with null input. Make opts_visitor_new() assert its argument isn't null, mostly for clarity. qobject_input_visitor_new() already asserts its argument isn't null. Backports commit f332e830e38b3ff3953ef02ac04e409ae53769c5 from qemu	2018-03-02 12:10:07 -05:00
Markus Armbruster	50e3cda49a	qapi: Drop string input visitor method optional() visit_optional() is to be called only between visit_start_struct() and visit_end_struct(). Visitors that don't support struct visits, i.e. don't implement start_struct(), end_struct(), have no use for it. Clarify documentation. The string input visitor doesn't support struct visits. Its parse_optional() is therefore useless. Drop it. Backports commit a8aec6de2ac1a5e36989fdfba29067b361009b75 from qemu	2018-03-02 12:07:55 -05:00
Markus Armbruster	84e5261cdf	qapi: Improve qobject input visitor error reporting Error messages refer to nodes of the QObject being visited by name. Trouble is the names are sometimes less than helpful: * The name of the root QObject is whatever @name argument got passed to the visitor, except NULL gets mapped to "null". We commonly pass NULL. Not good. Avoiding errors "at the root" mitigates. For instance, visit_start_struct() can only fail when the visited object is not a dictionary, and we commonly ensure it is beforehand. * The name of a QDict's member is the member key. Good enough only when this happens to be unique. * The name of a QList's member is "null". Not good. Improve error messages by referring to nodes by path instead, as follows: * The path of the root QObject is whatever @name argument got passed to the visitor, except NULL gets mapped to "<anonymous>". * The path of a root QDict's member is the member key. * The path of a root QList's member is "[%u]", where %u is the list index, starting at zero. * The path of a non-root QDict's member is the path of the QDict concatenated with "." and the member key. * The path of a non-root QList's member is the path of the QList concatenated with "[%u]", where %u is the list index. For example, the incorrect QMP command { "execute": "blockdev-add", "arguments": { "node-name": "foo", "driver": "raw", "file": {"driver": "file" } } } now fails with {"error": {"class": "GenericError", "desc": "Parameter 'file.filename' is missing"}} instead of {"error": {"class": "GenericError", "desc": "Parameter 'filename' is missing"}} and { "execute": "input-send-event", "arguments": { "device": "bar", "events": [ [] ] } } now fails with {"error": {"class": "GenericError", "desc": "Invalid parameter type for 'events[0]', expected: object"}} instead of {"error": {"class": "GenericError", "desc": "Invalid parameter type for 'null', expected: QDict"}} Aside: calling the thing "parameter" is suboptimal for QMP, because the root object is "arguments" there. The qobject output visitor doesn't have this problem because it should not fail. Same for dealloc and clone visitors. The string visitors don't have this problem because they visit just one value, whose name needs to be passed to the visitor as @name. The string output visitor shouldn't fail anyway. The options visitor uses QemuOpts names. Their name space is flat, so the use of QDict member keys as names is fine. NULL names used with roots and lists could conceivably result in bad error messages. Left for another day. Backports commit a9fc37f6bc3f2ab90585cb16493da9f6dcfbfbcf from qemu	2018-03-02 12:05:53 -05:00
Markus Armbruster	a5cf19858d	qapi: Make QObject input visitor set list reliably qobject_input_start_struct() sets list, except when it fails because qobject_input_get_object() fails, i.e. the input object doesn't exist. All the other input visitor start_struct(), start_list(), start_alternate() always set obj / list. Change qobject_input_start_struct() to match. Backports commit 58561c27669ddf1c6d39ff8ce25837c6f2d9d92c from qemu	2018-03-02 11:31:58 -05:00
Markus Armbruster	fdf09c6d12	qapi: Clean up after commit 3d344c2 Drop unused QIV_STACK_SIZE and unused qobject_input_start_struct() parameter errp. Backports commit b8874fbfd329b5084463bcacd1418d493a93c383 from qemu	2018-03-02 11:30:38 -05:00
Markus Armbruster	d7da652d4e	qapi: Improve a QObject input visitor error message The QObject input visitor has three error message formats: * Parameter '%s' is missing * "Invalid parameter type for '%s', expected: %s" * "QMP input object member '%s' is unexpected" The '%s' are member names (or "null", but I'll fix that later). The last error message calls the thing "QMP input object member" instead of "parameter". Misleading when the visitor is used on QObjects that don't come from QMP. Change it to "Parameter '%s' is unexpected". Backports commit 910f738b851a263396fc85b2052e47f884ffead3 from qemu	2018-03-02 11:29:02 -05:00
Markus Armbruster	d07bcef231	qmp: Eliminate silly QERR_QMP_* macros The QERR_ macros are leftovers from the days of "rich" error objects. QERR_QMP_BAD_INPUT_OBJECT, QERR_QMP_BAD_INPUT_OBJECT_MEMBER, QERR_QMP_EXTRA_MEMBER are used in just one place now, except for one use that has crept into qobject-input-visitor.c. Drop these macros, to make the (bad) error messages more visible. Backports commit 99fb0c53c038105bae68b02a3d9f1cbf7951ba10 from qemu	2018-03-02 11:28:17 -05:00
Yongji Xie	23f5b17a08	memory: Introduce DEVICE_HOST_ENDIAN for ram device At the moment ram device's memory regions are DEVICE_NATIVE_ENDIAN. It's incorrect. This memory region is backed by a MMIO area in host, so the uint64_t data that MemoryRegionOps read from/write to this area should be host-endian rather than target-endian. Hence, current code does not work when target and host endianness are different which is the most common case on PPC64. To fix it, this introduces DEVICE_HOST_ENDIAN for the ram device. This has been tested on PPC64 BE/LE host/guest in all possible combinations including TCG. Backports commit c99a29e702528698c0ce2590f06ca7ff239f7c39 from qemu	2018-03-02 11:24:32 -05:00
Paolo Bonzini	11709d0afa	cpu-exec: remove unnecessary check of cpu->exit_request The cpu->exit_request check in cpu_loop_exec_tb is unnecessary, because cpu->tcg_exit_req is always set after cpu->exit_request. So let the TB exit and we will pick up the exit request later in cpu_handle_interrupt. Backports commit 55ac0a9bf4e1b1adfc7d73586a7aa085f58c9851 from qemu	2018-03-02 11:21:35 -05:00
Eduardo Habkost	33ab5f71c9	i386: Reorganize and document CPUID initialization steps CPU runnability checks and CPU model expansion have slightly different requirements. Document the steps involved in loading a CPU model and realizing a CPU, so their requirements and purpose are clearly defined. This patch doesn't change any implementation. It just add comments, rename the x86_cpu_load_features() function for clarity (so it won't be confused with x86_cpu_load_def()), and move x86_cpu_filter_features() closer to it. Backports commit b8d834a00fa3ed4dad7d371e1a00938a126a54a0 from qemu	2018-03-02 10:55:00 -05:00
Eduardo Habkost	be606acff9	i386: Rename X86CPU::host_features to X86CPU::max_features Rename the field and add a small comment to make its purpose clearer. Backports commit 44bd8e530661be1d22ae0f461a5c9bdbcc3847ec from qemu	2018-03-02 10:51:40 -05:00
Pranith Kumar	ee609fa59f	aarch64: Change ext type to TCGType to fix warnings To fix the following warnings: In file included from /users/pranith/qemu/tcg/tcg.c:255: /users/pranith/qemu/tcg/aarch64/tcg-target.inc.c:879:24: warning: implicit conversion from enumeration type 'TCGMemOp' (aka 'enum TCGMemOp') to different enumeration type 'TCGType' (aka 'enum TCGType') [-Wenum-conversion] tcg_out_cmp(s, ext, a, b, b_const); ~~~~~~~~~~~ ^~~ /users/pranith/qemu/tcg/aarch64/tcg-target.inc.c:893:36: warning: implicit conversion from enumeration type 'TCGMemOp' (aka 'enum TCGMemOp') to different enumeration type 'TCGType' (aka 'enum TCGType') [-Wenum-conversion] tcg_out_insn(s, 3201, CBZ, ext, a, offset); ~~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~ /users/pranith/qemu/tcg/aarch64/tcg-target.inc.c:389:65: note: expanded from macro 'tcg_out_insn' glue(tcg_out_insn_,FMT)(S, glue(glue(glue(I,FMT),_),OP), ## __VA_ARGS__) ^ /users/pranith/qemu/tcg/aarch64/tcg-target.inc.c:895:37: warning: implicit conversion from enumeration type 'TCGMemOp' (aka 'enum TCGMemOp') to different enumeration type 'TCGType' (aka 'enum TCGType') [-Wenum-conversion] tcg_out_insn(s, 3201, CBNZ, ext, a, offset); ~~~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~ /users/pranith/qemu/tcg/aarch64/tcg-target.inc.c:389:65: note: expanded from macro 'tcg_out_insn' glue(tcg_out_insn_,FMT)(S, glue(glue(glue(I,FMT),_),OP), ## __VA_ARGS__) ^ /users/pranith/qemu/tcg/aarch64/tcg-target.inc.c:1610:27: warning: implicit conversion from enumeration type 'TCGType' (aka 'enum TCGType') to different enumeration type 'TCGMemOp' (aka 'enum TCGMemOp') [-Wenum-conversion] tcg_out_brcond(s, ext, a2, a0, a1, const_args[1], arg_label(args[3])); ~~~~~~~~~~~~~~ ^~~ backports commit dc1eccd661ada3b746ca4438e444993c36a0f04f from qemu	2018-03-02 10:48:56 -05:00
Peter Maydell	e141ea5dd2	softfloat: Use correct type in float64_to_uint64_round_to_zero() In float64_to_uint64_round_to_zero() a typo meant that we were taking the uint64_t return value from float64_to_uint64() and putting it into an int64_t variable before returning it as uint64_t again. Use uint64_t instead of pointlessly casting it back and forth to int64_t. Backports commit d000b477f2693dbca97cd8ea751c2e0b71890662 from qemu	2018-03-02 10:44:10 -05:00
Peter Maydell	0c9ef6f4b3	cputlb: Don't assume do_unassigned_access() never returns In get_page_addr_code(), if the guest PC doesn't correspond to RAM then we currently run the CPU's do_unassigned_access() hook if it has one, and otherwise we give up and exit QEMU with a more-or-less useful message. This code assumes that the do_unassigned_access hook will never return, because if it does then we'll plough on attempting to use a non-RAM TLB entry to get a RAM address and will abort() in qemu_ram_addr_from_host_nofail(). Unfortunately some CPU implementations of this hook do return: Microblaze, SPARC and the ARM v7M. Change the code to call report_bad_exec() if the hook returns, as well as if it didn't have one. This means we can tidy it up to use the cpu_unassigned_access() function which wraps the "get the CPU class and call the hook if it has one" work, since we aren't trying to distinguish "no hook" from "hook existed and returned" any more. This brings the handling of this hook into line with the handling used for data accesses, where "hook returned" is treated the same as "no hook existed" and gets you the default behaviour. Backports commit 44d7ce0ef39cb45e13d384574d79799eb3d39834 from qemu	2018-03-02 10:42:35 -05:00
Nick Reilly	4114fb2c0e	Add missing fp_access_check() to aarch64 crypto instructions The aarch64 crypto instructions for AES and SHA are missing the check for if the FPU is enabled. Backports commit a4f5c5b72380deeccd53a6890ea3782f10ca8054 from qemu	2018-03-02 10:39:16 -05:00
Alex Bennée	caba238b5a	tcg: enable MTTCG by default for ARM on x86 hosts This enables the multi-threaded system emulation by default for ARMv7 and ARMv8 guests using the x86_64 TCG backend. This is because on the guest side: - The ARM translate.c/translate-64.c have been converted to - use MTTCG safe atomic primitives - emit the appropriate barrier ops - The ARM machine has been updated to - hold the BQL when modifying shared cross-vCPU state - defer powerctl changes to async safe work All the host backends support the barrier and atomic primitives but need to provide same-or-better support for normal load/store operations. Backports commit ca759f9e387db87e1719911f019bc60c74be9ed8 from qemu	2018-03-02 10:32:47 -05:00
Alex Bennée	ff0ff28939	target-arm: don't generate WFE/YIELD calls for MTTCG The WFE and YIELD instructions are really only hints and in TCG's case they were useful to move the scheduling on from one vCPU to the next. In the parallel context (MTTCG) this just causes an unnecessary cpu_exit and contention of the BQL. Backports commit c22edfebff29f63d793032e4fbd42a035bb73e27 from qemu	2018-03-02 10:27:36 -05:00
Alex Bennée	157efaa8a9	cputlb: tweak qemu_ram_addr_from_host_nofail reporting This moves the helper function closer to where it is called and updates the error message to report via error_report instead of the deprecated fprintf. Backports commit 857baec1d9e80947f0c1007c3a3d2331d62b4b53 from qemu	2018-03-02 10:24:03 -05:00
Alex Bennée	454932263c	cputlb and arm/sparc targets: convert mmuidx flushes from varg to bitmap While the vargs approach was flexible the original MTTCG ended up having munge the bits to a bitmap so the data could be used in deferred work helpers. Instead of hiding that in cputlb we push the change to the API to make it take a bitmap of MMU indexes instead. For ARM some the resulting flushes end up being quite long so to aid readability I've tended to move the index shifting to a new line so all the bits being or-ed together line up nicely, for example: tlb_flush_page_by_mmuidx(other_cs, pageaddr, (1 << ARMMMUIdx_S1SE1) \| (1 << ARMMMUIdx_S1SE0)); Backports commit 0336cbf8532935d8e23c2aabf3e2ce2c0697b6ac from qemu	2018-03-02 10:12:40 -05:00
Alex Bennée	d56a4b0be4	tcg: handle EXCP_ATOMIC exception for system emulation The patch enables handling atomic code in the guest. This should be preferably done in cpu_handle_exception(), but the current assumptions regarding when we can execute atomic sections cause a deadlock. The current mechanism discards the flags which were set in atomic execution. We ensure they are properly saved by calling the cc->cpu_exec_enter/leave() functions around the loop. As we are running cpu_exec_step_atomic() from the outermost loop we need to avoid an abort() when single stepping over atomic code since debug exception longjmp will point to the the setlongjmp in cpu_exec(). We do this by setting a new jmp_env so that it jumps back here on an exception. Backports relevant parts of commit 08e73c48b053566bfe0c994f154f73991cd0ff0e from qemu	2018-03-02 09:56:43 -05:00
Alex Bennée	6760605e1c	tcg: enable thread-per-vCPU There are a couple of changes that occur at the same time here: - introduce a single vCPU qemu_tcg_cpu_thread_fn One of these is spawned per vCPU with its own Thread and Condition variables. qemu_tcg_rr_cpu_thread_fn is the new name for the old single threaded function. - the TLS current_cpu variable is now live for the lifetime of MTTCG vCPU threads. This is for future work where async jobs need to know the vCPU context they are operating in. The user to switch on multi-thread behaviour and spawn a thread per-vCPU. For a simple test kvm-unit-test like: ./arm/run ./arm/locking-test.flat -smp 4 -accel tcg,thread=multi Will now use 4 vCPU threads and have an expected FAIL (instead of the unexpected PASS) as the default mode of the test has no protection when incrementing a shared variable. We enable the parallel_cpus flag to ensure we generate correct barrier and atomic code if supported by the front and backends. This doesn't automatically enable MTTCG until default_mttcg_enabled() is updated to check the configuration is supported. Backports relevant parts of commit 372579427a5040a26dfee78464b50e2bdf27ef26	2018-03-02 09:43:14 -05:00
Alex Bennée	632b853761	tcg: remove global exit_request There are now only two uses of the global exit_request left. The first ensures we exit the run_loop when we first start to process pending work and in the kick handler. This is just as easily done by setting the first_cpu->exit_request flag. The second use is in the round robin kick routine. The global exit_request ensured every vCPU would set its local exit_request and cause a full exit of the loop. Now the iothread isn't being held while running we can just rely on the kick handler to push us out as intended. We lightly re-factor the main vCPU thread to ensure cpu->exit_requests cause us to exit the main loop and process any IO requests that might come along. As an cpu->exit_request may legitimately get squashed while processing the EXCP_INTERRUPT exception we also check cpu->queued_work_first to ensure queued work is expedited as soon as possible. Backports commit e5143e30fb87fbf179029387f83f98a5a9b27f19 from qemu	2018-03-02 09:38:08 -05:00
Alex Bennée	4d90497d14	tcg: rename tcg_current_cpu to tcg_current_rr_cpu ..and make the definition local to cpus. In preparation for MTTCG the concept of a global tcg_current_cpu will no longer make sense. However we still need to keep track of it in the single-threaded case to be able to exit quickly when required. qemu_cpu_kick_no_halt() moves and becomes qemu_cpu_kick_rr_cpu() to emphasise its use-case. qemu_cpu_kick now kicks the relevant cpu as well as qemu_kick_rr_cpu() which will become a no-op in MTTCG. For the time being the setting of the global exit_request remains. Backports commit 791158d93b27f22a17c2ada06621831d54f09a2c from qemu Also atomically sets the unicorn equivalents	2018-03-02 09:28:51 -05:00
Lioncash	18a229a69f	Resolve symbol errors with softfloat	2018-03-02 09:25:05 -05:00
KONRAD Frederic	c5730ff194	tcg: add options for enabling MTTCG We know there will be cases where MTTCG won't work until additional work is done in the front/back ends to support. It will however be useful to be able to turn it on. As a result MTTCG will default to off unless the combination is supported. However the user can turn it on for the sake of testing. Backports commit 8d4e9146b3568022ea5730d92841345d41275d66 from qemu	2018-03-02 09:25:01 -05:00
Alex Bennée	8c89344517	tcg: move TCG_MO/BAR types into own file We'll be using the memory ordering definitions to define values for both the host and guest. To avoid fighting with circular header dependencies just move these types into their own minimal header. Backports commit 20937143145b8f5a4194e5c407731ba38797864e from qemu	2018-03-02 09:08:44 -05:00
Pranith Kumar	616becc2dc	mttcg: translate-all: Enable locking debug in a debug build Enable tcg lock debug asserts in a debug build by default instead of relying on DEBUG_LOCKING. None of the other DEBUG_* macros have asserts, so this patch removes DEBUG_LOCKING and enable these asserts in a debug build. Backports commit 6ac3d7e845549f08473f020c1c70f14b8911a67e from qemu	2018-03-02 09:00:58 -05:00
Markus Armbruster	89d8e58718	util/cutils: Change qemu_strtosz*() from int64_t to uint64_t This will permit its use in parse_option_size(). Backports commit f46bfdbfc8f95cf65d7818ef68a801e063c40332 from qemu	2018-03-02 08:58:55 -05:00
Markus Armbruster	8650d0213c	util/cutils: Return qemu_strtosz*() error and value separately This makes qemu_strtosz(), qemu_strtosz_mebi() and qemu_strtosz_metric() similar to qemu_strtoi64(), except negative values are rejected. Backports commit f17fd4fdf0df3d2f3444399d04c38d22b9a3e1b7 from qemu	2018-03-02 08:57:16 -05:00
Markus Armbruster	6093e67947	util/cutils: Let qemu_strtosz*() optionally reject trailing crap Change the qemu_strtosz() & friends to return -EINVAL when @endptr is null and the conversion doesn't consume the string completely. Matches how qemu_strtol() & friends work. Only test_qemu_strtosz_simple() passes a null @endptr. No functional change there, because its conversion consumes the string. Simplify callers that use @endptr only to fail when it doesn't point to '\0' to pass a null @endptr instead. Backports commit 4fcdf65ae2c00ae69f7625f26ed41f37d77b403c from qemu	2018-03-02 08:54:53 -05:00
Markus Armbruster	f9c9eb7334	util/cutils: Drop QEMU_STRTOSZ_DEFSUFFIX_* macros Writing QEMU_STRTOSZ_DEFSUFFIX_* instead of '*' gains nothing. Get rid of these eyesores. Backports commit 17f942560e54f8ee72996bc3276c697503606d7b from qemu	2018-03-02 08:53:15 -05:00
Markus Armbruster	858acd4142	util/cutils: New qemu_strtosz() Most callers of qemu_strtosz_suffix() pass QEMU_STRTOSZ_DEFSUFFIX_B. Capture the pattern in new qemu_strtosz(). Inline qemu_strtosz_suffix() into its only remaining caller. Backports commit 466dea14e677555dd24465aca75d00a3537ad062 from qemu	2018-03-02 08:50:56 -05:00
Markus Armbruster	a3358798d6	util/cutils: Rename qemu_strtosz() to qemu_strtosz_MiB() With qemu_strtosz(), no suffix means mebibytes. It's used rarely. I'm going to add a similar function where no suffix means bytes. Rename qemu_strtosz() to qemu_strtosz_MiB() to make the name qemu_strtosz() available for the new function. Backports commit e591591b323772eea733de6027f5e8b50692d0ff from qemu	2018-03-02 08:49:26 -05:00
Markus Armbruster	f656cd91ec	util/cutils: New qemu_strtosz_metric() To parse numbers with metric suffixes, we use qemu_strtosz_suffix_unit(nptr, &eptr, QEMU_STRTOSZ_DEFSUFFIX_B, 1000) Capture this in a new function for legibility: qemu_strtosz_metric(nptr, &eptr) Replace test_qemu_strtosz_suffix_unit() by test_qemu_strtosz_metric(). Rename qemu_strtosz_suffix_unit() to do_strtosz() and give it internal linkage. Backports commit d2734d2629266006b0413433778474d5801c60be from qemu	2018-03-02 08:47:40 -05:00
Markus Armbruster	fb962d2e74	util/cutils: Clean up control flow around qemu_strtol() a bit Reorder check_strtox_error() to make it obvious that we always store through a non-null @endptr. Transform if (some error) { error case ... err = value for error case; } else { normal case ... err = value for normal case; } return err; to if (some error) { error case ... return value for error case; } normal case ... return value for normal case; Backports commit 4baef2679e029c76707be1e2ed54bf3dd21693fe from qemu	2018-03-02 08:45:18 -05:00
Markus Armbruster	9236950e61	util/cutils: Clean up variable names around qemu_strtol() Name same things the same, different things differently. * qemu_strtol()'s parameter @nptr is called @p in check_strtox_error(). Rename the latter. * qemu_strtol()'s parameter @endptr is called @next in check_strtox_error(). Rename the latter. * qemu_strtol()'s variable @p is called @endptr in check_strtox_error(). Rename both to @ep. * qemu_strtol()'s variable @err is negative errno, check_strtox_error()'s parameter @err is positive. Rename the latter to @libc_errno. Same for qemu_strtoul(), qemu_strtoi64(), qemu_strtou64(), of course. Backports commit 717adf960933da0650d995f050d457063d591914 from qemu	2018-03-02 08:41:47 -05:00
Markus Armbruster	41c2e1168f	util/cutils: Rename qemu_strtoll(), qemu_strtoull() The name qemu_strtoll() suggests conversion to long long, but it actually converts to int64_t. Rename to qemu_strtoi64(). The name qemu_strtoull() suggests conversion to unsigned long long, but it actually converts to uint64_t. Rename to qemu_strtou64(). Backports commit b30d188677456b17c1cd68969e08ddc634cef644 from qemu	2018-03-02 08:39:45 -05:00
Markus Armbruster	ac34d92d09	util/cutils: Rewrite documentation of qemu_strtol() & friends Fixes the following documentation bugs: * Fails to document that null @nptr is safe. * Fails to document that we return -EINVAL when no conversion could be performed (commit 47d4be1). * Confuses long long with int64_t, and unsigned long long with uint64_t. * Claims the unsigned conversions can underflow. They can't. While there, mark problematic assumptions that int64_t is long long, and uint64_t is unsigned long long with FIXME comments. Backports commit 4295f879becfbbb9f4330489311586b96915d920 from qemu	2018-03-02 08:37:57 -05:00
Markus Armbruster	9d1937f25d	qdict: Make qdict_get_qlist() safe like qdict_get_qdict() Commit 89cad9f changed qdict_get_qdict() to return NULL instead of crash when the key doesn't exist or its value isn't a QDict. Commit 2d6421a neglected to do the same for qdict_get_qlist(). Correct that, and update the function comments. qdict_get_obj() is now unused, remove. Backports commit b25f23e7dbc6bc0dcda010222a4f178669d1aedc from qemu	2018-03-02 08:35:17 -05:00
Bharata B Rao	7fadaf0bc4	softfloat: Add float128_to_uint32_round_to_zero() float128_to_uint32_round_to_zero() is needed by xscvqpuwz instruction of PowerPC ISA 3.0. Backports commit fd425037d25cecaaffdb3831697e0adc10ca2ba3 from qemu	2018-03-02 08:33:09 -05:00
Bharata B Rao	64d32a2237	softfloat: Add float128_to_uint64_round_to_zero() Implement float128_to_uint64() and use that to implement float128_to_uint64_round_to_zero() This is required by xscvqpudz instruction of PowerPC ISA 3.0. Backports commit 2e6d85683576c970c714c1cc071dca742835b9d4 from qemu	2018-03-02 08:32:02 -05:00
Bharata B Rao	80e522b499	softfloat: Add round-to-odd rounding mode Power ISA 3.0 introduces a few quadruple precision floating point instructions that support round-to-odd rounding mode. The round-to-odd mode is explained as under: Let Z be the intermediate arithmetic result or the operand of a convert operation. If Z can be represented exactly in the target format, the result is Z. Otherwise the result is either Z1 or Z2 whichever is odd. Here Z1 and Z2 are the next larger and smaller numbers representable in the target format respectively. Backports commit 9ee6f678f473007e252934d6acd09c24490d9d42 from qemu	2018-03-02 08:25:00 -05:00
Paul Burton	411ddd16cf	target-mips: Provide function to test if a CPU supports an ISA Provide a new cpu_supports_isa function which allows callers to determine whether a CPU supports one of the ISA_ flags, by testing whether the associated struct mips_def_t sets the ISA flags in its insn_flags field. An example use of this is to allow boards which generate bootloader code to determine the properties of the CPU that will be used, for example whether the CPU is 64 bit or which architecture revision it implements. Backports commit bed9e5ceb158c886d548fe59675a6eba18baeaeb from qemu	2018-03-02 08:20:19 -05:00
Paolo Bonzini	37918ba5b0	exec: make address_space_cache_destroy idempotent Clear cache->mr so that address_space_cache_destroy does nothing the second time it is called. Backports commit 91047df38dffa80222179f63fbb74c1dfefa25ed from qemu	2018-03-02 08:16:17 -05:00
Paolo Bonzini	e66da21a56	cpu-exec: remove outermost infinite loop Reorganize the sigsetjmp so that the restart case falls through to cpu_handle_exception and the execution loop. Backports commit 4515e58d60dc3aac53dbd5e53e4c3bec126967d8 from qemu	2018-03-02 08:13:43 -05:00
Paolo Bonzini	af524401ad	cpu-exec: avoid repeated sigsetjmp on interrupts The sigsetjmp only needs to be prepared once for the whole execution of cpu_exec. This patch takes care of the "== 0" side, using a nested loop so that cpu_handle_interrupt goes straight back to cpu_handle_exception without doing another sigsetjmp. Backports commit a42cf3f3f266a97ceb13e8b99bc7b13f7bf4192a from qemu	2018-03-02 08:09:50 -05:00
Paolo Bonzini	28b615a8b7	cpu-exec: avoid cpu_loop_exit in cpu_handle_interrupt The siglongjmp goes straight back to the beginning of cpu_exec's outermost loop. We do not need a siglongjmp, we can simply leave the inner TB execution loop. Backports commit 209b71b60ef3341246038e1c926c3b704969cdd3 from qemu	2018-03-02 08:03:18 -05:00
Paolo Bonzini	b39acfc3c6	cpu-exec: tighten barrier on TCG_EXIT_REQUESTED This seems to have worked just fine so far on weakly-ordered architectures, but I don't see anything that prevents the reordering from: store 1 to exit_request store 1 to tcg_exit_req load tcg_exit_req store 0 to tcg_exit_req load exit_request store 0 to exit_request store 1 to exit_request store 1 to tcg_exit_req to this: store 1 to exit_request store 1 to tcg_exit_req load tcg_exit_req load exit_request store 1 to exit_request store 1 to tcg_exit_req store 0 to tcg_exit_req store 0 to exit_request therefore losing a request. It's possible that other memory barriers (e.g. in rcu_read_unlock) are hiding it, but better safe than sorry. Backports commit a70fe14b7dddcb944fbd6c9f3739cd3a22089af5 from qemu	2018-03-02 08:01:08 -05:00
Wei Huang	c9bdf5e6c7	target-arm: Enable vPMU support under TCG mode This patch contains several fixes to enable vPMU under TCG mode. It first removes the checking of kvm_enabled() while unsetting ARM_FEATURE_PMU. With it, the .pmu option can be used to turn on/off vPMU under TCG mode. Secondly the PMU node of DT table is now created under TCG. The last fix is to disable the masking of PMUver field of ID_AA64DFR0_EL1. Backports commit d6f02ce3b8a43ddd8f83553fe754a34b26fb273f from qemu	2018-03-02 07:58:48 -05:00
Wei Huang	5e3349a818	target-arm: Add support for PMU register PMINTENSET_EL1 This patch adds access support for PMINTENSET_EL1. Backports commit e6ec54571e424bb1d6e50e32fe317c616cde3e05 from qemu	2018-03-02 07:57:40 -05:00
Wei Huang	3b34b7f0f9	target-arm: Add support for AArch64 PMU register PMXEVTYPER_EL0 In order to support Linux perf, which uses PMXEVTYPER register, this patch adds read/write access support for PMXEVTYPER. The access is CONSTRAINED UNPREDICTABLE when PMSELR is not 0x1f. Additionally this patch adds support for PMXEVTYPER_EL0. Backports commit fdb8665672ded05f650d18f8b62d5c8524b4385b from qemu	2018-03-02 07:53:05 -05:00
Wei Huang	1165020022	target-arm: Add support for PMU register PMSELR_EL0 This patch adds support for AArch64 register PMSELR_EL0. The existing PMSELR definition is revised accordingly. Backports commit 6b0407805d46bbeba70f4be426285d0a0e669750 from qemu	2018-03-02 07:39:43 -05:00
Peter Maydell	bddeac4430	target/arm: A32, T32: Create Instruction Syndromes for Data Aborts Add support for generating the ISS (Instruction Specific Syndrome) for Data Abort exceptions taken from AArch32. These syndromes are used by hypervisors for example to trap and emulate memory accesses. This is the equivalent for AArch32 guests of the work done for AArch64 guests in commit aaa1f954d4cab243. Backports commit 9bb6558a218bf7e466e5ac1100639517d8a30d33 from qemu	2018-03-02 00:37:06 -05:00
Peter Maydell	74d42aa939	target/arm: Abstract out pbit/wbit tests in ARM ldr/str decode In the ARM ldr/str decode path, rather than directly testing "insn & (1 << 21)" and "insn & (1 << 24)", abstract these bits out into wbit and pbit local flags. (We will want to do more tests against them to determine whether we need to provide syndrome information.) Backports commit 63f26fcfda8e19f94ce23336726d14805250a5b6 from qemu	2018-03-02 00:26:58 -05:00
Julian Brown	cc217b0c90	arm: Correctly handle watchpoints for BE32 CPUs In BE32 mode, sub-word size watchpoints can fail to trigger because the address of the access is adjusted in the opcode helpers before being compared with the watchpoint registers. This patch reverses the address adjustment before performing the comparison with the help of a new CPUClass hook. This version of the patch augments and tidies up comments a little. Backports commit 40612000599e52e792d23c998377a0fa429c4036 from qemu	2018-03-02 00:24:33 -05:00
Julian Brown	58059c3a35	Fix Thumb-1 BE32 execution and disassembly. Thumb-1 code has some issues in BE32 mode (as currently implemented). In short, since bytes are swapped within words at load time for BE32 executables, this also swaps pairs of adjacent Thumb-1 instructions. This patch un-swaps those pairs of instructions again, both for execution, and for disassembly. (The previous version of the patch always read four bytes in arm_read_memory_func and then extracted the proper two bytes, in a probably misguided attempt to match the behaviour of actual hardware as described by e.g. the ARM9TDMI TRM, section 3.3 "Endian effects for instruction fetches". It's less complicated to just read the correct two bytes though.) Backports commit f7478a92dd9ee2276bfaa5b7317140d3f9d6a53b from qemu	2018-03-02 00:20:11 -05:00
Julian Brown	1aedb26670	target/arm: Add cfgend parameter for ARM CPU selection. Add a new "cfgend" property which selects whether the CPU resets into big-endian mode or not. This setting affects whether we reset with SCTLR_B (ARMv6 and earlier) or SCTLR_EE (ARMv7 and later) set. Backports commit 3a062d5730266b2386eeda68b1a1c6e96451db31 from qemu	2018-03-02 00:18:18 -05:00
Bharata B Rao	4324d1e97e	softfloat: Fix the default qNAN for target-ppc Currently float128_default_nan() returns 0xFFFF800000000000 in the higher double word, but it should return 0x7FFF800000000000 which is the correct higher double word for default qNAN on PowerPC. Backports commit 5d51eaea84899d88cb161fab3f089168e3812e9e from qemu	2018-03-02 00:15:36 -05:00
Michael S. Tsirkin	ad6873ec57	arm: better stub version for MISMATCH_CHECK stub version of MISMATCH_CHECK is empty so it's easy to misuse for people not building kvm on arm. Use QEMU_BUILD_BUG_ON similar to the non-stub version to make it easier to catch bugs. Backports commit 705ae59fecae341a4b1a45ce48b46de4b1bb3cf4 from qemu	2018-03-02 00:13:45 -05:00
Michael S. Tsirkin	4d1139f83f	arm: add trailing ; after MISMATCH_CHECK Macro calls without a trailing ; look weird in C, this works as a side effect of how QEMU_BUILD_BUG_ON is implemented. Fix this up. Backports commit 1b28762a333bd238611103e9ed2348d7af93b0db from qemu	2018-03-02 00:12:04 -05:00
Michael S. Tsirkin	0455644974	ARRAY_SIZE: check that argument is an array It's a familiar pattern: some code uses ARRAY_SIZE, then refactoring changes the argument from an array to a pointer to a dynamically allocated buffer. Code keeps compiling but any ARRAY_SIZE calls now return the size of the pointer divided by element size. Let's add build time checks to ARRAY_SIZE before we allow more of these in the code-base. Backports commit ed63ec0d22ccdce3b2222d9a514423b7fbba3a0d from qemu	2018-03-02 00:09:51 -05:00
Michael S. Tsirkin	ac013df0a2	compiler: expression version of QEMU_BUILD_BUG_ON QEMU_BUILD_BUG_ON uses a typedef in order to be safe to use outside functions, but sometimes it's useful to have a version that can be used within an expression. Following what Linux does, introduce QEMU_BUILD_BUG_ON_ZERO that return zero after checking condition at build time. Backports commit d757573e69f2ef58a4a7b41f6c55d65fa1e1c5c2 from qemu	2018-03-02 00:07:33 -05:00
Michael S. Tsirkin	634a8094f1	compiler: rework BUG_ON using a struct There are theoretical concerns that some compilers might not trigger build failures on attempts to define an array of size (x ? -1 : 1) where x is a variable and make it a variable sized array instead. Let rewrite using a struct with a negative bit field size instead as there are no dynamic bit field sizes. This is similar to what Linux does. Backports commit f291887e8eef5d37d31484638f6e62401b4b99a2 from qemu	2018-03-02 00:05:07 -05:00
Michael S. Tsirkin	7f9fb3395c	QEMU_BUILD_BUG_ON: use __COUNTER__ Some headers use QEMU_BUILD_BUG_ON. This causes a problem if the C file including that header happens to have QEMU_BUILD_BUG_ON at the same line number. Fix using a widely available extension: __COUNTER__. If unavailable, provide a stub. Backports commit 60abf0a5e05134187e274ce5f32524ccf0cae1a6 from qemu	2018-03-02 00:03:44 -05:00
Michael S. Tsirkin	beca05eb5f	compiler: drop ; after BUILD_BUG_ON All users include the trailing ; anyway, let's require that - it seems cleaner. Backports commit f29831828441318c7916ae28e6e16e4a1c4a6795 from qemu	2018-03-02 00:01:44 -05:00
Ladi Prosek	babf848b82	memory: don't sign-extend 32-bit writes ldl_p has a signed return type so assigning it to uint64_t implicitly sign-extends the value. This results in devices with min_access_size = 8 seeing unexpected values passed to their write handlers. Example: guest performs a 32-bit write of 0x80000000 to an mmio region and the handler receives 0xFFFFFFFF80000000 in its value argument. Backports commit 6da67de6803e93cbb7e93ac3497865832f8c00ea from qemu	2018-03-02 00:00:22 -05:00
Peter Maydell	48825c1be2	target/arm: Drop IS_M() macro We only use the IS_M() macro in two places, and it's a bit of a namespace grab to put in cpu.h. Drop it in favour of just explicitly calling arm_feature() in the places where it was used. Backports commit 531c60a97ab51618b4b9ccef1c5fe00607079706 from qemu	2018-03-01 23:59:09 -05:00
Cao jin	f2a5ddf5dc	util/mmap-alloc: refactor a little bit for readability 1st mmap returns ptr which aligns to host page size, \| size + align \| ------------------------------------------ ptr input param align could be 1M, or 2M, or host page size. After QEMU_ALIGN_UP, offset will >= 0 2nd mmap use flag MAP_FIXED, then it return ptr+offset, or else fail. If it success, then we will have something like: \| offset \| size \| -------------------------------------- ptr ptr1 ptr1 is what we really want to return, it equals ptr+offset. Backports commit 6e4c890e15b23f078650499fbde11760b8eccf10 from qemu	2018-03-01 23:55:15 -05:00
Cao jin	217c14ad3e	util/mmap-alloc: check parameter before using Backports commit 4a3ecf201a1a49a804e8506df5906e446707c3b1 from qemu	2018-03-01 23:53:45 -05:00
Eduardo Habkost	f424e16f24	i386: Remove AMD feature flag aliases from Opteron models When CPU vendor is set to AMD, the AMD feature alias bits on CPUID[0x80000001].EDX are already automatically copied from CPUID[1].EDX on x86_cpu_realizefn(). When CPU vendor is Intel, those bits are reserved and should be zero. On either case, those bits shouldn't be set in the CPU model table. Commit 726a8ff68677d8d5fba17eb0ffb85076bfb598dc removed those bits from most CPU models, but the Opteron_* entries still have them. Remove the alias bits from Opteron_* too. Add an assert() to x86_register_cpudef_type() to ensure we don't make the same mistake again. Backports commit 2a923a293df95334fa22634016efdd138f49da7f from qemu	2018-03-01 23:49:04 -05:00
He Chen	b37fa358f3	x86: add AVX512_VPOPCNTDQ features AVX512_VPOPCNTDQ: Vector POPCNT instructions for word and qwords. variable precision. Backports commit f77543772dcd38fa438470d9b80bafbd3a3ebbd7 from qemu	2018-03-01 23:44:32 -05:00
Richard Henderson	5c4f79ac62	target-hppa: Add softfloat specializations Like the original MIPS, HPPA has the MSB of an SNaN set. However, it has different rules for silencing an SNaN: (1) msb is cleared and (2) msb-1 must be set if the fraction is now zero, and (implementation defined) may be set always. I haven't checked real hardware but chose the set always alternative because it's easy and within spec. Backports commit 005fa38d86257d471ac461c066a5409a9f5ebb02 from qemu	2018-03-01 23:42:09 -05:00
Sascha Silbe	11c66029b7	error: error_setg_errno(): errno gets preserved C11 allows errno to be clobbered by pretty much any library function call, so in general callers need to take care to save errno before calling other functions. However, for error reporting functions this is rather awkward and can make the code on the caller side more complicated than necessary. error_setg_errno() already takes care of preserving errno and some functions rely on that, so just promise that we continue to do so in the future. Backports commit 98cb89af4df7e1776ce418ed6167b6e214a64435 from qemu	2018-03-01 23:38:25 -05:00
Peter Maydell	aca671b3b1	target-arm: Enable EL2 feature bit on A53 and A57 Enable the ARM_FEATURE_EL2 bit on Cortex-A52 and Cortex-A57, since this is all now sufficiently implemented to work with the GICv3. We provide the usual CPU property to disable it for backwards compatibility with the older virt boards. In this commit, we disable the EL2 feature on the virt and ZynpMP boards, so there is no overall effect. Another commit will expose a board-level property to allow the user to enable EL2. Backports commit c25bd18a04c8bd0f19556d719864b7b08528222d from qemu	2018-03-01 23:36:44 -05:00
Peter Maydell	a036c73de8	target/arm/psci.c: If EL2 implemented, start CPUs in EL2 The PSCI spec states that a CPU_ON call should cause the new CPU to be started in the highest implemented Non-secure exception level. We were incorrectly starting it at the exception level of the caller, which happens to be correct if EL2 is not implemented. Implement the correct logic as described in the PSCI 1.0 spec section 6.4: * if EL2 exists and SCR_EL3.HCE is set: start in EL2 * otherwise start in EL1 Backports commit 3f591a20221511c639cc7959755e570801a21cd2 from qemu	2018-03-01 23:34:57 -05:00
Jean-Christophe DUBOIS	0aa0b849c2	ARM: Factor out ARM on/off PSCI control functions Split ARM on/off function from PSCI support code. This will allow to reuse these functions in other code. Backports commit 825482adde1f971cbddf27e15fb4453ab3fae994 from qemu	2018-03-01 23:31:47 -05:00
Peter Maydell	468e2849cd	target/arm: Implement DBGVCR32_EL2 system register The DBGVCR_EL2 system register is needed to run a 32-bit EL1 guest under a Linux EL2 64-bit hypervisor. Its only purpose is to provide AArch64 with access to the state of the DBGVCR AArch32 register. Since we only have a dummy DBGVCR, implement a corresponding dummy DBGVCR32_EL2. Backports commit 4d2ec4da1c2d60c9fd8bad137506870c2f980410 from qemu	2018-03-01 23:02:28 -05:00
Peter Maydell	0db334c0e4	target/arm: Handle VIRQ and VFIQ in arm_cpu_do_interrupt_aarch32() To run a VM in 32-bit EL1 our AArch32 interrupt handling code needs to be able to cope with VIRQ and VFIQ exceptions. These behave like IRQ and FIQ except that we don't need to try to route them to Monitor mode. Backports commit 87a4b270348c69a446ebcddc039bfae31b1675cb from qemu	2018-03-01 22:59:08 -05:00
Lioncash	ebae552174	mips: Build fix	2018-03-01 22:56:23 -05:00
Thomas Huth	b2f1326437	Move target-* CPU file into a target/ folder We've currently got 18 architectures in QEMU, and thus 18 target-xxx folders in the root folder of the QEMU source tree. More architectures (e.g. RISC-V, AVR) are likely to be included soon, too, so the main folder of the QEMU sources slowly gets quite overcrowded with the target-xxx folders. To disburden the main folder a little bit, let's move the target-xxx folders into a dedicated target/ folder, so that target-xxx/ simply becomes target/xxx/ instead. Backports commit fcf5ef2ab52c621a4617ebbef36bf43b4003f4c0 from qemu	2018-03-01 22:50:58 -05:00
Artyom Tarasenko	59ec6876bd	target-sparc: add ST_BLKINIT_ ASIs for UA2005+ CPUs In OpenSPARC T1+ TWINX ASIs in store instructions are aliased with Block Initializing Store ASIs. "UltraSPARC T1 Supplement Draft D2.1, 14 May 2007" describes them in the chapter "5.9 Block Initializing Store ASIs" Integer stores of all sizes are allowed with these ASIs. Backports commit 3390537b5df4014e24a30f9bdcfa05c2bd0cd6d8 from qemu	2018-03-01 22:29:21 -05:00
Artyom Tarasenko	a6981c9b91	target-sparc: store the UA2005 entries in sun4u format According to chapter 13.3 of the UltraSPARC T1 Supplement to the UltraSPARC Architecture 2005, only the sun4u format is available for data-access loads. Store UA2005 entries in the sun4u format to simplify processing. Backports commit 7285fba083de3f14f6e98abb4469173b56da9480 from qemu	2018-03-01 22:28:12 -05:00
Artyom Tarasenko	aa24403d8a	target-sparc: implement UA2005 ASI_MMU (0x21) Backports commit 7dd8c0760ee197420273a7dfeab13bf54f6bbd8d from qemu	2018-03-01 22:25:39 -05:00
Artyom Tarasenko	aac6955197	target-sparc: add more registers to dump_mmu Backports commit d00a2334433483d1751d94aabdf47985a68010d3 from qemu	2018-03-01 22:23:46 -05:00
Artyom Tarasenko	49e61dc62f	target-sparc: implement auto-demapping for UA2005 CPUs Backports commit 70f44d2f4bce44fa04426def3290306fa8064b91 from qemu	2018-03-01 22:23:06 -05:00
Artyom Tarasenko	b20b29fc8e	target-sparc: allow 256M sized pages Backports commit 70f44d2f4bce44fa04426def3290306fa8064b91 from qem#u	2018-03-01 22:22:50 -05:00
Lioncash	92730d9626	target-sparc: simplify ultrasparc_tsb_pointer	2018-03-01 22:18:25 -05:00
Artyom Tarasenko	76d1612dcb	target-sparc: implement UA2005 TSB Pointers Backports commit 15f746cedc6db2cc8fc7bcfe7692e02263caeeca from qemu	2018-03-01 21:31:47 -05:00
Artyom Tarasenko	f3d96d19e5	target-sparc: use SparcV9MMU type for sparc64 I/D-MMUs Backports commit 96df2bc99f9bdaf7a2f13550111f219b72b73708 from qemu	2018-03-01 21:28:43 -05:00
Artyom Tarasenko	c61e580b2d	target-sparc: replace the last tlb entry when no free entries left Implement the behavior described in the chapter 13.9.11 of UltraSPARC T1™ Supplement to the UltraSPARC Architecture 2005: "If a TLB Data-In replacement is attempted with all TLB entries locked and valid, the last TLB entry (entry 63) is replaced." Backports commit 4797a6851975c1239df440c5f01d8566e63717bb from qemu	2018-03-01 21:26:05 -05:00
Artyom Tarasenko	c43a89b2bc	target-sparc: ignore writes to UA2005 CPU mondo queue register Backports commit 2f1b52920205863024cc86007e88557f4c2c898e from qemu	2018-03-01 21:25:28 -05:00
Artyom Tarasenko	0c5a21230f	target-sparc: allow priveleged ASIs in hyperprivileged mode Backports commit 7cd39ef234a7e2eea45a08cd15f920da5f1ba008 from qemu	2018-03-01 21:24:10 -05:00
Artyom Tarasenko	3a5a9dd6cd	target-sparc: use direct address translation in hyperprivileged mode Please note that QEMU doesn't impelement Real->Physical address translation. The "Real Address" is always the "Physical Address". Backports commit 84f8f5876628963e67f66edde8a71208c4274ac8 from qemu	2018-03-01 21:24:09 -05:00
Artyom Tarasenko	f07be0ac3f	target-sparc: fix immediate UA2005 traps Backports commit 5c65df364af0a2cc60af318e5a3011ae5fce293a from qemu	2018-03-01 21:24:09 -05:00
Artyom Tarasenko	2f2bde32bf	target-sparc: implement UA2005 rdhpstate and wrhpstate instructions Backports commit f7f17ef75c9c90db63c44d11dc16fc085ca2c474 from qemu	2018-03-01 21:24:09 -05:00
Artyom Tarasenko	0a124b2199	target-sparc: implement UA2005 GL register Backports commit cbc3a6a4cc675516328a2b0d3602355d68b6302d from qemu	2018-03-01 21:24:09 -05:00
Artyom Tarasenko	05e80b59af	target-sparc: implement UA2005 hypervisor traps Backports commit 6e040755f12eba34d2fa3d56b18de32d63fea631 from qemu	2018-03-01 21:24:09 -05:00
Artyom Tarasenko	8710ef1128	target-sparc: hypervisor mode takes over nucleus mode Accordinf to UA2005, 9.3.3 "Address Space Identifiers", "In hyperprivileged mode, all instruction fetches and loads and stores with implicit ASIs use a physical address, regardless of the value of TL". Backports commit 9a10756d1204c3528e47892195349bf882069846 from qemu	2018-03-01 21:24:08 -05:00
Artyom Tarasenko	204a4dc1d3	target-sparc: implement UltraSPARC-T1 Strand status ASR Backports commit b8e31b3cc6315bc5c6ec686c363c088c4fb1d0ea from qemu	2018-03-01 21:24:08 -05:00
Artyom Tarasenko	2f329af7ef	target-sparc: implement UA2005 scratchpad registers Backports commit 4ec3e34654990868ad73a5a452a46d7f9f9dd378 from qemu	2018-03-01 21:24:08 -05:00
Artyom Tarasenko	ec74c31ebf	target-sparc: simplify replace_tlb_entry by using TTE_PGSIZE Backports commit e4d06ca74b751e486ca2a57f586fd4b858a13085 from qemu	2018-03-01 21:24:08 -05:00
Artyom Tarasenko	926247a35e	target-sparc: on UA2005 don't deliver Interrupt_level_n IRQs in hypervisor mode As described in Chapter 5.7.6 of the UltraSPARC Architecture 2005, outstanding disrupting exceptions that are destined for privileged mode can only cause a trap when the virtual processor is in nonprivileged or privileged mode and PSTATE.ie = 1. At all other times, they are held pending. Backports commit 1a2aefae6627170fdee689b394a65f76080c068a from qemu	2018-03-01 21:24:08 -05:00
Artyom Tarasenko	f486053ae0	target-sparc: add UltraSPARC T1 TLB #defines Backports commit 5b5352b2f41e460f213a515e087c24dac1322f49 from qemu	2018-03-01 21:24:08 -05:00
Artyom Tarasenko	b3f7d376cc	target-sparc: add UA2005 TTE bit #defines Backports commit c2c7f864df16ed6ef7ef21d255c5593dbeaec261 from qemu	2018-03-01 21:24:07 -05:00
Artyom Tarasenko	c1c88e147d	target-sparc: use explicit mmu register pointers Use explicit register pointers while accessing D/I-MMU registers. Call cpu_unassigned_access on access to missing registers. Backports commit 20395e63375358bf6dd147057aaf998abf7abdb9 from qemu	2018-03-01 21:24:07 -05:00
Artyom Tarasenko	be8357f8b5	target-sparc: store cpu super- and hypervisor flags in TB Backports commit c9b459aab8c5775a21dd913fc8820b736181e7be from qemu	2018-03-01 21:24:00 -05:00
Artyom Tarasenko	96af2cfb58	target-sparc: ignore MMU-faults if MMU is disabled in hypervisor mode while IMMU/DMMU is disabled - ignore MMU-faults in hypervisorv mode or if CPU doesn't have hypervisor - signal TT_INSN_REAL_TRANSLATION_MISS/TT_DATA_REAL_TRANSLATION_MISS otherwise Backports commit 1ceca928538a3633b74a7dc718a05ce6767f2f76 from qemu	2018-03-01 20:25:32 -05:00
Lioncash	d905278b86	Make unicorn happy with TLB execution	2018-03-01 20:13:37 -05:00
Alex Bennée	e3e57ca08e	cputlb: drop flush_global flag from tlb_flush We have never has the concept of global TLB entries which would avoid the flush so we never actually use this flag. Drop it and make clear that tlb_flush is the sledge-hammer it has always been. Backports commit d10eb08f5d8389c814b554d01aa2882ac58221bf from qemu	2018-03-01 19:36:04 -05:00
Alex Bennée	7e2cc86ad2	cpu_common_reset: wrap TCG specific code in tcg_enabled() Both the cpu->tb_jmp_cache and SoftMMU TLB structures are only used when running TCG code so we might as well skip them for anything else. Backports commit ba7d3d1858c257e39b47f7f12fa2016ffd960b11 from qemu	2018-03-01 19:29:57 -05:00
Alex Bennée	780ed8722e	qom/cpu: move tlb_flush to cpu_common_reset It is a common thing amongst the various cpu reset functions want to flush the SoftMMU's TLB entries. This is done either by calling tlb_flush directly or by way of a general memset of the CPU structure (sometimes both). This moves the tlb_flush call to the common reset function and additionally ensures it is only done for the CONFIG_SOFTMMU case and when tcg is enabled. In some target cases we add an empty end_of_reset_fields structure to the target vCPU structure so have a clear end point for any memset which is resetting value in the structure before CPU_COMMON (where the TLB structures are). While this is a nice clean-up in general it is also a precursor for changes coming to cputlb for MTTCG where the clearing of entries can't be done arbitrarily across vCPUs. Currently the cpu_reset function is usually called from the context of another vCPU as the architectural power up sequence is run. By using the cputlb API functions we can ensure the right behaviour in the future. Backports commit 1f5c00cfdb8114c1e3a13426588ceb64f82c9ddb from qemu	2018-03-01 19:21:07 -05:00
Laurent Vivier	770989f36f	target-m68k: increment/decrement with SP On 680x0 family only. Address Register indirect With postincrement: When using the stack pointer (A7) with byte size data, the register is incremented by two. Address Register indirect With predecrement: When using the stack pointer (A7) with byte size data, the register is decremented by two. Backports commit 727d937b59f1f722f983e20f9cd23b0e7ef60165 from qemu	2018-03-01 19:16:22 -05:00
Laurent Vivier	6ff83aadab	target-m68k: CAS doesn't need aligned access Backports commit b19578f42872aefef891e5804359af8d935a5487 from qemu	2018-03-01 19:15:20 -05:00
Laurent Vivier	636bf36272	target-m68k: manage pre-dec et post-inc in CAS In these cases we must update the address register after the operation. Backports commit 308feb935249ad745ef763707e1db69bc10ba789 from qemu	2018-03-01 19:14:35 -05:00
Laurent Vivier	1197d778cc	target-m68k: fix gen_flush_flags() gen_flush_flags() is setting unconditionally cc_op_synced to 1 and s->cc_op to CC_OP_FLAGS, whereas env->cc_op can be set to something else by a previous tcg fragment. We fix that by not setting cc_op_synced to 1 (except for gen_helper_flush_flags() that updates env->cc_op) FIX: https://github.com/vivier/qemu-m68k/issues/19 Backports commit 695576db2daaf2bdc63e7f6d36038b61caed622a from qemu	2018-03-01 19:13:35 -05:00
Laurent Vivier	b3c3cf84a5	target-m68k: fix bit operation with immediate value M680x0 bit operations with an immediate value use 9 bits of the 16bit value, while coldfire ones use only 8 bits. Backports commit fe53c2be8c12da345bd788b949e0b2360e4b3db3 from qemu	2018-03-01 19:12:20 -05:00
Richard Henderson	6f5081314b	target-m68k: Implement bfffo Backports commit a45f1763cc501861ea4f5eed06e6f58aa681a082 from qemu	2018-03-01 19:10:59 -05:00
Richard Henderson	797e5d44e9	target-m68k: Implement bitfield ops for memory Backports commit f2224f2c9a9ed63edaed77ae21ffb1e501d7f247 from qemu	2018-03-01 19:07:06 -05:00
Richard Henderson	4f481b2c5a	target-m68k: Implement bitfield ops for registers Backports commit ac815f46a325b5dabe2ebd6561e4244767c0a603 from qemu	2018-03-01 18:58:47 -05:00
Doug Evans	7bd3170ea5	target/i386: Fix bad patch application to translate.c In commit c52ab08aee6f7d4717fc6b517174043126bd302f, the patch snippet for the "syscall" insn got applied to "iret". Backports commit 410e98146ffde201ab4c778823ac8beaa74c4c3f from qemu	2018-03-01 18:52:10 -05:00
Richard Henderson	4bec129626	tcg/i386: Handle ctpop opcode Backports commit 993508e43e6d180e9ba9b747a9657eac69aec5bb from qemu	2018-03-01 18:49:43 -05:00
Richard Henderson	3a0fba32f3	tcg/ppc: Handle ctpop opcode Backports commit 33e75fb9c8cc44165c8dad9093762ba728cc7596 from qemu	2018-03-01 18:46:43 -05:00
Richard Henderson	6d4fc1319a	tcg/ppc: Handle ctz and clz opcodes Backports commit d0b07481fabb4dc4ed05d56d09718758f5f7a136 from qemu	2018-03-01 18:44:54 -05:00
Richard Henderson	ff3512a045	tcg: Use ctpop to generate ctz if needed Particularly when andc is also available, this is two insns shorter than using clz to compute ctz. Backports commit 14e99210f6c6cede461a54b2e0f9b4cd55175f00 from qemu	2018-03-01 18:39:20 -05:00
Richard Henderson	5ca8ac1aeb	qemu/host-utils.h: Reduce the operation count in the fallback ctpop Backports commit 7bdcecb7b2d79c292d1256f7d6cf0f1da50d381f from qemu	2018-03-01 18:35:51 -05:00
Richard Henderson	8a62878523	target-i386: Use ctpop helper Backports commit 4885c3c49531995d67e54907d01d5aa1350faaaf from qemu	2018-03-01 18:34:10 -05:00
Richard Henderson	d072ea48e7	target-sparc: Use ctpop helper Backports commit 08da3180dca8d41881b321d43944d97a838792fa from qemu	2018-03-01 18:28:54 -05:00
Richard Henderson	5f6e7bbdbd	tcg: Add opcode for ctpop The number of actual invocations of ctpop itself does not warrent an opcode, but it is very helpful for POWER7 to use in generating an expansion for ctz. Backports commit a768e4e99247911f00c5c0267c12d4e207d5f6cc from qemu	2018-03-01 18:26:41 -05:00
Richard Henderson	01b3c6273a	target-arm: Use clrsb helper Backports commit bc21dbcc1203ae6bb536f832c46a3b5e22a73451 from qemu	2018-03-01 18:16:56 -05:00
Richard Henderson	fff7ca4617	tcg: Add helpers for clrsb The number of actual invocations does not warrent an opcode, and the backends generating it. But at least we can eliminate redundant helpers. Backports commit 086920c2c8008f125fd38781072fa25c3ad158ea from qemu	2018-03-01 18:14:11 -05:00
Richard Henderson	246d891668	tcg/i386: Handle ctz and clz opcodes Backports commit bbf25f90ba802a286fd72be9175a860ae5fec726 from qemu	2018-03-01 16:56:08 -05:00
Richard Henderson	73ab332185	tcg/i386: Allow bmi2 shiftx to have non-matching operands Previously we could not have different constraints for different ISA levels, which prevented us from eliding the matching constraint for shifts. We do now have to make sure that the operands match for constant shifts. We can also handle some small left shifts via lea. Backports commit 6a5aed4bdc7078838a8098336588d56c9ce09d1d from qemu	2018-03-01 16:45:04 -05:00
Richard Henderson	9e3feebbfb	tcg/i386: Hoist common arguments in tcg_out_op Backports commit 42d5b514928a8a0d2f55a4c243d1333f9675815b from qemu	2018-03-01 16:42:30 -05:00
Richard Henderson	142ca07077	tcg/i386: Fuly convert tcg_target_op_def Use a switch instead of searching a table. Share constraints between 32-bit and 64-bit, when at all possible. Backports commit cd26449a505f808e479af4fdd539e05767e09c06 from qemu	2018-03-01 16:32:31 -05:00
Richard Henderson	54ca83b900	tcg/s390: Handle clz opcode Backports commit ce411066f4886cf3a4981fc0a070042a221a5fc8 from qemu	2018-03-01 16:24:29 -05:00
Richard Henderson	a90e026c18	tcg/mips: Handle clz opcode Backports commit 2a1d9d41aedd722d674b2a94d9b7dbea61469cac from qemu	2018-03-01 16:22:52 -05:00
Richard Henderson	303fc987ed	tcg/arm: Handle ctz and clz opcodes Backports commit cc0fec8a4d2a8546fe236a09bfd80150af9cbe6b from qemu	2018-03-01 16:20:46 -05:00
Richard Henderson	2b87ddda35	tcg/aarch64: Handle ctz and clz opcodes Backports commit 53c76c19904983d2c81e4f5e77027c241918a479 from qemu	2018-03-01 16:19:34 -05:00
Richard Henderson	22ebc5fcee	target-i386: Use clz and ctz opcodes Backports commit e5143c90883cd32a432eb793cdcce6bee747834a from qemu	2018-03-01 16:17:42 -05:00
Richard Henderson	9cde8bfc44	target-arm: Use clz opcode Backports commit 7539a012f614b724426ac9360238f3281d928a3f from qemu	2018-03-01 16:13:26 -05:00
Richard Henderson	9b2752b0a9	target-mips: Use clz opcode Backports commit 1a0196c5c7f197fad7b079074d587b3204bcfb0f from qemu	2018-03-01 16:08:19 -05:00
Richard Henderson	2cf34e1b55	tcg: Add clz and ctz opcodes Backports commit 0e28d0063bbd9e59a981ea2d20f82f30c5d956a8 from qemu	2018-03-01 16:04:11 -05:00
Richard Henderson	b4b173615c	tcg: Allow an operand to be matching or a constant This allows an output operand to match an input operand only when the input operand needs a register. Backports commit 17280ff4a5f264e01e55ae514ee6d3586f9577b2 from qemu	2018-03-01 15:49:05 -05:00
Richard Henderson	3f38611159	tcg: Pass the opcode width to target_parse_constraint This will let us choose how to interpret a given constraint depending on whether the opcode is 32- or 64-bit. Which will let us share more constraint combinations between opcodes. At the same time, change the interface to return the advanced pointer instead of passing it in/out by reference. Backports commit 069ea736b50b75fdec99c9b8cc603b97bd98419e from qemu	2018-03-01 15:45:40 -05:00
Richard Henderson	b8c93597b4	tcg: Transition flat op_defs array to a target callback This will allow the target to tailor the constraints to the auto-detected ISA extensions. Backports commit f69d277ece43c42c7ab0144c2ff05ba740f6706b from qemu	2018-03-01 15:40:11 -05:00
Richard Henderson	551ef0a9f7	tcg: Add markup for output requires new register This is the same concept as, and same markup as, the early clobber markup in gcc. Backports commit 82790a870992bd87d5fd9e607f40859dcf4f82ac from qemu	2018-03-01 15:24:58 -05:00
Richard Henderson	199b3859c4	tcg/optimize: Fold movcond 0/1 into setcond Backports commit 333b21b809fc80ce67c8f6a7d1c7cc66437d9791 from qemu	2018-03-01 14:41:38 -05:00
Richard Henderson	b62743947f	target-mips: Use the new extract op Use extract for EXT and DEXT. Backports commit 6eebb7a438236fcf3fdadb013921ac597aaea911 fromq qemu	2018-03-01 14:39:20 -05:00
Richard Henderson	e5acbeb86e	target-i386: Use new deposit and extract ops A couple of places where it was easy to identify a right-shift followed by an extract or and-with-immediate, and the obvious sign-extract from a high byte register. Backports commit 04fc2f1c8fc030a11e08e81bb926392c0991282a from qemu	2018-03-01 14:38:17 -05:00
Richard Henderson	ce3c153bd8	target-arm: Use new deposit and extract ops Use the new primitives for UBFX and SBFX. Backports commits 59a71b4c5b4ef2ef6425b9e21c972dd5bf450275 and 86c9ab277615af4e0389eb80a83073873ff96c86 from qemu	2018-03-01 14:09:17 -05:00
Richard Henderson	f0781470b4	tcg/s390: Support deposit into zero Since we can no longer use matching constraints, this does mean we must handle that data movement by hand. Backports commit 752b1be94757de906b9c24ebc8f5e6aa54b96b23 from qemu	2018-03-01 13:47:20 -05:00
Richard Henderson	a7462cc7bf	tcg/s390: Implement field extraction opcodes Backports commit b0bf5fe82df93c180f69d439af59f1f546632f13 from qemu	2018-03-01 13:45:33 -05:00
Richard Henderson	ab8871ea82	tcg/s390: Implement field extraction opcodes Backports commit b0bf5fe82df93c180f69d439af59f1f546632f13 from qemu	2018-03-01 13:43:46 -05:00
Richard Henderson	348802286c	tcg/s390: Expose host facilities to tcg-target.h This lets us expose facilities to TCG_TARGET_HAS_* defines directly, rather than hiding behind function calls. Backports commit b2c98d9d392c87c9b9e975d30f79924719d9cbbe from qemu	2018-03-01 13:43:00 -05:00
Richard Henderson	db41c6f1d0	tcg/ppc: Implement field extraction opcodes Backports commit c05021c3c8d6c976e4677d3010b9ef01488a4434 from qemu	2018-03-01 13:38:42 -05:00
Richard Henderson	b10a4a9ee6	tcg/mips: Implement field extraction opcodes Backports commit befbb3ced5869003ee2e806c4f36e306918d2374 from qemu	2018-03-01 13:37:24 -05:00
Richard Henderson	7a7a5c640d	tcg/i386: Implement field extraction opcodes Backports commit 78fdbfb94616f0391834d2eccabd16ea29e37da5 from qemu	2018-03-01 13:35:41 -05:00
Richard Henderson	cabb6f71a0	tcg/arm: Implement field extraction opcodes Backports commit ec903af18418e0870af84f6036d7aca1e6a5dc0a from qemu	2018-03-01 13:33:55 -05:00
Richard Henderson	c4f56ec541	tcg/arm: Move isa detection to tcg-target.h This allows us to use this detection within the TCG_TARGET_HAS_* macros, instead of requiring a function call into tcg-target.inc.c. Backports commit 40b2ccb156534f5d5f1d110a6ce008d87ee10af1 from qemu	2018-03-01 13:32:39 -05:00
Richard Henderson	fbea4130fc	tcg/aarch64: Implement field extraction opcodes Backports commit e2179f94a17bf0933df29ce1b4f6bc93cbe7dbd3 from qemu	2018-03-01 13:30:55 -05:00
Richard Henderson	9f2fcaaf27	tcg: Add deposit_z expander While we don't require a new opcode, it is handy to have an expander that knows the first source is zero. Backports commit 07cc68d52852bf47dea7c402b46ddd28248d4212 from qemu	2018-03-01 13:29:24 -05:00
Richard Henderson	8e0585dcb1	tcg: Add field extraction primitives Adds tcg_gen_extract_* and tcg_gen_sextract_* for extraction of fixed position bitfields, much like we already have for deposit. Backports commit 7ec8bab3deae643b1ce579c2d65a244f30708330 from qemu	2018-03-01 13:21:30 -05:00
Jason Wang	29932d0719	memory: handle alias in memory_region_is_iommu() Backports commit 12d37882f0c0def5dee1c21be5d8fea9c21baada from qemu	2018-03-01 13:06:18 -05:00
Jason Wang	fdca6292a1	exec: introduce address_space_get_iotlb_entry() This patch introduces a helper to query the iotlb entry for a possible iova. This will be used by later device IOTLB API to enable the capability for a dataplane (e.g vhost) to query the IOTLB. Backports commit 052c8fa9983f553fdfa0d61034774070dd639c2b from qemu	2018-03-01 13:05:08 -05:00
Richard Henderson	efad2631d2	translate-all: Avoid -Werror=switch-bool gcc 5.3.0 diagnoses translate-all.c: In function ‘alloc_code_gen_buffer’: translate-all.c:756:17: error: switch condition has boolean value switch (buf2 != MAP_FAILED) { ^ Backports commit f68808c7494b38764e1895a9852b994638b86536 from qemu	2018-03-01 13:01:50 -05:00
Jin Guojie	4ed2a37f6d	tcg-mips: Adjust qemu_ld/st for mips64 Backports commit f0d703314ecb0415d51425727ed73ad2c6e3238a from qemu	2018-03-01 13:01:05 -05:00
Jin Guojie	25b4e11814	tcg-mips: Adjust calling conventions for mips64 Backports commit 999b941633cabf2487d9bc77ce382b3fde3cd66d from qemu	2018-03-01 12:53:42 -05:00
Jin Guojie	3de761976c	tcg-mips: Adjust prologue for mips64 Take stack frame parameters out from the function body. Backports commit 0973b1cff8b66f3561befb1f467b2ab4d1a7d55a from qemu	2018-03-01 12:51:36 -05:00
Jin Guojie	b55b7403a8	tcg-mips: Adjust load/store functions for mips64 tcg_out_ldst: using a generic ALIAS_PADD to avoid ifdefs tcg_out_ld: generates LD or LW tcg_out_st: generates SD or SW Backports commit 32b69707df3365aadaad1d058044a7704397ec62 from qemu	2018-03-01 12:50:12 -05:00
Jin Guojie	022ff3580e	tcg-mips: Adjust move functions for mips64 tcg_out_mov: using OPC_OR as most mips assemblers do; tcg_out_movi: extended to 64-bit immediate. Backports commit 2294d05dab503d11664e73712c7f250fd0bf9e3b from qemu	2018-03-01 12:49:19 -05:00
Jin Guojie	00ccf9cec7	tcg-mips: Add bswap32u and bswap64 Without the mips32r2 instructions to perform swapping, bswap is quite large, dominating the size of each reverse-endian qemu_ld/qemu_st operation. Create two subroutines in the prologue block. The subroutines require extra reserved registers (TCG_TMP[2, 3]). Using these within qemu_ld means that we need not place additional restrictions on the qemu_ld outputs. Backports commit 7f54eaa3b78d71cb57e45a719980f9b5ff06d21c from qemu	2018-03-01 12:47:45 -05:00
Jin Guojie	397db1b046	tcg-mips: Support 64-bit opcodes Bulk patch adding 64-bit opcodes into tcg_out_op. Note that mips64 is as yet neither complete nor enabled. Backports commit 0119b1927d531f3fac22b9b4da01dafc23644973 from qemu	2018-03-01 12:46:18 -05:00
Jin Guojie	286f3a9f70	tcg-mips: Add mips64 opcodes Since the mips manual tables are in octal, reorg all of the opcodes into that format for clarity. Note that the 64-bit opcodes are as yet unused. Backports commit 57a701fc2b34902310d4dbd1411088055616938a from qemu	2018-03-01 12:36:20 -05:00
Jin Guojie	d2aa49e9d3	tcg-mips: Move bswap code to a subroutine Without the mips32r2 instructions to perform swapping, bswap is quite large, dominating the size of each reverse-endian qemu_ld/qemu_st operation. Create a subroutine in the prologue block. The subroutine requires extra reserved registers (TCG_TMP[2, 3]). Using these within qemu_ld means that we need not place additional restrictions on the qemu_ld outputs. Backports commit bb08afe9f0aee1a3f5c23508e2511b882ca31e1b from qemu	2018-03-01 12:35:20 -05:00
Laurent Vivier	5d4a5e9ba7	target-m68k: free TCG variables that are not This is a cleanup patch. It adds call to tcg_temp_free() when it is missing. Backports commit 2b5e2170678af36df48ab4b05dff81fe40b41a65 from qemu	2018-03-01 12:27:43 -05:00
Laurent Vivier	74beed5a4d	target-m68k: add rol/ror/roxl/roxr instructions Backports commit 0194cf31cfc84516d10eca354146673150e10410 from qemu	2018-03-01 12:21:02 -05:00
Richard Henderson	cf9424d60d	target-m68k: Inline shifts Also manage word and byte operands and fix the computation of overflow in the case of M68000 arithmetic shifts. Backports commit 367790cce8e14131426f5190dfd7d1bdbf656e4d from qemu	2018-03-01 12:11:10 -05:00
Richard Henderson	0c00b036be	target-m68k: Do not cpu_abort on undefined insns Report this properly via exception and, importantly, allow the disassembler the chance to tell us what insn is not handled. Backports commit 72d2e4b6a437f11f97d3138f6b2ec177b78210c7 from qemu	2018-03-01 12:02:39 -05:00
Laurent Vivier	90b0b6d867	target-m68k: Implement 680x0 movem 680x0 movem can load/store words and long words and can use more addressing modes. Coldfire can only use long words with (Ax) and (d16,Ax) addressing modes. Backports commit 7b542eb96d7d5d9266a9c0425f05d49c8e6df2f9 from qemu	2018-03-01 12:01:46 -05:00
Laurent Vivier	2d318da080	target-m68k: add cas/cas2 ops Implement CAS using cmpxchg. Implement CAS2 using helper and either cmpxchg when the 32bit addresses are consecutive, or with parallel_cpus+cpu_loop_exit_atomic() otherwise. Backports commit 14f944063affbcc7bd6df42b060793dbfee8a822 from qemu	2018-03-01 11:56:26 -05:00
Laurent Vivier	f602f9d40c	target-m68k: add abcd/sbcd/nbcd Backports commit fb5543d820018a46b713911e7653594be727ca98 from qemu	2018-03-01 11:51:49 -05:00
Laurent Vivier	90b87b3580	target-m68k: add 680x0 divu/divs variants Update helper to set the throwing location in case of div-by-0. Cleanup divX.w and add quad word variants of divX.l. Backports commit 0ccb9c1d8128a020720d5c6abf99a470742a1b94 from qemu	2018-03-01 11:42:30 -05:00
Laurent Vivier	77b8b2f3b8	target-m68k: add 680x0 divu/divs variants Update helper to set the throwing location in case of div-by-0. Cleanup divX.w and add quad word variants of divX.l. Backports commit 0ccb9c1d8128a020720d5c6abf99a470742a1b94 from qemu	2018-03-01 11:38:53 -05:00
Laurent Vivier	f3990e8f87	target-m68k: add 64bit mull Backports commit 8be95defd6ab10d2c9f986879a0afa82417cb8e5 from qemu	2018-03-01 11:32:59 -05:00
Laurent Vivier	e1c7d37556	target-m68k: add cmpm Backports commit 817af1c72d227fd5759ef882bef61acee40679b1 from qemu	2018-03-01 11:29:35 -05:00
Richard Henderson	e6ca471dda	target-m68k: Split gen_lea and gen_ea Provide gen_lea_mode and gen_ea_mode, where the mode can be specified manually, rather than taken from the instruction. Backports commit f84aab269ddab8509b77408b886e9071bf5c48fb from qemu	2018-03-01 11:27:43 -05:00
Richard Henderson	5541553e8d	target-m68k: Delay autoinc writeback Backports commit 8a1e52b69d2cd1c633f3c473a213d575931bf46d from qemu	2018-03-01 11:18:36 -05:00
Cédric Le Goater	c7ab1e782b	target-arm: Add VBAR support to ARM1176 CPUs ARM1176 CPUs have TrustZone support and can use the Vector Base Address Register, but currently, qemu only adds VBAR support to ARMv7 CPUs. Fix this by adding a new feature ARM_FEATURE_VBAR which can used for ARMv7 and ARM1176 CPUs. The VBAR feature is always set for ARMv7 because some legacy boards require it even if this is not architecturally correct. Backports commit 91db4642f868cf2e591b62d31a19d35b02ea791e from qemu	2018-03-01 11:12:29 -05:00
Peter Maydell	554ad1f34e	target-arm: Log AArch64 exception returns We already log exception entry; add logging of the AArch64 exception return path as well. Backports commit c9b61d9aa1ad234b0961f8add023cdc999cda3da from qemu	2018-03-01 11:08:35 -05:00
Julian Brown	86670028f7	Correct value of ARM Cortex-A8 MVFR1 register. The value of the MVFR1 (Media and VFP Feature Register 1) register for the Cortex-A8 appears to be incorrect (according to the TRM, DDI0344K), with the "full denormal arithmetic" and "propagation of NaN" fields holding both 0 instead of both 1. I had a go tracing the history of the use of this value, and it seems it's always just been wrong in QEMU: maybe it was derived from early documentation, or guessed based on the use of a "VFP Lite" implementation in the Cortex-A8. Depending on the startup/early-boot code in use, this can manifest as failure to perform denormal arithmetic properly: in our case, selecting a Cortex-A8 CPU when using QEMU as an instruction-set simulator for bare-metal GCC testing caused tests using denormal arithmetic to fail. Problems might be masked (or not occur) when using a full OS kernel with suitable trap handlers (I'm not sure). Backports commit 0f1944735b6bac810b067e8a7a5154744536fd59 from qemu	2018-03-01 11:07:08 -05:00
Richard Henderson	fcc05dc1ce	tcg/s390: Remove 'R' constraint Since R0 is reserved, we don't need a special case constraint. Backports commit e45d4ef6e345831c8d67a5bffe0d057efc20f4ff from qemu	2018-03-01 11:05:57 -05:00
Richard Henderson	7852cc600d	tcg/s390: Fix setcond expansion We can't use LOAD AND TEST for unsigned data and then expect to extract the result with ADD LOGICAL WITH CARRY. Fall through to using COMPARE LOGICAL IMMEDIATE instead. Backports commit 65839b56b9a740e6b898b5d81afc160502bd2935 from qemu	2018-03-01 11:04:40 -05:00
Kirill A. Shutemov	eb489625b5	x86: implement la57 paging mode The new paging more is extension of IA32e mode with more additional page table level. It brings support of 57-bit vitrual address space (128PB) and 52-bit physical address space (4PB). The structure of new page table level is identical to pml4. The feature is enumerated with CPUID.(EAX=07H, ECX=0):ECX[bit 16]. CR4.LA57[bit 12] need to be set when pageing enables to activate 5-level paging mode. Backports commit 6c7c3c21f95dd9af8a0691c0dd29b07247984122 from qemu	2018-03-01 11:02:07 -05:00
Doug Evans	7c874b1b2b	target-i386: Fix eflags.TF/#DB handling of syscall/sysret insns The syscall and sysret instructions behave a bit differently: TF is checked after the instruction completes. This allows the o/s to disable #DB at a syscall by adding TF to FMASK. And then when the sysret is executed the #DB is taken "as if" the syscall insn just completed. Backports commit c52ab08aee6f7d4717fc6b517174043126bd302f from qemu	2018-03-01 10:56:22 -05:00
Yi Sun	f6e624d97b	target-i386: Add Intel SHA_NI instruction support. Add SHA_NI feature bit. Its spec can be found at: https://software.intel.com/sites/default/files/managed/39/c5/325462-sdm-vol-1-2abcd-3abcd.pdf Backports commit 638cbd452d3a92a2ab18caee73078483d90f64eb from qemu	2018-03-01 10:52:54 -05:00
Paolo Bonzini	81ad780e5e	exec: introduce MemoryRegionCache Device models often have to perform multiple access to a single memory region that is known in advance, but would to use "DMA-style" functions instead of address_space_map/unmap. This can happen for example when the data has to undergo endianness conversion. Introduce a new data structure to cache the result of address_space_translate without forcing usage of a host address like address_space_map does. Backports commit 1f4e496e1fc2eb6c8bf377a0f9695930c380bfd3 from qemu	2018-03-01 10:50:30 -05:00
Paolo Bonzini	03a9bbb3d3	exec: introduce address_space_extend_translation This extracts the common part of address_space_map and address_space_cache_init into a new function. Backports commit 715c31ec8e12107f47ac74b464c97e813c76f898 from qemu	2018-03-01 10:06:43 -05:00
Paolo Bonzini	88ad0f4f6e	exec: introduce memory_ldst.inc.c Templatize the address_space_* and *_phys functions, so that we can add similar functions in the next patch that work with a lightweight, cache-like version of address_space_map/unmap. Backports commit 0ce265ffef87f19f4dd1ff0663e09a63d66ae408 from qemu	2018-03-01 09:59:34 -05:00
Paolo Bonzini	f3cc489a72	exec: optimize remaining address_space_* cases Do them right before the next patch generalizes them into a multi-included file. Backports commit 2651efe7f5f9d6dc89c8e54d7d63952b7b22597d from qemu	2018-03-01 09:40:29 -05:00
Richard Henderson	0f94929fa7	target-arm: Fix aarch64 vec_reg_offset Since CPUARMState.vfp.regs is not 16 byte aligned, the ^ 8 fixup used for a big-endian host doesn't do what's intended. Fix this by adding in the vfp.regs offset after computing the inter-register offset. Backports commit d437262fa8edd0d9fbe038a515dda3dbf7c5bb54 from qemu	2018-03-01 09:36:03 -05:00
Richard Henderson	ce9dca9c5e	target-arm: Fix aarch64 disas_ldst_single_struct We add s->be_data within do_vec_ld/st. Adding it here means that we have the wrong bits set in SIZE for a big-endian host, leading to g_assert_not_reached in write_vec_element and read_vec_element. Backports commit 74b13f92c2428abae41a61c46a5cf47545da5fcb from qemu	2018-03-01 09:34:34 -05:00
Paolo Bonzini	560515941a	target-i386: correctly propagate retaddr into SVM helpers Commit 2afbdf8 ("target-i386: exception handling for memory helpers", 2015-09-15) changed tlb_fill's cpu_restore_state+raise_exception_err to raise_exception_err_ra. After this change, the cpu_restore_state and raise_exception_err's cpu_loop_exit are merged into raise_exception_err_ra's cpu_loop_exit_restore. This actually fixed some bugs, but when SVM is enabled there is a second path from raise_exception_err_ra to cpu_loop_exit. This is the VMEXIT path, and now cpu_vmexit is called without a cpu_restore_state before. The fix is to pass the retaddr to cpu_vmexit (via cpu_svm_check_intercept_param). All helpers can now use GETPC() to pass the correct retaddr, too. Backports commit 823fb688ebc52a7d79c1308acb28c92b56820167 from qemu	2018-03-01 09:31:16 -05:00
Richard Henderson	f6a72d4dca	target/sparc: Restore ldstub of odd asis Fixes the booting of ss20 roms. Backports commit f61f76cb3ff9fea6ecbcb9696ed82b3e2c5b7364 from qemu	2018-03-01 09:21:44 -05:00
Paolo Bonzini	9404dbf74e	cpu-exec: fix icount out-of-bounds access When icount is active, tb_add_jump is surprisingly called with an out of bounds basic block index. I have no idea how that can work, but it does not seem like a good idea. Clear *last_tb for all TB_EXIT_ICOUNT_EXPIRED cases, even when all you have to do is refill icount_extra. Backports commit d8dea6fbcbed177ca5d23ab77b3834a9437f0e88 from qemu	2018-03-01 09:17:26 -05:00
Richard Henderson	6820964e2f	tcg/aarch64: Fix tcg_out_movi There were some patterns, like 0x0000_ffff_ffff_00ff, for which we would select to begin a multi-insn sequence with MOVN, but would fail to set the 0x0000 lane back from 0xffff. Backports commit 50b468d42107a2c646b1c566ed17d9ec362c51c4 from qemu	2018-03-01 09:15:34 -05:00
Richard Henderson	a03666f2f2	tcg/aarch64: Fix addsub2 for 0+C When al == xzr, we cannot use addi/subi because that encodes xsp. Force a zero into the temp register for that (rare) case. Backports commit 028fbea47713f909d6ea761a457779a82b276247 from qemu	2018-03-01 09:13:54 -05:00
Roman Kapl	337f57dd2c	exec: Add missing rcu_read_unlock rcu_read_unlock was not called if the address_space_access_valid result is negative. This caused (at least) a problem when qemu on PPC/E500+TAP failed to terminate properly and instead got stuck in a deadlock. Backports commit 662a97d74f9b34cafe9aeb6d96620a97d768a1fa from qemu	2018-03-01 09:12:27 -05:00
Peter Maydell	488b6cc82b	exec.c: Fix breakpoint invalidation race A bug (1647683) was reported showing a crash when removing breakpoints. The reproducer was bisected to 3359baad when tb_flush was finally made thread safe. While in MTTCG the locking in breakpoint_invalidate would have prevented any problems, but currently tb_lock() is a NOP for system emulation. The race is between a tb_flush from the gdbstub and the tb_invalidate_phys_addr() in breakpoint_invalidate(). Ideally we'd have actual locking here; for the moment the simple fix is to do a full tb_flush() for a bp invalidate, since that is thread-safe even if no lock is taken. Backports commit a9353fe897ca2687e5b3385ed39e3db3927a90e0 from qemu	2018-03-01 09:10:59 -05:00
Alex Bennée	f4c65739f3	target-arm/translate-a64: fix gen_load_exclusive While testing rth's latest TCG patches with risu I found ldaxp was broken. Investigating further I found it was broken by 1dd089d0 when the cmpxchg atomic work was merged. As part of that change the code attempted to be clever by doing a single 64 bit load and then shuffle the data around to set the two 32 bit registers. As I couldn't quite follow the endian magic I've simply partially reverted the change to the original code gen_load_exclusive code. This doesn't affect the cmpxchg functionality as that is all done on in gen_store_exclusive part which is untouched. I've also restored the comment that was removed (with a slight tweak to mention cmpxchg). Backports commit 5460da501a57cd72eda6fec736d76539122e2f99 from qemu	2018-03-01 09:09:16 -05:00
Marc-André Lureau	b0e5d04813	qapi: add missing colon-ending for section name The documentation parser we are going to add expects a section name to end with ':', otherwise the comment is treated as free-form text body. Backports commit 5072f7b38b1b9b26b8fbe1a89086386a420aded8 from qemu	2018-03-01 09:07:10 -05:00
Yongbok Kim	8575514f4c	target-mips: fix bad shifts in {dextp\|dextpdp} Fixed issues in the MIPSDSP64 instructions dextp and dextpdp. Shifting can go out of 32 bit range. https://bugs.launchpad.net/qemu/+bug/1631625 Backports commit e6e2784cacd4cfec149a7690976b9ff15e541c4d from qemu	2018-03-01 09:04:41 -05:00
Heiher	b5468b4b22	target-mips: Fix Loongson multimedia instructions. Needed to emit FPU exception on Loongson multimedia instructions executing if Status:CU1 is clear. or FPR changes may be missed on Linux. Backports commit b5a587b613f6151c2ce164552579ae64f2ddfd1c from qemu	2018-03-01 09:03:49 -05:00
Heiher	12a8570cbe	target-mips: Fix Loongson multimedia 'or' instruction. Backports commit bb7cab5f3466540f5603b209c0df2e27a02fbb95 from qemu	2018-03-01 09:03:10 -05:00
Heiher	ba39cb4fcb	target-mips: Fix Loongson pandn instruction. pandn FD, FS, FT Operation: FD = ((NOT FS) AND FT) Backports commit 9099a36b4bb81f84004b77f08e58ac2c67eed0e7 from qemu	2018-03-01 09:02:34 -05:00
Laurent Vivier	7055c38183	target-m68k: fix muluw/mulsw "The multiplier and multiplicand are both word operands, and the result is a long-word operand." So compute flags on a long-word result, not on a word result. Backports commit 4a18cd44f3c905d443c26e26bb9b09932606d1a3 from qemu	2018-03-01 08:59:42 -05:00
Laurent Vivier	527c68f40e	target-m68k: Fix cmpa operand size "The size of the operation can be specified as word or long. Word length source operands are sign-extended to 32 bits for comparison." So comparison is always done using OS_LONG. Backports commit 5436c29d78957a6825a93f0eb79dfab388641017 from qemu	2018-03-01 08:58:59 -05:00
Laurent Vivier	69687e1824	target-m68k: fix EXG instruction opcodes of "EXG Ax,Ay" and "EXG Dx,Dy" have been swapped Backports commit c090c97d925ce751d8834d5c5a404952598f67c0 from qemu	2018-03-01 08:57:45 -05:00
Bobby Bingham	d46e52d9d0	cpu_ldst.h: use correct guest address parameter In the user emulation code path, tlb_vaddr_to_host erronesously passed vaddr as the guest address to be translated, instead of addr, the parameter which actually contained the guest address. This resulted in incorrect addresses being used when emulating block copy (mvc/mvpg) and block clear (xc) instructions for the s390x target. Backports commit c2a85316902e67530da9d6548139fcce73c0cac6 from qemu	2018-03-01 08:56:37 -05:00
Ed Maste	6deaf98a8b	Fix FreeBSD (10.x) build after 7dc9ae43 Include sys/user.h for declaration of 'struct kinfo_proc'. Add -lutil to qemu-ga link for kinfo_getproc. Backports commit a7764f1548ef9946af30a8f96be9cef10761f0c1 from qemu	2018-03-01 08:55:43 -05:00
Luwei Kang	57533d1adc	x86: add AVX512_4VNNIW and AVX512_4FMAPS features The spec can be found in Intel Software Developer Manual or in Instruction Set Extensions Programming Reference. Backports commit 95ea69fb46266aaa46d0c8b7f0ba8c4903dbe4e3 from qemu	2018-03-01 08:51:09 -05:00
Alex Bennée	1e4154af83	exec.c: ensure all AddressSpaceDispatch updates under RCU The memory_dispatch field is meant to be protected by RCU so we should use the correct primitives when accessing it. This race was flagged up by the ThreadSanitizer. Backports commit f35e44e7645edbb08e35b111c10c2fc57e2905c7 from qemu	2018-03-01 08:44:19 -05:00
Joseph Myers	7ff441826c	tcg: correct 32-bit tcg_gen_ld8s_i64 sign-extension The version of tcg_gen_ld8s_i64 for 32-bit systems does a load into the low part of the return value - then attempts a sign extension into the high part, but wrongly sets the high part to a sign extension of itself rather than of the low part. This results in TCG internal errors from the use of the uninitialized high part (in some GCC tests of AArch64 NEON shift intrinsics, in particular). This patch corrects the sign-extension logic, making it match other functions such as tcg_gen_ld16s_i64. Backports commit 3ff91d7e85176f8b4b131163d7fd801757a2c949 from qemu	2018-03-01 08:41:23 -05:00
Peter Maydell	f9c5c1a604	tcg/tcg.h: Improve documentation of TCGv_i32 etc types The typedefs we use for the TCGv_i32, TCGv_i64 and TCGv_ptr types are somewhat confusing, because we define them as pointers to structs, but the structs themselves are never defined. Explain in the comments a bit more clearly why this is OK and what is going on under the hood. Backports commit a40d4701bc9f6e6a3bbfb7b4fbe756a5b72b5df1 from qemu	2018-03-01 08:40:35 -05:00
Richard Henderson	f5a35908da	tcg: Add tcg_gen_mulsu2_{i32,i64,tl} This multiply has one signed input and one unsigned input, producing the full double-width result. Backports commit 5087abfb7dfd1d368ae6939420057036b4d8e509 from qemu	2018-03-01 08:39:37 -05:00
Richard Henderson	f9d91a81b5	target-sparc: Use tcg_gen_atomic_cmpxchg_tl Backports commit 5a7267b6a9e94c264ca77a7ca5a239e70dac81da from qemu	2018-03-01 08:34:35 -05:00
Richard Henderson	47313adedd	target-sparc: Use tcg_gen_atomic_xchg_tl Backports commit da1bcae65288bdd51e0a7203d1e6c9cde1be5b3d from qemu	2018-03-01 08:32:10 -05:00
Richard Henderson	6b09040e23	target-sparc: Remove MMU_MODE*_SUFFIX The functions that these generate are no longer used. Backports commit 47b2696b975b794c6fa7b9fa8ae4699e749d662c from qemu	2018-03-01 08:30:27 -05:00
Richard Henderson	00fc847229	target-sparc: Allow 4-byte alignment on fp mem ops The cpu is allowed to require stricter alignment on these 8- and 16-byte operations, and the OS is required to fix up the accesses as necessary, so the previous code was not wrong. However, we can easily handle this misalignment for all direct 8-byte operations and for direct 16-byte loads. We must retain 16-byte alignment for 16-byte stores, so that we don't have to probe for writability of a second page before performing the first of two 8-byte stores. We also retain 8-byte alignment for no-fault loads, since they are rare and it's not worth extending the helpers for this. Backports commit cb21b4da6cca1bb4e3f5fefb698fb9e4d00c8f66 from qemu	2018-03-01 08:29:11 -05:00
Richard Henderson	eec264526e	target-sparc: Implement ldqf and stqf inline At the same time, fix a problem with stqf_asi, when a write might access two pages. Backports commit f939ffe5a022a8798824e2720ed5a14186fca6b6 from qemu	2018-03-01 08:20:36 -05:00
Richard Henderson	3a25695841	target-sparc: Remove asi helper code handled inline Now that we never call out to helpers when direct accesses can handle an asi, remove the corresponding code in those helpers. For ldda, this removes the entire helper. Backports commit 918d9a2c9d36378a3cf6636018900a4731c83b9d from qemu	2018-03-01 08:14:31 -05:00
Richard Henderson	15c8bf0b42	target-sparc: Implement BCOPY/BFILL inline Backports commit 34810610acbde7a0745be3a88e99f2ef9282260f from qemu	2018-02-28 12:54:10 -05:00
Richard Henderson	3c48eb4aaf	target-sparc: Implement cas_asi/casx_asi inline Backports commit 7268adebfda6548b8ae6865dc8337f116a5d266d from qemu	2018-02-28 12:47:26 -05:00
Richard Henderson	b28b5cd3d3	target-sparc: Implement ldstub_asi inline Backports commit fbb4bbb62e5603c991b880e25dc4bb30d342b944 from qemu	2018-02-28 12:42:26 -05:00
Richard Henderson	adf9faf075	target-sparc: Implement swap_asi inline Backports commit 4fb554bc6c88eb45270a3ad3cf6e6e2ad476aede from qemu	2018-02-28 12:39:55 -05:00
Richard Henderson	ebc292c174	target-sparc: Handle more twinx asis As used by HelenOS, presumably for ultra 2 and 3, prior to the sun4v platform and the current twinx names. Backports commit 34a6e13da70b2c798630a8dbd03d09f201c0198f from qemu	2018-02-28 12:28:08 -05:00
Richard Henderson	ecbeea7c56	target-sparc: Use MMU_PHYS_IDX for bypass asis Backports commit 7f87c90527d7363e8cecf1c6b5ad3d4cc85d3d28 from qemu	2018-02-28 12:26:29 -05:00
Richard Henderson	15eea419e5	target-sparc: Add MMU_PHYS_IDX It's handy to have a mmu idx for physical addresses, so that mmu disabled and physical access asis can use the same path as normal accesses. Backports commit af7a06bac7d3abb2da48ef3277d2a415772d2ae8 from qemu	2018-02-28 12:24:17 -05:00
Richard Henderson	9e60a8e432	target-sparc: Introduce cpu_raise_exception_ra Several helpers call helper_raise_exception directly, which requires in turn that their callers have performed save_state. The new function allows a TCG return address to be passed in so that we can restore PC + NPC + flags data from that. This fixes a bug in the usage of helper_check_align, whose callers had not been calling save_state. It fixes another bug in which the divide helpers used GETPC at a level other than the direct callee from TCG. This allows the translator to avoid save_state prior to SAVE, RESTORE, and FLUSHW instructions. Backports commit 2f9d35fc4006122bad33f9ae3e2e51d2263e98ee from qemu	2018-02-28 12:15:06 -05:00
Richard Henderson	62ae2a5102	target-sparc: Use overalignment flags for twinx and block asis This allows us to enforce 16 and 64-byte alignment without any extra overhead. Backports commit 808832277af11dafee5a55da2b9e41d019b879ca from qemu	2018-02-28 12:01:50 -05:00
Alex Bennée	da124da4b1	tcg: move locking for tb_invalidate_phys_page_range up In the linux-user case all things that involve ''l1_map' and PageDesc tweaks are protected by the memory lock (mmpa_lock). For SoftMMU mode we previously relied on single threaded behaviour, with MTTCG we now use the tb_lock(). As a result we need to do a little re-factoring and push the taking of this lock up the call tree. This requires a slightly different entry for the SoftMMU and user-mode cases from tb_invalidate_phys_range. This also means user-mode breakpoint insertion needs to take two locks but it hadn't taken any previously so this is an improvement. Backpoirts commit ba051fb5e56d5ff5e4fa672d37954452e58543b2 from qemu	2018-02-28 10:35:41 -05:00
Paolo Bonzini	9d64a89acf	tcg: comment on which functions have to be called with tb_lock held softmmu requires more functions to be thread-safe, because translation blocks can be invalidated from e.g. notdirty callbacks. Probably the same holds for user-mode emulation, it's just that no one has ever tried to produce a coherent locking there. This patch will guide the introduction of more tb_lock and tb_unlock calls for system emulation. Note that after this patch some (most) of the mentioned functions are still called outside tb_lock/tb_unlock. The next one will rectify this. Backports commit 7d7500d99895f888f97397ef32bb536bb0df3b74 from qemu	2018-02-28 10:26:28 -05:00
Alex Bennée	7aab0bd9a6	translate-all: add DEBUG_LOCKING asserts This adds asserts to check the locking on the various translation engines structures. There are two sets of structures that are protected by locks. The first the l1map and PageDesc structures used to track which translation blocks are associated with which physical addresses. In user-mode this is covered by the mmap_lock. The second case are TB context related structures which are protected by tb_lock which is also user-mode only. Currently the asserts do nothing in SoftMMU mode but this will change for MTTCG. Backports commit 301e40ed8005306c009978be295ed9a4b725178b from qemu	2018-02-28 08:56:15 -05:00
Alex Bennée	075aaad106	translate_all: DEBUG_FLUSH -> DEBUG_TB_FLUSH Make the debug define consistent with the others. The flush operation is all about invalidating TranslationBlocks on flush events. Also fix up the commenting on the other DEBUG for the benefit of checkpatch. Backports commit 955939a2b51f72bea1c200b559ea39985df5a633 from qemu	2018-02-28 08:53:38 -05:00
Anand J	8278af45cd	clean-up: removed duplicate #includes Some files contain multiple #includes of the same header file. Removed most of those unnecessary duplicate entries using scripts/clean-includes. Backports commit 814bb12a561d36aeb5ae4440ad43d2b0761d76da from qemu	2018-02-28 08:51:56 -05:00
Wei Huang	bceed21d23	arm: Add an option to turn on/off vPMU support This patch adds a pmu=[on/off] option to enable/disable vPMU support in guest vCPU. It allows virt tools, such as libvirt, to determine the exsitence of vPMU and configure it. Note this option is only available for cortex-a57/cortex-53/ host CPUs, but unavailable on ARMv7 and other processors. Also even though "pmu=" option is available for TCG mode, setting it doesn't turn PMU on. Backports commit 929e754d5a621cd53f30e69b766ccf381b58d124 from qemu	2018-02-28 08:49:23 -05:00
Laurent Vivier	5daf91ea48	target-m68k: immediate ops manage word and byte operands Backports commit 92c62548f69cb4ba739d7d046e9caf9ea75753e4 from qemu	2018-02-28 08:42:22 -05:00
Laurent Vivier	f7c29f73b3	target-m68k: cmp manages word and bytes operands Backports commit ff99b952c8280853801fe14f7ae62d0f87464f7d from qemu	2018-02-28 08:37:46 -05:00
Laurent Vivier	fc28e8127f	target-m68k: add/sub manage word and byte operands Backports commit 8a370c6cb770b618f7eb66628116c25e84588df8 from qemu	2018-02-28 07:18:25 -05:00
Laurent Vivier	bc27695926	target-m68k: add addressing modes to neg Backports commit 227de713e0f4224a82c32991b4e4c4973381426b from qemu	2018-02-28 07:07:28 -05:00
Laurent Vivier	3558b93f11	target-m68k: introduce byte and word cc_ops Backports commit db3d7945ae7992c91cc5705dccf60fec79b24dc4 from qemu	2018-02-28 06:52:16 -05:00
Laurent Vivier	4e257ffda9	target-m68k: some bit ops cleanup Backports commit 3c980d2ef664e6d5a1a0c98aca4d11d33b17ca59 from qemu	2018-02-28 01:25:58 -05:00
Laurent Vivier	cfab571859	target-m68k: suba/adda can manage word operand Backports commit 415f4b62eb4629bd3702e6fb8aa51437a92983ff from qemu	2018-02-28 01:20:23 -05:00
Laurent Vivier	99c297efe3	target-m68k: and can manage word and byte operands Backports commit 52dc23c5956159a79a4e2d4193e44d2c4cf3883c from qemu	2018-02-28 01:19:02 -05:00
Laurent Vivier	41372b0cc9	target-m68k: or can manage word and byte operands Backports commit 020a4659208a6f9a985881504fd4d3b44ab589be from qemu	2018-02-28 01:15:27 -05:00
Laurent Vivier	e140aac281	target-m68k: eor can manage word and byte operands Backports commit eec37aec85af9f5fd59b534d20c86a775b8e7973 from qemu	2018-02-28 01:05:21 -05:00
Laurent Vivier	bc52777b00	target-m68k: add addressing modes to not Backports commit ea4f2a844132c81f1e6b51fed7019686ce4e3bc5 from qemu	2018-02-28 01:03:38 -05:00
Richard Henderson	549e31cc72	target-m68k: Inline addx, subx, negx And add opcodes for 680x0 Backports commit a665a820e5d46b1611f409fbc7a540fe1c6bf5c8 from qemu	2018-02-28 01:02:31 -05:00
Laurent Vivier	b796f934ff	target-m68k: add dbcc Backports commit beff27ab3a60d8abab4a166670ca79b3c0970005 from qemu	2018-02-28 00:45:39 -05:00
Laurent Vivier	977c3fe6c4	target-m68k: add addressing modes to scc Backports commit d5a3cf33f2f65069d2f79a6e349f0d8140f02bb4 from qemu	2018-02-28 00:43:30 -05:00
Laurent Vivier	77b1754376	target-m68k: add exg ops Backports commit 29cf437da4eeacb46cd7076014d06c85ca47c91d from qemu	2018-02-28 00:37:41 -05:00
Laurent Vivier	56882899be	target-m68k: add linkl Backports commit c630e436c0ed3adc3a858c328119daf6d1b3357f from qemu	2018-02-28 00:31:27 -05:00
Laurent Vivier	59d6a1a744	target-m68k: add bkpt instruction Backports commit 71600eda7cc48f03ea306bc69ed7e52ef1d9dd91 from qemu	2018-02-28 00:29:41 -05:00
Emilio G. Cota	22be035e60	target-arm: remove EXCP_STREX + cpu_exclusive_{test, info} The exception is not emitted anymore; remove it and the associated TCG variables. Backports commit 05188cc72f0399e99c92f608a8e7ca4c8e552c4b from qemu	2018-02-28 00:24:20 -05:00
Emilio G. Cota	cb92eea81a	target-arm: emulate aarch64's LL/SC using cmpxchg helpers Emulating LL/SC with cmpxchg is not correct, since it can suffer from the ABA problem. Portable parallel code, however, is written assuming only cmpxchg--and not LL/SC--is available. This means that in practice emulating LL/SC with cmpxchg is a viable alternative. The appended emulates LL/SC pairs in aarch64 with cmpxchg helpers. This works in both user and system mode. In usermode, it avoids pausing all other CPUs to perform the LL/SC pair. The subsequent performance and scalability improvement is significant, as the plots below show. They plot the throughput of atomic_add-bench compiled for ARM and executed on a 64-core x86 machine. Hi-res plots: http://imgur.com/a/JVc8Y atomic_add-bench: 1000000 ops/thread, [0,1] range 18 ++---------+----------+---------+----------+----------+----------+---++ +cmpxchg +-E--+ + + + + + \| 16 ++master +-H--+ ++ \|\| \| 14 ++ ++ \| \| \| 12 ++\| ++ \| \| \| 10 ++++ ++ 8 ++E ++ \|+++ \| 6 ++ \| ++ \| \| \| 4 ++ \| ++ \| \| \| 2 +H++E+--- ++ + \| +E++----+E+---+--+E+----++E+------+E+------+E++----+E+---+--+E\| 0 ++H-H----H-+-----H----+---------+----------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 1000000 ops/thread, [0,2] range 18 ++---------+----------+---------+----------+----------+----------+---++ +cmpxchg +-E--+ + + + + + \| 16 ++master +-H--+ ++ \| \| \| 14 ++E ++ \| \| \| 12 ++\| ++ \|+++ \| 10 ++ \| ++ 8 ++ \| ++ \| \| \| 6 ++ \| ++ \| \| \| 4 ++ \| ++ \| +E+--- \| 2 +H+ +E+-----+++ +++ +++ ---+E+-----+E+------+++ +++ + +E+---+--+E+----++E+------+E+--- ++++ +++ + +E\| 0 ++H-H----H-+-----H----+---------+----------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 1000000 ops/thread, [0,128] range 70 ++---------+----------+---------+----------+----------+----------+---++ +cmpxchg +-E--+ + + + + + \| 60 ++master +-H--+ +++ ---+E+-----+E+------+E+ \| +E+------E-------+E+--- \| \| --- +++ \| 50 ++ +++--- ++ \| -+E+ \| 40 ++ +++---- ++ \| E- \| \| --\| \| 30 ++ -- +++ ++ \| +E+ \| 20 ++E+ ++ \|E+ \| \| \| 10 ++ ++ + + + + + + + \| 0 +HH-H----H-+-----H----+---------+----------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 1000000 ops/thread, [0,1024] range 160 ++---------+---------+----------+---------+----------+----------+---++ +cmpxchg +-E--+ + + + + + \| 140 ++master +-H--+ +++ +++ \| -+E+-----+E+-------E\| 120 ++ +++ ---- +++ \| +++ ----E-- \| 100 ++ --E--- +++ ++ \| +++ ---- +++ \| 80 ++ --E-- ++ \| ---- +++ \| \| -+E+ \| 60 ++ ---- +++ ++ \| +E+- \| 40 ++ -- ++ \| +E+ \| 20 +EE+ ++ +++ + + + + + + \| 0 +HH-H---H--+-----H---+----------+---------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads Backports commit 1dd089d0eec060dcd8478735114d98421d414805 from qemu	2018-02-28 00:21:27 -05:00
Emilio G. Cota	3546558f66	target-arm: emulate SWP with atomic_xchg helper Backports commit cf12bce088f22b92bf62ffa0d7f6a3e951e355a9 from qemu	2018-02-28 00:11:23 -05:00
Emilio G. Cota	ec14a00925	target-arm: emulate LL/SC using cmpxchg helpers Emulating LL/SC with cmpxchg is not correct, since it can suffer from the ABA problem. Portable parallel code, however, is written assuming only cmpxchg--and not LL/SC--is available. This means that in practice emulating LL/SC with cmpxchg is a viable alternative. The appended emulates LL/SC pairs in ARM with cmpxchg helpers. This works in both user and system mode. In usermode, it avoids pausing all other CPUs to perform the LL/SC pair. The subsequent performance and scalability improvement is significant, as the plots below show. They plot the throughput of atomic_add-bench compiled for ARM and executed on a 64-core x86 machine. Hi-res plots: http://imgur.com/a/aNQpB atomic_add-bench: 1000000 ops/thread, [0,1] range 9 ++---------+----------+----------+----------+----------+----------+---++ +cmpxchg +-E--+ + + + + + \| 8 +Emaster +-H--+ ++ \| \| \| 7 ++E ++ \| \| \| 6 ++++ ++ \| \| \| 5 ++ \| ++ 4 ++ \| ++ \| \| \| 3 ++ \| ++ \| \| \| 2 ++ \| ++ \|H++E+--- +++ ---+E+------+E+------+E\| 1 +++ +E+-----+E+------+E+------+E+------+E+-- +++ +++ ++ ++H+ + +++ + +++ ++++ + + + \| 0 ++--H----H-+-----H----+----------+----------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 1000000 ops/thread, [0,2] range 16 ++---------+----------+---------+----------+----------+----------+---++ +cmpxchg +-E--+ + + + + + \| 14 ++master +-H--+ ++ \| \| \| 12 ++\| ++ \| E \| 10 ++\| ++ \| \| \| 8 ++++ ++ \|E+\| \| \| \| \| 6 ++ \| ++ \| \| \| 4 ++ \| ++ \| +E+--- +++ +++ +++ ---+E+------+E\| 2 +H+ +E+------E-------+E+-----+E+------+E+------+E+-- +++ + \| + +++ + ++++ + + + \| 0 ++H-H----H-+-----H----+---------+----------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 1000000 ops/thread, [0,128] range 70 ++---------+----------+---------+----------+----------+----------+---++ +cmpxchg +-E--+ + + + ++++ + \| 60 ++master +-H--+ ----E------+E+-------++ \| -+E+--- +++ +++ +E\| \| +++ ---- +++ ++\| 50 ++ +++ ---+E+- ++ \| -E--- \| 40 ++ ---+++ ++ \| +++--- \| \| -+E+ \| 30 ++ +++---- ++ \| +E+ \| 20 ++ +++-- ++ \| +E+ \| \|+E+ \| 10 +E+ ++ + + + + + + + \| 0 +HH-H----H-+-----H----+---------+----------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 1000000 ops/thread, [0,1024] range 120 ++---------+---------+----------+---------+----------+----------+---++ +cmpxchg +-E--+ + + + + + \| \| master +-H--+ ++\| 100 ++ ----E+ \| +++ ---+E+--- ++\| \| --E--- +++ \| 80 ++ ---- +++ ++ \| ---+E+- \| 60 ++ -+E+-- ++ \| +++ ---- +++ \| \| -+E+- \| 40 ++ +++---- ++ \| +++ ---+E+ \| \| -+E+--- \| 20 ++ +E+ ++ \|+E+++ \| +E+ + + + + + + \| 0 +HH-H---H--+-----H---+----------+---------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads Backports commit 354161b37c6465a32073eac5f16fa35939af2bb4 from qemu	2018-02-28 00:07:44 -05:00
Richard Henderson	fd9933fbd5	target-arm: Rearrange aa32 load and store functions Stop specializing on TARGET_LONG_BITS == 32; unconditionally allocate a temp and expand with tcg_gen_extu_i32_tl. Split out gen_aa32_addr, gen_aa32_frob64, gen_aa32_ld_i32 and gen_aa32_st_i32 as separate interfaces. Backports commit 7f5616f53896a4e08ad37de3ac50d3a4cc8eff7a from qemu	2018-02-27 23:59:16 -05:00
Emilio G. Cota	3dc16ebca3	target-i386: remove helper_lock() It's been superseded by the atomic helpers. The use of the atomic helpers provides a significant performance and scalability improvement. Below is the result of running the atomic_add-test microbenchmark with: $ x86_64-linux-user/qemu-x86_64 tests/atomic_add-bench -o 5000000 -r $r -n $n , where $n is the number of threads and $r is the allowed range for the additions. The scenarios measured are: - atomic: implements x86' ADDL with the atomic_add helper (i.e. this patchset) - cmpxchg: implement x86' ADDL with a TCG loop using the cmpxchg helper - master: before this patchset Results sorted in ascending range, i.e. descending degree of contention. Y axis is Throughput in Mops/s. Tests are run on an AMD machine with 64 Opteron 6376 cores. atomic_add-bench: 5000000 ops/thread, [0,1] range 25 ++---------+----------+---------+----------+----------+----------+---++ + atomic +-E--+ + + + + + \| \|cmpxchg +-H--+ \| 20 +Emaster +-N--+ ++ \|\| \| \|++ \| \|\| \| 15 +++ ++ \|N\| \| \|+\| \| 10 ++\| ++ \|+\|+ \| \| \| -+E+------ +++ ---+E+------+E+------+E+-----+E+------+E\| \|+E+E+- +++ +E+------+E+-- \| 5 ++\|+ ++ \|+N+H+--- +++ \| ++++N+--+H++----+++ + +++ --++H+------+H+------+H++----+H+---+--- \| 0 ++---------+-----H----+---H-----+----------+----------+----------+---H+ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 5000000 ops/thread, [0,2] range 25 ++---------+----------+---------+----------+----------+----------+---++ ++atomic +-E--+ + + + + + \| \|cmpxchg +-H--+ \| 20 ++master +-N--+ ++ \|E\| \| \|++ \| \|\|E \| 15 ++\| ++ \|N\|\| \| \|+\|\| ---+E+------+E+-----+E+------+E\| 10 ++\| \| ---+E+------+E+-----+E+--- +++ +++ \|\|H+E+--+E+-- \| \|+++++ \| \| \|\| \| 5 ++\|+H+-- +++ ++ \|+N+ - ---+H+------+H+------ \| + +N+--+H++----+H+---+--+H+----++H+--- + + +H+---+--+H\| 0 ++---------+----------+---------+----------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 5000000 ops/thread, [0,8] range 40 ++---------+----------+---------+----------+----------+----------+---++ ++atomic +-E--+ + + + + + \| 35 +cmpxchg +-H--+ ++ \| master +-N--+ ---+E+------+E+------+E+-----+E+------+E\| 30 ++\| ---+E+-- +++ ++ \| \| -+E+--- \| 25 ++E ---- +++ ++ \|+++++ -+E+ \| 20 +E+ E-- +++ ++ \|H\|+++ \| \|+\| +H+------- \| 15 ++H+ ---+++ +H+------ ++ \|N++H+-- +++--- +H+------++\| 10 ++ +++ - +++ ---+H+ +++ +H+ \| \| +H+-----+H+------+H+-- \| 5 ++\| +++ ++ ++N+N+--+N++ + + + + + \| 0 ++---------+----------+---------+----------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 5000000 ops/thread, [0,128] range 160 ++---------+---------+----------+---------+----------+----------+---++ + atomic +-E--+ + + + + + \| 140 +cmpxchg +-H--+ +++ +++ ++ \| master +-N--+ E--------E------+E+------++\| 120 ++ --\| \| +++ E+ \| -- +++ +++ ++\| 100 ++ - ++ \| +++- +++ ++\| 80 ++ -+E+ -+H+------+H+------H--------++ \| ---- ---- +++ H\| \| ---+E+-----+E+- ---+H+ ++\| 60 ++ +E+--- +++ ---+H+--- ++ \| --+++ ---+H+-- \| 40 ++ +E+-+H+--- ++ \| +H+ \| 20 +EE+ ++ +N+ + + + + + + \| 0 ++N-N---N--+---------+----------+---------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 5000000 ops/thread, [0,1024] range 350 ++---------+---------+----------+---------+----------+----------+---++ + atomic +-E--+ + + + + + \| 300 +cmpxchg +-H--+ +++ \| master +-N--+ +++ \|\| \| +++ \| ----E\| 250 ++ \| ----E---- ++ \| ----E--- \| ---+H\| 200 ++ -+E+--- +++ ---+H+--- ++ \| ---- -+H+-- \| \| +E+ +++ ---- +++ \| 150 ++ ---+++ ---+H+- ++ \| --- -+H+-- \| 100 ++ ---+E+ ---- +++ ++ \| +++ ---+E+-----+H+- \| \| -+E+------+H+-- \| 50 ++ +E+ ++ +EE+ + + + + + + \| 0 ++N-N---N--+---------+----------+---------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads hi-res: http://imgur.com/a/fMRmq For master I stopped measuring master after 8 threads, because there is little point in measuring the well-known performance collapse of a contended lock. Backports commit 37b995f6e7a1cb6fa378c5cd4217b9dd9e1fc98b from qemu	2018-02-27 23:43:22 -05:00
Emilio G. Cota	9d9b7dedac	target-i386: emulate XCHG using atomic helper Backports commit ea97ebe89f7a879ea9aba90140e40c29b5cbd653 from qemu	2018-02-27 23:40:20 -05:00
Emilio G. Cota	8f96b6beb9	target-i386: emulate LOCK'ed BTX ops using atomic helpers Backports commit cfe819d309d472f75fd129faf1d1064a2498326c from qemu	2018-02-27 23:39:21 -05:00
Emilio G. Cota	089965fa8d	target-i386: emulate LOCK'ed XADD using atomic helper Backports commit f53b01817f95781d2bcc8a82e057d1416601e13b from qemu	2018-02-27 23:06:28 -05:00
Emilio G. Cota	f9ed728f27	target-i386: emulate LOCK'ed NEG using cmpxchg helper Backports commit 8eb8c7385608b99bed6055a22d897ff727a6cb8e from qemu	2018-02-27 23:03:28 -05:00
Emilio G. Cota	fedeb0f93e	target-i386: emulate LOCK'ed NOT using atomic helper Backports commit 2a5fe8ae145ef7a3ab480922116d27efcc97b85d from qemu	2018-02-27 23:00:33 -05:00
Emilio G. Cota	05c94546d5	target-i386: emulate LOCK'ed INC using atomic helper Backports commit 60e573462fcdb83aa1a41e66a9f31dc8a4364399 from qemu	2018-02-27 22:56:05 -05:00
Emilio G. Cota	7c7b0fe746	target-i386: emulate LOCK'ed OP instructions using atomic helpers Backports commit a7cee522f3529c2fc85379237b391ea98823271e from qemu	2018-02-27 22:53:46 -05:00
Emilio G. Cota	a386368f82	target-i386: emulate LOCK'ed cmpxchg using cmpxchg helpers The diff here is uglier than necessary. All this does is to turn FOO into: if (s->prefix & PREFIX_LOCK) { BAR } else { FOO } where FOO is the original implementation of an unlocked cmpxchg. Backports commit ae03f8de45427042ecd10b0941a005f21ecc064c from qemu	2018-02-27 22:38:37 -05:00
Richard Henderson	b48508a6c1	tcg: Emit barriers with parallel_cpus Backports commit 91682118aa330aff7e8ef0cc685c32d101f49940 from qemu	2018-02-27 22:28:33 -05:00
Richard Henderson	064543a415	tcg: Add CONFIG_ATOMIC64 Allow qemu to build on 32-bit hosts without 64-bit atomic ops. Even if we only allow 32-bit hosts to multi-thread emulate 32-bit guests, we still need some way to handle the 32-bit guest using a 64-bit atomic operation. Do so by dropping back to single-step. Backports commit df79b996a7b21c6ea7847f7927a2e1a294b86c72 from qemu	2018-02-27 22:25:36 -05:00
Richard Henderson	da01e53757	tcg: Add atomic128 helpers Force the use of cmpxchg16b on x86_64. Wikipedia suggests that only very old AMD64 (circa 2004) did not have this instruction. Further, it's required by Windows 8 so no new cpus will ever omit it. If we truely care about these, then we could check this at startup time and then avoid executing paths that use it. Backports commit 7ebee43ee3e2fcd7b5063058b7ef74bc43216733 from qemu	2018-02-27 21:43:48 -05:00
Richard Henderson	5c0ce1b99c	tcg: Add atomic helpers Add all of cmpxchg, op_fetch, fetch_op, and xchg. Handle both endian-ness, and sizes up to 8. Handle expanding non-atomically, when emulating in serial. Backports commit c482cb117cc418115ca9c6d21a7a2315414c0a40 from qemu	2018-02-27 15:57:47 -05:00
Richard Henderson	0245f93c02	cputlb: Remove includes from softmmu_template.h We already include exec/address-spaces.h and exec/memory.h in cputlb.c; the include of qemu/timer.h appears to be a fossil. Backports commit 40978428853e2f7b4597ab2a9ffeb187333802dc from qemu	2018-02-27 12:40:43 -05:00
Richard Henderson	5c79851143	cputlb: Tidy some macros TGT_LE and TGT_BE are not size dependent and do not need to be redefined. The others are no longer used at all. Backports commit c86c6e4c80fee4d9423bedb10ba9e9c4aa68f861 from qemu	2018-02-27 12:36:25 -05:00
Richard Henderson	4da1cfb902	cputlb: Move most of iotlb code out of line Saves 2k code size off of a cold path. Backports commit 82a45b96a203a7403427183f1afd3d295222ff7d from qemu	2018-02-27 12:34:19 -05:00
Richard Henderson	5df7c9eec7	cputlb: Move probe_write out of softmmu_template.h Backports commit 3b08f0a92545ba06fbdeaae929a5172480300c33 from qemu	2018-02-27 12:25:24 -05:00
Yongbok Kim	79e4c001a9	softmmu: Add probe_write() Probe for whether the specified guest write access is permitted. If it is not permitted then an exception will be taken in the same way as if this were a real write access (and we will not return). Otherwise the function will return, and there will be a valid entry in the TLB for this access. Backports commit 3b4afc9e75ab1a95f33e41f462921093f8a109c4 from qemu	2018-02-27 12:20:50 -05:00
Richard Henderson	1c9c8d3f10	cputlb: Replace SHIFT with DATA_SIZE Backports commit dea2198201b3e0151d75b42774c51cf2ffe2ca4b from qemu	2018-02-27 12:00:33 -05:00
Richard Henderson	e35aacd5ae	tcg: Add EXCP_ATOMIC When we cannot emulate an atomic operation within a parallel context, this exception allows us to stop the world and try again in a serial context. Backports commit fdbc2b5722f6092e47181a947c90fd4bdcc1c121 from qemu Also backports parts of commit 02d57ea115b7669f588371c86484a2e8ebc369be	2018-02-27 11:57:58 -05:00
Richard Henderson	d5510a546f	int128: Add int128_make128 Allows Int128 to be used more generally, rather than having to begin with 64-bit inputs and accumulate. Backports commit 1edaeee0955fba7d834b7c8f4e372e7eae030745 from qemu	2018-02-27 11:06:33 -05:00
Richard Henderson	9084e5fe1b	int128: Use __int128 if available Backports commit 0846beb36641e8f0c3ee55a5bb84d468b653c852 from qemu	2018-02-27 11:03:06 -05:00
Richard Henderson	4fdbe94eea	exec: Avoid direct references to Int128 parts Backports commit 258dfaaad05a5fbe32a142b794e1df3e16501d0e from qemu	2018-02-27 11:01:43 -05:00
Richard Henderson	ba1c63572e	atomics: Add __nocheck atomic operations While the check against sizeof(void *) is appropriate for normal usage within qemu, there are places in which we want wider operaions and have checked for their existance. Backports commit 84bca3927b36fb1d9a2ca85cbbdf9023d2b84678 from qemu	2018-02-27 11:00:20 -05:00
Lioncash	a59eef391e	atomic: MSVC compatible equivalents to some functions	2018-02-27 10:56:04 -05:00
Emilio G. Cota	c837d76a86	atomics: add atomic_op_fetch variants This paves the way for upcoming work. Backports commit 83d0c719f837724d9e3963b078211b2242bdd2a5 from qemu	2018-02-27 10:28:27 -05:00
Emilio G. Cota	102a53aa50	atomics: add atomic_xor This paves the way for upcoming work. Backports commit 61696ddbdc74263ddb6869856772cfe355a5d3bd from qemu	2018-02-27 10:23:31 -05:00
Richard Henderson	3fe8d46a15	atomics: Add parameters to macros Making these functional rather than object macros will prevent later problems with complex macro expansion. Backports commit d1a9f2d12fcfc942924956fbe321aedf4226ccb7 from qemu	2018-02-27 10:21:35 -05:00
Richard Henderson	4168095fed	target-m68k: Optimize gen_flush_flags Backports commit 36f0399d46f2ccf4f6e7451ba46b1e8d0e9ab341 from qemu	2018-02-27 10:19:54 -05:00
Richard Henderson	7403e63f2f	target-m68k: Optimize some comparisons Backports commit 9d896621c1820fd8f437fac26fd7d2e0921091c3 from qemu	2018-02-27 10:15:20 -05:00
Richard Henderson	672a28173f	target-m68k: Use setcond for scc Backports commit b459e3eccfae7fe83e30187c391de00bccf4f51d from qemu	2018-02-27 10:11:35 -05:00
Richard Henderson	ed6feb9329	target-m68k: Introduce DisasCompare Backports commit 6a432295d73df91890dc70c4a94dcc4ba88ad1c3 from qemu	2018-02-27 10:08:32 -05:00
Richard Henderson	4e498cc54d	target-m68k: Reorg flags handling Separate all ccr bits. Continue to batch updates via cc_op. Backports commit 620c6cf66584bfbee90db84a7e87a6eabf230ca9 from qemu	2018-02-27 10:02:02 -05:00
Richard Henderson	121309a4d0	target-m68k: Reorg flags handling Separate all ccr bits. Continue to batch updates via cc_op. Signed-off-by: Richard Henderson <rth@twiddle.net> Fix gen_logic_cc() to really extend the size of the result. Fix gen_get_ccr(): update cc_op as it is used by the helper. Factorize flags computing and src/ccr cleanup Backports commit 620c6cf66584bfbee90db84a7e87a6eabf230ca9 from qemu	2018-02-27 09:30:32 -05:00
Richard Henderson	61ab9a42cd	target-m68k: Remove incorrect clearing of cc_x The CF docs certainly doesnt suggest this is true. Backports commit 18dd87f26bed46f22bb1b9536329c02de500f407 from qemu	2018-02-27 09:21:26 -05:00
Richard Henderson	187c2a9807	target-m68k: Some fixes to SR and flags management Backports commit 99c514485b1d7922c4ca1ed767fd45525de4701f from qemu	2018-02-27 09:19:21 -05:00
Richard Henderson	9493b29399	target-m68k: Print flags properly Backports commit 8e394ccabdb1e439aab092de6b9d2f26432e962f from qemu	2018-02-27 09:17:44 -05:00
Laurent Vivier	57ea90a91f	target-m68k: update CPU flags management Copied from target-i386 Backports commit 9fdb533fb129b19610941bd1e5dd93e7471a18f5 from qemu	2018-02-27 09:15:29 -05:00
Laurent Vivier	125675e334	target-m68k: don't update cc_dest in helpers Backports commit 91f90d7191f862ab27528dbdf76cee55c77f79cf from qemu	2018-02-27 09:04:51 -05:00
Laurent Vivier	12f9ba3fe4	target-m68k: update move to/from ccr/sr Backports commit 7c0eb318bdcc3667a861e7b0f140df0b6d9895e2 from qemu	2018-02-27 08:57:05 -05:00
Laurent Vivier	b8366d5b31	target-m68k: remove m68k_cpu_exec_enter() and m68k_cpu_exec_exit() Update cc_op directly from tcg_gen_insn_start() and restore_state_to_opc() Copied from target-i386 Backports commit 20a8856eba0980fbe9d2b8ed2b33ecdb9c9fe5ad from qemu	2018-02-27 08:53:02 -05:00
Laurent Vivier	a521f4f41d	target-m68k: Replace helper_xflag_lt with setcond Backports commit f9083519034aaa5ad5cd2c5727bd61c29bf60bc5 from qemu	2018-02-27 08:50:44 -05:00
Laurent Vivier	b079255576	target-m68k: allow to update flags with operation on words and bytes Backports commit 5dbb6784b7e2b833c036b4df58aa07067e35f476 from qemu	2018-02-27 08:47:12 -05:00
Laurent Vivier	f069762b61	target-m68k: REG() macro cleanup Backports commit bcc098b0c23b4dd902ff56987d769bd839677331 from qemu	2018-02-27 08:37:25 -05:00
Laurent Vivier	3d59fe56b3	target-m68k: set PAGE_BITS to 12 for m68k Backports commit 2b04e85a3401e13cb19b1de197e6c211eaadca4c from qemu	2018-02-27 08:36:09 -05:00
Laurent Vivier	292fc83c86	target-m68k: define operand sizes Backports commit 7ef25cdd6cee4fa468d6cb913fa064a6689faf7d from qemu	2018-02-27 08:35:13 -05:00
Laurent Vivier	2653165c63	target-m68k: introduce read_imXX() functions Read a 8, 16 or 32bit immediat constant. An immediate constant is stored in the instruction opcode and can be in one or two extension words. Backports commit 28b68cd79ef01e8b1f5bd26718cd8c09a12c625f from qemu	2018-02-27 08:32:04 -05:00
Laurent Vivier	d29cbb70b3	target-m68k: manage scaled index Scaled index is not supported by 68000, 68008, and 68010. EA = (bd + PC) + Xn.SIZE*SCALE + od Ignore it: M68000 FAMILY PROGRAMMER’S REFERENCE MANUAL 2.4 BRIEF EXTENSION WORD FORMAT COMPATIBILITY "If the MC68000 were to execute an instruction that encoded a scaling factor, the scaling factor would be ignored and would not access the desired memory address. The earlier microprocessors do not recognize the brief extension word formats implemented by newer processors. Although they can detect illegal instructions, they do not decode invalid encodings of the brief extension word formats as exceptions." Backports commit d8633620a112296fcf6a6ae9a1cbba614c0ca502 from qemu	2018-02-27 08:27:20 -05:00
Laurent Vivier	fa4a71a1bf	target-m68k: define m680x0 CPUs and features This patch defines height new features: - M68K_FEATURE_SCALED_INDEX, scaled address index register - M68K_FEATURE_LONG_MULDIV, 32bit multiply/divide - M68K_FEATURE_QUAD_MULDIV, 64bit multiply/divide - M68K_FEATURE_BCCL, long conditional branches - M68K_FEATURE_BITFIELD, bit field instructions - M68K_FEATURE_FPU, FPU instructions - M68K_FEATURE_CAS, cas instruction - M68K_FEATURE_BKPT, bkpt instruction Backports commit f076803bbf6ad1618f493f543faff97f3dd0c970 from qemu	2018-02-27 08:26:06 -05:00
John Paul Adrian Glaubitz	2fd7779aa5	target-m68k: Build the opcode table only once to avoid multithreading issues Backports commit b208525797b031c1be4121553e21746686318a38 from qemu	2018-02-27 08:14:35 -05:00
Laurent Vivier	fd84549b3e	target-m68k: fix DEBUG_DISPATCH Backports commit a1ff19302007986fa081738e88905a715bd68e2e from qemu	2018-02-27 08:07:21 -05:00
Daniel P. Berrange	83a5bf2d25	qapi: rename QmpOutputVisitor to QObjectOutputVisitor The QmpOutputVisitor has no direct dependency on QMP. It is valid to use it anywhere that one wants a QObject. Rename it to better reflect its functionality as a generic QAPI to QObject converter. The commit before previous renamed the files, this one renames C identifiers. Backports commit 7d5e199ade76c53ec316ab6779800581bb47c50a from qemu	2018-02-27 08:05:33 -05:00
Daniel P. Berrange	2949a90977	qapi: rename QmpInputVisitor to QObjectInputVisitor The QmpInputVisitor has no direct dependency on QMP. It is valid to use it anywhere that one has a QObject. Rename it to better reflect its functionality as a generic QObject to QAPI converter. The previous commit renamed the files, this one renames C identifiers. Backports commit 09e68369a88d7de0f988972bf28eec1b80cc47f9 from qemu	2018-02-26 15:54:15 -05:00
Daniel P. Berrange	228f122248	qapi: rename qmp--visitor* to qobject--visitor* The QMP visitors have no direct dependency on QMP. It is valid to use them anywhere that one has a QObject. Rename them to better reflect their functionality as a generic QObject to QAPI converter. This is the first of three parts: rename the files. The next two parts will rename C identifiers. The split is necessary to make git rename detection work. Backports commit b3db211f3c80bb996a704d665fe275619f728bd4 from qemu	2018-02-26 15:42:37 -05:00
Peter Maydell	1a850bcb19	target-arm: Implement new HLT trap for semihosting Version 2.0 of the semihosting specification introduces new trap instructions for AArch32: HLT 0xF000 for A32 and HLT 0x3C for T32. Implement these (in the same way we implement the existing HLT semihosting trap for A64). The old traps via SVC and BKPT are unaffected. Backports commit 19a6e31c9d2701ef648b70ddcfc3bf64cec8c37e from qemu	2018-02-26 15:28:45 -05:00
Peter Maydell	db8b0a82b1	cpu: Support a target CPU having a variable page size Support target CPUs having a page size which isn't knownn at compile time. To use this, the CPU implementation should: * define TARGET_PAGE_BITS_VARY * not define TARGET_PAGE_BITS * define TARGET_PAGE_BITS_MIN to the smallest value it might possibly want for TARGET_PAGE_BITS * call set_preferred_target_page_bits() in its realize function to indicate the actual preferred target page size for the CPU (and report any error from it) In CONFIG_USER_ONLY, the CPU implementation should continue to define TARGET_PAGE_BITS appropriately for the guest OS page size. Machines which want to take advantage of having the page size something larger than TARGET_PAGE_BITS_MIN must set the MachineClass minimum_page_bits field to a value which they guarantee will be no greater than the preferred page size for any CPU they create. Note that changing the target page size by setting minimum_page_bits is a migration compatibility break for that machine. For debugging purposes, attempts to use TARGET_PAGE_SIZE before it has been finally confirmed will assert. Backports commit 20bccb82ff3ea09bcb7c4ee226d3160cab15f7da from qemu	2018-02-26 12:29:08 -05:00
Vijaya Kumar K	a7229cc08a	translate-all.c: Compute L1 page table properties at runtime Remove L1 page mapping table properties computing statically using macros which is dependent on TARGET_PAGE_BITS. Drop macros V_L1_SIZE, V_L1_SHIFT, V_L1_BITS macros and replace with variables which are computed at early stage of VM boot. Removing dependency can help to make TARGET_PAGE_BITS dynamic. Backports commit 66ec9f49399f0a9fa13ee77c472caba0de2773fc from qemu	2018-02-26 11:46:58 -05:00
Vijaya Kumar K	3082b4e4ec	exec.c: Remove static allocation of sub_section of sub_page Allocate sub_section dynamically. Remove dependency on TARGET_PAGE_SIZE to make run-time page size detection for arm platforms. Backports commit 2615fabd42ea0078dd9e659bdb21a5b7a1f87a9a from qemu	2018-02-26 10:50:04 -05:00
Paolo Bonzini	eb75004013	memory: add a per-AddressSpace list of listeners This speeds up MEMORY_LISTENER_CALL noticeably. Right now, with many PCI devices you have N regions added to M AddressSpaces (M = # PCI devices with bus-master enabled) and each call looks up the whole listener list, with at least M listeners in it. Because most of the regions in N are BARs, which are also roughly proportional to M, the whole thing is O(M^3). This changes it to O(M^2), which is the best we can do without rewriting the whole thing. Backports commit 9a54635dcb51a3fcf7507af630168f514a8cd4e7 from qemu	2018-02-26 10:46:50 -05:00
Paolo Bonzini	4b06e8bbb7	memory: eliminate global MemoryListeners There is none, so just drop the code. Backports commit d45fa784cd0c111131696808d1168259d66b7519 from qemu	2018-02-26 10:19:28 -05:00
Paolo Bonzini	8734e13a73	tcg: try sti when moving a constant into a dead memory temp This comes from free from unifying tcg_reg_alloc_mov and tcg_reg_alloc_movi's handling of TEMP_VAL_CONST. It triggers often on moves to cc_dst, such as the following translation of "sub $0x3c,%esp": before: after: subl $0x3c,%ebp subl $0x3c,%ebp movl %ebp,0x10(%r14) movl %ebp,0x10(%r14) movl $0x3c,%ebx movl $0x3c,0x2c(%r14) movl %ebx,0x2c(%r14) Backports commit 0fe4fca4e1a5e06a270127dd80bb753d4dda61c6 from qemu	2018-02-26 10:08:47 -05:00
Paolo Bonzini	be00a3e100	target-i386: fix 32-bit addresses in LEA This was found with test-i386. The issue is that instructions such as addr32 lea (%eax), %rax did not perform a 32-bit extension, because the LEA translation skipped the gen_lea_v_seg step. That step does not just add segments, it also takes care of extending from address size to pointer size. Backports commit 620abfb004543404bef1953e25da2ad77352941a from qemu	2018-02-26 10:06:08 -05:00
Paolo Bonzini	8b239bd48b	atomic: base mb_read/mb_set on load-acquire and store-release This introduces load-acquire and store-release operations in QEMU. For now, just use them as an implementation detail of atomic_mb_read and atomic_mb_set. Since docs/atomics.txt documents that atomic_mb_read only synchronizes with an atomic_mb_set of the same variable, we can use the new implementation everywhere instead of seq-cst loads and stores. Backports commit 803cf26a9e019b5d2256a8edeb22e3538c4f3261 from qemu	2018-02-26 10:02:46 -05:00
Paolo Bonzini	fd7ef4c184	atomic: introduce smp_mb_acquire and smp_mb_release	2018-02-26 09:58:22 -05:00
Eduardo Habkost	b41bb81737	target-i386: Don't use cpu->migratable when filtering features When explicitly enabling unmigratable flags using "-cpu host" (e.g. "-cpu host,+invtsc"), the requested feature won't be enabled because cpu->migratable is true by default. This is inconsistent with all other CPU models, which don't have the "migratable" option, making "+invtsc" work without the need for extra options. This happens because x86_cpu_filter_features() uses cpu->migratable as an argument for x86_cpu_get_supported_feature_word(). This is not useful because: 2) on "-cpu host" it only makes QEMU disable features that were explicitly enabled in the command-line; 1) on all the other CPU models, cpu->migratable is already false. The fix is to just use 'false' as an argument to x86_cpu_get_supported_feature_word() in x86_cpu_filter_features(). Note that: * This won't change anything for people using using "-cpu host" or "-cpu host,migratable=<on\|off>" (with no extra features) because the x86_cpu_get_supported_feature_word() call on the cpu->host_features check uses cpu->migratable as argument. * This won't change anything for any CPU model except "host" because they all have cpu->migratable == false (and only "host" has the "migratable" property that allows it to be changed). * This will only change things for people using "-cpu host,+<feature>", where <feature> is a non-migratable feature. The only existing named non-migratable feature is "invtsc". In other words, this change will only affect people using "-cpu host,+invtsc" (that will now get what they asked for: the invtsc flag will be enabled). All other use cases are unaffected. Backports commit 46c032f3afcc05a0123914609f1003906ba63fda from qemu	2018-02-26 09:51:14 -05:00
Eduardo Habkost	4096ce0184	target-i386: x86_cpu_load_features() function When probing for CPU model information, we need to reuse the code that initializes CPUID fields, but not the remaining side-effects of x86_cpu_realizefn(). Move that code to a separate function that can be reused later. Backports commit 41f3d4d69a423dadb8431fda65d8d7c68c0de0fc from qemu	2018-02-26 09:49:34 -05:00
Eduardo Habkost	aa98c8a93f	target-i386: Move warning code outside x86_cpu_filter_features() x86_cpu_filter_features() will be reused by code that shouldn't print any warning. Move the warning code to a new x86_cpu_report_filtered_features() function, and call it from x86_cpu_realizefn(). Backports commit 8ca30e8673aff9bfcf8f969f8db4266b5f62e49c from qemu	2018-02-26 09:40:11 -05:00
Eduardo Habkost	08bfa41e1b	target-i386: xsave: Add FP and SSE bits to x86_ext_save_areas Instead of treating the FP and SSE bits as special cases, add them to the x86_ext_save_areas array. This will simplify the code that calculates the supported xsave components and the size of the xsave area. Backports commit e3c9022b4e2b6a4deb6518361d2bbf33522b9198 from qemu	2018-02-26 09:37:48 -05:00
Eduardo Habkost	54bd827472	target-i386: Register properties for feature aliases manually Instead of keeping the aliases inside the feature name arrays and require parsing the strings, just register alias properties manually. This simplifies the code for property registration and lookup. Backports commit 16d2fcaa509b1ca56eb2fcd8fe877279cf65cccc from qemu	2018-02-26 09:34:52 -05:00
Eduardo Habkost	b508b9e02a	target-i386: Remove underscores from feat_names arrays Instead of translating the feature name entries when adding property names, store the actual property names in the feature name array. For reference, here is the full list of functions that use FeatureWordInfo::feat_names: * x86_cpu_get_migratable_flags(): not affected, as it just check for non-NULL values. * report_unavailable_features(): informative only. It will start printing feature names with hyphens. * x86_cpu_list(): informative only. It will start printing feature names with hyphens * x86_cpu_register_feature_bit_props(): not affected, as it was already calling feat2prop(). Now we can remove the feat2prop() calls safely. So, the only user-visible effect of this patch are the new names being used in help and error messages for users. Backports commit fc7dfd205f3287893c436d932a167bffa30579c8 from qemu	2018-02-26 09:33:15 -05:00
Eduardo Habkost	6d1a7bccb5	target-i386: Disable VME by default with TCG VME is already disabled automatically when using TCG. So, instead of pretending it is there when reporting CPU model data on query-cpu-* QMP commands (making every CPU model to be reported as not runnable), we can disable it by default on all CPU models when using TCG. Do that by adding a tcg_default_props array that will work like kvm_default_props. Backports commit 04d99c3c61f4bdc0450dbeb6512b6dd743baca65 from qemu	2018-02-26 08:23:44 -05:00
Eduardo Habkost	594cbeaa06	target-i386: List CPU models using subclass list Instead of using the builtin_x86_defs array, use the QOM subclass list to list CPU models on "-cpu ?" and "query-cpu-definitions". Backports commit ee465a3ef77c2b2975ffa71c72208c05b3f3970d from qemu	2018-02-26 08:17:04 -05:00
Peter Maydell	200771d0ba	target-arm: Add trace events for the generic timers Backports commit 194cbc492bcc8f3f1868ec97a35146bc99c3c71c from qemu	2018-02-26 08:15:42 -05:00
Peter Maydell	158bfc109a	target-arm: Implement dummy MDCCINT_EL1 MDCCINT_EL1 is part of the DCC debugger communication channel between the CPU and an attached external debugger. QEMU doesn't implement this, but since Linux may try to access this register we need to provide at least a dummy implementation. Backports commit 5dbdc4342f479d799a1970dd5fd22e64c9dcd50d from qemu	2018-02-26 08:11:54 -05:00
Peter Maydell	f2dcb81b27	Fix masking of PC lower bits when doing exception returns In commit 9b6a3ea7a699594 store_reg() was changed to mask both bits 0 and 1 of the new PC value when in ARM mode. Unfortunately this broke the exception return code paths when doing a return from ARM mode to Thumb mode: in some of these we write a new CPSR including new Thumb mode bit via gen_helper_cpsr_write_eret(), and then use store_reg() to write the new PC. In this case if the new CPSR specified Thumb mode then masking bit 1 of the PC is incorrect (these code paths correspond to the v8 ARM ARM pseudocode function AArch32.ExceptionReturn(), which always aligns the new PC appropriately for the new instruction set state). Instead of using store_reg() in exception-return code paths, call a new store_pc_exc_ret() which stores the raw new PC value to env->regs[15], and then mask it appropriately in the subsequent helper_cpsr_write_eret() where the new env->thumb state is available. This fixes a bug introduced by 9b6a3ea7a699594 which caused crashes/hangs or otherwise bad behaviour for Linux when userspace was using Thumb. Backports commit fb0e8e79a9d77ee240dbca036fa8698ce654e5d1 from qemu	2018-02-26 08:09:28 -05:00
Thomas Hanson	c69ae10ca7	target-arm: Comments added to identify cases in a switch 3 cases in a switch in disas_exc() require reference to the ARM ARM spec in order to determine what case they're handling. Backports commit 957956b3013c8122a749dfe61a41aef8b4100e31 from qemu	2018-02-26 08:05:49 -05:00
Thomas Hanson	00d1803436	target-arm: Code changes to implement overwrite of tag field on PC load For BR, BLR and RET instructions, if tagged addresses are enabled, the tag field in the address must be cleared out prior to loading the address into the PC. Depending on the current EL, it will be set to either all 0's or all 1's. Backports commit 6feecb8b941f2d21e5645d0b6e0cdb776998121b from qemu	2018-02-26 08:04:00 -05:00
Thomas Hanson	2af4ca54e9	target-arm: Infrastucture changes to enable handling of tagged address loading into PC When capturing the current CPU state for the TB, extract the TBI0 and TBI1 values from the correct TCR for the current EL and then add them to the TB flags field. Then, at the start of code generation for the block, copy the TBI fields into the DisasContext structure. Backports commit 86fb3fa4ed5873b021a362ea26a021f4aeab1bb4 from qemu	2018-02-26 07:58:17 -05:00
Marc-André Lureau	be6e25bcc7	qapi: return a 'missing parameter' error The 'old' dispatch code returned a QERR_MISSING_PARAMETER for missing parameters, but the qapi qmp_dispatch() code uses QERR_INVALID_PARAMETER_TYPE. Improve qapi code to return QERR_MISSING_PARAMETER where appropriate. Fix expected error message in iotests. Backports commit 1382d4abdf9619985e4078e37e49e487cea9935e from qemu	2018-02-26 05:19:53 -05:00
Marc-André Lureau	ddc25c8aaf	qapi: assert list entry has a value This helps to figure out the expectations. Backports commit eac8e79ff749fc15e1dca4caccf1f38664ab4915 from qemu	2018-02-26 05:15:32 -05:00
Marc-André Lureau	bd469af15f	qapi: add assert about root value qiv->root should not be null, make that clearer with some assert. Backports commit 5d0cbbcfeb59e1e3f5ee7d26b8a215382f6d9abd from qemu	2018-02-26 05:15:01 -05:00
Marc-André Lureau	1a138915a5	qapi: Fix crash when 'any' or 'null' parameter is missing Unlike the other visit methods, visit_type_any() and visit_type_null() neglect to check whether qmp_input_get_object() succeeded. They crash when it fails. Reproducer: { "execute": "qom-set", "arguments": { "path": "/machine", "property": "rtc-time" } } Will crash with: qapi/qapi-visit-core.c:277: visit_type_any: Assertion `!err != !*obj' failed Broken in commit 5c678ee. Fix by adding the missing error checks. Backports commit c489780203f9b22aca5539ec7589b7140bdc951f from qemu	2018-02-26 05:13:54 -05:00
Alex Bennée	fbf6fb1e25	atomic.h: fix __SANITIZE_THREAD__ build Only very modern GCC's actually set this define when building with the ThreadSanitizer so this little typo slipped though. Backports commit 23ea7f57949f2f5934f4d5bbc29fe321b3a7067b from qemu	2018-02-26 05:12:17 -05:00
Alex Bennée	d4cb954102	cpu: atomically modify cpu->exit_request ThreadSanitizer picks up potential races although we already use barriers to ensure things are in the correct order when processing exit requests. For true C11 defined behaviour across threads we need to use relaxed atomic_set/atomic_read semantics to reassure tsan. Backports commit 027d9a7d2911e993cdcbd21c7c35d1dd058f05bb from qemu	2018-02-26 05:11:18 -05:00
Alex Bennée	e1cf9ca84a	qom/cpu: atomically clear the tb_jmp_cache The ThreadSanitizer rightly complains that something initialised with a normal access is later updated and read atomically. Backports commit ce7cf6a973f4b614162b9518954d441fa5e32fc6 from qemu	2018-02-26 05:09:05 -05:00
Alex Bennée	12d7e946a1	qom/object: update class cache atomically The idiom CPU_GET_CLASS(cpu) is fairly extensively used in various threads and trips of ThreadSanitizer due to the fact it updates obj->class->object_cast_cache behind the scenes. As this is just a fast-path cache there is no need to lock updates. However to ensure defined C11 behaviour across threads we need to use the plain atomic_read/set primitives and keep the sanitizer happy. Backports commit b6b3ccfda015dcd5ab50f70c189ee5cc6c622e91 from qemu	2018-02-26 05:06:40 -05:00
Alex Bennée	bf72733576	tcg/optimize: move default return out of if statement This is to appease sanitizer builds which complain that: "error: control reaches end of non-void function" Backports commit 550276ae0a88851edda2cb7fcdd64256dbb8e314 from qemu	2018-02-26 05:05:21 -05:00
Alex Bennée	4046235e92	atomic.h: comment on use of atomic_read/set Add some notes on the use of the relaxed atomic access helpers and their importance for defined behaviour in C11's multi-threaded memory model. Backports commit e653bc6b0ff645c25b8a2eb607c18a5c98b59db6 from qemu	2018-02-26 05:03:59 -05:00
Peter Maydell	f48d1fe391	target-arm: Correctly handle 'sub pc, pc, 1' for ARMv6 In the ARM v6 architecture, 'sub pc, pc, 1' is not an interworking branch, so the computed new value is written to r15 as a normal value. The architecture says that in this case, bits [1:0] of the value written must be ignored if we are in ARM mode (or bit [0] ignored if in Thumb mode); this is a change from the ARMv4/v5 specification that behaviour is UNPREDICTABLE. Use the correct mask on the PC value when doing a non-interworking store to PC. A popular library used on RaspberryPi uses this instruction as part of a trick to determine whether it is running on ARMv6 or ARMv7, and we were mishandling the sequence. Fixes bug: https://bugs.launchpad.net/bugs/1625295 Backports commit 9b6a3ea7a699594162ed3d11e4e04b98568dc5c0 from qemu	2018-02-26 05:02:32 -05:00
Edgar E. Iglesias	dedab81d68	target-arm: A64: Fix decoding of iss_sf in disas_ld_lit Fix the decoding of iss_sf in disas_ld_lit. The SF (Sixty-Four) field in the ISS (Instruction Specific Syndrome) is a bit that specifies the width of the register that the instruction loads to. If cleared it specifies 32 bits. If set it specifies 64 bits. Backports commit 173ff58580b383a7841b18fddb293038c9d40d1c from qemu	2018-02-26 05:01:33 -05:00
Evgeny Yakovlev	fa9d708fbd	target-i386: Correct family/model/stepping for Opteron_G3 Current CPU definition for AMD Opteron third generation includes features like SSE4a and LAHF_LM support in emulated CPUID. These features are present in K8 rev.E or K10 CPUs and later. However, current G3 family and model describe 2nd generation K8 cores instead. This is incorrect but was considered harmless until our tests found a problem with linux kernels >= 3.10 (and maybe earlier) which specifically check for Opteron K8 model when parsing CPUID leaf 0x80000001: http://lxr.free-electrons.com/source/arch/x86/kernel/cpu/amd.c?v=3.16#L552 This code will disable LAHF_LM feature in /proc/cpuinfo if model number is inconsistent. This change sets Opteron_G3 family/model/stepping to 16/2/3 which is a proper Opteron 3rd generation 2350 CPU. Backports commit 339892d758efb2d0954160d41736a0eac9875d67 from qemu	2018-02-26 04:59:18 -05:00
Eduardo Habkost	b7f434373b	target-i386: Report known CPUID[EAX=0xD,ECX=0]:EAX bits as migratable A regression was introduced by commit 96193c22a "target-i386: Move xsave component mask to features array": all CPUID[EAX=0xD,ECX=0]:EAX bits were being reported as unmigratable because they don't have feature names defined. This broke "-cpu host" because it enables only migratable features by default. This adds a new field to FeatureWordInfo: migratable_flags, which will make those features be reported as migratable even if they don't have a property name defined. Backports commit 6fb2fff75dceed1716e757882a6dfbadd9042407 from qemu	2018-02-26 04:58:05 -05:00
Alex Bennée	33589eb75f	cpus: pass CPUState to run_on_cpu helpers CPUState is a fairly common pointer to pass to these helpers. This means if you need other arguments for the async_run_on_cpu case you end up having to do a g_malloc to stuff additional data into the routine. For the current users this isn't a massive deal but for MTTCG this gets cumbersome when the only other parameter is often an address. This adds the typedef run_on_cpu_func for helper functions which has an explicit CPUState * passed as the first parameter. All the users of run_on_cpu and async_run_on_cpu have had their helpers updated to use CPUState where available. Backports commit e0eeb4a21a3ca4b296220ce4449d8acef9de9049 from qemu	2018-02-26 04:54:55 -05:00
Felipe Franciosi	0ed8880525	compiler: Swap 'public domain' header for license As discussed on the list [1], having a comment stating that this file is "public domain" is arguably wrong and not legally binding. This patch replaces that comment with a clear GPLv2+ license as proposed in [2]. [1] http://lists.nongnu.org/archive/html/qemu-devel/2016-09/msg06151.html [2] http://lists.nongnu.org/archive/html/qemu-devel/2016-09/msg06217.html Worth noting, compiler.h was originally created on 5c026320 by splitting qemu-common.h. At the time, qemu-common.h was already GPLv2+. Backports commit cc9d8a3b2c41c22fb09f90f3085e6036c199c3ca from qemu	2018-02-26 04:49:45 -05:00
Eduardo Habkost	49c04d7104	target-i386: Clear KVM CPUID features if KVM is disabled This will ensure all checks for features[FEAT_KVM] in the code will be correct in case the KVM CPUID leaf is completely disabled. Backports commit aec661de86894e914d2d82431d9cefa9a9a40213 from qemu	2018-02-26 04:47:05 -05:00
Eduardo Habkost	f29384c810	target-i386: Move xsave component mask to features array This will reuse the existing check/enforce logic in x86_cpu_filter_features() to check the xsave component bits against GET_SUPPORTED_CPUID. Backports commit 96193c22ab39ea24f81e386ad7883260ff24f5fd from qemu	2018-02-26 04:45:35 -05:00
Eduardo Habkost	3fb3e6672b	target-i386: xsave: Calculate set of xsave components on realize Instead of doing complex calculations and calling kvm_arch_get_supported_cpuid() inside cpu_x86_cpuid(), calculate the set of required XSAVE components earlier, at realize time. Backports commit 2ca8a8becc2eeb5262e478ce502f5daa53f3d0bc from qemu	2018-02-26 04:40:41 -05:00
Eduardo Habkost	28f002cbaf	target-i386: xsave: Helper function to calculate xsave area size Move the xsave area size calculation from cpu_x86_cpuid() inside its own function. While doing it, change it to use the XSAVE area struct sizes for the initial size, instead of the magic 0x240 number. Backports commit 1fda6198e4126af9988754c8824cfc9928649890 from qemu	2018-02-26 04:36:27 -05:00
Eduardo Habkost	c35e9eb9af	target-i386: xsave: Simplify CPUID[0xD,0].{EAX,EDX} calculation Instead of assigning individual bits in a loop, just copy the values from ena_mask. Backports commit 8057c621b1b17cbcb35fe67d1a09ada9055873a9 from qemu	2018-02-26 04:35:14 -05:00
Eduardo Habkost	c7195afd32	target-i386: xsave: Calculate enabled components only once Instead of checking both env->features and ena_mask at two different places in the CPUID code, initialize ena_mask based on the features that are enabled for the CPU, and then clear unsupported bits based on kvm_arch_get_supported_cpuid(). The results should be exactly the same, but it will make it easier to move the mask calculation elsewhare, and reuse x86_cpu_filter_features() for the kvm_arch_get_supported_cpuid() check. Backports commit 4928cd6de6b4211a79f98c8dc39115be1e815c2b from qemu	2018-02-26 04:33:18 -05:00
Eduardo Habkost	c3a0cba5b1	target-i386: Don't try to enable PT State xsave component The code that calculates the set of supported XSAVE components on CPUID looks at ext_save_areas to find out which components should be enabled. However, if there are zeroed entries in the ext_save_areas array, the ((env->features[esa->feature] & esa->bits) == esa->bits) check will always succeed and QEMU will unconditionally try to enable the component. Luckily this never caused any problems because the only missing entry in ext_save_areas is the PT State component (bit 8), and KVM currently doesn't support it (so it was cleared on ena_mask). But the code was still incorrect and would break if KVM starts returning CPUID[EAX=0xD,ECX=0].EAX[bit 8] as supported on GET_SUPPORTED_CPUID. Fix the problem by changing the code to not enable a XSAVE component if ExtSaveArea::bits is zero. Backports commit 9646f4927faf68e8690588c2fd6dc9834c440b58 from qemu	2018-02-26 04:30:35 -05:00
Eduardo Habkost	6188c6d6e4	target-i386: Move feature name arrays inside FeatureWordInfo It makes it easier to guarantee the arrays are the right size, and to find information when looking at the code. Backports commit 2d5312da566e4424a807d078da05f92ee7be3eec from qemu	2018-02-26 04:29:47 -05:00
Eduardo Habkost	74ae087743	target-i386: Enable CPUID[0x8000000A] if SVM is enabled SVM needs CPUID[0x8000000A] to be available. So if SVM is enabled in a CPU model or explicitly in the command-line, adjust CPUID xlevel to expose the CPUID[0x8000000A] leaf. Backports commit 0c3d7c0051576d220e6da0a8ac08f2d8482e2f0b from qemu	2018-02-26 04:05:47 -05:00
Eduardo Habkost	37406874ea	target-i386: Automatically set level/xlevel/xlevel2 when needed Instead of requiring users and management software to be aware of required CPUID level/xlevel/xlevel2 values for each feature, automatically increase those values when features need them. This was already done for CPUID[7].EBX, and is now made generic for all CPUID feature flags. Unit test included, to make sure we don't break ABI on older machine-types and don't mess with the CPUID level values if they are explicitly set by the user. Backports commit c39c0edf9bb3b968ba95484465a50c7b19f4aa3a from qemu	2018-02-26 04:03:09 -05:00
Eduardo Habkost	6861fe80cf	target-i386: Add a marker to end of the region zeroed on reset Instead of using cpuid_level, use an empty struct as a marker (like we already did with {start,end}_init_save). This will avoid accidentaly resetting the wrong fields if we change the field ordering on CPUX86State. Backports commit 5e992a8e337e710ea2d02f35668ac55a80e15f99 from qemu	2018-02-26 03:59:03 -05:00
Eduardo Habkost	c78d24b93c	target-i386: Remove unused X86CPUDefinition::xlevel2 field No CPU model in builtin_x86_defs has xlevel2 set, so it is always zero. Delete the field. Note that this is not an user-visible change. It doesn't remove the ability to set xlevel2 on the command-line, it just removes an unused field in builtin_x86_defs. Backports commit 0456441b5eb6694a561ad5bb8dad52483e6a08d0 from qemu	2018-02-26 03:57:02 -05:00
Leon Alrae	f60eca6930	target-mips: generate fences Make use of memory barrier TCG opcode in MIPS front end. Backports commit d208ac0c2e4cb43b74153bd584fc63c7b8a93ed6 from qemu	2018-02-26 03:52:35 -05:00
André Draszik	f14ece4aa1	target-mips: add 24KEc CPU definition Define a new CPU definition supporting 24KEc cores, similar to the existing 24Kc, but with added support for DSP instructions and MIPS16e (and without FPU). Backports commit e9deaad8a58c899dc32e9fdeff9e533070e79dca from qemu	2018-02-26 03:50:22 -05:00
Andrey Yurovsky	e24890a580	arm: add Cortex A7 CPU parameters Add the "cortex-a7" CPU with features and registers matching the Cortex-A7 MPCore Technical Reference Manual and the Cortex-A7 Floating-Point Unit Technical Reference Manual. The A7 is very similar to the A15. Backports commit dcf578ed8cec89543158b103940e854ebd21a8cf from qemu	2018-02-26 03:44:24 -05:00
Richard Henderson	552ef4b3e6	target-i386: Use struct X86XSaveArea in fpu_helper.c This avoids a double hand-full of magic numbers in the xsave and xrstor helper functions. Backports commit 3f32bd21df655e62eb271182a5c63280d631c7b3 from qemu	2018-02-26 03:38:53 -05:00
Richard Henderson	2ab4b8fa4d	tcg/i386: Extend TARGET_PAGE_MASK to the proper type TARGET_PAGE_MASK, as defined, has type "int". We need to extend that to the proper target width before oring in an "unsigned". Backports commit ebb90a005da67147245cd38fb04a965a87a961b7 from qemu	2018-02-26 03:32:38 -05:00
Pranith Kumar	16d71f0f10	tcg: Optimize fence instructions This commit optimizes fence instructions. Two optimizations are currently implemented: (1) unnecessary duplicate fence instructions, and (2) merging weaker fences into a stronger fence. [rth: Merge tcg_optimize_mb back into tcg_optimize, so that we only loop over the opcode stream once. Merge "unrelated" weaker barriers into one stronger barrier.] Backports commit 34f939218ce78163171addd63750e1e0300376ab from qemu	2018-02-26 03:29:59 -05:00
Pranith Kumar	533e083495	target-i386: Generate fences for x86 Backports commit cc19e497a047193db5083425957d7292c8dd3226 from qemu	2018-02-26 03:28:31 -05:00
Pranith Kumar	32b7cee81e	target-aarch64: Generate fences for aarch64 Backports commit ce1bd93f94e8d4b7117744e49652d2f907bed99f from qemu	2018-02-26 03:26:35 -05:00
Pranith Kumar	7849f8d72a	target-arm: Generate fences in ARMv7 frontend Backports commit 61e4c432ab26526bab0f3ef746c1861415b6da29 from qemu	2018-02-26 03:22:53 -05:00
Pranith Kumar	65a73763e3	tcg/sparc: Add support for fence Backports commit f8f03b3707b49898052fb8cd75ee31d19c8161fc from qemu	2018-02-26 03:20:39 -05:00
Pranith Kumar	a6fdc24e28	tcg/s390: Add support for fence Backports commit c9314d610e0e5da4d2cd5a36f3563d102b3294e0 from qemu	2018-02-26 03:19:41 -05:00
Pranith Kumar	bdd9cad15c	tcg/ppc: Add support for fence Backports commit 7b4af5ee8a1336bc39714b6de47924ee71fba761 from qemu	2018-02-26 03:18:43 -05:00
Pranith Kumar	5f10101245	tcg/mips: Add support for fence Backports commit 6f0b99104a396905870edc3049310ece29b6b8d6 from qemu	2018-02-26 03:17:34 -05:00
Pranith Kumar	e29cbe9640	tcg/arm: Add support for fence Backports commit 40f191ab8226fdada185efa49c44b60d8f494890 from qemu	2018-02-26 03:13:17 -05:00
Pranith Kumar	907060b865	tcg/aarch64: Add support for fence Backports commit c7a59c2a92592e556b9361437c9c4229917bd1e3 from qemu	2018-02-26 03:11:03 -05:00
Pranith Kumar	d49bd55f52	tcg/i386: Add support for fence Generate a 'lock orl $0,0(%esp)' instruction for ordering instead of mfence which has similar ordering semantics. Backports commit a7d00d4effb58889ac6df64f98ac50c9d1594149 from qemu	2018-02-26 03:10:58 -05:00
Pranith Kumar	5e44ce9be8	Introduce TCGOpcode for memory barrier This commit introduces the TCGOpcode for memory barrier instruction. This opcode takes an argument which is the type of memory barrier which should be generated. Backports commit f65e19bc2c9e8358e634d309606144ac2a3c2936 from qemu	2018-02-26 03:02:41 -05:00
Richard Henderson	66d79ac959	tcg: Merge GETPC and GETRA The return address argument to the softmmu template helpers was confused. In the legacy case, we wanted to indicate that there is no return address, and so passed in NULL. However, we then immediately subtracted GETPC_ADJ from NULL, resulting in a non-zero value, indicating the presence of an (invalid) return address. Push the GETPC_ADJ subtraction down to the only point it's required: immediately before use within cpu_restore_state_from_tb, after all NULL pointer checks have been completed. This makes GETPC and GETRA identical. Remove GETRA as the lesser used macro, replacing all uses with GETPC. Backports commit 01ecaf438b1eb46abe23392c8ce5b7628b0c8cf5 from qemu	2018-02-26 02:54:44 -05:00
Richard Henderson	91f5cf0417	tcg: Support arbitrary size + alignment Previously we allowed fully unaligned operations, but not operations that are aligned but with less alignment than the operation size. In addition, arm32, ia64, mips, and sparc had been omitted from the previous overalignment patch, which would have led to that alignment being enforced. Backports commit 85aa80813dd9f5c1f581c743e45678a3bee220f8 from qemu	2018-02-26 02:47:26 -05:00
Stanislav Shmarov	5f9552657e	target-i386: Fixed syscall posssible segfault In user-mode emulation env->idt.base memory is allocated in linux-user/main.c with size 8512 = 4096 (for 64-bit). When fake interrupt EXCP_SYSCALL is thrown do_interrupt_user checks destination privilege level for this fake exception, and tries to read 4 bytes at address base + (256 2^4)=4096, that causes segfault. Privlege level was checked only for int's, so lets read dpl from memory only for this case. Backports commit 885b7c44e4f8b7a012a92770a0dba8b238662caa from qemu	2018-02-26 02:36:09 -05:00
Paolo Bonzini	d8d0d08262	target-i386: fix ordering of fields in CPUX86State Make sure reset zeroes TSC_AUX, XCR0, PKRU. Move XSTATE_BV from the "vmstate only" section to the "KVM only" section. Backports commit 7616f1c2da1c0f336a474a56ad6d32e15ccd666e from qemu	2018-02-26 02:34:22 -05:00
Ladi Prosek	7acc14da16	Remove unused function declarations Unused function declarations were found using a simple gcc plugin and manually verified by grepping the sources. Backports commit d4b84d564ee3eb7a58e4585d671fb3c220b6c3b9 from qemu	2018-02-26 02:31:46 -05:00
Thomas Huth	b581d4033f	tcg: Remove duplicate header includes host-utils.h and timer.h are included twice in tcg.c. One time should be enough. Backports commit 347519eb9d68303a6c23a7663c0fa6c20a225191 from qemu	2018-02-26 02:29:38 -05:00
Lioncash	1ff9724b46	cutils: Remove unused vector ifdef block	2018-02-26 02:28:50 -05:00
Andrew Dutcher	26b36e5ff8	fpu: add mechanism to check for invalid long double formats All operations that take a floatx80 as an operand need to have their inputs checked for malformed encodings. In all of these cases, use the function floatx80_invalid_encoding to perform the check. If an invalid operand is found, raise an invalid operation exception, and then return either NaN (for fp-typed results) or the integer indefinite value (the minimum representable signed integer value, for int-typed results). For the non-quiet comparison operations, this touches adjacent code in order to pass style checks. Backports cast correction portion of commit d1eb8f2acba579830cf3798c3c15ce51be852c56m from qemu	2018-02-26 02:27:40 -05:00
Pranith Kumar	9e6fec8741	atomics: Use __atomic__n() variant primitives Use the __atomic__n() primitives which take the value as argument. It is not necessary to store the value locally before calling the primitive, hence saving us a stack store and load. Backports commit 89943de17c4e276f2c47f05b4604e8816a6a636c from qemu	2018-02-26 02:16:48 -05:00
Fam Zheng	1a2c30abbf	rules.mak: Don't extract libs from .mo-libs in link command For module build, .mo objects are passed to LINK and consumed in process-archive-undefs. The reason behind that is documented in the comment above process-archive-undefs. Similarly, extract-libs should be called with .mo filtered out too. Otherwise, the .mo-libs are added to the link command incorrectly, spoiling the purpose of modularization. Currently we don't have any .mo-libs usage, but it will be used soon when we modularize more multi-source objects, like sdl and gtk. Backports commit 5b1b6dbd94e2e2e98920f886cb32fcf4a1520b50 from qemu	2018-02-26 02:08:03 -05:00
Sergey Fedorov	58ff618708	tcg: rename tb_find_physical() In fact, this function does not exactly perform a lookup by physical address as it is descibed for comment on get_page_addr_code(). Thus it may be a bit confusing to have "physical" in it's name. So rename it to tb_htable_lookup() to better reflect its actual functionality. Backports commit b34de45fc40d01c14b31d3a682e284180a2ed8c5 from qemu	2018-02-26 02:07:06 -05:00
Sergey Fedorov	ab0c87bc6f	tcg: Merge tb_find_slow() and tb_find_fast() These functions are not too big and can be merged together. This makes locking scheme more clear and easier to follow. Backports commit bd2710d5da06ad7706d4864f65b3f0c9f7cb4d7f from qemu	2018-02-26 02:05:19 -05:00
Sergey Fedorov	9b6f287488	tcg: Avoid bouncing tb_lock between tb_gen_code() and tb_add_jump() Backports commit 74d356dd48b64eaa2a6104ac1493ca64cb31fa16 from qemu	2018-02-26 02:01:40 -05:00
Alex Bennée	09c3ef656e	tcg: cpu-exec: remove tb_lock from the hot-path Lock contention in the hot path of moving between existing patched TranslationBlocks is the main drag in multithreaded performance. This patch pushes the tb_lock() usage down to the two places that really need it: - code generation (tb_gen_code) - jump patching (tb_add_jump) The rest of the code doesn't really need to hold a lock as it is either using per-CPU structures, atomically updated or designed to be used in concurrent read situations (qht_lookup). To keep things simple I removed the #ifdef CONFIG_USER_ONLY stuff as the locks become NOPs anyway until the MTTCG work is completed. Backports commit 518615c6503ad78d3bb67ddf1cd848c4a41de02e from qemu	2018-02-26 01:58:33 -05:00
Alex Bennée	62aa0abd02	tcg: set up tb->page_addr before insertion This ensures that if we find the TB on the slow path that tb->page_addr is correctly set before being tested. Backports commit 2e1ae44a4f4a6149fbb9dc812243522f07284700 from qemu	2018-02-26 01:50:04 -05:00
Paolo Bonzini	30845ae475	tcg: Prepare TB invalidation for lockless TB lookup When invalidating a translation block, set an invalid flag into the TranslationBlock structure first. It is also necessary to check whether the target TB is still valid after acquiring 'tb_lock' but before calling tb_add_jump() since TB lookup is to be performed out of 'tb_lock' in future. Note that we don't have to check 'last_tb'; an already invalidated TB will not be executed anyway and it is thus safe to patch it. Backports commit 6d21e4208f382dd8ca1f7995a6dd9ea7ca281163 from qemu	2018-02-26 01:48:13 -05:00
Sergey Fedorov	c0dda5fbe9	tcg: Prepare safe access to tb_flushed out of tb_lock Ensure atomicity and ordering of CPU's 'tb_flushed' access for future translation block lookup out of 'tb_lock'. This field can only be touched from another thread by tb_flush() in user mode emulation. So the only access to be sequential atomic is: * a single write in tb_flush(); * reads/writes out of 'tb_lock'. In future, before enabling MTTCG in system mode, tb_flush() must be safe and this field becomes unnecessary. Backports commit 118b07308a8cedc16ef63d7ab243a95f1701db40 from qemu	2018-02-25 23:33:58 -05:00
Sergey Fedorov	9eb02a540d	tcg: Prepare safe tb_jmp_cache lookup out of tb_lock Ensure atomicity of CPU's 'tb_jmp_cache' access for future translation block lookup out of 'tb_lock'. Note that this patch does not make CPU's TLB invalidation safe if it is done from some other thread while the CPU is in its execution loop. Backports commit 89a16b1e4294e3664667a151c2f70c84dfac6fd9 from qemu	2018-02-25 23:29:18 -05:00
Sergey Fedorov	371101a184	tcg: Pass last_tb by value to tb_find_fast() This is a small clean up. tb_find_fast() is a final consumer of this variable so no need to pass it by reference. 'last_tb' is always updated by subsequent cpu_loop_exec_tb() in cpu_exec(). This change also simplifies calling cpu_exec_nocache() in cpu_handle_exception(). Backports commit 4b7e69509df2fcbfdab8c62c294dbfcfdab8a6e1 from qemu	2018-02-25 23:23:22 -05:00
Cao jin	cc45b82472	timer/cpus: fix some typos and update some comments Backports commit 3224e8786fcbe531746f1530c37210c425625213 from qemu	2018-02-25 23:21:57 -05:00
Paolo Bonzini	57fff7a94b	target-m68k: fix get_mac_extf helper val is assigned twice; the second one should be combined with "\|". Reported by Coverity. Backports commit 5ce747cfac697f61668ab4fa4a71c1dba15cc272 from qemu	2018-02-25 23:21:05 -05:00
Thomas Huth	aed5df31b7	sparc: Use g_memdup() instead of g_new0() + memcpy() There is no need to make sure that the memory is zeroed after the allocation if we also immediatly fill the whole buffer afterwards with memcpy(). Thus g_new0 should be g_new instead. But since we are also doing a memcpy() here, we can also simply replace both with g_memdup() instead. Backports commit a337f295defad7eb977da4d6317cf70f7f2fa4b4 from qemu	2018-02-25 23:19:44 -05:00
Peter Maydell	eb77f61bea	configure: Always compile with -fwrapv QEMU's code relies on left shifts of signed integers always being defined behaviour with the obvious 2s-complement semantics. The only way to tell the compiler (and any associated undefined-behaviour sanitizer) that we require a C dialect with these semantics is to use the -fwrapv option. This is a bit of a heavy hammer for the job as it also gives us guaranteed semantics on integer arithmetic overflow which in theory we don't require. In an ideal world this would allow us to drop the warning flag -Wno-shift-negative-value, but we must retain this to avoid spurious warnings on clang versions predating the fix to https://llvm.org/bugs/show_bug.cgi?id=25552. Backports commit 2d31515bc0880a1cea86ce638d2a109f4f4e6f7d from qemu	2018-02-25 23:17:41 -05:00
Longpeng(Mike)	8b5400d675	target-i386: present virtual L3 cache info for vcpus Some software algorithms are based on the hardware's cache info, for example, for x86 linux kernel, when cpu1 want to wakeup a task on cpu2, cpu1 will trigger a resched IPI and told cpu2 to do the wakeup if they don't share low level cache. Oppositely, cpu1 will access cpu2's runqueue directly if they share llc. The relevant linux-kernel code as bellow: static void ttwu_queue(struct task_struct p, int cpu) { struct rq rq = cpu_rq(cpu); ...... if (... && !cpus_share_cache(smp_processor_id(), cpu)) { ...... ttwu_queue_remote(p, cpu); /* will trigger RES IPI / return; } ...... ttwu_do_activate(rq, p, 0); / access target's rq directly / ...... } In real hardware, the cpus on the same socket share L3 cache, so one won't trigger a resched IPIs when wakeup a task on others. But QEMU doesn't present a virtual L3 cache info for VM, then the linux guest will trigger lots of RES IPIs under some workloads even if the virtual cpus belongs to the same virtual socket. For KVM, there will be lots of vmexit due to guest send IPIs. The workload is a SAP HANA's testsuite, we run it one round(about 40 minuates) and observe the (Suse11sp3)Guest's amounts of RES IPIs which triggering during the period: No-L3 With-L3(applied this patch) cpu0: 363890 44582 cpu1: 373405 43109 cpu2: 340783 43797 cpu3: 333854 43409 cpu4: 327170 40038 cpu5: 325491 39922 cpu6: 319129 42391 cpu7: 306480 41035 cpu8: 161139 32188 cpu9: 164649 31024 cpu10: 149823 30398 cpu11: 149823 32455 cpu12: 164830 35143 cpu13: 172269 35805 cpu14: 179979 33898 cpu15: 194505 32754 avg: 268963.6 40129.8 The VM's topology is "1socket 8cores 2threads". After present virtual L3 cache info for VM, the amounts of RES IPIs in guest reduce 85%. For KVM, vcpus send IPIs will cause vmexit which is expensive, so it can cause severe performance degradation. We had tested the overall system performance if vcpus actually run on sparate physical socket. With L3 cache, the performance improves 7.2%~33.1%(avg:15.7%). Backports commit 14c985cffa6cb177fc01a163d8bcf227c104718c from qemu	2018-02-25 23:16:14 -05:00
Lioncash	2d87095858	glib_compat: Amend header guard	2018-02-25 23:12:20 -05:00
Sergey Sorokin	a882118050	target-arm: Fix lpae bit in FSR on an alignment fault If an alignment fault occurred and target EL is using AArch32, then DFSR/IFSR bit LPAE[9] must be set correctly. Backports commit e0fe723c24562c8f909bb40f131bfdbe75650677 from qemu	2018-02-25 23:10:29 -05:00
Luwei Kang	af7b3995dd	target-i386: Add more Intel AVX-512 instructions support Add more AVX512 feature bits, include AVX512DQ, AVX512IFMA, AVX512BW, AVX512VL, AVX512VBMI. Its spec can be found at: https://software.intel.com/sites/default/files/managed/b4/3a/319433-024.pdf Backports commit cc728d1493eee3e20c1547191862e43d3f55e714 from qemu	2018-02-25 23:09:18 -05:00
Alex Williamson	fe66c2e088	memory: Don't use memcpy for ram_device regions With a vfio assigned device we lay down a base MemoryRegion registered as an IO region, giving us read & write accessors. If the region supports mmap, we lay down a higher priority sub-region MemoryRegion on top of the base layer initialized as a RAM device pointer to the mmap. Finally, if we have any quirks for the device (ie. address ranges that need additional virtualization support), we put another IO sub-region on top of the mmap MemoryRegion. When this is flattened, we now potentially have sub-page mmap MemoryRegions exposed which cannot be directly mapped through KVM. This is as expected, but a subtle detail of this is that we end up with two different access mechanisms through QEMU. If we disable the mmap MemoryRegion, we make use of the IO MemoryRegion and service accesses using pread and pwrite to the vfio device file descriptor. If the mmap MemoryRegion is enabled and results in one of these sub-page gaps, QEMU handles the access as RAM, using memcpy to the mmap. Using either pread/pwrite or the mmap directly should be correct, but using memcpy causes us problems. I expect that not only does memcpy not necessarily honor the original width and alignment in performing a copy, but it potentially also uses processor instructions not intended for MMIO spaces. It turns out that this has been a problem for Realtek NIC assignment, which has such a quirk that creates a sub-page mmap MemoryRegion access. To resolve this, we disable memory_access_is_direct() for ram_device regions since QEMU assumes that it can use memcpy for those regions. Instead we access through MemoryRegionOps, which replaces the memcpy with simple de-references of standard sizes to the host memory. With this patch we attempt to provide unrestricted access to the RAM device, allowing byte through qword access as well as unaligned access. The assumption here is that accesses initiated by the VM are driven by a device specific driver, which knows the device capabilities. If unaligned accesses are not supported by the device, we don't want them to work in a VM by performing multiple aligned accesses to compose the unaligned access. A down-side of this philosophy is that the xp command from the monitor attempts to use the largest available access weidth, unaware of the underlying device. Using memcpy had this same restriction, but at least now an operator can dump individual registers, even if blocks of device memory may result in access widths beyond the capabilities of a given device (RTL NICs only support up to dword). Backports commit 1b16ded6a512809f99c133a97f19026fe612b2de from qemu	2018-02-25 23:06:36 -05:00
Alex Williamson	5db45219c9	memory: Replace skip_dump flag with ram_device Setting skip_dump on a MemoryRegion allows us to modify one specific code path, but the restriction we're trying to address encompasses more than that. If we have a RAM MemoryRegion backed by a physical device, it not only restricts our ability to dump that region, but also affects how we should manipulate it. Here we recognize that MemoryRegions do not change to sometimes allow dumps and other times not, so we replace setting the skip_dump flag with a new initializer so that we know exactly the type of region to which we're applying this behavior. Backports commit ca83f87a66d19fdaabf23d4f5ebb49396fe232c1 from qemu	2018-02-25 23:00:45 -05:00
Pranith Kumar	1b19fe260a	softfloat: Fix warn about implicit conversion from int to int8_t Change the flag type to 'uint8_t' to fix the implicit conversion error. Backports commit dfd607671037ff46d5b16ade10e10efdf0d260be from qemu	2018-02-25 22:54:39 -05:00
Pranith Kumar	4c880fba9d	target-arm: Fix warn about implicit conversion Clang warns about an implicit conversion as follows: /mnt/devops/code/qemu/target-arm/neon_helper.c:1075:1: warning: implicit conversion from 'int' to 'int8_t' (aka 'signed char') changes value from 128 to -128 [-Wconstant-conversion] NEON_VOP_ENV(qrshl_s8, neon_s8, 4) ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /mnt/devops/code/qemu/target-arm/neon_helper.c:116:83: note: expanded from macro 'NEON_VOP_ENV' uint32_t HELPER(glue(neon_,name))(CPUARMState env, uint32_t arg1, uint32_t arg2) \ ^ /mnt/devops/code/qemu/target-arm/neon_helper.c:106:5: note: expanded from macro '\ NEON_VOP_BODY' NEON_DO##n; \ ^~~~~~~~~~ <scratch space>:21:1: note: expanded from here NEON_DO4 ^~~~~~~~ /mnt/devops/code/qemu/target-arm/neon_helper.c:93:5: note: expanded from macro 'NEON_DO4' NEON_FN(vdest.v1, vsrc1.v1, vsrc2.v1); \ ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /mnt/devops/code/qemu/target-arm/neon_helper.c:1054:23: note: expanded from macro 'NEON_FN' dest = (1 << (sizeof(src1) 8 - 1)); \ ~ ~~^~~~~~~~~~~~~~~~~~~~~~~~~ Fix it by casting to appropriate type. Backports commit 6bbbb0ac136102098a70b97ab0c07bc7bf53131c from qemu	2018-02-25 22:44:43 -05:00
Richard Henderson	ede1cae3dc	tcg: Lower indirect registers in a separate pass Rather than rely on recursion during the middle of register allocation, lower indirect registers to loads and stores off the indirect base into plain temps. For an x86_64 host, with sufficient registers, this results in identical code, modulo the actual register assignments. For an i686 host, with insufficient registers, this means that temps can be (temporarily) spilled to the stack in order to satisfy an allocation. This as opposed to the possibility of not being able to spill, to allocate a register for the indirect base, in order to perform a spill. Backports commit 5a18407f55ade924aa6397c9a043a9ffd59645fe from qemu	2018-02-25 22:32:28 -05:00
Richard Henderson	8a012ff6d3	tcg: Require liveness analysis Backports commit c0ef05b5e62ab0c291a94022f14104e61e306f03 from qemu	2018-02-25 22:20:42 -05:00
Lioncash	541601edc4	util: Move qemu-log to utils	2018-02-25 22:17:44 -05:00
Richard Henderson	2aa46dd9a1	tcg: Include liveness info in the dumps Backports commit bdfb460ef77500f7b186759b585f06ff2120929d from qemu	2018-02-25 22:13:08 -05:00
Richard Henderson	e973e89a57	tcg: Compress dead_temps and mem_temps into a single array We only need two bits per temporary. Fold the two bytes into one, and reduce the memory and cachelines required during compilation. Backports commit c70fbf0a9938baf3b4f843355a77c17a7e945b98 from qemu	2018-02-25 22:07:08 -05:00
Richard Henderson	690985a582	tcg: Fold life data into TCGOp Reduce the size of other bitfields to make room. This reduces the cache footprint of compilation. Backports commit bee158cb4dde35c41632a3a129c869f14a32f8f0 from qemu	2018-02-25 21:49:42 -05:00
Lioncash	b5e765d562	target-mips: Silence unused function warning	2018-02-25 21:47:22 -05:00
Richard Henderson	1547048a22	tcg: Reorg TCGOp chaining Instead of using -1 as end of chain, use 0, and link through the 0 entry as a fully circular double-linked list. Backports commit dcb8e75870e2de199db853697f8839cb603beefe from qemu	2018-02-25 21:44:50 -05:00
Richard Henderson	b2e6e351c2	tcg: Compress liveness data to 16 bits This reduces both memory usage and per-insn cacheline usage during code generation. Backports commit a1b3c48d2b23d6eaeb4529d3e1183d2648731bf8 from qemu	2018-02-25 21:27:24 -05:00
Eric Blake	30cbcafc05	osdep: Document differences in rounding macros Make it obvious which macros are safe in which situations. Useful since QEMU_ALIGN_UP and ROUND_UP both purport to do the same thing, but differ on whether the alignment must be a power of 2.	2018-02-25 21:05:21 -05:00
Leon Alrae	bc434da124	target-mips: fix EntryHi.EHINV being cleared on TLB exception While implementing TLB invalidation feature we forgot to modify part of code responsible for updating EntryHi during TLB exception. Consequently EntryHi.EHINV is unexpectedly cleared on the exception. Backports commit 701074a6fc7470d0ed54e4a4bcd4d491ad8da22e from qemu	2018-02-25 21:02:31 -05:00
Igor Mammedov	943b9fc261	qdev: Fix object reference leak in case device.realize() fails If device doesn't have parent assined before its realize is called, device_set_realized() will implicitly set parent to '/machine/unattached'. However device_set_realized() may fail after that point at several other points leaving not realized object dangling in '/machine/unattached' and as result caller of obj = object_new() obj->ref == 1 object_property_set_bool(obj,..., true, "realized",...) obj->ref == 2 if (fail) object_unref(obj); obj->ref == 1 will get object leak instead of expected object destruction. Fix it by making device_set_realized() to cleanup after itself in case of failure. Backports commit 69382d8b3e8600b349c191394d761dcb480502cf from qemu	2018-02-25 21:00:26 -05:00
Igor Mammedov	62c89b9cd4	exec: Reduce CONFIG_USER_ONLY ifdeffenery Backports commit 1bc7e522d9cf1b58f2de9c8f1737be0bb5129c35 from qemu	2018-02-25 20:57:48 -05:00
Igor Mammedov	d30410dc9a	target-i386: Add x86_cpu_unrealizefn() First remove VCPU from exec loop and only then remove lapic. Backports commit c884776e9dc947105827bd6c22192863f97267d2 from qemu	2018-02-25 20:54:13 -05:00
Igor Mammedov	298b0e6529	target-i386: Fix apic object leak when CPU is deleted Backports commit 67e55caa6dcb91c80428cee6fe463f8dd8a755ab from qemu	2018-02-25 20:48:40 -05:00
Igor Mammedov	e15fb246ab	target-i386: cpu: Do not ignore error and fix apic parent object_property_add_child() silently fails with error that it can't create duplicate propery 'apic' as we already have 'apic' property registered for 'apic' feature. As result generic device_realize puts apic into unattached container. As it's programming error, abort if name collision happens in future and fix property name for apic_state to 'lapic', this way apic is a child of cpu instance. Backports commit 6816b1b3811e839540df22855d975b6d76ae438b from qemu	2018-02-25 20:47:46 -05:00
Paolo Bonzini	403021183d	target-i386: Add support for UMIP and RDPID CPUID bits These are both stored in CPUID[EAX=7,EBX=0].ECX. KVM is going to be able to emulate both (albeit with a performance loss in the case of RDPID, which therefore will be in KVM_GET_EMULATED_CPUID rather than KVM_GET_SUPPORTED_CPUID). It's also possible to implement both in TCG, but this is for 2.8. Backports commit c2f193b538032accb9db504998bf2ea7c0ef65af from qemu	2018-02-25 20:46:40 -05:00
Igor Mammedov	6714284211	target-i386: Add socket/core/thread properties to X86CPU These properties will be used by as address where to plug CPU with help -device/device_add commands. Backports commit d89c2b8b98e097b9cad5104b0f178bde1cfa011b from qemu	2018-02-25 20:45:35 -05:00
Igor Mammedov	2ac9df3633	target-i386: Replace custom apic-id setter/getter with static property Custom apic-id setter/getter doesn't do any property specific checks anymore, so clean it up and use more compact static property DEFINE_PROP_UINT32 instead. Backports commit 2da00e3176abac34ca7a6aab1f5bbb94a0d03fc5 from qemu	2018-02-25 20:44:18 -05:00
Igor Mammedov	0525a9c9fa	pc: cpu: Consolidate apic-id validity checks in pc_cpu_pre_plug() Machine code knows about all possible APIC IDs so use that instead of hack which does O(n^2) complexity duplicate checks, interating over global CPUs list. As result duplicate check is done only once with O(log n) complexity. Backports commit 4ec60c76d5ab513e375f17b043d2b9cb849adf6c from qemu	2018-02-25 20:38:43 -05:00
Dr. David Alan Gilbert	9ee1a82185	target-i386: Set physical address bits based on host Add the host-phys-bits boolean property, if true, take phys-bits from the hosts physical bits value, overriding either the default or the user specified value. We can also use the value we read from the host to check the users explicitly set value and warn them if it doesn't match. Note: a) We only read the hosts value in KVM mode (because on non-x86 we get an abort if we try) b) We don't warn about trying to use host-phys-bits in TCG mode, we just fall back to the TCG default. This allows the machine type to set the host-phys-bits flag if it wants and then to work in both TCG and KVM. Backports commit 11f6fee576680a2d482123535da920f8ceb33eb5 from qemu	2018-02-25 20:36:12 -05:00
Igor Mammedov	95cced34fb	pc: Add x86_topo_ids_from_apicid() It's reverse of apicid_from_topo_ids() and will be used in follow up patches to fill in data structures for query-hotpluggable-cpus and for user friendly error reporting. Backports commit 9f3aab58539b4cc716e42e772be8116dc2e7d159 from qemu	2018-02-25 20:31:36 -05:00
Igor Mammedov	bc8dbd862d	target-i386: Use uint32_t for X86CPU.apic_id Redo 9886e834 (target-i386: Require APIC ID to be explicitly set before CPU realize) in another way that doesn't use int64_t to detect if apic-id property has been set. Use the fact that 0xFFFFFFFF is the broadcast value that a CPU can't have and set default uint32_t apic_id to it instead of using int64_t. Later uint32_t apic_id will be used to drop custom property setter/getter in favor of static property. Backports commit d9c84f196970f78d4b55ab87e03cbcad7c65f86f from qemu	2018-02-25 20:30:31 -05:00
Dr. David Alan Gilbert	54851f7d74	target-i386: Fill high bits of mtrr mask Fill the bits between 51..number-of-physical-address-bits in the MTRR_PHYSMASKn variable range mtrr masks so that they're consistent in the migration stream irrespective of the physical address space of the source VM in a migration. Backports commit fcc35e7ccaed771790940524f3b0eef7aebfc9b1 from qemu	2018-02-25 20:29:20 -05:00
Dr. David Alan Gilbert	78254267ff	target-i386: Allow physical address bits to be set Currently QEMU sets the x86 number of physical address bits to the magic number 40. This is only correct on some small AMD systems; Intel systems tend to have 36, 39, 46 bits, and large AMD systems tend to have 48. Having the value different from your actual hardware is detectable by the guest and in principal can cause problems; The current limit of 40 stops TB VMs being created by those lucky enough to have that much. This patch lets you set the physical bits by a cpu property but defaults to the same 40bits which matches TCGs setup. I've removed the ancient warning about the 42 bit limit in exec.c; I can't find that limit in there and no one else seems to know where it is. We use a magic value of 0 as the property default so that we can later distinguish between the default and a user set value. Backports commit af45907a132857cfd47acc998bf5f7c26cd13071 from qemu	2018-02-25 20:28:38 -05:00
Dr. David Alan Gilbert	7cb359cc19	target-i386: Provide TCG_PHYS_ADDR_BITS Provide a constant for the number of address bits supported under TCG. Backports commit 709787ee997f0a0ccab78e0edaf10d48929151ee from qemu	2018-02-25 20:23:25 -05:00
Eric Blake	23ab6d81f9	qapi: Implement boxed types for commands/events Turn on the ability to pass command and event arguments in a single boxed parameter, which must name a non-empty type (although the type can be a struct with all optional members). For structs, it makes it possible to pass a single qapi type instead of a breakout of all struct members (useful if the arguments are already in a struct or if the number of members is large); for other complex types, it is now possible to use a union or alternate as the data for a command or event. The empty type may be technically feasible if needed down the road, but it's easier to forbid it now and relax things to allow it later, than it is to allow it now and have to special case how the generated 'q_empty' type is handled (see commit 7ce106a9 for reasons why nothing is generated for the empty type). An alternate type is never considered empty, but now that a boxed type can be either an object or an alternate, we have to provide a trivial QAPISchemaAlternateType.is_empty(). The new call to arg_type.is_empty() during QAPISchemaCommand.check() requires that we first check the type in question; but there is no chance of introducing a cycle since objects do not refer back to commands. We still have a split in syntax checking between ad-hoc parsing up front (merely validates that 'boxed' has a sane value) and during .check() methods (if 'boxed' is set, then 'data' must name a non-empty user-defined type). Generated code is unchanged, as long as no client uses the new feature. Backports commit c818408e449ea55371253bd4def1c1dc87b7bb03 from qemu	2018-02-25 20:22:03 -05:00
Eric Blake	c65f056fbe	qapi: Plumb in 'boxed' to qapi generator lower levels The next patch will add support for passing a qapi union type as the 'data' of a command. But to do that, the user function for implementing the command, as called by the generated marshal command, must take the corresponding C struct as a single boxed pointer, rather than a breakdown into one parameter per member. Even without a union, being able to use a C struct rather than a list of parameters can make it much easier to handle coding with QAPI. This patch adds the internal plumbing of a 'boxed' flag associated with each command and event. In several cases, this means adding indentation, with one new dead branch and the remaining branch being the original code more deeply nested; this was done so that the new implementation in the next patch is easier to review without also being mixed with indentation changes. For this patch, no behavior or generated output changes, other than the testsuite outputting the value of the new flag (always False for now). Backports commit 48825ca419fd9c8140d4fecb24e982d68ebca74f from qemu	2018-02-25 20:17:01 -05:00
Eric Blake	6ff318b839	qapi-event: Simplify visit of non-implicit data Commit 7ce106a9 documented why we don't generated a visit_type_FOO() for implicit types; and therefore events with an anonymous type for 'data' have to open-code a visit. Note that the open-coded visit in qapi-event.c is slightly different from what is done in qapi-visit.c for normal types, in part because we don't have to check for obj being NULL or free things on error. But where the type is not implicit, it is nicer to reuse the normal visit instead of open-coding a duplicate. At the moment, the only event with a non-implicit 'data' is in the testsuite, where test-qapi-event.c changes as follows: \|@@ -155,6 +155,7 @@ void qapi_event_send___org_qemu_x_event( \| __org_qemu_x_Struct param = { \| __org_qemu_x_member1, (char )__org_qemu_x_member2, has_q_wchar_t, q_wchar_t \| }; \|+ __org_qemu_x_Struct *arg = &param; \| \| emit = qmp_event_get_func_emit(); \| if (!emit) { \|@@ -164,16 +165,7 @@ void qapi_event_send___org_qemu_x_event( \| qmp = qmp_event_build_dict("__ORG.QEMU_X-EVENT"); \| \| v = qmp_output_visitor_new(&obj); \|- \|- visit_start_struct(v, "__ORG.QEMU_X-EVENT", NULL, 0, &err); \|- if (err) { \|- goto out; \|- } \|- visit_type___org_qemu_x_Struct_members(v, &param, &err); \|- if (!err) { \|- if (!err) { \|- visit_check_struct(v, &err); \|- } \|- visit_end_struct(v, NULL); \|+ visit_type___org_qemu_x_Struct(v, "__ORG.QEMU_X-EVENT", &arg, &err); \| if (err) { \| goto out; \| } Backports commit 4d0b268fdb17a1fed10fe980e77fd388e5427bfd from qemu	2018-02-25 20:12:34 -05:00
Eric Blake	b5220a6867	qapi: Drop useless gen_err_check() Ever since commit 12f254f removed the last parameterization of gen_err_check(), it no longer makes sense to hide the three lines of generated C code behind a macro call. Just inline it into the remaining users. No change to generated code. Backports commit fa274ed6fb788866ed3a2cfd54a2ddf78f04f2c0 from qemu	2018-02-25 20:10:45 -05:00
Eric Blake	d7014c66df	qapi: Add type.is_empty() helper In the near future, we want to lift our artificial restriction of no variants at the top level of an event, at which point the currently open-coded check for empty members will become insufficient. Factor it out into a new helper method is_empty() now, and future-proof it by checking variants, too, along with an assert that it is not used prior to the completion of .check(). Update places that were checking for (non-)empty .members to use the new helper. All of the current callers assert that there are no variants (either directly, or by qapi.py asserting that base types have no variants), so this is not a semantic change. No change to generated code. Backports commit b6167706829c6e0d3572daa2b6769594ced276f7 from qemu	2018-02-25 20:07:43 -05:00
Eric Blake	4b39eaae33	qapi: Hide tag_name data member of variants Clean up the only remaining external use of the tag_name field of QAPISchemaObjectTypeVariants, by explicitly listing the generated 'type' tag for all variants in the testsuite (you can still tell simple unions by the -wrapper types). Then we can mark the tag_name field as private by adding a leading underscore to prevent any further use. Backports commit da9cb19385fc66b2cb2584bbbbcbf50246d057e2 from qemu	2018-02-25 20:06:15 -05:00
Eric Blake	febeea5f4b	qapi: Special case c_name() for empty type Commit 7ce106a rendered QAPISchemaObjectType.c_name() redundant, since it now does nothing more than delegate to its superclass. However, rather than deleting it, we can restore part of the assertion that was removed in that commit, to prove that we never emit the empty type directly in generated code, but rather special-case it as a built-in that makes other aspects of code generation easier to reason about. Backports commit cd50a2564560986e865ff64fa73b59d2564076f0 from qemu	2018-02-25 20:05:16 -05:00
Eric Blake	8ccfff95fe	qapi: Require all branches of flat union enum to be covered We were previously enforcing that all flat union branches were found in the corresponding enum, but not that all enum values were covered by branches. The resulting generated code would abort() if the user passes the uncovered enum value. We don't automatically treat non-present branches in a flat union as empty types, for symmetry with simple unions (there, the enum type is generated from the list of all branches, so there is no way to omit a branch but still have it be part of the union). A later patch will add shorthand so that branches that are empty in flat unions can be declared as 'branch':{} instead of 'branch':'Empty', to avoid the need for an otherwise useless explicit empty type. [Such shorthand for simple unions is a bit harder to justify, since we would still have to generate a wrapper type that parses 'data':{}, rather than truly being an empty branch with no additional siblings to the 'type' member.] Backports commit d0b182392d0281ef780e3effcb82677a004f1f97 from qemu	2018-02-25 20:04:18 -05:00
Paolo Bonzini	674805745b	qapi: change QmpInputVisitor to QSLIST This saves a lot of memory compared to a statically-sized array, or at least 24kb could be considered a lot on an Atari ST. It also makes the code more similar to QmpOutputVisitor. This removes the limit on the depth of a QObject that can be processed into a QAPI tree. This is not a problem because QObjects can be considered trusted; the text received on the QMP wire is untrusted input, but the JSON parser already takes pains to limit the QObject tree it creates. We don't need the QMP input visitor to limit it again. Backports commit 3d344c2aabb7bc9b414321e3c52872901edebdda from qemu	2018-02-25 20:02:09 -05:00
Paolo Bonzini	b14f1d7a80	qapi: change QmpOutputVisitor to QSLIST This saves a little memory compared to the doubly-linked QTAILQ. Backports commit fc76ae8b38783e82c109834573ba5d6f080440b5 from qemu	2018-02-25 19:59:16 -05:00
Sergey Fedorov	e39b9d0391	target-i386: Remove redundant HF_SOFTMMU_MASK 'HF_SOFTMMU_MASK' is only set when 'CONFIG_SOFTMMU' is defined. So there's no need in this flag: test 'CONFIG_SOFTMMU' instead. Backports commit da6d48e3348bbc266896cf8adf0c33f1eaf5b31f from qemu	2018-02-25 19:59:15 -05:00
Sergey Sorokin	4a904baaf5	target-arm: Add missed AArch32 TLBI sytem registers Some PL2 related TLBI system registers are missed in AArch32 implementation. The patch fixes it. Backports commit 541ef8c2e73fb99d173b125bef7c262fdd2fe33c from qemu	2018-02-25 19:59:15 -05:00
Peter Lieven	799bf1c3a5	exec: avoid realloc in phys_map_node_reserve this is the first step in reducing the brk heap fragmentation created by the map->nodes memory allocation. Since the introduction of RCU the freeing of the PhysPageMaps is delayed so that sometimes several hundred are allocated at the same time. Even worse the memory for map->nodes is allocated and shortly afterwards reallocated. Since the number of nodes it grows to in the end is the same for all PhysPageMaps remember this value and at least avoid the reallocation. The large number of simultaneous allocations (about 450 x 70kB in my configuration) has to be addressed later. Backports commit 101420b886eec36990419bc9ed5b503622af8a0d from qemu	2018-02-25 19:32:40 -05:00
Paolo Bonzini	a47c68164d	compiler: never omit assertions if using a static analysis tool Assertions help both Coverity and the clang static analyzer avoid false positives, but on the other hand both are confused when the condition is compiled as (void)(x != FOO). Always expand assertion macros when using Coverity or clang, through a new QEMU_STATIC_ANALYSIS preprocessor symbol. This fixes a couple false positives in TCG. Backports commit 8bff06a0bbf257a2083223534c1607bf87d913e6 from qemu	2018-02-25 19:19:28 -05:00
Vijay	5608b53b6f	target-arm: Use Neon for zero checking Use Neon instructions to perform zero checking of buffer. This is helps in reducing total migration time. Use case: Idle VM live migration with 4 VCPUS and 8GB ram running CentOS 7. Without Neon, the Total migration time is 3.5 Sec Migration status: completed total time: 3560 milliseconds downtime: 33 milliseconds setup: 5 milliseconds transferred ram: 297907 kbytes throughput: 685.76 mbps remaining ram: 0 kbytes total ram: 8519872 kbytes duplicate: 2062760 pages skipped: 0 pages normal: 69808 pages normal bytes: 279232 kbytes dirty sync count: 3 With Neon, the total migration time is 2.9 Sec Migration status: completed total time: 2960 milliseconds downtime: 65 milliseconds setup: 4 milliseconds transferred ram: 299869 kbytes throughput: 830.19 mbps remaining ram: 0 kbytes total ram: 8519872 kbytes duplicate: 2064313 pages skipped: 0 pages normal: 70294 pages normal bytes: 281176 kbytes dirty sync count: 3 Backports commit 7069532e3b944c25707d4f69998e68a739eabff9 from qemu	2018-02-25 19:17:38 -05:00
Richard Henderson	d17dc29d2e	target-sparc: Elide duplicate updates to fprs Backports commit f9c816c00cf4242542472ae6b2a579b11b7e86f1 from qemu	2018-02-25 19:14:59 -05:00
Richard Henderson	2215ef7e21	target-sparc: Use cpu_loop_exit_restore from helper_check_ieee_exceptions This avoids needing to save state before every FP operation. Backports commit 02c79d78853f07d519b3272d06e43041eb4a4105 from qemu	2018-02-25 19:12:36 -05:00
Richard Henderson	524e4af5ca	target-sparc: Use cpu_fsr in stfsr Backports commit ba2397d1ca6546e8cf5bd9e2939923546ac3091a from qemu	2018-02-25 19:10:27 -05:00
Lioncash	17c54e2702	header_gen: alphabetize general symbols	2018-02-25 19:07:20 -05:00
Lioncash	4b8cae3f61	header_gen: alphabetize ARM symbols	2018-02-25 19:00:31 -05:00
Lioncash	fa10382007	header_gen: alphabetize aarch64 symbols	2018-02-25 19:00:01 -05:00
Lioncash	3f8802fcf5	header_gen: alphabetize MIPS symbols	2018-02-25 18:59:49 -05:00
Richard Henderson	12eecc4939	target-sparc: Use explicit writes to cpu_fsr By arranging for explicit writes to cpu_fsr after floating point operations, we are able to mark the helpers as not writing to tcg globals, which means that we don't need to invalidate the integer register set across said calls. Backports commit 7385aed20db5d83979f683b9d0048674411e963c from qemu	2018-02-25 18:55:07 -05:00
Richard Henderson	2e24c09db3	target-sparc: Remove helper_ldf_asi, helper_stf_asi We've now implemented all fp asis inline, except for the no-fault memory reads. The latter can be passed directly to helper_ld_asi. Backports commit f2fe396f0fae6b389169f65abf294df9ae6cfee5 from qemu	2018-02-25 18:32:35 -05:00
Richard Henderson	a921273a6c	target-sparc: Directly implement block and short ldf/stf asis Backports commit ca5ce5723fb1ee3445f690004f63c209c15fb813 from qemu	2018-02-25 18:27:52 -05:00
Richard Henderson	333d88c9e6	target-sparc: Directly implement easy ldf/stf asis Backports commit 7705091ca4a20c8c2d20e2af5d0a1bcb17296657 from qemu	2018-02-25 18:23:45 -05:00
Richard Henderson	9d47cda44c	target-sparc: Pass TCGMemOp constants to helper_ld/st_asi Reduces the argument count for helper_ld_asi; do helper_st_asi for consistency. Backports commit 6850811e7c56403b0d225a1bffd096abf2ff06f9 from qemu	2018-02-25 18:19:42 -05:00
Richard Henderson	950aa89c7a	target-sparc: Fix obvious error in ASI_M_BFILL Backports commit c095b83f9836cef80f64b32603fea240762a824b from qemu	2018-02-25 18:08:40 -05:00
Richard Henderson	eb285aa281	target-sparc: Directly implement easy ldd/std asis Backports commit e4dc0052a40d3e7b00ca0b008f345e2ed644aa20 from qemu	2018-02-25 18:07:51 -05:00
Richard Henderson	1ed7df7720	target-sparc: Introduce gen_check_align Backports commit 35e94905ce4b39b358a673995f9bee11f46ec8be from qemu	2018-02-25 17:59:47 -05:00
Richard Henderson	cef4ae5ca8	target-sparc: Use QT0 to return results from ldda Also implement a few more twinx asis. Backports commit 3f4288ebf6fca7b266fa42a74d9d99b961ba6844 from qemu	2018-02-25 17:56:08 -05:00
Richard Henderson	9e402493a9	target-sparc: Directly implement easy ld/st asis Backports commit f0913be04be13cfb4f9341ae79e035fc8479fd28 from qemu	2018-02-25 17:49:16 -05:00
Richard Henderson	e2d0ee1286	target-sparc: Use defines from asi.h Backports commit 0cc1f4bf76a20c7fee0bab5c9bba9ad7302198b5 from qemu	2018-02-25 17:44:36 -05:00
Richard Henderson	bd3b7a2537	target-sparc: Add UA2005 defines to asi.h Backports commit 1d854963ea340855efe3f8a5b99c95a75bd717ae from qemu	2018-02-25 17:32:46 -05:00
Richard Henderson	b9a65e0e79	target-sparc: Import linux/arch/sparc/include/uapi/asm/asi.h Copied from tag v4.2, 64291f7db5bd8150a74ad2036f1037e6a0428df2. Backports commit 68a03b8c8853c66724c6f200af3f821ae0d7e934 from qemu	2018-02-25 17:29:51 -05:00
Richard Henderson	c509a5562d	target-sparc: Pass TCGMemOp to gen_ld/st_asi Backports commit 1d65b0f5bb8f32500bbce09d922d226bb7cf4c68 from qemu	2018-02-25 17:26:34 -05:00
Richard Henderson	4bc53f223c	target-sparc: Introduce get_asi Replace gen_get_asi, and use it for both 32-bit and 64-bit. For v8, do supervisor and immediate checks here. Also, move save_state and TB ending into the respective subroutines, out of disas_sparc_insn. Backports commit 7ec1e5ea4bd0700fa48da86bffa2fcc6146c410a from qemu	2018-02-25 17:23:20 -05:00
Richard Henderson	1dcd14d434	target-sparc: Store %asi in TB flags Knowing the value of %asi at translation time means that we can handle the common settings without a function call. The steady state appears to be %asi == ASI_P, so that sparcv9 code can use offset forms of lda/sta. The %asi register gets pushed and popped on entry to certain functions, but it rarely takes on values other than ASI_P or ASI_AIUP. Therefore we're unlikely to be expanding the set of TBs created. Backports commit a6d567e523ed7e928861f3caa5d49368af3f330d from qemu	2018-02-25 05:17:21 -05:00
Richard Henderson	080281bc9c	target-sparc: Unify asi handling between 32 and 64-bit We now have a single copy of gen_ld_asi, gen_st_asi, gen_swap_asi, and everything uses gen_get_asi. Backports commit 22e700607aeaff5f5e139d0fdc3d861e5502040c from qemu	2018-02-25 05:11:52 -05:00
Richard Henderson	847d65258b	target-sparc: Create gen_exception This unifies quite a few duplicate code fragments. Backports commit 4fbe00679000f9fd0c509c2d548d957b08ec6057 from qemu	2018-02-25 04:55:16 -05:00
Richard Henderson	39d1657fc3	target-sparc: Store mmu index in TB flags Doing this instead of saving the raw PS_PRIV and TL. This means that all nucleus mode TBs (TL > 0) can be shared. This fixes a bug in that we didn't include HS_PRIV in the TB flags, and so could produce incorrect TB matches for hypervisor state. The LSU and DMMU states were unused by the translator. Including them in TB flags meant unnecessary mismatches from tb_find_fast. Backports commit 99a230638a3674e921224dbe628159c867d734b1 from qemu	2018-02-25 04:51:50 -05:00
Richard Henderson	395e00cdc5	target-sparc: Remove softint as a TCG global The global is only ever read for one insn; we can just as well use a load from env instead and generate the same code. This also allows us to indicate the the associated helpers do not touch TCG globals. Backports commit e86ceb0d652baa5738e05a59ee0e7989dafbeaa1 from qemu	2018-02-25 04:49:27 -05:00
Richard Henderson	dcd1d6f8ce	target-sparc: Mark more flags for helpers Quite a few helpers do not modify tcg globals but did not so indicate. Backports commit be72f9fcca742c5e9a949f5eac901ed6cc26a2a0 from qemu	2018-02-25 04:28:54 -05:00
Markus Armbruster	c2ffbc575d	Clean up decorations and whitespace around header guards Cleaned up with scripts/clean-header-guards.pl. Backports commit 175de52487ce0b0c78daa4cdf41a5a465a168a25 from qemu	2018-02-25 04:26:02 -05:00
Markus Armbruster	1275b9b459	Clean up ill-advised or unusual header guards Cleaned up with scripts/clean-header-guards.pl. Backports commit 2a6a4076e117113ebec97b1821071afccfdfbc96 from qemu	2018-02-25 04:22:46 -05:00
Markus Armbruster	9ae2fc4d9e	Clean up header guards that don't match their file name Header guard symbols should match their file name to make guard collisions less likely. Offenders found with scripts/clean-header-guards.pl -vn. Cleaned up with scripts/clean-header-guards.pl, followed by some renaming of new guard symbols picked by the script to better ones. Backports commit 121d07125bb6d7079c7ebafdd3efe8c3a01cc440 from qemu	2018-02-25 04:18:42 -05:00
Markus Armbruster	25ec9ab016	tcg: Clean up tcg-target.h header guards These use guard symbols like TCG_TARGET_$target. scripts/clean-header-guards.pl doesn't like them because they don't match their file name (they should, to make guard collisions less likely). Clean them up: use guard symbol $target_TCG_TARGET_H for tcg/$target/tcg-target.h. Backports commit 14e54f8ecfe9c5e17348f456781344737ed10b3b from qemu	2018-02-25 04:15:08 -05:00
Markus Armbruster	2b65f98538	target-*: Clean up cpu.h header guards Most of them use guard symbols like CPU_$target_H, but we also have __MIPS_CPU_H__ and __TRICORE_CPU_H__. They all upset scripts/clean-header-guards.pl. The script dislikes CPU_$target_H because they don't match their file name (they should, to make guard collisions less likely). The others are reserved identifiers. Clean them all up: use guard symbol $target_CPU_H for target-$target/cpu.h. Backports commit 07f5a258750b3b9a6e10fd5ec3e29c9a943b650e from qemu	2018-02-25 04:12:46 -05:00
Markus Armbruster	60e8836b74	Use #include "..." for our own headers, <...> for others Tracked down with an ugly, brittle and probably buggy Perl script. Also move includes converted to <...> up so they get included before ours where that's obviously okay. Backports commit a9c94277f07d19d3eb14f199c3e93491aa3eae0e from qemu	2018-02-25 04:10:33 -05:00
Peter Maydell	f6f843b4d4	bswap.h: Document cpu_to_* and *_to_cpu conversion functions Add a documentation comment describing the functions for converting between the cpu and little or bigendian formats. Backports commit 7d820b766a2049f33ca7e078aa51018f2335f8c5 from qemu	2018-02-25 04:06:28 -05:00
Peter Maydell	1d7f813942	bswap.h: Remove unused cpu_to_w() and _to_cpup() Now that all uses of cpu_to_w() and _to_cpup() have been replaced with either ld_p()/st_p() or by doing direct dereferences and using the cpu_to_()/_to_cpu() byteswap functions, we can remove the unused implementations. Backports commit f76bde702916d0230bf359d478bcac8d7f3b30ae from qemu	2018-02-25 04:04:46 -05:00
Sergey Sorokin	d1e4ac0451	Fix confusing argument names in some common functions There are functions tlb_fill(), cpu_unaligned_access() and do_unaligned_access() that are called with access type and mmu index arguments. But these arguments are named 'is_write' and 'is_user' in their declarations. The patches fix the arguments to avoid a confusion. Backports commit b35399bb4e9968296a12303b00f9f2066470e987 from qemu	2018-02-25 03:58:27 -05:00
Leon Alrae	a465707a47	target-mips: enable 10-bit ASIDs in I6400 CPU Backports commit cdc46fab07a122dfcc8a1054510a68d936ae3440 from qemu	2018-02-25 03:50:58 -05:00
Paul Burton	002b392a15	target-mips: support CP0.Config4.AE bit The read-only Config4.AE bit set denotes extended 10 bits ASID. Backports commit a0c8060841f2d56fb3504292c18522b957972e4c from qemu	2018-02-25 03:49:36 -05:00
Paul Burton	ba4dcc8c2f	target-mips: change ASID type to hold more than 8 bits ASID currently has uint8_t type which is too small since some processors support more than 8 bits ASID. Therefore change its type to uint16_t. Backports commit 2d72e7b047d800c9f99262466f65a98684ecca14 from qemu	2018-02-25 03:48:10 -05:00
Paul Burton	ac27c881ff	target-mips: add ASID mask field and replace magic values Backports commit 6ec98bd7b64ad75870c8e9d87a90fcd1a64b4942 from qemu	2018-02-25 03:44:26 -05:00
Leon Alrae	7e589c117b	target-mips: replace MIPS64R6-generic with the real I6400 CPU model MIPS64R6-generic gradually gets closer to I6400 CPU, feature-wise. Rename it to make it clear which MIPS processor it is supposed to emulate. Backports commit 8f95ad1c79b4166350b982a6defe0e21faa04dac from qemu	2018-02-25 03:35:55 -05:00
Leon Alrae	c0b3938b88	target-mips: add exception base to MIPS CPU Replace hardcoded 0xbfc00000 with exception_base which is initialized with this default address so there is no functional change here. However, it is now exposed and consequently it will be possible to modify it from outside of the CPU. Backports commit 89777fd10fc3dd573c3b4d1b2efdd10af823c001 from qemu	2018-02-25 03:22:10 -05:00
Stanislav Shmarov	6f20d35cd1	translate-all: Fix user-mode self-modifying code in 2 page long TB In user-mode emulation Translation Block can consist of 2 guest pages. In that case QEMU also mprotects 2 host pages that are dedicated for guest memory, containing instructions. QEMU detects self-modifying code with SEGFAULT signal processing. In case if instruction in 1st page is modifying memory of 2nd page (or vice versa) QEMU will mark 2nd page with PAGE_WRITE, invalidate TB, generate new TB contatining 1 guest instruction and exit to CPU loop. QEMU won't call mprotect, and new TB will cause same SEGFAULT. Page will have both PAGE_WRITE_ORG and PAGE_WRITE flags, so QEMU will handle the signal as guest binary problem, and exit with guest SEGFAULT. Solution is to do following: In case if current TB was invalidated continue to invalidate TBs from remaining guest pages and mark pages as PAGE_WRITE. After that disable host page protection with mprotect. If current tb was invalidated longjmp to main loop. That is more efficient, since we won't get SEGFAULT when executing new TB. Backports commit 7399a337e4126f7c8c8af3336726f001378c4798 from qemu	2018-02-25 03:14:22 -05:00
Samuel Damashek	670d81367b	cputlb: Fix for self-modifying writes across page boundaries As it currently stands, QEMU does not properly handle self-modifying code when the write is unaligned and crosses a page boundary. The procedure for handling a write to the current translation block is to write-protect the current translation block, catch the write, split up the translation block into the current instruction (which remains write-protected so that the current instruction is not modified) and the remaining instructions in the translation block, and then restore the CPU state to before the write occurred so the write will be retried and successfully executed. However, since unaligned writes across pages are split into one-byte writes for simplicity, writes to the second page (which is not the current TB) may succeed before a write to the current TB is attempted, and since these writes are not invalidated before resuming state after splitting the TB, these writes will be performed a second time, thus corrupting the second page. Credit goes to Patrick Hulin for discovering this. In recent 64-bit versions of Windows running in emulated mode, this results in either being very unstable (a BSOD after a couple minutes of uptime), or being entirely unable to boot. Windows performs one or more 8-byte unaligned self-modifying writes (xors) which intersect the end of the current TB and the beginning of the next TB, which runs into the aforementioned issue. This commit fixes that issue by making the unaligned write loop perform the writes in forwards order, instead of reverse order. This way, QEMU immediately tries to write to the current TB, and splits the TB before any write to the second page is executed. The write then proceeds as intended. With this patch applied, I am able to boot and use Windows 7 64-bit and Windows 10 64-bit in QEMU without KVM. Per Richard Henderson's input, this patch also ensures the second page is in the TLB before executing the write loop, to ensure the second page is mapped. The original discussion of the issue is located at http://lists.nongnu.org/archive/html/qemu-devel/2014-08/msg02161.html. Backports commit 81daabaf7a572f138a8b88ba6eea556bdb0cce46 from qemu	2018-02-25 03:12:11 -05:00
Samuel Damashek	04c423b081	cputlb: Add address parameter to VICTIM_TLB_HIT Backports commit a390284b80d2b6581143cdb40666674e60e635ae from qemu	2018-02-25 03:03:36 -05:00
Richard Henderson	9e2422032a	cputlb: Move VICTIM_TLB_HIT out of line There are currently 22 invocations of this function, and we're about to increase that number. Backports commit 7e9a7c50d9a400ef51242d661a261123c2cc9485 from qemu	2018-02-25 02:58:47 -05:00
Haozhong Zhang	2893a1c381	target-i386: Publish advised value of MSR_IA32_FEATURE_CONTROL via fw_cfg It's a prerequisite that certain bits of MSR_IA32_FEATURE_CONTROL should be set before some features (e.g. VMX and LMCE) can be used, which is usually done by the firmware. This patch adds a fw_cfg file "etc/msr_feature_control" which contains the advised value of MSR_IA32_FEATURE_CONTROL and can be used by guest firmware (e.g. SeaBIOS). Backports commit 217f1b4a72153cf8d556e9d45919e9222c38d25e from qemu	2018-02-25 02:49:42 -05:00
Ashok Raj	b58f1fccce	target-i386: kvm: Add basic Intel LMCE support This patch adds the support to inject SRAR and SRAO as LMCE, i.e. they are injected to only one VCPU rather than broadcast to all VCPUs. As KVM reports LMCE support on Intel platforms, this features is only available on Intel platforms. LMCE is disabled by default and can be enabled/disabled by cpu option 'lmce=on/off'. Backports commit 87f8b626041ceaea9adcfdbd549359f0ca7b871d from qemu	2018-02-25 02:48:22 -05:00
Evgeny Yakovlev	49fdd75329	target-i386: Report hyperv feature words through qom This change adds hyperv feature words report through qom rpc. When VM is configured with hyperv features enabled libvirt will check that required feature words are set in cpuid leaf 40000003 through qom request. Currently qemu does not report hyperv feature words which prevents windows guests from starting with libvirt. To avoid conflicting with current hyperv properties all added feature words cannot be set directly with -cpu +feature yet. Backports commit c35bd19a5c9140bce8b913cc5cefe6f071135bdb from qemu	2018-02-25 02:45:20 -05:00
Paolo Bonzini	f39cc9e3b9	target-i386: Avoid using locals outside their scope x86_cpu_parse_featurestr has a "val = num;" assignment just before num goes out of scope. Push num up to fix the issue. Backports commit cf2887c9738451eb989c6c102af070dee2dc172a from qemu	2018-02-25 02:30:06 -05:00
Paolo Bonzini	1be92ac243	target-i386: TCG can support CPUID.07H:EBX.erms ERMS just says "rep movsb" and "rep stosb" are fast. It does not imply any new instruction, so we can support it easily. Backports commit 7eb24386dbfb0b66464c7f856c1074c606efccda from qemu	2018-02-25 02:29:00 -05:00
Igor Mammedov	78f9128dbb	target-sparc: Use sparc_cpu_parse_features() directly Make SPARC target use sparc_cpu_parse_features() directly so it won't get in the way of switching other propertified targets to handling features as global properties. Backports commit fb02d56e96d553088c5b4267a3c954a3e952a50a from qemu	2018-02-25 02:27:23 -05:00
Sergey Sorokin	e4d123caa9	tcg: Improve the alignment check infrastructure Some architectures (e.g. ARMv8) need the address which is aligned to a size more than the size of the memory access. To support such check it's enough the current costless alignment check implementation in QEMU, but we need to support an alignment size specifying. Backports commit 1f00b27f17518a1bcb4cedca49eaec96a4d560bd from qemu	2018-02-25 02:23:28 -05:00
Richard Henderson	23586e2674	tcg: Optimize spills of constants While we can store constants via constrants on INDEX_op_st_i32 et al, we weren't able to spill constants to backing store. Add a new backend interface, tcg_out_sti, which may store the constant (and is allowed to fail). Rearrange the temp_* helpers so that we only attempt to directly store a constant when the temp is becoming dead/free. Backports commit 59d7c14eeff8d2ad7f61aed86ce5a176113bc153 from qemu	2018-02-25 01:45:29 -05:00
Richard Henderson	64fda683b1	tcg: Fix name for high-half register	2018-02-25 01:36:35 -05:00
Lioncash	532f840dc3	qapi: Add new clone visitor We have a couple places in the code base that want to deep-clone one QAPI object into another, and they were resorting to serializing the struct out to QObject then reparsing it. A much more efficient version can be done by adding a new clone visitor. Since cloning is still relatively uncommon, expose the use of the new visitor via a QAPI_CLONE() macro that takes care of type-punning the underlying function pointer, rather than generating lots of unused functions for types that won't be cloned. And yes, we're relying on the compiler treating all pointers equally, even though a strict C program cannot portably do so - but we're not the first one in the qemu code base to expect it to work (hello, glib!). The choice of adding a fourth visitor type deserves some explanation. On the surface, the clone visitor is mostly an input visitor (it takes arbitrary input - in this case, another QAPI object - and creates a new QAPI object during the course of the visit). But ever since commit da72ab0 consolidated enum visits based on the visitor type, using VISITOR_INPUT would cause us to run visit_type_str(), even though for cloning there is nothing to do (we just copy the enum value across, without regards to its mapping to strings). Also, since our input happens to be a QAPI object, we can also satisfy the internal checks for VISITOR_OUTPUT. So in the end, I settled with a new VISITOR_CLONE, and chose its value such that many internal checks can use 'v->type & mask', sticking to 'v->type == value' where the difference matters. Note that we can only clone objects (including alternates) and lists, not built-ins or enums. The visitor core hides integer width from the actual visitor (since commit 04e070d), and as long as that's the case, we can't clone top-level integers. Then again, those can always be cloned by direct copy, since they are not objects with deep pointers, so it's no real loss. And restricting cloning to just objects and lists is cleaner than restricting it to non-integers. As such, I documented that the clone visitor is for direct use only by code internal to QAPI, and should not be used on incomplete objects (other than a hack to work around the fact that we allow NULL in place of "" in visit_type_str() in other output visitors). Note that as written, the clone visitor will never fail on a complete object. Scalars (including enums) not at the root of the clone copy just fine with no additional effort while visiting the scalar, by virtue of a g_memdup() each time we push another struct onto the stack. Cloning a string requires deduplication of a pointer, which means it can also provide the guarantee of an input visitor of never producing NULL even when still accepting NULL in place of "" the way the QMP output visitor does. Cloning an 'any' type could be possible by incrementing the QObject refcnt, but it's not obvious whether that is better than implementing a QObject deep clone. So for now, we document it as unsupported, and intentionally omit the .type_any() callback to let a developer know their usage needs implementation. Add testsuite coverage for several different clone situations, to ensure that the code is working. I also tested that valgrind was happy with the test. Backports commit a15fcc3cf69ee3d408f60d6cc316488d2b0249b4 from qemu	2018-02-25 01:34:12 -05:00
Eric Blake	85af4b2030	qapi: Add new visit_complete() function Making each output visitor provide its own output collection function was the only remaining reason for exposing visitor sub-types to the rest of the code base. Add a polymorphic visit_complete() function which is a no-op for input visitors, and which populates an opaque pointer for output visitors. For maximum type-safety, also add a parameter to the output visitor constructors with a type-correct version of the output pointer, and assert that the two uses match. This approach was considered superior to either passing the output parameter only during construction (action at a distance during visit_free() feels awkward) or only during visit_complete() (defeating type safety makes it easier to use incorrectly). Most callers were function-local, and therefore a mechanical conversion; the testsuite was a bit trickier, but the previous cleanup patch minimized the churn here. The visit_complete() function may be called at most once; doing so lets us use transfer semantics rather than duplication or ref-count semantics to get the just-built output back to the caller, even though it means our behavior is not idempotent. Generated code is simplified as follows for events: \|@@ -26,7 +26,7 @@ void qapi_event_send_acpi_device_ost(ACP \| QDict qmp; \| Error err = NULL; \| QMPEventFuncEmit emit; \|- QmpOutputVisitor qov; \|+ QObject obj; \| Visitor v; \| q_obj_ACPI_DEVICE_OST_arg param = { \| info \|@@ -39,8 +39,7 @@ void qapi_event_send_acpi_device_ost(ACP \| \| qmp = qmp_event_build_dict("ACPI_DEVICE_OST"); \| \|- qov = qmp_output_visitor_new(); \|- v = qmp_output_get_visitor(qov); \|+ v = qmp_output_visitor_new(&obj); \| \| visit_start_struct(v, "ACPI_DEVICE_OST", NULL, 0, &err); \| if (err) { \|@@ -55,7 +54,8 @@ void qapi_event_send_acpi_device_ost(ACP \| goto out; \| } \| \|- qdict_put_obj(qmp, "data", qmp_output_get_qobject(qov)); \|+ visit_complete(v, &obj); \|+ qdict_put_obj(qmp, "data", obj); \| emit(QAPI_EVENT_ACPI_DEVICE_OST, qmp, &err); and for commands: \| { \| Error err = NULL; \|- QmpOutputVisitor qov = qmp_output_visitor_new(); \| Visitor v; \| \|- v = qmp_output_get_visitor(qov); \|+ v = qmp_output_visitor_new(ret_out); \| visit_type_AddfdInfo(v, "unused", &ret_in, &err); \|- if (err) { \|- goto out; \|+ if (!err) { \|+ visit_complete(v, ret_out); \| } \|- *ret_out = qmp_output_get_qobject(qov); \|- \|-out: \| error_propagate(errp, err); Backports commit 3b098d56979d2f7fd707c5be85555d114353a28d from qemu	2018-02-25 01:20:03 -05:00
Eric Blake	ec53301cda	qmp-output-visitor: Favor new visit_free() function Now that we have a polymorphic visit_free(), we no longer need qmp_output_visitor_cleanup(); however, we still need to expose the subtype for qmp_output_get_qobject(). Backports commit 1830f22a6777cedaccd67a08f675d30f7a85ebfd from qemu	2018-02-25 01:12:27 -05:00
Eric Blake	f008d93ac0	qmp-input-visitor: Favor new visit_free() function Now that we have a polymorphic visit_free(), we no longer need qmp_input_visitor_cleanup(); which in turn means we no longer need to return a subtype from qmp_input_visitor_new() nor a public upcast function. Generated code changes to qmp-marshal.c look like: \|@@ -52,11 +52,10 @@ void qmp_marshal_add_fd(QDict args, QOb \| { \| Error err = NULL; \| AddfdInfo retval; \|- QmpInputVisitor qiv = qmp_input_visitor_new(QOBJECT(args), true); \| Visitor *v; \| q_obj_add_fd_arg arg = {0}; \| \|- v = qmp_input_get_visitor(qiv); \|+ v = qmp_input_visitor_new(QOBJECT(args), true); \| visit_start_struct(v, NULL, NULL, 0, &err); \| if (err) { \| goto out; Backports commit b70ce1018a251c0c33498d9c927a07cade655a5e from qemu	2018-02-25 01:10:53 -05:00
Eric Blake	e88a7e260b	string-input-visitor: Favor new visit_free() function Now that we have a polymorphic visit_free(), we no longer need string_input_visitor_cleanup(); which in turn means we no longer need to return a subtype from string_input_visitor_new() nor a public upcast function. Backports commit 7a0525c7be6b38d32d586e3fd12e7377ded21faa from qemu	2018-02-25 01:08:04 -05:00
Eric Blake	7f741a6c9b	qapi: Add new visit_free() function Making each visitor provide its own (awkwardly-named) FOO_cleanup() is unusual, when we can instead have a polymorphic visit_free() interface. Over the next few patches, we can use the polymorphic functions to eliminate the need for a FOO_get_visitor() function for accessing specific visitor functionality, once everything can be accessed directly through the Visitor* interfaces. The dealloc visitor is the first one converted to completely use the new entry point, since qapi_dealloc_visitor_cleanup() was the only reason that qapi_dealloc_get_visitor() existed, and only generated and testsuite code was even using it. With the new visit_free() entry point in place, we no longer need to expose the QapiDeallocVisitor subtype through qapi_dealloc_visitor_new(), and can get by with less generated code, with diffs that look like: \| void qapi_free_ACPIOSTInfo(ACPIOSTInfo obj) \| { \|- QapiDeallocVisitor qdv; \| Visitor *v; \| \| if (!obj) { \| return; \| } \| \|- qdv = qapi_dealloc_visitor_new(); \|- v = qapi_dealloc_get_visitor(qdv); \|+ v = qapi_dealloc_visitor_new(); \| visit_type_ACPIOSTInfo(v, NULL, &obj, NULL); \|- qapi_dealloc_visitor_cleanup(qdv); \|+ visit_free(v); \|} Backports commit 2c0ef9f411ae6081efa9eca5b3eab2dbeee45a6c from qemu	2018-02-25 01:05:41 -05:00
Eric Blake	37ae4dfdfd	qapi: Add parameter to visit_end_* Rather than making the dealloc visitor track of stack of pointers remembered during visit_start_* in order to free them during visit_end_, it's a lot easier to just make all callers pass the same pointer to visit_end_. The generated code has access to the same pointer, while all other users are doing virtual walks and can pass NULL. The dealloc visitor is then greatly simplified. All three visit_end_() functions intentionally take a void, even though the visit_start_() functions differ between void, GenericList, and GenericAlternate*. This is done for several reasons: when doing a virtual walk, passing NULL doesn't care what the type is, but when doing a generated walk, we already have to cast the caller's specific FOO to call visit_start, while using void** lets us use visit_end without a cast. Also, an upcoming patch will add a clone visitor that wants to use the same implementation for all three visit_end callbacks, which is made easier if all three share the same signature. For visitors with already track per-object state (the QMP visitors via a stack, and the string visitors which do not allow nesting), add an assertion that the caller is indeed passing the same pointer to paired calls. Backports commit 1158bb2a058fcdd0c8fc3e60dc77f7a57ddbb271 from qemu	2018-02-25 00:57:54 -05:00
Changlong Xie	2ca07642f1	qom: Fix comment typo It's qom_unref, not qdef_unref. Backports commit ada03a0e8423ef8950e30d216f56a9661a4070e2 from qemu	2018-02-25 00:46:15 -05:00
Markus Armbruster	eeef227560	range: Replace internal representation of Range Range represents a range as follows. Member @start is the inclusive lower bound, member @end is the exclusive upper bound. Zero @end is special: if @start is also zero, the range is empty, else @end is to be interpreted as 2^64. No other empty ranges may occur. The range [0,2^64-1] cannot be represented. If you try to create it with range_set_bounds1(), you get the empty range instead. If you try to create it with range_set_bounds() or range_extend(), assertions fail. Before range_set_bounds() existed, the open-coded creation usually got you the empty range instead. Open deathtrap. Moreover, the code dealing with the janus-faced @end is too clever by half. Dumb this down to a more pedestrian representation: members @lob and @upb are inclusive lower and upper bounds. The empty range is encoded as @lob = 1, @upb = 0. Backports commit 6dd726a2bf1b800289d90a84d5fcb5ce7b78a8e1 from qemu	2018-02-25 00:44:36 -05:00
Markus Armbruster	8b2a0c4ece	range: Eliminate direct Range member access Users of struct Range mess liberally with its members, which makes refactoring hard. Create a set of methods, and convert all users to call them instead of accessing members. The methods have carefully worded contracts, and use assertions to check them. Backports commit a0efbf16604770b9d805bcf210ec29942321134f from qemu	2018-02-25 00:39:43 -05:00
Alistair Francis	fbb0645fb3	bitops: Add MAKE_64BIT_MASK macro Add a macro that creates a 64bit value which has length number of ones shifted across by the value of shift. Backports commit ae2923b5c20a21c6457680330506a9c13873485c from qemu	2018-02-25 00:30:39 -05:00
Peter Maydell	efc6cc2b83	memory: Assert that memory_region_init_rom_device() ops aren't NULL It doesn't make sense to pass a NULL ops argument to memory_region_init_rom_device(), because the effect will be that if the guest tries to write to the memory region then QEMU will segfault. Catch the bug earlier by sanity checking the arguments to this function, and remove the misleading documentation that suggests that passing NULL might be sensible. Backports commit 39e0b03dec518254fabd2acff29548d3f1d2b754 from qemu	2018-02-25 00:29:52 -05:00
Peter Maydell	334e951ec1	memory: Provide memory_region_init_rom() Provide a new helper function memory_region_init_rom() for memory regions which are read-only (and unlike those created by memory_region_init_rom_device() don't have special behaviour for writes). This has the same behaviour as calling memory_region_init_ram() and then memory_region_set_readonly() (which is what we do today in boards with pure ROMs) but is a more easily discoverable API for the purpose. Backports commit a1777f7f6462c66e1ee6e98f0d5c431bfe988aa5 from qemu	2018-02-25 00:28:17 -05:00
Alexey Kardashevskiy	7187d77cfa	memory: Add MemoryRegionIOMMUOps.notify_started/stopped callbacks The IOMMU driver may change behavior depending on whether a notifier client is present. In the case of POWER, this represents a change in the visibility of the IOTLB, for other drivers such as intel-iommu and future AMD-Vi emulation, notifier support is not yet enabled and this provides the opportunity to flag that incompatibility. Backports commit d22d8956b185c002b50a4d0883aff61f857347ef from qemu	2018-02-25 00:23:00 -05:00
Eric Blake	c14d8226ab	qapi: Fix memleak in string visitors on int lists Commit 7f8f9ef1 introduced the ability to store a list of integers as a sorted list of ranges, but when merging ranges, it leaks one or more ranges. It was also using range_get_last() incorrectly within range_compare() (a range is a start/end pair, but range_get_last() is for start/len pairs), and will also mishandle a range ending in UINT64_MAX (remember, we document that no range covers 2*64 bytes, but that ranges that end on UINT64_MAX have end < begin). The whole merge algorithm was rather complex, and included unnecessary passes over data within glib functions, and enough indirection to make it hard to easily plug the data leaks. Since we are already hard-coding things to a list of ranges, just rewrite the thing to open-code the traversal and comparisons, by making the range_compare() helper function give us an answer that is easier to use, at which point we avoid the need to pass any callbacks to g_list_(). Then by reusing range_extend() instead of duplicating effort with range_merge(), we cover the corner cases correctly. Drop the now-unused range_merge() and ranges_can_merge(). Doing this lets test-string-{input,output}-visitor pass under valgrind without leaks. Backports commit db486cc334aafd3dbdaf107388e37fc3d6d3e171 from qemu	2018-02-25 00:20:34 -05:00
Eric Blake	ef357d06bc	qapi: Simplify use of range.h Calling our function g_list_insert_sorted_merged is a misnomer, since we are NOT writing a glib function. Furthermore, we are making every caller pass the same comparator function of range_merge(): any caller that would try otherwise would break in weird ways since our internal call to ranges_can_merge() is hard-coded to operate only on ranges, rather than paying attention to the caller's comparator. Better is to fix things so that callers don't have to care about our internal comparator, by picking a function name and updating the parameter type away from a gratuitous use of void*, to make it obvious that we are operating specifically on a list of ranges and not a generic list. Plus, refactoring the code here will make it easier to plug a memory leak in the next patch. range_compare() is now internal only, and moves to the .c file. Backports commit 7c47959d0cb05db43014141a156ada0b6d53a750 from qemu	2018-02-25 00:02:42 -05:00
Eric Blake	5e22c7e180	range: Create range.c for code that should not be inline g_list_insert_sorted_merged() is rather large to be an inline function; move it to its own file. range_merge() and ranges_can_merge() can likewise move, as they are only used internally. Also, it becomes obvious that the condition within range_merge() is already satisfied by its caller, and that the return value is not used. The diffstat is misleading, because of the copyright boilerplate. Backports commit fec0fc0a13ac7f1a1130433a6740cd850c3db34a from qemu	2018-02-24 23:59:13 -05:00
Eric Blake	ebeb0e46f8	qapi: Fix crash on missing alternate member of QAPI struct If a QAPI struct has a mandatory alternate member which is not present on input, the input visitor reports an error for the missing alternate without setting the discriminator, but the cleanup code for the struct still tries to use the dealloc visitor to clean up the alternate. Commit dbf11922 changed visit_start_alternate to set obj to NULL when an error occurs, where it was previously left untouched. Thus, before the patch, the dealloc visitor is blindly trying to cleanup whatever branch corresponds to (obj)->type == 0 (that is, QTYPE_NONE, because obj still pointed to zeroed memory), which selects the default branch of the switch and sets an error, but this second error is ignored by the way the dealloc visitor is used; but after the patch, the attempt to switch dereferences NULL. When cleaning up after a partial object parse, we specifically check for !obj after visit_start_struct() (see gen_visit_object()); doing the same for alternates fixes the crash. Enhance the testsuite to give coverage for both missing struct and missing alternate members. Also add an abort - we expect visit_start_alternate() to either set an error or to set (obj)->type to a valid QType that corresponds to actual user input, and QTYPE_NONE should never be reachable from valid input. Had the abort() been in place earlier, we might have noticed the dealloc visitor dereferencing bogus zeroed memory prior to when commit dbf11922 forced our hand by setting obj to NULL and causing a fault. Test case: {'execute':'blockdev-add', 'arguments':{'options':{'driver':'raw'}}} The choice of 'driver':'raw' selects a BlockdevOptionsGenericFormat struct, which has a mandatory 'file':'BlockdevRef' in QAPI. Since 'file' is missing as a sibling of 'driver', this should report a graceful error rather than fault. After this patch, we are back to: {"error": {"class": "GenericError", "desc": "Parameter 'file' is missing"}} Generated code in qapi-visit.c changes as: \|@@ -2444,6 +2444,9 @@ void visit_type_BlockdevRef(Visitor v, \| if (err) { \| goto out; \| } \|+ if (!obj) { \|+ goto out_obj; \|+ } \| switch ((obj)->type) { \| case QTYPE_QDICT: \| visit_start_struct(v, name, NULL, 0, &err); \|@@ -2459,10 +2462,13 @@ void visit_type_BlockdevRef(Visitor v, \| case QTYPE_QSTRING: \| visit_type_str(v, name, &(*obj)->u.reference, &err); \| break; \|+ case QTYPE_NONE: \|+ abort(); \| default: \| error_setg(&err, QERR_INVALID_PARAMETER_TYPE, name ? name : "null", \| "BlockdevRef"); \| } \|+out_obj: \| visit_end_alternate(v); Backports commit 9b4e38fe6a35890bb1d995316d7be08de0b30ee5 from qemu	2018-02-24 23:53:29 -05:00
Aleksandar Markovic	f95e0e9e98	target-mips: Add FCR31's FS bit definition Add preprocessor definition of FCR31's FS bit, and update related code for setting this bit. Backports commit 77be419980114d75605811e1681115d0919cfa1a from qemu	2018-02-24 21:32:10 -05:00
Aleksandar Markovic	4a540f88de	target-mips: Implement FCR31's R/W bitmask and related functionalities This patch implements read and write access rules for Mips floating point control and status register (FCR31). The change can be divided into following parts: - Add fields that will keep FCR31's R/W bitmask in procesor definitions and processor float_status structure. - Add appropriate value for FCR31's R/W bitmask for each supported processor. - Add function for setting snan_bit_is_one, and integrate it in appropriate places. - Modify handling of CTC1 (case 31) instruction to use FCR31's R/W bitmask. - Modify handling user mode executables for Mips, in relation to the bit EF_MIPS_NAN2008 from ELF header, that is in turn related to reading and writing to FCR31. - Modify gdb behavior in relation to FCR31. Backports commit 599bc5e89c46f95f86ccad0d747d041c89a28806 from qemu	2018-02-24 21:30:24 -05:00
Aleksandar Markovic	84b516d9db	target-mips: Add nan2008 flavor of <CEIL\|CVT\|FLOOR\|ROUND\|TRUNC>.<L\|W>.<S\|D> New set of helpers for handling nan2008-syle versions of instructions <CEIL\|CVT\|FLOOR\|ROUND\|TRUNC>.<L\|W>.<S\|D>, for Mips R6. All involved instructions have float operand and integer result. Their core functionality is implemented via invocations of appropriate SoftFloat functions. The problematic cases are when the operand is a NaN, and also when the operand (float) is out of the range of the result. Here one can distinguish three cases: CASE MIPS-A: (FCR31.NAN2008 == 1) 1. Operand is a NaN, result should be 0; 2. Operand is larger than INT_MAX, result should be INT_MAX; 3. Operand is smaller than INT_MIN, result should be INT_MIN. CASE MIPS-B: (FCR31.NAN2008 == 0) 1. Operand is a NaN, result should be INT_MAX; 2. Operand is larger than INT_MAX, result should be INT_MAX; 3. Operand is smaller than INT_MIN, result should be INT_MAX. CASE SoftFloat: 1. Operand is a NaN, result is INT_MAX; 2. Operand is larger than INT_MAX, result is INT_MAX; 3. Operand is smaller than INT_MIN, result is INT_MIN. Current implementation of <CEIL\|CVT\|FLOOR\|ROUND\|TRUNC>.<L\|W>.<S\|D> implements case MIPS-B. This patch relates to case MIPS-A. For case MIPS-A, only return value for NaN-operands should be corrected after appropriate SoftFloat library function is called. Related MSA instructions FTRUNC_S and FTINT_S already handle well all cases, in the fashion similar to the code from this patch. Backports commit 87552089b62fa229d2ff86906e4e779177fb5835 from qemu	2018-02-24 21:14:04 -05:00
Aleksandar Markovic	a411a12170	target-mips: Add abs2008 flavor of <ABS\|NEG>.<S\|D> Updated handling of instructions <ABS\|NEG>.<S\|D>. Note that legacy (pre-abs2008) ABS and NEG instructions are arithmetic (and, therefore, any NaN operand causes signaling invalid operation), while abs2008 ones are non-arithmetic, always and only changing the sign bit, even for NaN-like operands. Details on these instructions are documented in [1] p. 35 and 359. Implementation-wise, abs2008 versions are implemented without helpers, for simplicity and performance sake. [1] "MIPS Architecture For Programmers Volume II-A: The MIPS64 Instruction Set Reference Manual", Imagination Technologies LTD, Revision 6.04, November 13, 2015 Backports commit 6be77480052b1a71557081896e7080363a8a2f95 from qemu	2018-02-24 20:45:06 -05:00
Aleksandar Markovic	ef9f33a345	target-mips: Activate IEEE 754-2008 signaling NaN bit meaning for MSA Function msa_reset() is updated so that flag snan_bit_is_one is properly set to 0. By applying this patch, a number of incorrect MSA behaviors that require IEEE 754-2008 compliance will be fixed. Those are behaviors that (up to the moment of applying this patch) did not get the desired functionality from SoftFloat library with respect to distinguishing between quiet and signaling NaN, getting default NaN values (both quiet and signaling), establishing if a floating point number is NaN or not, etc. Two examples: * FMAX, FMIN will now correctly detect and propagate NaNs. * FCLASS.D ans FCLASS.S will now correcty detect NaN flavors Backports commit 40bd6dd456e61a36e454fb9dd2cc739b67c224cf from qemu	2018-02-24 20:41:48 -05:00
Aleksandar Markovic	3e9325f1e9	softfloat: Handle snan_bit_is_one == 0 in MIPS pickNaNMulAdd() Only for Mips platform, and only for cases when snan_bit_is_one is 0, correct the order of argument comparisons in pickNaNMulAdd(). For more info, see [1], page 53, section "3.5.3 NaN Propagation". [1] "MIPS Architecture for Programmers Volume IV-j: The MIPS32 SIMD Architecture Module", Imagination Technologies LTD, Revision 1.12, February 3, 2016 Backports commit c27644f0e9659471e1c9355da5b667960d311937 from qemu	2018-02-24 20:40:11 -05:00
Aleksandar Markovic	33833b6605	softfloat: For Mips only, correct default NaN values Only for Mips platform, and only for cases when snan_bit_is_one is 0, correct default NaN values (in their 16-, 32-, and 64-bit flavors). For more info, see [1], page 84, Table 6.3 "Value Supplied When a New Quiet NaN Is Created", and [2], page 52, Table 3.7 "Default NaN Encodings". [1] "MIPS Architecture For Programmers Volume II-A: The MIPS64 Instruction Set Reference Manual", Imagination Technologies LTD, Revision 6.04, November 13, 2015 [2] "MIPS Architecture for Programmers Volume IV-j: The MIPS32 SIMD Architecture Module", Imagination Technologies LTD, Revision 1.12, February 3, 2016 Backports commit a7c04d545a97126c9df9d96623747d8613aaf7db from qemu	2018-02-24 20:38:23 -05:00
Aleksandar Markovic	33ee9429b2	softfloat: Clean code format in fpu/softfloat-specialize.h fpu/softfloat-specialize.h is the most critical file in SoftFloat library, since it handles numerous differences between platforms in relation to floating point arithmetics. This patch makes the code in this file more consistent format-wise, and hopefully easier to debug and maintain. Backports commit a59eaea64686c8966b7653303660f8c26f285c77 from qemu	2018-02-24 20:35:05 -05:00
Aleksandar Markovic	6eb4fa54f6	softfloat: Implement run-time-configurable meaning of signaling NaN bit This patch modifies SoftFloat library so that it can be configured in run-time in relation to the meaning of signaling NaN bit, while, at the same time, strictly preserving its behavior on all existing platforms. Background: In floating-point calculations, there is a need for denoting undefined or unrepresentable values. This is achieved by defining certain floating-point numerical values to be NaNs (which stands for "not a number"). For additional reasons, virtually all modern floating-point unit implementations use two kinds of NaNs: quiet and signaling. The binary representations of these two kinds of NaNs, as a rule, differ only in one bit (that bit is, traditionally, the first bit of mantissa). Up to 2008, standards for floating-point did not specify all details about binary representation of NaNs. More specifically, the meaning of the bit that is used for distinguishing between signaling and quiet NaNs was not strictly prescribed. (IEEE 754-2008 was the first floating-point standard that defined that meaning clearly, see [1], p. 35) As a result, different platforms took different approaches, and that presented considerable challenge for multi-platform emulators like QEMU. Mips platform represents the most complex case among QEMU-supported platforms regarding signaling NaN bit. Up to the Release 6 of Mips architecture, "1" in signaling NaN bit denoted signaling NaN, which is opposite to IEEE 754-2008 standard. From Release 6 on, Mips architecture adopted IEEE standard prescription, and "0" denotes signaling NaN. On top of that, Mips architecture for SIMD (also known as MSA, or vector instructions) also specifies signaling bit in accordance to IEEE standard. MSA unit can be implemented with both pre-Release 6 and Release 6 main processor units. QEMU uses SoftFloat library to implement various floating-point-related instructions on all platforms. The current QEMU implementation allows for defining meaning of signaling NaN bit during build time, and is implemented via preprocessor macro called SNAN_BIT_IS_ONE. On the other hand, the change in this patch enables SoftFloat library to be configured in run-time. This configuration is meant to occur during CPU initialization, at the moment when it is definitely known what desired behavior for particular CPU (or any additional FPUs) is. The change is implemented so that it is consistent with existing implementation of similar cases. This means that structure float_status is used for passing the information about desired signaling NaN bit on each invocation of SoftFloat functions. The additional field in float_status is called snan_bit_is_one, which supersedes macro SNAN_BIT_IS_ONE. IMPORTANT: This change is not meant to create any change in emulator behavior or functionality on any platform. It just provides the means for SoftFloat library to be used in a more flexible way - in other words, it will just prepare SoftFloat library for usage related to Mips platform and its specifics regarding signaling bit meaning, which is done in some of subsequent patches from this series. Further break down of changes: 1) Added field snan_bit_is_one to the structure float_status, and correspondent setter function set_snan_bit_is_one(). 2) Constants <float16\|float32\|float64\|floatx80\|float128>_default_nan (used both internally and externally) converted to functions <float16\|float32\|float64\|floatx80\|float128>_default_nan(float_status). This is necessary since they are dependent on signaling bit meaning. At the same time, for the sake of code cleanup and simplicity, constants <floatx80\|float128>_default_nan_<low\|high> (used only internally within SoftFloat library) are removed, as not needed. 3) Added a float_status argument to SoftFloat library functions XXX_is_quiet_nan(XXX a_), XXX_is_signaling_nan(XXX a_), XXX_maybe_silence_nan(XXX a_). This argument must be present in order to enable correct invocation of new version of functions XXX_default_nan(). (XXX is <float16\|float32\|float64\|floatx80\|float128> here) 4) Updated code for all platforms to reflect changes in SoftFloat library. This change is twofolds: it includes modifications of SoftFloat library functions invocations, and an addition of invocation of function set_snan_bit_is_one() during CPU initialization, with arguments that are appropriate for each particular platform. It was established that all platforms zero their main CPU data structures, so snan_bit_is_one(0) in appropriate places is not added, as it is not needed. [1] "IEEE Standard for Floating-Point Arithmetic", IEEE Computer Society, August 29, 2008. Backports commit af39bc8c49224771ec0d38f1b693ea78e221d7bc from qemu	2018-02-24 20:27:12 -05:00
Alexey Kardashevskiy	096ca207af	memory: Add reporting of supported page sizes Every IOMMU has some granularity which MemoryRegionIOMMUOps::translate uses when translating, however this information is not available outside the translate context for various checks. This adds a get_min_page_size callback to MemoryRegionIOMMUOps and a wrapper for it so IOMMU users (such as VFIO) can know the minimum actual page size supported by an IOMMU. As IOMMU MR represents a guest IOMMU, this uses TARGET_PAGE_SIZE as fallback. This removes vfio_container_granularity() and uses new helper in memory_region_iommu_replay() when replaying IOMMU mappings on added IOMMU memory region. Backports the relevant parts of commit f682e9c244af7166225f4a50cc18ff296bb9d43e from qemu	2018-02-24 19:23:28 -05:00
Lluís Vilanova	2297527755	exec: [tcg] Track which vCPU is performing translation and execution Information is tracked inside the TCGContext structure, and later used by tracing events with the 'tcg' and 'vcpu' properties. The 'cpu' field is used to check tracing of translation-time events ("_trans"). The 'tcg_env' field is used to pass it to execution-time events ("_exec"). Backports commit 7c2550432abe62f53e6df878ceba6ceaf71f0e7e from qemu	2018-02-24 19:21:39 -05:00
Eduardo Habkost	0f6513ef62	error: Remove unnecessary local_err variables This patch simplifies code that uses a local_err variable just to immediately use it for an error_propagate() call. Coccinelle patch used to perform the changes added to scripts/coccinelle/remove_local_err.cocci. Backports commit 6b62d961373e0327f2af8fb77d6d5d6308864180 from qemu	2018-02-24 19:12:25 -05:00
Peter Maydell	5ae787f895	target-arm: Provide hook to tell GICv3 about changes of security state The GICv3 CPU interface needs to know when the CPU it is attached to makes an exception level or mode transition that changes the security state, because whether it is asserting IRQ or FIQ can change depending on these things. Provide a mechanism for letting the GICv3 device register a hook to be called on such changes. Backports commit bd7d00fc50c9960876dd194ebf0c88889b53e765 from qemu	2018-02-24 19:09:22 -05:00
Peter Maydell	eec3a5f843	target-arm: Define new arm_is_el3_or_mon() function The GICv3 system registers need to know if the CPU is AArch64 in EL3 or AArch32 in Monitor mode. This happens to be the first part of the check for arm_is_secure(), so factor it out into a new arm_is_el3_or_mon() function that the GIC can also use. Backports commit 712058764da29b2908f6fbf56760ca4f15980709 from qemu	2018-02-24 19:04:27 -05:00
Peter Maydell	f893dacef0	bitops.h: Implement half-shuffle and half-unshuffle ops A half-shuffle operation takes a word with zeros in the high half: 0000 0000 0000 0000 ABCD EFGH IJKL MNOP and spreads the bits out so they are in every other bit of the word: 0A0B 0C0D 0E0F 0G0H 0I0J 0K0L 0M0N 0O0P A half-unshuffle performs the reverse operation. Provide functions in bitops.h which implement these operations for 32-bit and 64-bit inputs, and add tests for them. Backports commit b355438de52d0782983bf4bdc47936189a0c988b from qemu	2018-02-24 19:02:36 -05:00
Bharata B Rao	851dec945d	qom: API to get instance_size of a type Add an API object_type_get_size(const char *typename) that returns the instance_size of the give typename. Backports commit 3f97b53a682d2595747c926c00d78b9d406f1be0 from qemu	2018-02-24 19:00:16 -05:00
Thomas Huth	aee5c93f58	configure: Enable -Werror for MinGW builds, too MinGW seems to compile currently without warnings, so it should be safe to enable -Werror now for this environment, too. Backports commit e4650c81b3d15ba67236815defbb475c4bdf8690 from qemu	2018-02-24 18:56:05 -05:00
Eduardo Habkost	b918dd95f3	target-i386: Consolidate calls of object_property_parse() in x86_cpu_parse_featurestr Backports commit f6750e959a397dea988efd4e488e1ff813011065 from qemu	2018-02-24 18:53:55 -05:00
Igor Mammedov	800b28483b	target-i386: Move features logic that requires CPUState to realize time Making x86_cpu_parse_featurestr() a pure convertor of legacy feature string into global properties, needs it to be called before a CPU instance is created so parser shouldn't modify CPUState directly or access it at all. Hence move current hack that directly pokes into CPUState, to set/unset +-feats, from parser to CPU's realize method. Backports commit dc15c0517b010a9444a2c05794dae980f2a2cbd9 from qemu	2018-02-24 18:47:46 -05:00
Eduardo Habkost	b9ca5c4d33	target-i386: Remove xlevel & hv-spinlocks option fixups The "fixup will be removed in future versions" warnings are present since QEMU 1.7.0, at least, so users should have fixed their scripts and configurations, already. In the case of libvirt users, libvirt doesn't use the "xlevel" option, and already rejects HyperV spinlock retry count < 0xFFF. Backports commit c19b85216b5d47d922ac010931d4c7b2d79b2f68 from qemu	2018-02-24 18:33:32 -05:00
Radim Krčmář	610a52e9c7	target-i386: Implement CPUID[0xB] (Extended Topology Enumeration) I looked at a dozen Intel CPU that have this CPUID and all of them always had Core offset as 1 (a wasted bit when hyperthreading is disabled) and Package offset at least 4 (wasted bits at <= 4 cores). QEMU uses more compact IDs and it doesn't make much sense to change it now. I keep the SMT and Core sub-leaves even if there is just one thread/core; it makes the code simpler and there should be no harm. Backports commit 5232d00a041c8f3628b3532ef35d703a1f0dac19 from qemu	2018-02-24 18:31:14 -05:00
Eduardo Habkost	8991e8bf0b	target-i386: add Skylake-Client cpu model Introduce Skylake-Client cpu mode which inherits the features from Broadwell and supports some additional features that are: MPX, XSAVEC, and XGETBV1. Backports commit f6f949e9295889fb272698aea763dcea77d616ce from qemu	2018-02-24 18:25:50 -05:00
Peter Maydell	9bdf310d49	target-arm: Don't permit ARMv8-only Neon insns on ARMv7 The Neon instructions VCVTA, VCVTM, VCVTN, VCVTP, VRINTA, VRINTM, VRINTN, VRINTP, VRINTX, and VRINTZ were only introduced with ARMv8, so they need a guard to make them UNDEF if the CPU only supports ARMv7. (We got this right for all the other new-in-v8 insns, but forgot it for these Neon 2-reg-misc ops.) Backports commit fe8fcf3d642b4de1369841bf6acac13e0ec8770d from qemu	2018-02-24 18:20:00 -05:00
Peter Maydell	a9fb399490	target-arm: Fix reset and migration of TTBCR(S) Commit 6459b94c26dd666badb3 broke reset and migration of the AArch32 TTBCR(S) register if the guest used non-LPAE page tables. This is because the AArch32 TTBCR register definition is marked as ARM_CP_ALIAS, meaning that the AArch64 variant has to handle migration and reset. Although AArch64 TCR_EL3 doesn't need to care about the mask and base_mask fields, AArch32 may do so, and so we must use the special TTBCR reset and raw write functions to ensure they are set correctly. This doesn't affect TCR_EL2, because the AArch32 equivalent of that is HTCR, which never uses the non-LPAE page table variant. Backports commit 811595a2d4ab8c6354857a50ffd29fafce52a892 from qemu	2018-02-24 18:18:24 -05:00
Shannon Zhao	51c9e12605	target-arm: kvm64: set guest PMUv3 feature bit if supported Check if kvm supports guest PMUv3. If so, set the corresponding feature bit for vcpu. Backports commit 5c0a3819f009639f67ce0453dff6ec7211bfee54 from qemu	2018-02-24 18:17:11 -05:00
Emilio G. Cota	ae3e22a689	tb hash: hash phys_pc, pc, and flags with xxhash For some workloads such as arm bootup, tb_phys_hash is performance-critical. The is due to the high frequency of accesses to the hash table, originated by (frequent) TLB flushes that wipe out the cpu-private tb_jmp_cache's. More info: https://lists.nongnu.org/archive/html/qemu-devel/2016-03/msg05098.html To dig further into this I modified an arm image booting debian jessie to immediately shut down after boot. Analysis revealed that quite a bit of time is unnecessarily spent in tb_phys_hash: the cause is poor hashing that results in very uneven loading of chains in the hash table's buckets; the longest observed chain had ~550 elements. The appended addresses this with two changes: 1) Use xxhash as the hash table's hash function. xxhash is a fast, high-quality hashing function. 2) Feed the hashing function with not just tb_phys, but also pc and flags. This improves performance over using just tb_phys for hashing, since that resulted in some hash buckets having many TB's, while others getting very few; with these changes, the longest observed chain on a single hash bucket is brought down from ~550 to ~40. Tests show that the other element checked for in tb_find_physical, cs_base, is always a match when tb_phys+pc+flags are a match, so hashing cs_base is wasteful. It could be that this is an ARM-only thing, though. UPDATE: On Tue, Apr 05, 2016 at 08:41:43 -0700, Richard Henderson wrote: > The cs_base field is only used by i386 (in 16-bit modes), and sparc (for a TB > consisting of only a delay slot). > It may well still turn out to be reasonable to ignore cs_base for hashing. BTW, after this change the hash table should not be called "tb_hash_phys" anymore; this is addressed later in this series. This change gives consistent bootup time improvements. I tested two host machines: - Intel Xeon E5-2690: 11.6% less time - Intel i7-4790K: 19.2% less time Increasing the number of hash buckets yields further improvements. However, using a larger, fixed number of buckets can degrade performance for other workloads that do not translate as many blocks (600K+ for debian-jessie arm bootup). This is dealt with later in this series. Backports commit 42bd32287f3a18d823f2258b813824a39ed7c6d9 from qemu	2018-02-24 18:00:14 -05:00
Emilio G. Cota	9ef9de9cf8	exec: add tb_hash_func5, derived from xxhash This will be used by upcoming changes for hashing the tb hash. Add this into a separate file to include the copyright notice from xxhash. Backports commit dc8b295d05ec35a8c032f9abca421772347ba5d4 from qemu	2018-02-24 17:36:35 -05:00
Emilio G. Cota	8518f55df7	compiler.h: add QEMU_ALIGNED() to enforce struct alignment Backports commit 911a4d2215b05267b16925503218f49d607c6b29 from qemu	2018-02-24 17:32:43 -05:00
Peter Maydell	48539e54da	target-i386: Move user-mode exception actions out of user-exec.c The exception_action() function in user-exec.c is just a call to cpu_loop_exit() for every target CPU except i386. Since this function is only called if the target's handle_mmu_fault() hook has indicated an MMU fault, and that hook is only called from the handle_cpu_signal() code path, we can simply move the x86-specific setup into that hook, which allows us to remove the TARGET_I386 ifdef from user-exec.c. Of the actions that were done by the call to raise_interrupt_err(): * cpu_svm_check_intercept_param() is a no-op in user mode * check_exception() is a no-op since double faults are impossible for user-mode * assignments to cs->exception_index and env->error_code are no-ops * assigning to env->exception_next_eip is unnecessary because it is not used unless env->exception_is_int is true * cpu_loop_exit_restore() is equivalent to cpu_loop_exit() since pc is 0 which leaves just setting env_>exception_is_int as the action that needs to be added to x86_cpu_handle_mmu_fault(). Backports commit 0c33682d5f29b0a4ae53bdec4c8e52e4fae37b34 from qemu	2018-02-24 17:27:08 -05:00
Peter Maydell	fa2679ba96	target-i386: Add comment about do_interrupt_user() next_eip argument Add a comment to do_interrupt_user() along the same lines as the existing one for do_interrupt_all() noting that the next_eip argument is not used unless is_int is true or intno is EXCP_SYSCALL. Backports commit 33271823323483b4ede1ae99de83d33b25875402 from qemu	2018-02-24 17:26:18 -05:00
Peter Maydell	d7dccff836	cpu-exec: Rename cpu_resume_from_signal() to cpu_loop_exit_noexc() The function cpu_resume_from_signal() is now always called with a NULL puc argument, and is rather misnamed since it is never called from a signal handler. It is essentially forcing an exit to the top level cpu loop but without raising any exception, so rename it to cpu_loop_exit_noexc() and drop the useless unused argument. Backports commit 6886b98036a8f8f5bce8b10756ce080084cef11b from qemu	2018-02-24 17:25:28 -05:00
Peter Maydell	b2013255aa	user-exec: Push resume-from-signal code out to handle_cpu_signal() Since the only caller of page_unprotect() which might cause it to need to call cpu_resume_from_signal() is handle_cpu_signal() in the user-mode code, push the longjump handling out to that function. Since this is the only caller of cpu_resume_from_signal() which passes a non-NULL puc argument, split the non-NULL handling into a new cpu_exit_tb_from_sighandler() function. This allows us to merge the softmmu and usermode implementations of the cpu_resume_from_signal() function, which are now identical. Backports commit f213e72f2356b77768b9cb73814a3b26ad5a0099 from qemu	2018-02-24 17:21:06 -05:00
Peter Maydell	37b7538d85	translate-all.c: Don't pass puc, locked to tb_invalidate_phys_page() The user-mode-only function tb_invalidate_phys_page() is only called from two places: * page_unprotect(), which passes in a non-zero pc, a puc pointer and the value 'true' for the locked argument * page_set_flags(), which passes in a zero pc, a NULL puc pointer and a 'false' locked argument If the pc is non-zero then we may call cpu_resume_from_signal(), which does a longjmp out of the calling code (and out of the signal handler); this is to cover the case of a target CPU with "precise self-modifying code" (currently only x86) executing a store instruction which modifies code in the same TB as the store itself. Rather than doing the longjump directly here, return a flag to the caller which indicates whether the current TB was modified, and move the longjump to page_unprotect. Backports commit 75809229bbf28b371afce14921ff5be98ddc5faa from qemu	2018-02-24 17:11:30 -05:00
Paolo Bonzini	1db22b5889	Makefile: add dependency on scripts/create_config Make sure that config-host.h and config-target.h are rebuilt whenever there is a change in the scripts that generates them; add the dependency to the pattern rule as suggested by Peter. Backports commit 553350156d80c18d0127c742f47b7adbd642f3ef from qemu	2018-02-24 17:05:03 -05:00
Fam Zheng	c17a3070ea	Makefile: Add a FORCE target Backports commit d41d4da3c5d702b505d74265900a13fae2c8d0e0 from qemu	2018-02-24 17:03:51 -05:00
Peter Maydell	8d0faac1dc	qemu-common.h: Drop WORDS_ALIGNED define The WORDS_ALIGNED #define is not used anywhere, and hasn't been since 2013 when commit 612d590ebc6cef rewrote the various ld<type>_<endian>_p functions to not use it. Remove the #define and the comment describing it. Also remove the line in the comment about TARGET_WORDS_ALIGNED, since it has never actually existed. Backports commit 0d5c21f2b3bf1e0b562a2c74e353d2e03f2f50ef from qemu	2018-02-24 17:01:55 -05:00
Stefan Weil	4470900f3b	configure: Use instead of deprecated This fixes these warnings from shellcheck: ^-- SC2006: Use $(..) instead of deprecated `..` Backports commit 89138857619b2a023c32200e9af780792ccaa8c3 from qemu	2018-02-24 16:59:40 -05:00
Sergey Sorokin	c05902eddd	target-arm: Fix TTBR selecting logic on AArch32 Stage 2 translation Address size is 40-bit for the AArch32 stage 2 translation, and t0sz can be negative (from -8 to 7), so we need to adjust it to use the existing TTBR selecting logic. Backports commit 6e99f762612827afeff54add2e4fc2c3b2657fed from qemu	2018-02-24 16:54:32 -05:00
Peter Maydell	806d72035e	target-arm: Don't try to set ESR IL bit in arm_cpu_do_interrupt_aarch64() Remove some incorrect code from arm_cpu_do_interrupt_aarch64() which attempts to set the IL bit in the syndrome register based on the value of env->thumb. This is wrong in several ways: * IL doesn't indicate Thumb-vs-ARM, it indicates instruction length (which may be 16 or 32 for Thumb and is always 32 for ARM) * not every syndrome format uses IL like this -- for some IL is always set, and for some it is always clear * the code is changing esr_el[new_el] even for interrupt entry, which is not supposed to modify ESR_ELx at all Delete the code, and instead rely on the syndrome value in env->exception.syndrome having already been set up with the correct value of IL. Backports commit 78f1edb19fe11fa0c5d0bf484db59a384f455d3c from qemu	2018-02-24 16:49:53 -05:00
Peter Maydell	dc8bf22d88	target-arm: Set IL bit in syndromes for insn abort, watchpoint, swstep For some exception syndrome types, the IL bit should always be set. This includes the instruction abort, watchpoint and software step syndrome types; add the missing ARM_EL_IL bit to the syndrome values returned by syn_insn_abort(), syn_swstep() and syn_watchpoint(). Backports commit 04ce861ea545477425ad9e045eec3f61c8a27df9 from qemu	2018-02-24 16:48:59 -05:00
Edgar E. Iglesias	8aee797956	target-arm: A64: Create Instruction Syndromes for Data Aborts Add support for generating the ISS (Instruction Specific Syndrome) for Data Abort exceptions taken from AArch64. These syndromes are used by hypervisors for example to trap and emulate memory accesses. We save the decoded data out-of-band with the TBs at translation time. When exceptions hit, the extra data attached to the TB is used to recreate the state needed to encode instruction syndromes. This avoids the need to emit moves with every load/store. Based on a suggestion from Peter Maydell. Backports commit aaa1f954d4cab243e3d5337a72bc6d104e1c4808 from qemu	2018-02-24 16:46:44 -05:00
Alistair Francis	25daa5363e	target-arm: Add the HSTR_EL2 register Add the Hypervisor System Trap Register for EL2. This register is used early in the Linux boot and without it the kernel aborts with a "Synchronous Abort" error. Backports commit 2a5a9abd4bc45e2f4c62c77e07aebe53608c6915 from qemu	2018-02-24 16:24:57 -05:00
Fam Zheng	495c39300c	rules.mak: Add COMMA constant Using "," literal in $(call quiet-command, ...) arguments is awkward. Add this constant to make it at least doable. Backports commit 2f4e4dc237261c76734d8ae1d8e09d2983d2f1ca from qemu	2018-02-24 16:20:31 -05:00
Paolo Bonzini	8df5ad80b1	exec: hide mr->ram_addr from qemu_get_ram_ptr users Let users of qemu_get_ram_ptr and qemu_ram_ptr_length pass in an address that is relative to the MemoryRegion. This basically means what address_space_translate returns. Because the semantics of the second parameter change, rename the function to qemu_map_ram_ptr. Backports commit 0878d0e11ba8013dd759c6921cbf05ba6a41bd71 from qemu	2018-02-24 16:17:49 -05:00
Paolo Bonzini	b2e1b34bcc	memory: split memory_region_from_host from qemu_ram_addr_from_host Move the old qemu_ram_addr_from_host to memory_region_from_host and make it return an offset within the region. For qemu_ram_addr_from_host return the ram_addr_t directly, similar to what it was before commit 1b5ec23 ("memory: return MemoryRegion from qemu_ram_addr_from_host", 2013-07-04). Backports commit 07bdaa4196b51bc7ffa7c3f74e9e4a9dc8a7966a from qemu	2018-02-24 16:06:49 -05:00
Paolo Bonzini	918c626847	exec: remove ram_addr argument from qemu_ram_block_from_host Of the two callers, one does not use it, and the other can compute it itself based on the other output argument (offset) and the RAMBlock. Backports commit f615f39616c4fd1a3a3b078af8d75bb4be6390de from qemu	2018-02-24 03:37:40 -05:00
Paolo Bonzini	f26f1f123c	memory: remove qemu_get_ram_fd, qemu_set_ram_fd, qemu_ram_block_host_ptr Remove direct uses of ram_addr_t and optimize memory_region_{get,set}_fd now that a MemoryRegion knows its RAMBlock directly. Backports commit 4ff87573df3606856a92c14eef3393a63d736d11 from qemu	2018-02-24 03:34:44 -05:00
Emilio G. Cota	ab569f5cde	atomics: do not emit consume barrier for atomic_rcu_read Currently we emit a consume-load in atomic_rcu_read. Because of limitations in current compilers, this is overkill for non-Alpha hosts and it is only useful to make Thread Sanitizer work. This patch leaves the consume-load in atomic_rcu_read when compiling with Thread Sanitizer enabled, and resorts to a relaxed load + smp_read_barrier_depends otherwise. On an RMO host architecture, such as aarch64, the performance improvement of this change is easily measurable. For instance, qht-bench performs an atomic_rcu_read on every lookup. Performance before and after applying this patch: $ tests/qht-bench -d 5 -n 1 Before: 9.78 MT/s After: 10.96 MT/s Backports commit 15487aa132109891482f79d78a30d6cfd465a391 from qemu	2018-02-24 03:28:11 -05:00
Emilio G. Cota	87ef2a2c5f	atomics: emit an smp_read_barrier_depends() barrier only for Alpha and Thread Sanitizer For correctness, smp_read_barrier_depends() is only required to emit a barrier on Alpha hosts. However, we are currently emitting a consume fence unconditionally, and most compilers currently treat consume and acquire fences as equivalent. Fix it by keeping the consume fence if we're compiling with Thread Sanitizer, since this might help prevent false warnings. Otherwise, only emit the barrier for Alpha hosts. Note that we still guarantee that smp_read_barrier_depends() is a compiler barrier. Backports commit c983895258a771f8a5e4a53950bfb7fd2216651c from qemu	2018-02-24 03:26:52 -05:00
Sergey Fedorov	3a9c5e7509	cpu-exec: Fix direct jump to TB spanning page It is not safe to make a direct jump to a TB spanning two pages in system emulation because the mapping for the second page can get changed but we don't take care of direct jumps in this case. However in user mode emulation, this is not the case because there's only static address translation and TBs are always invalidated properly. Backports commit c88c67e58b61618a904d2333ceebefc3c852d32e from qemu	2018-02-24 03:24:53 -05:00
Eduardo Habkost	9c04a28bd2	target-i386: Move TCG initialization to realize time QOM instance_init functions are not supposed to have any side-effects, as new objects may be created at any moment for querying property information (see qmp_device_list_properties()). Move TCG initialization to realize time so it won't be called when just doing object_new() on a X86CPU subclass. Backports commit 57f2453ab48a771b30aeced01b329ee85853bb7b from qemu	2018-02-24 03:23:09 -05:00
Eduardo Habkost	fee2c27f2b	cpu: Eliminate cpudef_init(), cpudef_setup() x86_cpudef_init() doesn't do anything anymore, cpudef_init(), cpudef_setup(), and x86_cpudef_init() can be finally removed. Backports commit 3e2c0e062f0963a6b73b0cd1990fad79495463d9 from qemu	2018-02-24 03:20:46 -05:00
Eduardo Habkost	956e20ea6b	target-i386: Set constant model_id for qemu64/qemu32/athlon Newer PC machines don't set hw_version, and older machines set model-id on compat_props explicitly, so we don't need the x86_cpudef_setup() code that sets model_id using qemu_hw_version() anymore. Backports commit 9cf2cc3d8237732946720d78bf9aec0064026ed8 from qemu	2018-02-24 03:18:11 -05:00
Eduardo Habkost	aa3d46ef83	osdep: Move default qemu_hw_version() value to a macro The macro will be used by code that will stop calling qemu_hw_version() at runtime and just need a constant value. Backports commit d494352c2f7818aeba184a8ef757569083740bb2 from qemu	2018-02-24 03:16:34 -05:00
Eduardo Habkost	923dcf1cb8	target-i386: Use xsave structs for ext_save_area This doesn't introduce any change in the code, as the offsets and struct sizes match what was present in the table. This can be validated by the QEMU_BUILD_BUG_ON lines on target-i386/cpu.h, which ensures the struct sizes and offsets match the existing values in ext_save_area. Backports commit ee1b09f695dcd8532f470e53297473bd3bc88718 from qemu	2018-02-24 03:13:16 -05:00
Lioncash	05963470a2	target-i386: Include log.h in smm_helper Fixes a compilation error	2018-02-24 03:06:07 -05:00
Eduardo Habkost	128f7c078a	target-i386: Define structs for layout of xsave area Add structs that define the layout of the xsave areas used by Intel processors. Add some QEMU_BUILD_BUG_ON lines to ensure the structs match the XSAVE_* macros in target-i386/kvm.c and the offsets and sizes at target-i386/cpu.c:ext_save_areas. Backports commit b503717d28e8f7eff39bf38624e6cf42687d951a from qemu	2018-02-24 03:04:31 -05:00
Paolo Bonzini	77305ce4ee	memory: remove unnecessary masking of MemoryRegion ram_addr mr->ram_block->offset is already aligned to both host and target size (see qemu_ram_alloc_internal). Remove further masking as it is unnecessary. Backports commit e4e697940dff612b789b0858270c20a8b680f78d from qemu	2018-02-24 03:01:34 -05:00
Fam Zheng	74962feee1	memory: Drop FlatRange.romd_mode Its value is alway set to mr->romd_mode, so the removed comparisons are fully superseded by "a->mr == b->mr". Backports commit 5b5660adf1fdb61db14ec681b10463b8cba633f1 from qemu	2018-02-24 02:57:29 -05:00
Fam Zheng	fb8135cd0d	memory: Remove code for mr->may_overlap The collision check does nothing and hasn't been used. Remove the variable together with related code. Backports commit b61359781958759317ee6fd1a45b59be0b7dbbe1 from qemu	2018-02-24 02:55:25 -05:00
Gonglei	feff56cc11	memory: drop find_ram_block() On the one hand, we have already qemu_get_ram_block() whose function is similar. On the other hand, we can directly use mr->ram_block but searching RAMblock by ram_addr which is a kind of waste. Backports commit fa53a0e53efdc7002497ea4a76aacf6cceb170ef from qemu	2018-02-24 02:52:20 -05:00
Paolo Bonzini	9bb67a3f58	hw: clean up hw/hw.h includes Include qom/object.h and exec/memory.h instead of exec/ioport.h; exec/ioport.h was almost everywhere required only for those two includes, not for the content of the header itself. Remove block/aio.h, everybody is already including it through another path. With this change, include/hw/hw.h is freed from qemu-common.h. Backports commit df43d49cb8708b9c88a20afe0d1a3089b550a5b8 from qemu	2018-02-24 02:46:41 -05:00
Paolo Bonzini	d0d3712417	hw: remove pio_addr_t pio_addr_t is almost unused, because these days I/O ports are simply accessed through the address space. cpu_{in,out}[bwl] themselves are almost unused; monitor.c and xen-hvm.c could use address_space_read/write directly, since they have an integer size at hand. This leaves qtest as the only user of those functions. On the other hand even portio_* functions use this type; the only interesting use of pio_addr_t thus is include/hw/sysbus.h. I guess I could move it there, but I don't see much benefit in that either. Using uint32_t is enough and avoids the need to include ioport.h everywhere. Backports commit 89a80e7400f7225d9401b35ef32454b4ab29dc67 from qemu	2018-02-24 02:43:16 -05:00
Paolo Bonzini	9485b7c2e1	cpu: move exec-all.h inclusion out of cpu.h exec-all.h contains TCG-specific definitions. It is not needed outside TCG-specific files such as translate.c, exec.c or *helper.c. One generic function had snuck into include/exec/exec-all.h; move it to include/qom/cpu.h. Backports commit 63c915526d6a54a95919ebece83fa9ca631b2508 from qemu	2018-02-24 02:39:08 -05:00
Paolo Bonzini	58693409ea	exec: extract exec/tb-context.h TCG backends do not need most of exec-all.h; extract what they actually need to a separate file or move it directly to tcg.h. The next patch will stop including exec-all.h from everywhere. Backports commit 00f6da6a1a5d1ce085334eccbb50ec899ceed513 from qemu	2018-02-24 02:09:58 -05:00
Paolo Bonzini	f9b9d0ba0f	hw: explicitly include qemu/log.h Move the inclusion out of hw/hw.h, most files do not need it. Backports commit 03dd024ff57733a55cd2e455f361d053c81b1b29 from qemu	2018-02-24 02:00:45 -05:00
Paolo Bonzini	adf97a4d59	mips: move CP0 functions out of cpu.h These are here for historical reasons: they are needed from both gdbstub.c and op_helper.c, and the latter was compiled with fixed AREG0. It is not needed anymore, so uninline them. Backports commit e6623d88f44aae9e9c78276c0cb7bd352283d50a from qemu	2018-02-24 01:57:30 -05:00
Paolo Bonzini	058624b9e4	arm: move arm_log_exception into .c file Avoid need for qemu/log.h inclusion, and make the function static too. Backports commit 27a7ea8a1f351578ce869b41ba1ba662c063fd62 from qemu	2018-02-24 01:52:55 -05:00
Paolo Bonzini	37f26922dd	qemu-common: push cpu.h inclusion out of qemu-common.h Backports commit 33c11879fd422b759483ed25fef133ea900ea8d7 from qemu	2018-02-24 01:50:56 -05:00
Paolo Bonzini	e84da64a2b	qemu-common: stop including qemu/bswap.h from qemu-common.h Move it to the actual users. There are still a few includes of qemu/bswap.h in headers; removing them is left for future work. Backports commit 58369e22cf971448411bfbc8c894b2addebe2111 from qemu	2018-02-24 01:06:03 -05:00
Paolo Bonzini	78fd1aab94	cpu: move endian-dependent load/store functions to cpu-all.h Disentangle cpu-common.h and memory.h from NEED_CPU_H. Prototypes are not defined for !NEED_CPU_H, so remove them from poison.h too. Only macros need poisoning. Backports commit a7d6039cb35592683ecc56d2b37817da2d2f8b00 from qemu	2018-02-24 01:04:26 -05:00
Paolo Bonzini	9c0e31ed3a	target-sparc: make cpu-qom.h not target specific Make SPARCCPU an opaque type within cpu-qom.h, and move all definitions of private methods, as well as all type definitions that require knowledge of the layout to cpu.h. This helps making files independent of NEED_CPU_H if they only need to pass around CPU pointers. Backports commit d61d1b20610e4655d7846e4cb43d22188e935f5f from qemu	2018-02-24 01:00:56 -05:00
Paolo Bonzini	01bd1c1a73	target-mips: make cpu-qom.h not target specific Make MIPSCPU an opaque type within cpu-qom.h, and move all definitions of private methods, as well as all type definitions that require knowledge of the layout to cpu.h. This helps making files independent of NEED_CPU_H if they only need to pass around CPU pointers. Backports commit 416bf936864f16caad6993b9ebd452fb34f801bd from qemu	2018-02-24 00:59:03 -05:00
Paolo Bonzini	27ebc27beb	target-m68k: make cpu-qom.h not target specific Make M68KCPU an opaque type within cpu-qom.h, and move all definitions of private methods, as well as all type definitions that require knowledge of the layout to cpu.h. This helps making files independent of NEED_CPU_H if they only need to pass around CPU pointers. Backports commit a836b8fa00fa1032ccd234a71b33943627d211ea from qemu	2018-02-24 00:56:58 -05:00
Paolo Bonzini	2f4ae94b5c	target-i386: make cpu-qom.h not target specific Make X86CPU an opaque type within cpu-qom.h, and move all definitions of private methods, as well as all type definitions that require knowledge of the layout to cpu.h. This helps making files independent of NEED_CPU_H if they only need to pass around CPU pointers. Backports commit 4da6f8d954429c0cd1471d25cb9dbe909607374e from qemu	2018-02-24 00:55:22 -05:00
Lioncash	791413630e	target-arm: make cpu-qom.h not target specific Make ARMCPU an opaque type within cpu-qom.h, and move all definitions of private methods, as well as all type definitions that require knowledge of the layout to cpu.h. This helps making files independent of NEED_CPU_H if they only need to pass around CPU pointers. Backports commit 74e755647c1598a6845df1ee4f8b96d01afd96e7 from qemu	2018-02-24 00:48:59 -05:00
Paolo Bonzini	fee6dcb22a	include: move CPU-related definitions out of qemu-common.h Backports commit 4b4629d9d26fd0e100d9be526367a96aa35b541d from qemu	2018-02-24 00:33:49 -05:00
Wei Jiangang	7cf135457a	accel: make configure_accelerator return void Return the negated value of accel_initialised is meaningless, and the caller vl doesn't check it. Backports commit bdc3f61dec2f9c227235bb5f677a0272e1184c82 from qemu	2018-02-24 00:31:28 -05:00
Sergey Fedorov	eab60b7c77	cpu-exec: Clean up 'interrupt_request' reloading in cpu_handle_interrupt() Backports commit 8b1fe3f439eaa2f0a6ee7737942bb6c405725867 from qemu	2018-02-24 00:27:05 -05:00
Sergey Fedorov	b4b7b88f69	cpu-exec: Remove unused 'x86_cpu' and 'env' from cpu_exec() Backports commit ba048a4ae15ba0f70c6dcb12ee05db120408de78 from qemu	2018-02-24 00:16:40 -05:00
Sergey Fedorov	aefb8935a9	cpu-exec: Move TB execution stuff out of cpu_exec() Simplify cpu_exec() by extracting TB execution code outside of cpu_exec() into a new static inline function cpu_loop_exec_tb(). Backports commit 928de9ee14b0b63ee9f9275732ed3e1c8b5f4790 from qemu	2018-02-24 00:15:24 -05:00
Sergey Fedorov	d4ef96abf2	cpu-exec: Move interrupt handling out of cpu_exec() Simplify cpu_exec() by extracting interrupt handling code outside of cpu_exec() into a new static inline function cpu_handle_interrupt(). Backports commit c385e6e49763c6dd5dbbd90fadde95d986f8bd38 from qemu	2018-02-24 00:09:06 -05:00
Sergey Fedorov	c1b52a4387	cpu-exec: Move exception handling out of cpu_exec() Simplify cpu_exec() by extracting exception handling code out of cpu_exec() into a new static inline function cpu_handle_exception(). Also make cpu_handle_debug_exception() inline as it is used only once. Backports commit ea284766ec6b9f1712369249566b4c372f3cec8b from qemu	2018-02-24 00:03:37 -05:00
Sergey Fedorov	fc3d135dac	cpu-exec: Move halt handling out of cpu_exec() Simplify cpu_exec() by extracting CPU halt state handling code out of cpu_exec() into a new static inline function cpu_handle_halt(). Backports commit 8b2d34e997371c9729a0f41e3cc624d4300bbe78 from qemu	2018-02-23 23:53:20 -05:00
Lioncash	88d00a75ca	cpu-exec: move cpu_exec to the bottom of the file Remove forward declarations	2018-02-23 23:50:28 -05:00
Sergey Fedorov	0088ca994f	cpu-exec: Remove relic orphaned comment This comment should have been deleted by commit 0ac087f1f3ae ("removed unused code") but somehow it is still here. There's no point to keep it. Backports commit c6f0d9f84c43ae973270df1a77482466558ee487 from qemu	2018-02-23 23:47:05 -05:00
Sergey Fedorov	1a768018c2	tcg: Remove needless CPUState::current_tb This field was used for telling cpu_interrupt() to unlink a chain of TBs being executed when it worked that way. Now, cpu_interrupt() don't do this anymore. So we don't need this field anymore. Backports commit 3213525f8ab48742db09dab18cb9ae6f36a6c921 from qemu	2018-02-23 23:45:42 -05:00
Sergey Fedorov	73c75b4cf7	cpu-exec: Move TB chaining into tb_find_fast() Move tb_add_jump() call and surrounding code from cpu_exec() into tb_find_fast(). That simplifies cpu_exec() a little by hiding the direct chaining optimization details into tb_find_fast(). It also allows to move tb_lock()/tb_unlock() pair into tb_find_fast(), putting it closer to tb_find_slow() which also manipulates the lock. Backports commit a0522c7a55cc8ac76d82884cf8e52f76daa664cc from qemu	2018-02-23 23:38:57 -05:00
Sergey Fedorov	ba9a237586	tcg: Rework tb_invalidated_flag 'tb_invalidated_flag' was meant to catch two events: * some TB has been invalidated by tb_phys_invalidate(); * the whole translation buffer has been flushed by tb_flush(). Then it was checked: * in cpu_exec() to ensure that the last executed TB can be safely linked to directly call the next one; * in cpu_exec_nocache() to decide if the original TB should be provided for further possible invalidation along with the temporarily generated TB. It is always safe to patch an invalidated TB since it is not going to be used anyway. It is also safe to call tb_phys_invalidate() for an already invalidated TB. Thus, setting this flag in tb_phys_invalidate() is simply unnecessary. Moreover, it can prevent from pretty proper linking of TBs, if any arbitrary TB has been invalidated. So just don't touch it in tb_phys_invalidate(). If this flag is only used to catch whether tb_flush() has been called then rename it to 'tb_flushed'. Declare it as 'bool' and stick to using only 'true' and 'false' to set its value. Also, instead of setting it in tb_gen_code(), just after tb_flush() has been called, do it right inside of tb_flush(). In cpu_exec(), this flag is used to track if tb_flush() has been called and have made 'next_tb' (a reference to the last executed TB) invalid for linking it to directly call the next TB. tb_flush() can be called during the CPU execution loop from tb_gen_code(), during TB execution or by another thread while 'tb_lock' is released. Catch for translation buffer flush reliably by resetting this flag once before first TB lookup and each time we find it set before trying to add a direct jump. Don't touch in in tb_find_physical(). Each vCPU has its own execution loop in multithreaded mode and thus should have its own copy of the flag to be able to reset it with its own 'next_tb' and don't affect any other vCPU execution thread. So make this flag per-vCPU and move it to CPUState. In cpu_exec_nocache(), we only need to check if tb_flush() has been called from tb_gen_code() called by cpu_exec_nocache() itself. To do this reliably, preserve the old value of the flag, reset it before calling tb_gen_code(), check afterwards, and combine the saved value back to the flag. This patch is based on the patch "tcg: move tb_invalidated_flag to CPUState" from Paolo Bonzini <pbonzini@redhat.com>. Backports commit 6f789be56d3f38e9214dafcfab3bf9be7191f370 from qemu	2018-02-23 23:34:51 -05:00
Sergey Fedorov	c9700af2bd	tcg: Clean up from 'next_tb' The value returned from tcg_qemu_tb_exec() is the value passed to the corresponding tcg_gen_exit_tb() at translation time of the last TB attempted to execute. It is a little confusing to store it in a variable named 'next_tb'. In fact, it is a combination of 4-byte aligned pointer and additional information in its two least significant bits. Break it down right away into two variables named 'last_tb' and 'tb_exit' which are a pointer to the last TB attempted to execute and the TB exit reason, correspondingly. This simplifies the code and improves its readability. Correct a misleading documentation comment for tcg_qemu_tb_exec() and fix logging in cpu_tb_exec(). Also rename a misleading 'next_tb' in another couple of places. Backports commit 819af24b9c1e95e6576f1cefd32f4d6bf56dfa56 from qemu	2018-02-23 23:29:04 -05:00
Paolo Bonzini	66faf3b5df	tcg: code_bitmap and code_write_count are not used by user-mode emulation Backports commit 6fad459c91e8a1dedbb6681d3f57ede5222a225c from qemu	2018-02-23 23:17:37 -05:00
Sergey Fedorov	ffdc9d6323	tcg: Allow goto_tb to any target PC in user mode In user mode, there's only a static address translation, TBs are always invalidated properly and direct jumps are reset when mapping change. Thus the destination address is always valid for direct jumps and there's no need to restrict it to the pages the TB resides in. Backports commit 90aa39a1cc4837360889f0e033ca25cc82100308 from qemu	2018-02-23 23:12:14 -05:00
Sergey Fedorov	73c59faad5	tcg: Clean up direct block chaining safety checks We don't take care of direct jumps when address mapping changes. Thus we must be sure to generate direct jumps so that they always keep valid even if address mapping changes. Luckily, we can only allow to execute a TB if it was generated from the pages which match with current mapping. Document tcg_gen_goto_tb() declaration and note the reason for destination PC limitations. Some targets with variable length instructions allow TB to straddle a page boundary. However, we make sure that both of TB pages match the current address mapping when looking up TBs. So it is safe to do direct jumps into the both pages. Correct the checks for some of those targets. Given that, we can safely patch a TB which spans two pages. Remove the unnecessary check in cpu_exec() and allow such TBs to be patched. Backports commit 5b053a4a28278bca606eeff7d1c0730df1b047e9 from qemu	2018-02-23 22:26:00 -05:00
Sergey Fedorov	39d262f0d2	tcg: Clean up tb_jmp_unlink() Unify the code of this function with tb_jmp_remove_from_list(). Making these functions similar improves their readability. Also this could be a step towards making this function thread-safe. Backports commit f9c5b66f487a04d3747dc6997b1503f9258df945 from qemu	2018-02-23 21:40:07 -05:00
Lioncash	68272af618	translate-all: Remove unused variable in size_code_gen_buffer Also eliminates the unused parameter	2018-02-23 21:38:34 -05:00
Sergey Fedorov	c530eb06a9	tcg: Extract removing of jumps to TB from tb_phys_invalidate() Move the code for removing jumps to a TB out of tb_phys_invalidate() to a separate static inline function tb_jmp_unlink(). This simplifies tb_phys_invalidate() and improves code structure. Backports commit 89bba496322d4cf996d42cdd4bb0912231656c3d from qemu	2018-02-23 21:36:29 -05:00
Sergey Fedorov	0d2e91518b	tcg: Rename tb_jmp_remove() to tb_remove_from_jmp_list() tb_jmp_remove() was only used to remove the TB from a list of all TBs jumping to the same TB which is n-th jump destination of the given TB. Put a comment briefly describing the function behavior and rename it to better reflect its purpose. Backports commit 133626783aa5a1bf86332fa3e6f7b8efe005f924 from qemu	2018-02-23 21:34:01 -05:00
Sergey Fedorov	d60af028c5	tcg: Clarify thread safety check in tb_add_jump() The check is to make sure that another thread hasn't already done the same while we were outside of tb_lock. Mention this in a comment. Backports commit 9962c478b153a18fe88a6509fe58cd178aff8abc from qemu	2018-02-23 21:32:47 -05:00
Sergey Fedorov	e93f68a755	tcg: Init TB's direct jumps before making it visible Initialize TB's direct jump list data fields and reset the jumps before tb_link_page() puts it into the physical hash table and the physical page list. So TB is completely initialized before it becomes visible. This is pure rearrangement of code to a more suitable place, though it could be a preparation for relaxing the locking scheme in future. Backports commit 901bc3deb43bf37c85e43955905d003be7ae5fa5 from qemu	2018-02-23 21:31:36 -05:00
Sergey Fedorov	87f2bb42d4	tcg: Rearrange tb_link_page() to avoid forward declaration Backports commit e90d96b158665a684ab89b4f002838034b5fafc8 from qemu	2018-02-23 21:28:20 -05:00
Sergey Fedorov	fbc0a1105f	tcg: Use uintptr_t type for jmp_list_{next\|first} fields of TB These fields do not contain pure pointers to a TranslationBlock structure. So uintptr_t is the most appropriate type for them. Also put some asserts to assure that the two least significant bits of the pointer are always zero before assigning it to jmp_list_first. Backports commit c37e6d7e3589ecb96914faa21025ad7ba6654aea from qemu	2018-02-23 21:28:19 -05:00
Sergey Fedorov	e60c24cecf	tcg: Clean up direct block chaining data fields Briefly describe in a comment how direct block chaining is done. It should help in understanding of the following data fields. Rename some fields in TranslationBlock and TCGContext structures to better reflect their purpose (dropping excessive 'tb_' prefix in TranslationBlock but keeping it in TCGContext): tb_next_offset => jmp_reset_offset tb_jmp_offset => jmp_insn_offset tb_next => jmp_target_addr jmp_next => jmp_list_next jmp_first => jmp_list_first Avoid using a magic constant as an invalid offset which is used to indicate that there's no n-th jump generated. Backports commit f309101c26b59641fc1aa8fb2a98a5441cdaea03 from qemu	2018-02-23 21:28:19 -05:00
Richard Henderson	bb0b055a99	translate-all: Adjust 256mb testing for mips64 Make sure we preserve the high 32-bits when masking for mips64. Backports commit 7ba6a512ae439c98c0c1f0f4348c079d90f9dd9d from qemu	2018-02-23 21:28:19 -05:00
Emilio G. Cota	de17843702	translate-all: add missing munmap of the code_gen guard page for MIPS Backports commit 8bdf4997823126a39bd4c99e4b2283b02cc7865f from qemu	2018-02-23 21:28:19 -05:00
Emilio G. Cota	9a2b02b241	translate-all: remove redundant setting of tcg_ctx.code_gen_buffer_size The setting of tcg_ctx.code_gen_buffer_size is done by the only caller of size_code_gen_buffer(), which is code_gen_alloc(): $ git grep size_code_gen_buffer translate-all.c:static inline size_t size_code_gen_buffer(size_t tb_size) translate-all.c: tcg_ctx.code_gen_buffer_size = size_code_gen_buffer(tb_size); Backports commit 835154b6e2200460f04719d0028716a37c178368 from qemu	2018-02-23 21:28:19 -05:00
Sergey Fedorov	c5b234ed1f	tcg: Note requirement on atomic direct jump patching Backports commit 10b4f4855537dd421e193a7d0416513116370558 from qemu	2018-02-23 21:28:18 -05:00
Sergey Fedorov	87c3382dc8	tcg/mips: Make direct jump patching thread-safe Ensure direct jump patching in MIPS is atomic by using atomic_read()/atomic_set() for code patching. Backports commit c82460a560176ef69c2f0662bd280612e274db96 from qemu	2018-02-23 21:28:18 -05:00
Sergey Fedorov	7538001da9	tcg/sparc: Make direct jump patching thread-safe Ensure direct jump patching in SPARC is atomic by using atomic_read()/atomic_set() for code patching. Backports commit 84f79fb7c6e857edc807e4a251338243ce0cbac3 from qemu	2018-02-23 21:28:18 -05:00
Sergey Fedorov	a45f8cb49d	tcg/aarch64: Make direct jump patching thread-safe Ensure direct jump patching in AArch64 is atomic by using atomic_read()/atomic_set() for code patching. Backports commit 9e269112953be4d670cb0d25042bd6546fcf3e45 from qemu	2018-02-23 21:28:18 -05:00
Sergey Fedorov	52e2972300	tcg/arm: Make direct jump patching thread-safe Ensure direct jump patching in ARM is atomic by using atomic_read()/atomic_set() for code patching. Backports commit 7d14e0e2d661479985197203589c38840e1066df from qemu	2018-02-23 21:28:18 -05:00
Sergey Fedorov	57359fbe6c	tcg/s390: Make direct jump patching thread-safe Ensure direct jump patching in s390 is atomic by: * naturally aligning a location of direct jump address; * using atomic_read()/atomic_set() for code patching. Backports commit ed3d51ecd7fe248d3959e469d53890ac9ffe0cd2 from qemu	2018-02-23 21:28:18 -05:00
Sergey Fedorov	5eb2d6618f	tcg/i386: Make direct jump patching thread-safe Ensure direct jump patching in i386 is atomic by: * naturally aligning a location of direct jump address; * using atomic_read()/atomic_set() for code patching. Backports commit 0d07abf05e98903c7faf204a9a90f7d45b7554dc from qemu	2018-02-23 21:28:17 -05:00
Lioncash	fffa27d269	osdep: MSVC-compatible alignment macros	2018-02-23 21:28:17 -05:00
Sergey Fedorov	3456f0879e	include/qemu/osdep.h: Add macros for pointer alignment These macros provide a convenient way to n-byte align pointers up and down and check if a pointer is n-byte aligned. Backports commit 6b587d3cda48e7ba26de8d30bf0d8a7063970715 from qemu	2018-02-23 21:28:17 -05:00
Sergey Fedorov	47eac70cb9	include/qemu/osdep.h: Add a macro to check for alignment Backports commit 18a60a76147569ca9e11b0607e50ce4012fe1aaa from qemu	2018-02-23 21:28:17 -05:00
Emilio G. Cota	170f6e0b3b	tb: consistently use uint32_t for tb->flags We are inconsistent with the type of tb->flags: usage varies loosely between int and uint64_t. Settle to uint32_t everywhere, which is superior to both: at least one target (aarch64) uses the most significant bit in the u32, and uint64_t is wasteful. Compile-tested for all targets. Backports commit 89fee74a0f066dfd73830a7b5fa137e87888c870 from qemu	2018-02-23 21:28:11 -05:00
Peter Maydell	fe2000aa32	target-arm: Avoid unnecessary TLB flush on TCR_EL2, TCR_EL3 writes The TCR_EL2 and TCR_EL3 regdefs were incorrectly using the vmsa_tcr_el1_write function for writes. Since these registers don't have the A1 bit that TCR_EL1 does, we don't need to do a tlb_flush() when they are written. Remove the unnecessary .writefn and also the harmless but unneeded .raw_writefn and .resetfn definitions. Backports commit 6459b94c26dd666badb3547fef1456992a08e60b from qemu	2018-02-23 20:09:12 -05:00
Edgar E. Iglesias	eb79db28d5	target-arm/translate-a64.c: Unify some of the ldst_reg decoding The various load/store variants under disas_ldst_reg can all reuse the same decoding for opc, size, rt and is_vector. This patch unifies the decoding in preparation for generating instruction syndromes for data aborts. This will allow us to reduce the number of places to hook in updates to the load/store state needed to generate the insn syndromes. No functional change. Backports commit cd694521ca061a5d0436d5df4ec8c17c8f4dfcdb from qemu	2018-02-23 20:06:31 -05:00
Edgar E. Iglesias	602e9e34b9	target-arm/translate-a64.c: Use extract32 in disas_ldst_reg_imm9 Use extract32 instead of open coding the bit masking when decoding is_signed and is_extended. This streamlines the decoding with some of the other ldst variants. No functional change. Backports commit 026a19c3128678d4fe301fc36e8ffacdc9ecccb8 from qemu	2018-02-23 20:04:11 -05:00
Peter Maydell	56e9d7c09e	target-arm: Split data abort syndrome generator Split the data abort syndrome generator into two versions: One with a valid Instruction Specific Syndrome (ISS) and another without. The following new flags are supported by the syndrome generator with ISS: * isv - Instruction syndrome valid * sas - Syndrome access size * sse - Syndrome sign extend * srt - Syndrome register transfer * sf - Sixty-Four bit register width * ar - Acquire/Release These flags are not yet used, so this patch has no functional change except that we will now correctly set the IL bit in data abort syndromes without ISS information. Backports commit 094d028a7968236cd2b7f7b96394f7a3b8ad97c8 from qemu	2018-02-23 20:03:04 -05:00
Edgar E. Iglesias	bfc74c4da2	gen-icount: Use tcg_set_insn_param Use tcg_set_insn_param() instead of directly accessing internal tcg data structures to update an insn param. Backports commit 25caa94c4a26daaab1e65c6d887e2972aeb5749e from qemu	2018-02-23 20:01:17 -05:00
Edgar E. Iglesias	a30a478538	tcg: Add tcg_set_insn_param Add tcg_set_insn_param as a mechanism to modify an insn parameter after emiting the insn. This is useful for icount and also for embedding fault information for a specific insn. Backports commit 1d41478fd428e01f057d3248292e4cdcdb048523 from qemu	2018-02-23 19:58:49 -05:00
Sergey Sorokin	98a6d44c54	target-arm: Fix descriptor address masking in ARM address translation There is a bug in ARM address translation regime with a long-descriptor format. On the descriptor reading its address is formed from an index which is a part of the input address. And on the first iteration this index is incorrectly masked with 'grainsize' mask. But it can be wider according to pseudo-code. On the other hand on the iterations other than first the descriptor address is formed from the previous level descriptor by masking with 'descaddrmask' value. It always clears just 12 lower bits, but it must clear 'grainsize' lower bits instead according to pseudo-code. The patch fixes both cases. Backports commit dddb5223413c5425ae6eaeb3b967627efc9675f7 from qemu	2018-02-23 19:56:56 -05:00
Sergey Sorokin	00e751f18e	target-arm: Stage 2 permission fault was fixed in AArch32 state As described in AArch32.CheckS2Permission an instruction fetch fails if XN bit is set or there is no read permission for the address. Backports commit dfda68377e20943f474505e75238cb96bc6874bf from qemu	2018-02-23 19:55:11 -05:00
Eric Blake	2f42c2c195	qapi: Change visit_type_FOO() to no longer return partial objects Returning a partial object on error is an invitation for a careless caller to leak memory. We already fixed things in an earlier patch to guarantee NULL if visit_start fails ("qapi: Guarantee NULL obj on input visitor callback error"), but that does not help the case where visit_start succeeds but some other failure happens before visit_end, such that we leak a partially constructed object outside visit_type_FOO(). As no one outside the testsuite was actually relying on these semantics, it is cleaner to just document and guarantee that ALL pointer-based visit_type_FOO() functions always leave a safe value in obj during an input visitor (either the new object on success, or NULL if an error is encountered), so callers can now unconditionally use qapi_free_FOO() to clean up regardless of whether an error occurred. The decision is done by adding visit_is_input(), then updating the generated code to check if additional cleanup is needed based on the type of visitor in use. Note that we still leave obj unchanged after a scalar-based visit_type_FOO(); I did not feel like auditing all uses of visit_type_Enum() to see if the callers would tolerate a specific sentinel value (not to mention having to decide whether it would be better to use 0 or ENUM__MAX as that sentinel). Backports commit 68ab47e4b4ecc1c4649362b8cc1e49794d1a6537 from qemu	2018-02-23 19:53:17 -05:00
Eric Blake	0d52542da2	qapi: Simplify semantics of visit_next_list() The semantics of the list visit are somewhat baroque, with the following pseudocode when FooList is used: start() for (prev = head; cur = next(prev); prev = &cur) { visit(&cur->value) } Note that these semantics (advance before visit) requires that the first call to next() return the list head, while all other calls return the next element of the list; that is, every visitor implementation is required to track extra state to decide whether to return the input as-is, or to advance. It also requires an argument of 'GenericList *' to next(), solely because the first iteration might need to modify the caller's GenericList head, so that all other calls have to do a layer of dereferencing. Thankfully, we only have two uses of list visits in the entire code base: one in spapr_drc (which completely avoids visit_next_list(), feeding in integers from a different source than uint8List), and one in qapi-visit.py. That is, all other list visitors are generated in qapi-visit.c, and share the same paradigm based on a qapi FooList type, so we can refactor how lists are laid out with minimal churn among clients. We can greatly simplify things by hoisting the special case into the start() routine, and flipping the order in the loop to visit before advance: start(head) for (tail = head; tail; tail = next(tail)) { visit(&tail->value) } With the simpler semantics, visitors have less state to track, the argument to next() is reduced to 'GenericList *', and it also becomes obvious whether an input visitor is allocating a FooList during visit_start_list() (rather than the old way of not knowing if an allocation happened until the first visit_next_list()). As a minor drawback, we now allocate in two functions instead of one, and have to pass the size to both functions (unless we were to tweak the input visitors to cache the size to start_list for reuse during next_list, but that defeats the goal of less visitor state). The signature of visit_start_list() is chosen to match visit_start_struct(), with the new parameters after 'name'. The spapr_drc case is a virtual visit, done by passing NULL for list, similarly to how NULL is passed to visit_start_struct() when a qapi type is not used in those visits. It was easy to provide these semantics for qmp-output and dealloc visitors, and a bit harder for qmp-input (several prerequisite patches refactored things to make this patch straightforward). But it turned out that the string and opts visitors munge enough other state during visit_next_list() to make it easier to just document and require a GenericList visit for now; an assertion will remind us to adjust things if we need the semantics in the future. Several pre-requisite cleanup patches made the reshuffling of the various visitors easier; particularly the qmp input visitor. Backports commit d9f62dde1303286b24ac8ce88be27e2b9b9c5f46 from qemu	2018-02-23 19:50:26 -05:00
Lioncash	ed72ba0f8b	qapi: Fix string input visitor handling of invalid list As shown in the previous commit, the string input visitor was treating bogus input as an empty list rather than an error. Fix parse_str() to set errp, then the callers to exit early if an error was reported. Meanwhile, fix the testsuite to use the generated qapi_free_int16List() instead of rolling our own, and to validate the fixed behavior, while at the same time documenting one more change that we'd like to make in a later patch (a failed visit_start_list should guarantee a NULL pointer, regardless of what things were on input). Backports commit 74f24cb6306d065045d0e2215a7d10533fa59c57 from qemu	2018-02-23 19:25:26 -05:00
Eric Blake	6084be1882	qapi: Split visit_end_struct() into pieces As mentioned in previous patches, we want to call visit_end_struct() functions unconditionally, so that visitors can release resources tied up since the matching visit_start_struct() without also having to worry about error priority if more than one error occurs. Even though error_propagate() can be safely used to ignore a second error during cleanup caused by a first error, it is simpler if the cleanup cannot set an error. So, split out the error checking portion (basically, input visitors checking for unvisited keys) into a new function visit_check_struct(), which can be safely skipped if any earlier errors are encountered, and leave the cleanup portion (which never fails, but must be called unconditionally if visit_start_struct() succeeded) in visit_end_struct(). Generated code in qapi-visit.c has diffs resembling: \|@@ -59,10 +59,12 @@ void visit_type_ACPIOSTInfo(Visitor *v, \| goto out_obj; \| } \| visit_type_ACPIOSTInfo_members(v, obj, &err); \|- error_propagate(errp, err); \|- err = NULL; \|+ if (err) { \|+ goto out_obj; \|+ } \|+ visit_check_struct(v, &err); \| out_obj: \|- visit_end_struct(v, &err); \|+ visit_end_struct(v); \| out: and in qapi-event.c: @@ -47,7 +47,10 @@ void qapi_event_send_acpi_device_ost(ACP \| goto out; \| } \| visit_type_q_obj_ACPI_DEVICE_OST_arg_members(v, &param, &err); \|- visit_end_struct(v, err ? NULL : &err); \|+ if (!err) { \|+ visit_check_struct(v, &err); \|+ } \|+ visit_end_struct(v); \| if (err) { \| goto out; Backports commit 15c2f669e3fb2bc97f7b42d1871f595c0ac24af8 from qemu	2018-02-23 19:13:47 -05:00
Eric Blake	ae8d475ae0	qmp: Tighten output visitor rules Tighten assertions in the QMP output visitor, so that: - qmp_output_get_qobject() can only be called after pairing a visit_end_* for every visit_start_* (rather than allowing it on a partially built object) - qmp_output_get_qobject() cannot be called unless at least one visit_type_* or visit_start/visit_end pair has occurred since creation/reset (the accidental return of NULL fixed by commit ab8bf1d7 would have been much easier to diagnose) - ensure that we are encountering the expected object or list type, to provide protection against mismatched push(struct)/ pop(list) or push(list)/pop(struct), similar to the qmp-input protection added in commit bdd8e6b5. - ensure that except for the root, 'name' is non-null inside a dict, and NULL inside a list (this may need changing later if we add "name.0" support for better error messages for a list, but for now it makes sure all users are at least consistent) Backports commit 56a6f02b8ce1fe41a2a9077593e46eca7d98267d from qemu	2018-02-23 19:04:41 -05:00
Eric Blake	e5b2cff2bd	qmp: Support explicit null during visits Implement the new type_null() callback for the qmp input and output visitors. While we don't yet have a use for this in QAPI input (the generator will need some tweaks first), some potential usages have already been discussed on the list. Meanwhile, the output visitor could already output explicit null via type_any, but this gives us finer control. At any rate, it's easy to test that we can round-trip an explicit null through manual use of visit_type_null() wrapped by a virtual visit_start_struct() walk, even if we can't do the visit in a QAPI type. Repurpose the test_visitor_out_empty test, particularly since a future patch will tighten semantics to forbid use of qmp_output_get_qobject() without at least one intervening visit_type_*. Backports commit 3df016f185521f8dfa5bd89168722887156405c7 from qemu	2018-02-23 19:02:18 -05:00
Eric Blake	ef6b7b50f6	qapi: Add visit_type_null() visitor Right now, qmp-output-visitor happens to produce a QNull result if nothing is actually visited between the creation of the visitor and the request for the resulting QObject. A stronger protocol would require that a QMP output visit MUST visit something. But to still be able to produce a JSON 'null' output, we need a new visitor function that states our intentions. Yes, we could say that such a visit must go through visit_type_any(), but that feels clunky. So this patch introduces the new visit_type_null() interface and its no-op interface in the dealloc visitor, and stubs in the qmp visitors (the next patch will finish the implementation). For the visitors that will not implement the callback, document the situation. The code in qapi-visit-core unconditionally dereferences the callback pointer, so that a segfault will inform a developer if they need to implement the callback for their choice of visitor. Note that JSON has a primitive null type, with the single value null; likewise with the QNull type for QObject; but for QAPI, we just have the 'null' value without a null type. We may eventually want to add more support in QAPI for null (most likely, we'd use it via an alternate type that permits 'null' or an object); but we'll create that usage when we need it. Backports commit 3bc97fd5924561d92f32758c67eaffd2e4e25038 from qemu	2018-02-23 15:48:57 -05:00
Eric Blake	fafb3e354b	qapi: Document visitor interfaces, add assertions The visitor interface for mapping between QObject/QemuOpts/string and QAPI is scandalously under-documented, making changes to visitor core, individual visitors, and users of visitors difficult to coordinate. Among other questions: when is it safe to pass NULL, vs. when a string must be provided; which visitors implement which callbacks; the difference between concrete and virtual visits. Correct this by retrofitting proper contracts, and document where some of the interface warts remain (for example, we may want to modify visit_end_* to require the same 'obj' as the visit_start counterpart, so the dealloc visitor can be simplified). Later patches in this series will tackle some, but not all, of these warts. Add assertions to (partially) enforce the contract. Some of these were only made possible by recent cleanup commits. Backports commit adfb264c9ed04bfc694921b72173be8e29e90024 from qemu	2018-02-23 15:45:31 -05:00
Eric Blake	9e999acc83	qapi: Change visit_start_implicit_struct to visit_start_alternate After recent changes, the only remaining use of visit_start_implicit_struct() is for allocating the space needed when visiting an alternate. Since the term 'implicit struct' is hard to explain, rename the function to its current usage. While at it, we can merge the functionality of visit_get_next_type() into the same function, making it more like visit_start_struct(). Generated code is now slightly smaller: \| { \| Error err = NULL; \| \|- visit_start_implicit_struct(v, (void) obj, sizeof(BlockdevRef), &err); \|+ visit_start_alternate(v, name, (GenericAlternate )obj, sizeof(obj), \|+ true, &err); \| if (err) { \| goto out; \| } \|- visit_get_next_type(v, name, &(obj)->type, true, &err); \|- if (err) { \|- goto out_obj; \|- } \| switch ((*obj)->type) { \| case QTYPE_QDICT: \| visit_start_struct(v, name, NULL, 0, &err); ... \| } \|-out_obj: \|- visit_end_implicit_struct(v); \|+ visit_end_alternate(v); \| out: \| error_propagate(errp, err); \| } Backports commit dbf11922622685934bfb41e7cf2be9bd4a0405c0 from qemu	2018-02-23 15:33:25 -05:00
Eric Blake	5389c1cd5f	qmp-input: Refactor when list is advanced In the QMP input visitor, visiting a list traverses two objects: the QAPI GenericList of the caller (which gets advanced in visit_next_list() regardless of this patch), and the QList input that we are converting to QAPI. For consistency with QDict visits, we want to consume elements from the input QList during the visit_type_FOO() for the list element; that is, we want ALL the code for consuming an input to live in qmp_input_get_object(), rather than having it split according to whether we are visiting a dict or a list. Making qmp_input_get_object() the common point of consumption will make it easier for a later patch to refactor visit_start_list() to cover the GenericList * head of a QAPI list, and in turn will get rid of the 'first' flag (which lived in qmp_input_next_list() pre-patch, and is hoisted to StackObject by this patch). This patch is therefore altering the post-condition use of 'entry', while keeping what gets visited unchanged, from: start_list next_list type_ELT ... next_list type_ELT next_list end_list visits 1st elt last elt entry NULL 1st elt 1st elt last elt last elt NULL gone where type_ELT() returns (entry ? entry : 1st elt) and next_list() steps entry to this usage: start_list next_list type_ELT ... next_list type_ELT next_list end_list visits 1st elt last elt entry 1st elt 1nd elt 2nd elt last elt NULL NULL gone where type_ELT() steps entry and returns the old entry, and next_list() leaves entry alone. Backports commit fcf3cb21783b2dae3358fdbe7001cb2f74e0cedf from qemu	2018-02-23 15:19:40 -05:00
Eric Blake	68cf25fafa	qmp-input: Require struct push to visit members of top dict Don't embed the root of the visit into the stack of current containers being visited. That way, we no longer get confused on whether the first visit of a dictionary is to the dictionary itself or to one of the members of the dictionary, based on whether the caller passed name=NULL; and makes the QMP Input visitor like other visitors where the value of 'name' is now ignored on the root visit. (We may someday want to revisit the rules on what 'name' should be on a top-level visit, rather than just ignoring it; but that would be the topic of another patch). An audit of all qmp_input_visitor_new() call sites shows that there were only two places where callers had previously been visiting to a QDict with a non-NULL name to bypass a call to visit_start_struct(), and those were fixed in prior patches. Backports commit ce140b176920b5b65184020735a3c65ed3e9aeda from qemu	2018-02-23 15:16:43 -05:00
Eric Blake	1bb4e4c787	qmp-input: Don't consume input when checking has_member Commit e8316d7 mistakenly passed consume=true within qmp_input_optional() when checking if an optional member was present, but the mistake was silently ignored since the code happily let us extract a member more than once. Fix qmp_input_optional() to not consume anything, then tighten up the input visitor to ensure that a member is consumed exactly once (all generated code follows this pattern; and the new assert will catch any hand-written code that tries to visit the same key more than once). Backports commit e5826a2fd727f0be54a81083f31fe02a275465cd from qemu	2018-02-23 15:12:58 -05:00
Eric Blake	cae9c2bd2d	qapi: Use strict QMP input visitor in more places The following uses of a QMP input visitor should be strict (that is, excess keys in QDict input should be flagged if not converted to QAPI): - Testsuite code unrelated to explicitly testing non-strict mode (test-qmp-commands, test-visitor-serialization); since we want more code to be strict by default, having more tests of strict mode doesn't hurt - Code used for cloning QAPI objects (replay-input.c, qemu-sockets.c); we are reparsing a QObject just barely produced by the qmp output visitor and which therefore should not have any garbage, so while it is extra work to be strict, it validates that our clone is correct [note that a later patch series will simplify these two uses by creating an actual clone visitor that is much more efficient than a generate/reparse cycle] - qmp_object_add(), which calls into user_creatable_add_type(). Since command line parsing for '-object' uses the same user_creatable_add_type() through the OptsVisitor, and that is always strict, we want to ensure that any nested dictionaries would be treated the same in QMP and from the command line (I don't actually know if such nested dictionaries exist). Note that on this code change, strictness only matters for nested dictionaries (if even possible), since we already flag excess input at the top level during an earlier object_property_set() on an unknown key, whether from QemuOpts: $ ./x86_64-softmmu/qemu-system-x86_64 -nographic -nodefaults -qmp stdio -object secret,id=sec0,data=letmein,format=raw,foo=bar qemu-system-x86_64: -object secret,id=sec0,data=letmein,format=raw,foo=bar: Property '.foo' not found or from QMP: $ ./x86_64-softmmu/qemu-system-x86_64 -nographic -nodefaults -qmp stdio {"QMP": {"version": {"qemu": {"micro": 93, "minor": 5, "major": 2}, "package": ""}, "capabilities": []}} {"execute":"qmp_capabilities"} {"return": {}} {"execute":"object-add","arguments":{"qom-type":"secret","id":"sec0","props":{"format":"raw","data":"letmein","foo":"bar"}}} {"error": {"class": "GenericError", "desc": "Property '.foo' not found"}} The only remaining uses of non-strict input visits are: - QMP 'qom-set' (which eventually executes object_property_set_qobject()) - mark it as something to revisit in the future (I didn't want to spend any more time on this patch auditing if we have any QOM dictionary properties that might be impacted, and couldn't easily prove whether this code path is shared with anything else). - test-qmp-input-visitor: explicit tests of non-strict mode. If we later get rid of users that don't need strictness, then this test should be merged with test-qmp-input-strict Backports relevant parts of commit 240f64b6dc3346d044d7beb7cc3a53668ce47384 from qemu	2018-02-23 15:11:35 -05:00
Eric Blake	559304aed9	qapi: Consolidate QMP input visitor creation Rather than having two separate ways to create a QMP input visitor, where the safer approach has the more verbose name, it is better to consolidate things into a single function where the caller must explicitly choose whether to be strict or to ignore excess input. This patch is the strictly mechanical conversion; the next patch will then audit which uses can be made stricter. Backports commit fc471c18d5d2ec713d5a019f9530398675494bc8 from qemu	2018-02-23 15:09:57 -05:00
Eric Blake	b1c4558849	qmp-input: Clean up stack handling Management of the top of stack was a bit verbose; creating a temporary variable and adding some comments makes the existing code more legible before the next few patches improve things. No semantic changes other than asserting that we are always visiting a QObject, and not a NULL value. In particular, the check for 'name && qobject_type(qobj) == QTYPE_QDICT)' is a bit overkill (a dict visit should always have a name); a later patch revisits that, while this patch is only changing one layer of indentation due to dropping 'if (qobj)'. Backports commit b471d012e5d7bec1d2272738141e121b5581fcdf from qemu	2018-02-23 15:08:14 -05:00
Eric Blake	0ec9a5adaf	qapi: Guarantee NULL obj on input visitor callback error Our existing input visitors were not very consistent on errors in a function taking 'TYPE *obj'. These are start_struct(), start_alternate(), type_str(), and type_any(). next_list() is similar, but can't fail (see commit 08f9541). While all of them set 'obj' to allocated storage on success, it was not obvious whether 'obj' was guaranteed safe on failure, or whether it was left uninitialized. But a future patch wants to guarantee that visit_type_FOO() does not leak a partially-constructed obj back to the caller; it is easier to implement this if we can reliably state that input visitors assign 'obj' regardless of success or failure, and that on failure obj is NULL. Add assertions to enforce consistency in the final setting of err vs. obj. The opts-visitor start_struct() doesn't set an error, but it also was doing a weird check for 0 size; all callers pass in non-zero size if obj is non-NULL. The testsuite has at least one spot where we no longer need to pre-initialize a variable prior to a visit; valgrind confirms that the test is still fine with the cleanup. A later patch will document the design constraint implemented here. Backports commit e58d695e6c3a5cfa0aa2fc91b87ade017ef28b05 from qemu	2018-02-23 14:53:23 -05:00
Eric Blake	3cf7b6dd3b	qapi: Adjust layout of FooList types By sticking the next pointer first, we don't need a union with 64-bit padding for smaller types. On 32-bit platforms, this can reduce the size of uint8List from 16 bytes (or 12, depending on whether 64-bit ints can tolerate 4-byte alignment) down to 8. It has no effect on 64-bit platforms (where alignment still dictates a 16-byte struct); but fewer anonymous unions is still a win in my book. It requires visit_next_list() to gain a size parameter, to know what size element to allocate; comparable to the size parameter of visit_start_struct(). I debated about going one step further, to allow for fewer casts, by doing: typedef GenericList GenericList; struct GenericList { GenericList next; }; struct FooList { GenericList base; Foo value; }; so that you convert to 'GenericList ' by '&foolist->base', and back by 'container_of(generic, GenericList, base)' (as opposed to the existing '(GenericList )foolist' and '(FooList )generic'). But doing that would require hoisting the declaration of GenericList prior to inclusion of qapi-types.h, rather than its current spot in visitor.h; it also makes iteration a bit more verbose through 'foolist->base.next' instead of 'foolist->next'. Note that for lists of objects, the 'value' payload is still hidden behind a boxed pointer. Someday, it would be nice to do: struct FooList { FooList next; Foo value; }; for one less level of malloc for each list element. This patch is a step in that direction (now that 'next' is no longer at a fixed non-zero offset within the struct, we can store more than just a pointer's-worth of data as the value payload), but the actual conversion would be a task for another series, as it will touch a lot of code. Backports commit e65d89bf1a4484e0db0f3dc820a8b209f2fb1e8b from qemu	2018-02-23 14:49:06 -05:00
Eric Blake	eef0932471	qapi-visit: Add visitor.type classification We have three classes of QAPI visitors: input, output, and dealloc. Currently, all implementations of these visitors have one thing in common based on their visitor type: the implementation used for the visit_type_enum() callback. But since we plan to add more such common behavior, in relation to documenting and further refining the semantics, it makes more sense to have the visitor implementations advertise which class they belong to, so the common qapi-visit-core code can use that information in multiple places. A later patch will better document the types of visitors directly in visitor.h. For this patch, knowing the class of a visitor implementation lets us make input_type_enum() and output_type_enum() become static functions, by replacing the callback function Visitor.type_enum() with the simpler enum member Visitor.type. Share a common assertion in qapi-visit-core as part of the refactoring. Move comments in opts-visitor.c to match the refactored layout. Backports commit 983f52d4b3f86fb9dc9f8b142132feb5a8723016 from qemu	2018-02-23 14:25:41 -05:00
Dave Hansen	f50acc467f	target-i386: fix typo in xsetbv implementation QEMU 2.6 added support for the XSAVE family of instructions, which includes the XSETBV instruction which allows setting the XCR0 register. But, when booting Linux kernels with XSAVE support enabled, I was getting very early crashes where the instruction pointer was set to 0x3. I tracked it down to a jump instruction generated by this: gen_jmp_im(s->pc - pc_start); where s->pc is pointing to the instruction after XSETBV and pc_start is pointing _at_ XSETBV. Subtract the two and you get 0x3. Whoops. The fix is to replace this typo with the pattern found everywhere else in the file when folks want to end the translation buffer. Richard Henderson confirmed that this is a bug and that this is the correct fix. Backports commit 502c8e86ea07294067578292c6d402601c196019 from qemu	2018-02-23 14:15:35 -05:00
Fam Zheng	5c739f14f5	util: Fix MIN_NON_ZERO MIN_NON_ZERO(1, 0) is evaluated to 0. Rewrite the macro to fix it. Backports commit b6ece2c6f37926a994bc564a9e55ef3be6016d8f from qemu	2018-02-23 14:09:44 -05:00
Artyom Tarasenko	1781d5cfa6	target-sparc: fix register corruption in ldstub if there is no write permission Backports commit 9566ceeef41ccb5241d340b34776a33450e8f9e5 from qemu	2018-02-23 14:06:38 -05:00
Paolo Bonzini	44c4dd02c9	target-i386: key sfence availability on CPUID_SSE, not CPUID_SSE2 sfence was introduced before lfence and mfence. This fixes Linux 2.4's measurement of checksumming speeds for the pIII_sse algorithm: md: linear personality registered as nr 1 md: raid0 personality registered as nr 2 md: raid1 personality registered as nr 3 md: raid5 personality registered as nr 4 raid5: measuring checksumming speed 8regs : 384.400 MB/sec 32regs : 259.200 MB/sec invalid operand: 0000 CPU: 0 EIP: 0010:[<c0240b2a>] Not tainted EFLAGS: 00000246 eax: c15d8000 ebx: 00000000 ecx: 00000000 edx: c15d5000 esi: 8005003b edi: 00000004 ebp: 00000000 esp: c15bdf50 ds: 0018 es: 0018 ss: 0018 Process swapper (pid: 1, stackpage=c15bd000) Stack: 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000206 c0241c6c 00001000 c15d4000 c15d7000 c15d4000 c15d4000 Call Trace: [<c0241c6c>] [<c0105000>] [<c0241db4>] [<c010503b>] [<c0105000>] [<c0107416>] [<c0105030>] Code: 0f ae f8 0f 10 04 24 0f 10 4c 24 10 0f 10 54 24 20 0f 10 5c <0>Kernel panic: Attempted to kill init! Backports commit bd5d278668f33aa08755a982986cd1159746c037 from qemu	2018-02-23 14:03:19 -05:00
Aurelien Jarno	0c4bebb9bc	target-mips: fix call to memset in soft reset code Recent versions of GCC report the following error when compiling target-mips/helper.c: qemu/target-mips/helper.c:542:9: warning: ‘memset’ used with length equal to number of elements without multiplication by element size [-Wmemset-elt-size] Backports commit a525decfaa3449f1458ea2d7a06320cf46aebf3f from qemu	2018-02-23 14:01:50 -05:00
James Hogan	e4903fc5f2	target-mips: Fix RDHWR exception host PC Commit b00c72180c36 ("target-mips: add PC, XNP reg numbers to RDHWR") changed the rdhwr helpers to use check_hwrena() to check the register being accessed is enabled in CP0_HWREna when used from user mode. If that check fails an EXCP_RI exception is raised at the host PC calculated with GETPC(). However check_hwrena() may not be fully inlined as the do_raise_exception() part of it is common regardless of the arguments. This causes GETPC() to calculate the address in the call in the helper instead of the generated code calling the helper. No TB will be found and the EPC reported with the resulting guest RI exception points to the beginning of the TB instead of the RDHWR instruction. We can't reliably force check_hwrena() to be inlined, and converting it to a macro would be ugly, so instead pass the host PC in as an argument, with each rdhwr helper passing GETPC(). This should avoid any dependence on compiler behaviour, and in practice seems to ensure the full inlining of check_hwrena() on x86_64. This issue causes failures when running a MIPS KVM (trap & emulate) guest in a MIPS QEMU TCG guest, as the inner guest kernel will do a RDHWR of counter, which is disabled in the outer guest's CP0_HWREna by KVM so it can emulate the inner guest's counter. The emulation fails and the RI exception is passed to the inner guest. Backports commit d96391c1ffeb30a0afa695c86579517c69d9a889 from qemu	2018-02-23 13:59:37 -05:00
Christoffer Dall	6180cbd477	util: align memory allocations to 2M on AArch64 For KVM to use Transparent Huge Pages (THP) we have to ensure that the alignment of the userspace address of the KVM memory slot and the IPA that the guest sees for a memory region have the same offset from the 2M huge page size boundary. One way to achieve this is to always align the IPA region at a 2M boundary and ensure that the mmap alignment is also at 2M. Unfortunately, we were only doing this for __arm__, not for __aarch64__, so add this simple condition. This fixes a performance regression using KVM/ARM on AArch64 platforms that showed a performance penalty of more than 50%, introduced by the following commit: 9fac18f (oslib: allocate PROT_NONE pages on top of RAM, 2015-09-10) We were only lucky before the above commit, because we were allocating large regions and naturally getting a 2M alignment on those allocations then. Backports commit ee1e0f8e5d3682c561edcdceccff72b9d9b16d8b from qemu	2018-02-23 13:56:59 -05:00
Aurelien Jarno	6060ab6596	tcg: check for CONFIG_DEBUG_TCG instead of NDEBUG Check for CONFIG_DEBUG_TCG instead of NDEBUG, drop now useless code. Backports commit 8d8fdbae010aa75a23f0307172e81034125aba6e from qemu	2018-02-23 13:55:21 -05:00
Aurelien Jarno	355ed7cd08	tcg: use tcg_debug_assert instead of assert (fix performance regression) The TCG code is quite performance sensitive, but at the same time can also be quite tricky. That is why asserts that can be enabled with the --enable-debug-tcg configure option. This used to work the following way: \| #include "config.h" \| \| ... \| \| #if !defined(CONFIG_DEBUG_TCG) && !defined(NDEBUG) \| /* define it to suppress various consistency checks (faster) */ \| #define NDEBUG \| #endif \| \| ... \| \| #include <assert.h> Since commit 757e725b (tcg: Clean up includes) "config.h" as been replaced by "qemu/osdep.h" which itself includes <assert.h>. As a consequence the assertions are always enabled, even when using --disable-debug-tcg, causing a performance regression, especially on targets with many registers. For instance on qemu-system-ppc the speed difference is about 15%. tcg_debug_assert is controlled directly by CONFIG_DEBUG_TCG and already uses in some places. This patch replaces all the calls to assert into calss to tcg_debug_assert. Backports commit eabb7b91b36b202b4dac2df2d59d698e3aff197a from qemu	2018-02-23 13:52:13 -05:00
Artyom Tarasenko	af0e282ab1	target-sparc: fix Trap Based Address Register behavior for sparc64 Accoding the chapter 7.6 Trap Processing of the SPARC Architecture Manual v9, the Trap Based Address Register is not modified as a trap is taken. This fix allows booting FreeBSD-10.3-RELEASE-sparc64. Backports commit de5f1077446ca455342db149737bdc395a7b9882 from qemu	2018-02-23 13:39:59 -05:00
Artyom Tarasenko	34472bd6fd	target-sparc: fix Nucleus quad LDD 128 bit access for windowed registers Fix register offset calculation when regwptr is used. Backports commit 01a780d51a3a0851729e1747f3787a0db4d96722 from qemu	2018-02-23 13:39:34 -05:00
Mark Cave-Ayland	7dcdae9807	target-sparc: fix ldstub sign-extension bug ldstub [addr], reg incorrectly reads a signed byte from memory which causes problems in the 32-bit Solaris mutex code. Here the byte value being read is 0xff which is incorrectly sign-extended to 0xffffffff before being written back to the target register causing lock detection to behave incorrectly. This fixes the intermittent hangs and MUTEX_HELD warnings issued to the console when running 32-bit Solaris images under qemu-system-sparc. With thanks to Joseph Dery for providing a condensed test image to consistently reproduce the problem on demand, and Martin Husemann for allowing me access to real hardware for comparison. Backports commit 4553e10360a0713e31647220ed396942f9a6fca0 from qemu	2018-02-23 13:37:36 -05:00
Emilio G. Cota	4cacbf212f	translate-all: add missing fold of tb_ctx into tcg_ctx Since 5e5f07e08 "TCG: Move translation block variables to new context inside tcg_ctx: tb_ctx" on Feb 1 2013, compilation of usermode + TB_DEBUG_CHECK has been broken. Fix it. Backports commit 7e6bd36d61129feb7f667cb09ffec1b7b54b971c from qemu	2018-02-23 13:35:42 -05:00
Paolo Bonzini	bdcea2bcb0	target-i386: check for PKU even for non-writable pages Xiao Guangrong ran kvm-unit-tests on an actual machine with PKU and found that it fails: test pte.p pte.user pde.p pde.user pde.a pde.pse pkru.wd pkey=1 user write efer.nx cr4.pke: FAIL: error code 27 expected 7 Dump mapping: address: 0x123400000000 ------L4: 2ebe007 ------L3: 2ebf007 ------L2: 8000000020000a5 (All failures are combinations of "pde.user pde.p pkru.wd pkey=1", plus either "pde.pse" or "pte.p pte.user", plus one of "user cr0.wp", "cr0.wp" or "user", plus unimportant bits such as accessed/dirty or efer.nx). So PFEC.PKEY is set even if the ordinary check failed (which it did because pde.w is zero). Adjust QEMU to match behavior of silicon. Backports commit 44d066a2f770ee9d61fd1c2a609bdf2a994dfdf7 from qemu	2018-02-23 13:23:37 -05:00
James Hogan	41c6079823	tcg/mips: Fix type of tcg_target_reg_alloc_order[] The MIPS TCG backend is the only one to have tcg_target_reg_alloc_order[] elements of type TCGReg rather than int. This resulted in commit 91478cefaaf2 ("tcg: Allocate indirect_base temporaries in a different order") breaking the build on MIPS since the type differed from indirect_reg_alloc_order[]: tcg/tcg.c:1725:44: error: pointer type mismatch in conditional expression [-Werror] order = rev ? indirect_reg_alloc_order : tcg_target_reg_alloc_order; ^ Make it an array of ints to fix the build and match other architectures. Backports commit 2dc7553d0c0a3915c649e1a91b0f0be70b4674b3 from qemu	2018-02-23 13:21:44 -05:00
Lioncash	88af0b0153	target-arm: Get rid of unused variable warnings	2018-02-23 12:43:09 -05:00
Lioncash	87130fc884	exec-all: Remove externs These are unused	2018-02-23 12:43:03 -05:00
Chen Fan	d0621f1852	cpu: Introduce X86CPUTopoInfo structure for argument simplification In order to simplify arguments of function, introduce a new struct named X86CPUTopoInfo. Backports commit ed256144cd6f0ca2ff59fc3fc8dca547506f433b from qemu	2018-02-23 10:58:43 -05:00
Peter Crosthwaite	576f1752a6	include/exec: Move cputlb exec.c defs out Move the architecture agnostic function prototypes for exec.c out of cputlb.h to exec-all.h. This allows hiding of the arch specific cputlb.h from exec.c which should be getting close to having no architecture specifics. Prepares support for multi-arch, which will have a minimal cpu.h that services exec.c but not cputlb.h. Backports commit dfccc7602374c9fd3b083208b552d62daa244811 from qemu	2018-02-23 10:52:25 -05:00
Peter Crosthwaite	97c9423ee8	cputlb: move CPU_LOOP() for tlb_reset() to exec.c To prepare for multi-arch, cputlb.c should only have awareness of one single architecture. This means it should not have access to the full CPU lists which may be heterogeneous. Instead, push the CPU_LOOP() up to the one and only caller in exec.c. Backports commit 9a13565d52bfd321934fb44ee004bbaf5f5913a8 from qemu	2018-02-23 10:46:31 -05:00
Paolo Bonzini	9479199c6b	memory: fix usage of find_next_bit and find_next_zero_bit The last two arguments to these functions are the last and first bit to check relative to the base. The code was using incorrectly the first bit and the number of bits. Fix this in cpu_physical_memory_get_dirty and cpu_physical_memory_all_dirty. This requires a few changes in the iteration; change the code in cpu_physical_memory_set_dirty_range to match. Backports commit 88c73d16ad1b6c22a2ab082064d0d521f756296a from qemu	2018-02-22 19:51:43 -05:00
Alex Bennée	171d267209	include/qemu/atomic.h: default to __atomic functions The __atomic primitives have been available since GCC 4.7 and provide a richer interface for describing memory ordering requirements. As a bonus by using the primitives instead of hand-rolled functions we can use tools such as the ThreadSanitizer which need the use of well defined APIs for its analysis. If we have __ATOMIC defines we exclusively use the __atomic primitives for all our atomic access. Otherwise we fall back to the mixture of __sync and hand-rolled barrier cases. Backports commit a0aa44b488b3601415d55041e4619aef5f3a4ba8 from qemu	2018-02-22 16:12:59 -05:00
Paolo Bonzini	4e7259a49b	atomics: add explicit compiler fence in __atomic memory barriers __atomic_thread_fence does not include a compiler barrier; in the C++11 memory model, fences take effect in combination with other atomic operations. GCC implements this by making __atomic_load and __atomic_store access memory as if the pointer was volatile, and leaves no trace whatsoever of acquire and release fences in the compiler's intermediate representation. In QEMU, we want memory barriers to act on all memory, but at the same time we would like to use __atomic_thread_fence for portability reasons. Add compiler barriers manually around the __atomic_thread_fence. Backports commit 3bbf572345c65813f86a8fc434ea1b23beb08e16 from qemu	2018-02-22 15:56:37 -05:00
Paolo Bonzini	02e3eeff40	atomic: fix position of volatile qualifier What needs to be volatile is not the pointer, but the pointed-to value! Backports commit 2cbcfb281afa041a41f6e4c4da0f5c9314084604 from qemu	2018-02-22 15:52:48 -05:00
Stefan Hajnoczi	e79e0881cd	memory: RCU ram_list.dirty_memory[] for safe RAM hotplug Although accesses to ram_list.dirty_memory[] use atomics so multiple threads can safely dirty the bitmap, the data structure is not fully thread-safe yet. This patch handles the RAM hotplug case where ram_list.dirty_memory[] is grown. ram_list.dirty_memory[] is change from a regular bitmap to an RCU array of pointers to fixed-size bitmap blocks. Threads can continue accessing bitmap blocks while the array is being extended. See the comments in the code for an in-depth explanation of struct DirtyMemoryBlocks. I have tested that live migration with virtio-blk dataplane works. Backports commit 5b82b703b69acc67b78b98a5efc897a3912719eb from qemu	2018-02-22 15:38:03 -05:00
Peter Maydell	a632d1b96d	target-arm: Make the 64-bit version of VTCR do the migration Move the ALIAS tag from VTCR_EL2 to VTCR so that we migrate the 64-bit version, as is usual. (This has no particular effect now unless the guest wrote to the high RES0 bits of VTCR_EL2.) Add a comment about why it's OK that we don't have the various accessor functions that the EL1 TCR regdefs do. Backports commit bf06c1123a427fefc2cf9cf8019578eafc19eb6f from qemu	2018-02-22 11:53:19 -05:00
Peter Maydell	a93e873441	target-arm: Remove incorrect ALIAS tags from ESR_EL2 and ESR_EL3 The regdefs for the ESR_EL2 and ESR_EL3 system registers should not be marked as ARM_CP_ALIAS, because these are the master copies; the DFSR regdef in vmsa_pmsa_cp_reginfo[] is marked as an alias. Remove the ALIAS tags so that these registers are correctly migrated. Backports commit 094a7d0b9d10812d06be2c5c19288cee4603c693 from qemu	2018-02-22 11:40:20 -05:00
Peter Maydell	f1b5b5cea9	target-arm: Correctly reset SCTLR_EL3 for 64-bit CPUs The regdef for SCTRL_EL3 was incorrectly marked as being an ARM_CP_ALIAS, with the remark that this was because the 32-bit definition would take care of reset and migration. However the intention for banked registers as documented in the comment in add_cpreg_to_hashtable() is: * 2) If ARMv8 is enabled then we can count on a 64-bit version * taking care of the secure bank. This requires that separate * 32 and 64-bit definitions are provided. and so it marks the 32-bit secure banked version as an alias. This results in the sctlr_s/sctlr_el[3] field never being reset or migrated for a 64-bit CPU with EL3 enabled. Fix this by removing the ARM_CP_ALIAS annotation from SCTLR_EL3. Since this means it now needs a real reset value, move the regdef into the same place that we define the 32-bit SCTLR. Backports commit e24fdd238a159d830a9a65dd9b08f80fba9b9e06 from qemu	2018-02-22 11:38:16 -05:00
Leon Alrae	224cbb008a	target-mips: indicate presence of IEEE 754-2008 FPU in R6/R5+MSA CPUs MIPS Release 6 and MIPS SIMD Architecture make it mandatory to have IEEE 754-2008 FPU which is indicated by CP1 FIR.HAS2008, FCSR.ABS2008 and FCSR.NAN2008 bits set to 1. In QEMU we still keep these bits cleared as there is no 2008-NaN support. However, this now causes problems preventing from running R6 Linux with the v4.5 kernel. Kernel refuses to execute 2008-NaN ELFs on a CPU whose FPU does not support 2008-NaN encoding: (...) VFS: Mounted root (ext4 filesystem) readonly on device 8:0. devtmpfs: mounted Freeing unused kernel memory: 256K (ffffffff806f0000 - ffffffff80730000) request_module: runaway loop modprobe binfmt-464c Starting init: /sbin/init exists but couldn't execute it (error -8) request_module: runaway loop modprobe binfmt-464c Starting init: /bin/sh exists but couldn't execute it (error -8) Kernel panic - not syncing: No working init found. Try passing init= option to kernel. See Linux Documentation/init.txt for guidance. Therefore always indicate presence of 2008-NaN support in R6 as well as in R5+MSA CPUs, even though this feature is not yet supported by MIPS in QEMU. Backports commit ba5c79f26221c0fd7139c883a34a4e75d993f732 from qemu	2018-02-22 11:30:08 -05:00
Denis V. Lunev	eb29ff04ca	log: move qemu_log_close/qemu_log_flush from header to log.c There is no particular reason to keep these functions in the header. Suggested by Paolo. Backports commit 99affd1d5bd4e396ecda50e53dfbc5147fa1313d from qemu	2018-02-22 11:13:17 -05:00
Yongbok Kim	6602163087	target-mips: add MAAR, MAARI register The MAAR register is a read/write register included in Release 5 of the architecture that defines the accessibility attributes of physical address regions. In particular, MAAR defines whether an instruction fetch or data load can speculatively access a memory region within the physical address bounds specified by MAAR. As QEMU doesn't do speculative access, hence this patch only provides ability to access the registers. Backports commit f6d4dd810983fdf3d1c9fb81838167efef63d1c8 from qemu	2018-02-22 11:00:17 -05:00
Yongbok Kim	15e0109162	target-mips: use CP0_CHECK for gen_m{f\|t}hc0 Reuse CP0_CHECK macro for gen_m{f\|t}hc0. Backports commit c98d3d79ee387ea6e8fb091299f8562b20022f10 from qemu	2018-02-22 10:49:55 -05:00
Leon Alrae	0c5ebbd096	target-mips: check CP0 enabled for CACHE instruction also in R6 Backports commit 40d48212f934d4deab40ffe84a0f9c4c553d4742 from qemu	2018-02-22 10:47:54 -05:00
Leon Alrae	70306ec586	target-mips: enable CM GCR in MIPS64R6-generic CPU Indicate that in the MIPS64R6-generic CPU the memory-mapped Global Configuration Register Space is implemented. Backports commit a9a95061715ca09abff56a3f239f704c410912c2 from qemu	2018-02-22 10:46:40 -05:00
Yongbok Kim	d65583df80	target-mips: add CMGCRBase register Physical base address for the memory-mapped Coherency Manager Global Configuration Register space. The MIPS default location for the GCR_BASE address is 0x1FBF_8. This register only exists if Config3 CMGCR is set to one. Backports commit c870e3f52cac0c8a4a1377398327c4ff20d49d41 from qemu	2018-02-22 10:43:26 -05:00
Paolo Bonzini	1435732c0d	target-i386: implement PKE for TCG Backports commit 0f70ed4759a29ca932af1e9525729f4f455642f8 from qemu	2018-02-22 10:18:55 -05:00
Dr. David Alan Gilbert	58fcb2b57b	config.status: Pass extra parameters This allows you to do: ./config.status --the-option-you-forgot Backports commit cf7cc9291bf7f2f6470815db876ed28eb474ea52 from qemu	2018-02-22 10:12:54 -05:00
Alex Bennée	d01c318b3e	cputlb: modernise the debug support To avoid cluttering the code with #ifdef legs we wrap up the print statements into a tlb_debug() macro. As access to the virtual TLB can get quite heavy defining DEBUG_TLB_LOG will ensure all the logs go to the qemu_log target of CPU_LOG_MMU instead of stderr. This remains compile time optional as these debug statements haven't been considered for usefulness for user visible logging. I've also removed DEBUG_TLB_CHECK which wasn't used. Backports commit 8526e1f4e418443a4d6ed0714487e47d45ef9c98 from qemu	2018-02-22 10:10:45 -05:00
Alex Bennée	3da7d9d9ae	qemu-log: dfilter-ise exec, out_asm, op and opt_op qemu-log: dfilter-ise exec, out_asm, op and opt_op This ensures the code generation debug code will honour -dfilter if set. For the "exec" tracing I've added a new inline macro for efficiency's sake. Backports commit d977e1c2dbc9e63454b2000f91954d02543bf43b from qemu	2018-02-22 10:06:19 -05:00
Alex Bennée	2d401b6f23	qemu-log: new option -dfilter to limit output When debugging big programs or system emulation sometimes you want both the verbosity of cpu,exec et all but don't want to generate lots of logs for unneeded stuff. This patch adds a new option -dfilter which allows you to specify interesting address ranges in the form: -dfilter 0x8000..0x8fff,0xffffffc000080000+0x200,... Then logging code can use the new qemu_log_in_addr_range() function to decide if it will output logging information for the given range. Backports commit 3514552e04388d8e7686bcf89efd022e892acb5b from qemu	2018-02-22 10:02:26 -05:00
Lioncash	b895ae38a9	fpu: silence warnings	2018-02-22 09:52:28 -05:00
Peter Maydell	3f5e36e15f	qemu-log: Improve the exec TB execution logging Improve the TB execution logging so that it is easier to identify what is happening from trace logs: * move the "Trace" logging of executed TBs into cpu_tb_exec() so that it is emitted if and only if we actually execute a TB, and for consistency for the CPU state logging * log when we link two TBs together via tb_add_jump() * log when cpu_tb_exec() returns early from a chain of TBs The new style logging looks like this: Trace 0x7fb7cc822ca0 [ffffffc0000dce00] Linking TBs 0x7fb7cc822ca0 [ffffffc0000dce00] index 0 -> 0x7fb7cc823110 [ffffffc0000dce10] Trace 0x7fb7cc823110 [ffffffc0000dce10] Trace 0x7fb7cc823420 [ffffffc000302688] Trace 0x7fb7cc8234a0 [ffffffc000302698] Trace 0x7fb7cc823520 [ffffffc0003026a4] Trace 0x7fb7cc823560 [ffffffc0000dce44] Linking TBs 0x7fb7cc823560 [ffffffc0000dce44] index 1 -> 0x7fb7cc8235d0 [ffffffc0000dce70] Trace 0x7fb7cc8235d0 [ffffffc0000dce70] Stopped execution of TB chain before 0x7fb7cc8235d0 [ffffffc0000dce70] Trace 0x7fb7cc8235d0 [ffffffc0000dce70] Trace 0x7fb7cc822fd0 [ffffffc0000dd52c] Backports commit 1a830635229e14c403600167823ea6b3b79d3097 from qemu	2018-02-22 09:40:11 -05:00
Peter Maydell	66e1bacd64	qemu-log: Avoid function call for disabled qemu_log_mask logging Make qemu_log_mask() a macro which only calls the function to do the actual work if the logging is enabled. This avoids making a function call in possible fast paths where logging is disabled. Backports commit 7ee606230e6b7645d92365d9b39179368e83ac54 from qemu	2018-02-22 09:32:48 -05:00
Alex Bennée	bc5d7c5e1d	tcg: pass down TranslationBlock to tcg_code_gen My later debugging patches need access to the origin PC which is held in the TranslationBlock structure. Pass down the whole structure as it also holds the information about the code start point. Backports commit 5bd2ec3d7b47b2252745882795d79aef36380fb7 from qemu	2018-02-22 09:28:06 -05:00
Veronia Bahaa	bafc81b1d3	util: move declarations out of qemu-common.h Move declarations out of qemu-common.h for functions declared in utils/ files: e.g. include/qemu/path.h for utils/path.c. Move inline functions out of qemu-common.h and into new files (e.g. include/qemu/bcd.h) Backports commit f348b6d1a53e5271cf1c9f9acc4646b4b98c1771 from qemu	2018-02-22 09:25:48 -05:00
Marc-André Lureau	fff79ed49b	utils: rename strtosz to use qemu prefix Not only it makes sense, but it gets rid of checkpatch warning: WARNING: consider using qemu_strtosz in preference to strtosz Also remove get rid of tabs to please checkpatch. Backports commit 4677bb40f809394bef5fa07329dea855c0371697 from qemu	2018-02-22 00:17:52 -05:00
Rutuja Shah	d9fdc180d7	Replaced get_tick_per_sec() by NANOSECONDS_PER_SECOND This patch replaces get_ticks_per_sec() calls with the macro NANOSECONDS_PER_SECOND. Also, as there are no callers, get_ticks_per_sec() is then removed. This replacement improves the readability and understandability of code. For example, timer_mod(fdctrl->result_timer, qemu_clock_get_ns(QEMU_CLOCK_VIRTUAL) + (get_ticks_per_sec() / 50)); NANOSECONDS_PER_SECOND makes it obvious that qemu_clock_get_ns matches the unit of the expression on the right side of the plus. Backports commit 73bcb24d932912f8e75e1d88da0fc0ac6d4bce78 from qemu	2018-02-21 23:21:36 -05:00
Paolo Bonzini	c024ca9f49	hw: explicitly include qemu-common.h and cpu.h	2018-02-21 23:15:09 -05:00
Markus Armbruster	6730bd3131	Move QEMU_ALIGN_*() from qemu-common.h to qemu/osdep.h qemu-common.h should only be included by .c files. Its file comment explains why: "No header file should depend on qemu-common.h, as this would easily lead to circular header dependencies." One of the reasons for headers to include it is QEMU_ALIGN_UP() and QEMU_ALIGN_DOWN(). Move them next to ROUND_UP() in qemu/osdep.h, to facilitate removing these ill-advised includes later on. Backports commit e07e540aaa08718c9ff8213067a3dcef31b3e313 from qemu	2018-02-21 23:12:24 -05:00
Markus Armbruster	6b1ebd16e6	Move HOST_LONG_BITS from qemu-common.h to qemu/osdep.h qemu-common.h should only be included by .c files. Its file comment explains why: "No header file should depend on qemu-common.h, as this would easily lead to circular header dependencies." One of the reasons for headers to include it is HOST_LONG_BITS. Move that to its more natural home qemu/osdep.h, to facilitate removing these ill-advised includes later on. This also lets us use HOST_LONG_BITS in bswap.h instead of duplicating its definition there to avoid cyclic inclusion. Backports commit a8139632161d7546218b696cada0a4f64cc78fb7 from qemu	2018-02-21 23:10:43 -05:00
Markus Armbruster	06668850e3	include/qemu/osdep.h: Don't include qapi/error.h Commit 57cb38b included qapi/error.h into qemu/osdep.h to get the Error typedef. Since then, we've moved to include qemu/osdep.h everywhere. Its file comment explains: "To avoid getting into possible circular include dependencies, this file should not include any other QEMU headers, with the exceptions of config-host.h, compiler.h, os-posix.h and os-win32.h, all of which are doing a similar job to this file and are under similar constraints." qapi/error.h doesn't do a similar job, and it doesn't adhere to similar constraints: it includes qapi-types.h. That's in excess of 100KiB of crap most .c files don't actually need. Add the typedef to qemu/typedefs.h, and include that instead of qapi/error.h. Include qapi/error.h in .c files that need it and don't get it now. Include qapi-types.h in qom/object.h for uint16List. Update scripts/clean-includes accordingly. Update it further to match reality: replace config.h by config-target.h, add sysemu/os-posix.h, sysemu/os-win32.h. Update the list of includes in the qemu/osdep.h comment quoted above similarly. This reduces the number of objects depending on qapi/error.h from "all of them" to less than a third. Unfortunately, the number depending on qapi-types.h shrinks only a little. More work is needed for that one. Backports commit da34e65cb4025728566d6504a99916f6e7e1dd6a from qemu	2018-02-21 23:08:18 -05:00
Stefan Weil	baa477d324	Remove unneeded include statements for setjmp.h As soon as setjmp.h is included from qemu/osdep.h, those old include statements are no longer needed. Add also setjmp.h to the list in scripts/clean-includes. Backports commit 8ff98f1ed2f50cd05c3c5027c7efdf69859ec664 from qemu	2018-02-21 22:57:32 -05:00
Stefan Weil	904b3c467e	Include setjmp.h in qemu/osdep.h (bug fix for w64) setjmp must be declared before sysemu/os-win32.h because it is redefined there for 64 bit Windows. Backports commit e89fdafb58038038e3ccb860c5e1068ba063bac8 from qemu	2018-02-21 22:56:46 -05:00
Eric Blake	2f30651c40	qapi: Use anonymous bases in QMP flat unions Now that the generator supports it, we might as well use an anonymous base rather than breaking out a single-use Base structure, for all three of our current QMP flat unions. Oddly enough, this change does not affect the resulting introspection output (because we already inline the members of a base type into an object, and had no independent use of the base type reachable from a command). The case_whitelist now has to list the name of an implicit type; which is not too bad (consider it a feature if it makes it harder for developers to make the whitelist grow :) Backports commit 3666a97f78704b941c360dc917acb14c8774eca7 from qemu	2018-02-21 22:55:12 -05:00
Eric Blake	e16b731799	qapi: Allow anonymous base for flat union Rather than requiring all flat unions to explicitly create a separate base struct, we can allow the qapi schema to specify the common members via an inline dictionary. This is similar to how commands can specify an inline anonymous type for its 'data'. We already have several struct types that only exist to serve as a single flat union's base; the next commit will clean them up. In particular, this patch's change to the BlockdevOptions example in qapi-code-gen.txt will actually be done in the real QAPI schema. Now that anonymous bases are legal, we need to rework the flat-union-bad-base negative test (as previously written, it forms what is now valid QAPI; tweak it to now provide coverage of a new error message path), and add a positive test in qapi-schema-test to use an anonymous base (making the integer argument optional, for even more coverage). Note that this patch only allows anonymous bases for flat unions; simple unions are already enough syntactic sugar that we do not want to burden them further. Meanwhile, while it would be easy to also allow an anonymous base for structs, that would be quite redundant, as the members can be put right into the struct instead. Backports commit ac4338f8eb783fd421aae492ca262a586918471e from qemu	2018-02-21 22:54:17 -05:00
Eric Blake	8f4a64398a	qapi: Don't special-case simple union wrappers Simple unions were carrying a special case that hid their 'data' QMP member from the resulting C struct, via the hack method QAPISchemaObjectTypeVariant.simple_union_type(). But by using the work we started by unboxing flat union and alternate branches, coupled with the ability to visit the members of an implicit type, we can now expose the simple union's implicit type in qapi-types.h: \| struct q_obj_ImageInfoSpecificQCow2_wrapper { \| ImageInfoSpecificQCow2 data; \| }; \| \| struct q_obj_ImageInfoSpecificVmdk_wrapper { \| ImageInfoSpecificVmdk data; \| }; ... \| struct ImageInfoSpecific { \| ImageInfoSpecificKind type; \| union { /* union tag is @type / \| void data; \|- ImageInfoSpecificQCow2 qcow2; \|- ImageInfoSpecificVmdk vmdk; \|+ q_obj_ImageInfoSpecificQCow2_wrapper qcow2; \|+ q_obj_ImageInfoSpecificVmdk_wrapper vmdk; \| } u; \| }; Doing this removes asymmetry between QAPI's QMP side and its C side (both sides now expose 'data'), and means that the treatment of a simple union as sugar for a flat union is now equivalent in both languages (previously the two approaches used a different layer of dereferencing, where the simple union could be converted to a flat union with equivalent C layout but different {} on the wire, or to an equivalent QMP wire form but with different C representation). Using the implicit type also lets us get rid of the simple_union_type() hack. Of course, now all clients of simple unions have to adjust from using su->u.member to using su->u.member.data; while this touches a number of files in the tree, some earlier cleanup patches helped minimize the change to the initialization of a temporary variable rather than every single member access. The generated qapi-visit.c code is also affected by the layout change: \|@@ -7393,10 +7393,10 @@ void visit_type_ImageInfoSpecific_member \| } \| switch (obj->type) { \| case IMAGE_INFO_SPECIFIC_KIND_QCOW2: \|- visit_type_ImageInfoSpecificQCow2(v, "data", &obj->u.qcow2, &err); \|+ visit_type_q_obj_ImageInfoSpecificQCow2_wrapper_members(v, &obj->u.qcow2, &err); \| break; \| case IMAGE_INFO_SPECIFIC_KIND_VMDK: \|- visit_type_ImageInfoSpecificVmdk(v, "data", &obj->u.vmdk, &err); \|+ visit_type_q_obj_ImageInfoSpecificVmdk_wrapper_members(v, &obj->u.vmdk, &err); \| break; \| default: \| abort(); Backports commit 32bafa8fdd098d52fbf1102d5a5e48d29398c0aa from qemu	2018-02-21 22:51:33 -05:00
Eric Blake	534e37585d	qapi: Drop unused c_null() Now that we are always bulk-initializing a QAPI C struct to 0 (whether by g_malloc0() or by 'Type arg = {0};'), we no longer have any clients of c_null() in the generator for per-element initialization. This patch is easy enough to revert if we find a use in the future, but in the present, get rid of the dead code. Backports commit 861877a0dd0a8e1bdbcc9743530f4dc9745a736a from qemu	2018-02-21 22:49:33 -05:00
Eric Blake	cd19e75fc2	qapi: Inline gen_visit_members() into lone caller Commit 82ca8e46 noticed that we had multiple implementations of visiting every member of a struct, and consolidated it into gen_visit_fields() (now gen_visit_members()) with enough parameters to cater to slight differences between the clients. But recent exposure of implicit types has meant that we are now down to a single use of that method, so we can clean up the unused conditionals and just inline it into the remaining caller: gen_visit_object_members(). Likewise, gen_err_check() no longer needs optional parameters, as the lone use of non-defaults was via gen_visit_members(). No change to generated code. Backports commit 12f254fd5f98717d17f079c73500123303b232da from qemu	2018-02-21 22:47:50 -05:00
Eric Blake	a86b89f166	qapi-event: Utilize implicit struct visits Rather than generate inline per-member visits, take advantage of the 'visit_type_FOO_members()' function for emitting events. This is possible now that implicit structs can be visited like any other. Generated code shrinks accordingly; by initializing a struct based on parameters, through a new gen_param_var() helper, like: \|@@ -338,6 +250,9 @@ void qapi_event_send_block_job_error(con \| QMPEventFuncEmit emit = qmp_event_get_func_emit(); \| QmpOutputVisitor qov; \| Visitor v; \|+ q_obj_BLOCK_JOB_ERROR_arg param = { \|+ (char )device, operation, action \|+ }; \| \| if (!emit) { \| return; @@ -351,19 +266,7 @@ void qapi_event_send_block_job_error(con \| if (err) { \| goto out; \| } \|- visit_type_str(v, "device", (char )&device, &err); \|- if (err) { \|- goto out_obj; \|- } \|- visit_type_IoOperationType(v, "operation", &operation, &err); \|- if (err) { \|- goto out_obj; \|- } \|- visit_type_BlockErrorAction(v, "action", &action, &err); \|- if (err) { \|- goto out_obj; \|- } \|-out_obj: \|+ visit_type_q_obj_BLOCK_JOB_ERROR_arg_members(v, &param, &err); \| visit_end_struct(v, err ? NULL : &err); Notice that the initialization of 'param' has to cast away const (just as the old gen_visit_members() had to do): we can't change the signature of the user function (which uses 'const char '), but have to assign it to a non-const QAPI object (which requires 'char *'). While touching this, document with a FIXME comment that there is still a potential collision between QMP members and our choice of local variable names within qapi_event_send_FOO(). This patch also paves the way for some followup simplifications in the generator, in subsequent patches. Backports commit 0949e95b48e30715e157cabbc59dcb0ed912d3ff from qemu	2018-02-21 22:45:28 -05:00
Eric Blake	eb4b02705a	qapi-event: Drop qmp_output_get_qobject() null check qmp_output_get_qobject() was changed never to return null some time ago (in commit 6c2f9a15), but the qapi_event_send_FOO() functions still check. Clean that up: \|@@ -28,7 +28,6 @@ void qapi_event_send_acpi_device_ost(ACP \| QMPEventFuncEmit emit; \| QmpOutputVisitor qov; \| Visitor v; \|- QObject *obj; \| \| emit = qmp_event_get_func_emit(); \| if (!emit) { \|@@ -54,10 +53,7 @@ out_obj: \| goto out; \| } \| \|- obj = qmp_output_get_qobject(qov); \|- g_assert(obj); \|- \|- qdict_put_obj(qmp, "data", obj); \|+ qdict_put_obj(qmp, "data", qmp_output_get_qobject(qov)); \| emit(QAPI_EVENT_ACPI_DEVICE_OST, qmp, &err); \| \| out: Backports commit 8df59565d2c27dec8c96a2090f0eb73303efce14 from qemu	2018-02-21 22:43:00 -05:00
Eric Blake	9aa8356bce	qapi: Adjust names of implicit types The original choice of ':obj-' as the prefix for implicit types made it obvious that we weren't going to clash with any user-defined names, which cannot contain ':'. But now we want to create structs for implicit types, to get rid of special cases in the generators, and our use of ':' in implicit names needs a tweak to produce valid C code. We could transliterate ':' to '_', except that C99 mandates that "identifiers that begin with an underscore are always reserved for use as identifiers with file scope in both the ordinary and tag name spaces". So it's time to change our naming convention: we can instead use the 'q_' prefix that we reserved for ourselves back in commit 9fb081e0. Technically, since we aren't planning on exposing the empty type in generated code, we could keep the name ':empty', but renaming it to 'q_empty' makes the check for startswith('q_') cover all implicit types, whether or not code is generated for them. As long as we don't declare 'empty' or 'obj' ticklish, it shouldn't clash with c_name() prepending 'q_' to the user's ticklish names. Backports commit 7599697c66d22ff4c859ba6ccea30e6a9aae6b9b from qemu	2018-02-21 22:41:38 -05:00
Eric Blake	d777876e6b	qapi: Emit implicit structs in generated C We already have several places that want to visit all the members of an implicit object within a larger context (simple union variant, event with anonymous data, command with anonymous arguments struct); and will be adding another one soon (the ability to declare an anonymous base for a flat union). Having a C struct declared for these implicit types, along with a visit_type_FOO_members() helper function, will make for fewer special cases in our generator. We do not, however, need qapi_free_FOO() or visit_type_FOO() functions for implicit types, because they should not be used directly outside of the generated code. This is done by adding a conditional in visit_object_type() for both qapi-types.py and qapi-visit.py based on the object name. The comparison of "name.startswith('q_')" is a bit hacky (it's basically duplicating what .is_implicit() already uses), but beats changing the signature of the visit_object_type() callback to pass a new 'implicit' flag. The hack should be temporary: we are considering adding a future patch that consolidates the narrow visit_object_type(..., base, local_members, variants) and visit_object_type_flat(..., all_members, variants) [where different sets of information are already broken out, and the QAPISchemaObjectType is no longer available] into a broader visit_object_type(obj_type) [where the visitor can query the needed fields from obj_type directly]. Also, now that we WANT to output C code for implicits, we no longer need the visit_needed() filter, leaving 'q_empty' as the only object still needing a special case. Remember, 'q_empty' is the only built-in generated object, which means that without a special case it would be emitted in multiple files (the main qapi-types.h and in qga-qapi-types.h) causing compilation failure due to redefinition. But since it has no members, it's easier to just avoid an attempt to visit that particular type; since gen_object() is called recursively, we also prime the objects_seen set to cover any recursion into the empty type. The patch relies on the changed naming of implicit types in the previous patch. It is a bit unfortunate that the generated struct names and visit_type_FOO_members() don't match normal naming conventions, but it's not too bad, since they will only be used in generated code. The generated code grows substantially in size: the implicit '-wrapper' types must be emitted in qapi-types.h before any union can include an unboxed member of that type. Arguably, the '-args' types could be emitted in a private header for just qapi-visit.c and qmp-marshal.c, rather than polluting qapi-types.h; but adding complexity to the generator to split the output location according to role doesn't seem worth the maintenance costs. Backports commit 7ce106a96feee4d46bfcdb47127b0935804c9357 from qemu	2018-02-21 22:31:15 -05:00
Eric Blake	f1a8fcd7a7	qapi: Drop useless 'data' member of unions We started moving away from the use of the 'void *data' member in the C union corresponding to a QAPI union back in commit 544a373; recent commits have gotten rid of other uses. Now that it is completely unused, we can remove the member itself as well as the FIXME comment. Update the testsuite to drop the negative test union-clash-data. Backports commit 48eb62a74fc2d6b0ae9e5f414304a85cfbf33066 from qemu	2018-02-21 22:27:26 -05:00
Lioncash	8728fea067	qapi-visit: Expose visit_type_FOO_members() Dan Berrange reported a case where he needs to work with a QCryptoBlockOptions union type using the OptsVisitor, but only visit one of the branches of that type (the discriminator is not visited directly, but learned externally). When things were boxed, it was easy: just visit the variant directly, which took care of both allocating the variant and visiting its members, then store that pointer in the union type. But now that things are unboxed, we need a way to visit the members without allocation, done by exposing visit_type_FOO_members() to the user. Before the patch, we had quite a bit of code associated with object_members_seen to make sure that a declaration of the helper was in scope before any use of the function. But now that the helper is public and declared in the header, the .c file no longer needs to worry about topological sorting (the helper is always in scope), which leads to some nice cleanups. Backports commit 4d91e9115cc6700113e772b19d1f39bbcf345977 from qemu	2018-02-21 22:26:38 -05:00
Eric Blake	d28c6244c0	qapi: Rename 'fields' to 'members' in generated C code C types and JSON objects don't have fields, but members. We shouldn't gratuitously invent terminology. This patch is a strict renaming of static genarated functions, plus the naming of the dummy filler member for empty structs, before the next patch exposes some of that naming to the rest of the code base. Backports commit c81200b01422783cd29796ef4ccc275d05f9ce67 from qemu	2018-02-21 22:23:07 -05:00
Eric Blake	825fb4835b	qapi: Rename 'fields' to 'members' in generator C types and JSON objects don't have fields, but members. We shouldn't gratuitously invent terminology. This patch is a strict renaming of generator code internals (including testsuite comments), before later patches rename C interfaces. No change to generated code with this patch. Backports commit 14f00c6c492488381a513c3816b15794446231a0 from qemu	2018-02-21 22:20:02 -05:00
Eric Blake	b239241e99	qapi: Make c_type() more OO-like QAPISchemaType.c_type() is a bit awkward: it takes two optional boolean flags is_param and is_unboxed, and they should never both be True. Add a new method for each of the flags, and drop the flags from c_type(). Most callers pass no flags; they remain unchanged. One caller passes is_param=True; call the new .c_param_type() instead. One caller passes is_unboxed=True, except for simple union types. This is actually an ugly special case that will go away soon, so until then, we now have to call either .c_type() or the new .c_unboxed_type(). Tolerable in the interim. It requires slightly more Python, but is arguably easier to read. Backports commit 4040d995e49c5b818be79e50a18c1bf8d2354d12 from qemu	2018-02-21 22:01:09 -05:00
Eric Blake	a7713451d9	qapi: Assert in places where variants are not handled We are getting closer to the point where we could use one union as the base or variant type within another union type (as long as there are no collisions between any possible combination of member names allowed across all discriminator choices). But until we get to that point, it is worth asserting that variants are not present in places where we are not prepared to handle them: when exploding a type into a parameter list, we do not expect variants. The qapi.py code is already checking this, via the older check_type() method; but someday we hope to get rid of that and move checking into QAPISchema*.check(). The two asserts added here make sure any refactoring still catches problems, and makes it locally obvious why we can iterate over only type.members without worrying about type.variants. Backports commit 29f6bd15eb8a55ed37b2a443f7275b3d134eb2b2 from qemu	2018-02-21 21:58:29 -05:00
Max Reitz	1cfdf802a9	qapi: Drop QERR_UNKNOWN_BLOCK_FORMAT_FEATURE Just specifying a custom string is simpler in basically all places that used it, and in addition, specifying the BB or node name is something we generally do not do in other error messages when opening a BDS, so we should not do it here. This changes the output for iotest 036 (to the better, in my opinion), so the reference output needs to be changed accordingly. Backports commit a55448b3681a880b77eaefe8b2c42912000cb481 from qemu	2018-02-21 21:55:15 -05:00
Sergey Sorokin	da6a9f331b	target-arm: Fix translation level on early translation faults Qemu reports translation fault on 1st level instead of 0th level in case of AArch64 address translation if the translation table walk is disabled or the address is in the gap between the two regions. Backports commit 1b4093ea6678ff79d3006db3d3abbf6990b4a59b from qemu	2018-02-21 21:53:15 -05:00
Peter Maydell	8309945dcc	target-arm: Implement MRS (banked) and MSR (banked) instructions Starting with the ARMv7 Virtualization Extensions, the A32 and T32 instruction sets provide instructions "MSR (banked)" and "MRS (banked)" which can be used to access registers for a mode other than the current one: * R<m>_<mode> * ELR_hyp * SPSR_<mode> Implement the missing instructions. Backports commit 8bfd0550be821cf27d71444e2af350de3c3d2ee3 from qemu	2018-02-21 21:50:42 -05:00
Daniel P. Berrange	d561d28827	error: ensure errno detail is printed with error_abort When &error_abort is passed in, the error reporting code will print the current error message and then abort() the process. Unfortunately at the time it aborts, we've not yet appended the errno detail. This makes debugging certain problems significantly harder as the log is incomplete. Backports commit 20e2dec14954568848ad74e73aee9b3aeedd6584 from qemu	2018-02-21 21:40:24 -05:00
Paolo Bonzini	9cf056404a	exec: fix early return from ram_block_add After reporting an error, ram_block_add was going on with the registration of the RAMBlock. The visible effect is that it unlocked the ramlist mutex twice. Backports commit 39c350ee12e733070e63d64a21bd42607366ea99 from qemu	2018-02-21 21:37:58 -05:00
Richard Henderson	7775b05fb8	target-i386: Dump unknown opcodes with -d unimp We discriminate here between opcodes that are illegal in the current cpu mode or with illegal arguments (such as modrm.mod == 3) and encodings that are unknown (such as an unimplemented isa extension). Backports commit b9f9c5b41aab06479cb1695990b7cca98ef84fc7 from qemu	2018-02-21 21:37:16 -05:00
Richard Henderson	1c096b8fa2	target-i386: Fix inhibit irq mask handling The patch in 7f0b714 was too simplistic, in that we wound up setting the flag and then resetting it immediately in gen_eob. Fixes the reported boot problem with Windows XP. Backports commit f083d92c03e7a0741d2a9eba774a60d5a3ca772f from qemu	2018-02-21 21:24:50 -05:00
Richard Henderson	c7d5d85979	target-i386: Use gen_nop_modrm for prefetch instructions Backports commit 26317698ef3be5942c5ee5630997dbc98431c5f6 from qemu	2018-02-21 21:22:52 -05:00
Paolo Bonzini	55c2a21fe8	target-i386: Fix addr16 prefix While ADDSEG will only be false in 16-bit mode for LEA, it can be false even in other cases when 16-bit addresses are obtained via the 67h prefix in 32-bit mode. In this case, gen_lea_v_seg forgets to add a nonzero FS or GS base if CS/DS/ES/SS are all zero. This case is pretty rare but happens when booting Windows 95/98, and this patch fixes it. The bug is visible since commit d6a291498, but it was introduced together with gen_lea_v_seg and it probably could be reproduced with a "addr16 gs movsb" instruction as early as in commit ca2f29f555805d07fb0b9ebfbbfc4e3656530977. Backports commit e2e02a820741ec4d96b8f313b06a2a7ed5e94fbd from qemu	2018-02-21 21:21:26 -05:00
Richard Henderson	085a3c9aab	target-i386: Fix SMSW for 64-bit mode In non-64-bit modes, the instruction always stores 16 bits. But in 64-bit mode, when the destination is a register, the instruction can write 32 or 64 bits. Backports commit a657f79e32422634415c09f3f15c73d610297af5 from qemu	2018-02-21 21:19:33 -05:00
Paolo Bonzini	a233d7b13e	target-i386: Fix SMSW and LMSW from/to register SMSW and LMSW accept register operands, but commit 1906b2a ("target-i386: Rearrange processing of 0F 01", 2016-02-13) did not account for that. Backports commit 880f8486503b32a29b653a3c0b3cfc5432012f38 from qemu	2018-02-21 21:17:45 -05:00
Paolo Bonzini	bdf1189046	target-i386: Avoid repeated calls to the bnd_jmp helper Two flags were tested the wrong way. Backports commit 8b33e82b863d1c6fce7e69a41f6c96a8e15b73fb from qemu	2018-02-21 21:13:37 -05:00
Fam Zheng	f7bff04b7b	exec: Introduce AddressSpaceDispatch.mru_section Under heavy workloads the lookup will likely end up with the same MemoryRegionSection from last time. Using a pointer to cache the result, like ram_list.mru_block, significantly reduces cost of address_space_translate. During address space topology update, as->dispatch will be reallocated so the pointer is invalidated automatically. Perf reports a visible drop on the cpu usage, because phys_page_find is not called. Before: 2.35% qemu-system-x86_64 [.] phys_page_find 0.97% qemu-system-x86_64 [.] address_space_translate_internal 0.95% qemu-system-x86_64 [.] address_space_translate 0.55% qemu-system-x86_64 [.] address_space_lookup_region After: 0.97% qemu-system-x86_64 [.] address_space_translate_internal 0.97% qemu-system-x86_64 [.] address_space_lookup_region 0.84% qemu-system-x86_64 [.] address_space_translate Backports commit 729633c2bc30496073431584eb6e304776b4ebd4 from qemu	2018-02-21 21:10:16 -05:00
Fam Zheng	f642465dc4	exec: Factor out section_covers_addr This will be shared by the next patch. Also add a comment explaining the unobvious condition on "size.hi". Backports commit 29cb533d8cbff1330717619780c2f1dfe764e003 from qemu	2018-02-21 21:08:08 -05:00
Daniel P. Berrange	eddfb13c2c	qom: Change object property iterator API contract Currently the ObjectProperty iterator API works as follows: ObjectPropertyIterator *iter; iter = object_property_iter_init(obj); while ((prop = object_property_iter_next(iter))) { ... } object_property_iter_free(iter); This has the benefit that the ObjectPropertyIterator struct can be opaque, but has the downside that callers need to explicitly call a free function. It is also not in keeping with iterator style used elsewhere in QEMU/GLib2. This patch changes the API to use stack allocation instead: ObjectPropertyIterator iter; object_property_iter_init(&iter, obj); while ((prop = object_property_iter_next(&iter))) { ... } Backports commit 7746abd8e9ee9db20c0b0fdb19504f163ba3cbea from qemu	2018-02-21 21:03:58 -05:00
Daniel P. Berrange	b97ab59f08	qom: Allow properties to be registered against classes When there are many instances of a given class, registering properties against the instance is wasteful of resources. The majority of objects have a statically defined list of possible properties, so most of the properties are easily registerable against the class. Only those properties which are conditionally registered at runtime need be recorded against the klass. Registering properties against classes also makes it possible to provide static introspection of QOM - currently introspection is only possible after creating an instance of a class, which severely limits its usefulness. This impl only supports simple scalar properties. It does not attempt to allow child object / link object properties against the class. There are ways to support those too, but it would make this patch more complicated, so it is left as an exercise for the future. There is no equivalent to object_property_del() provided, since classes must be immutable once they are defined. Backports commit 16bf7f522a2ff68993f80631ed86254c71eaf5d4 from qemu	2018-02-21 21:00:56 -05:00
Pavel Fedin	825bc2fb04	qom: Replace object property list with GHashTable ARM GICv3 systems with large number of CPUs create lots of IRQ pins. Since every pin is represented as a property, number of these properties becomes very large. Every property add first makes sure there's no duplicates. Traversing the list becomes very slow, therefore QEMU initialization takes significant time (several seconds for e. g. 16 CPUs). This patch replaces list with GHashTable, making lookup very fast. The only drawback is that object_child_foreach() and object_child_foreach_recursive() cannot add or remove properties during traversal, since GHashTableIter does not have modify-safe version. However, the code seems not to modify objects via these functions. Backports commit b604a854e843505007c59d68112c654556102a20 from qemu	2018-02-21 13:35:10 -05:00
Lioncash	4b79ff71b4	glib_compat: backport hashtable iterator interfaces	2018-02-21 13:18:44 -05:00
Markus Armbruster	9bc51db79d	memory: Fix bad error handling in memory_region_init_ram_ptr() Commit ef701d7 screwed up handling of out-of-memory conditions. Before the commit, we report the error and exit(1), in one place. The commit lifts the error handling up the call chain some, to three places. Fine. Except it uses &error_abort in these places, changing the behavior from exit(1) to abort(), and thus undoing the work of commit 3922825 "exec: Don't abort when we can't allocate guest memory". The previous two commits fixed one of the three places, another one was fixed in commit 33e0eb5. This commit fixes the third one. Backports commit 0bdaa3a429c6d07cd437b442a1f15f70be1addaa from qemu	2018-02-21 11:24:38 -05:00
Pavel Fedin	0201c71145	Merge memory_region_init_reservation() into memory_region_init_io() Just specifying ops = NULL in some cases can be more convenient than having two functions. Backports commit 6d6d2abf2c2e52c0f404d0a31a963e945b0cc7ad from qemu	2018-02-21 11:23:00 -05:00
Paolo Bonzini	7a1ce36785	memory: fix refcount leak in memory_region_present memory_region_present() leaks a reference to a MemoryRegion in the case "mr == container". While fixing it, avoid reference counting altogether for memory_region_present(), by using RCU only. The return value could in principle be already invalid immediately after memory_region_present returns, but presumably the caller knows that and it's using memory_region_present to probe for devices that are unpluggable, or something like that. The RCU critical section is needed anyway, because it protects as->current_map. Backports commit c6742b14fe7352059cd4954a356a8105757af31b from qemu	2018-02-21 11:17:19 -05:00
Paolo Bonzini	f9315cde1c	memory: do not add a reference to the owner of aliased regions Very often the owner of the aliased region is the same as the owner of the alias region itself. When this happens, the reference count can never go back to 0 and the owner is leaked. This is for example breaking hot-unplug of virtio-pci devices (the device cannot be plugged back again with the same id). Another common use for alias is to transform the system I/O address space into an MMIO regions; in this case the aliased region never dies, so there is no problem. Otherwise the owner is always the same for aliasing and aliased region. I checked all calls to memory_region_init_alias introduced after commit dfde4e6 (memory: add ref/unref calls, 2013-05-06) and they do not need the reference in order to keep the owner of the aliased region alive. Backports commit 52c91dac6bd891656f297dab76da51fc8bc61309 from qemu	2018-02-21 11:10:49 -05:00
Lioncash	10bf76861b	exec: Remove unnecessary return in qemu_ram_remap	2018-02-21 09:51:23 -05:00
Fam Zheng	fa7d3e6cdb	memory: Drop MemoryRegion.ram_addr All references to mr->ram_addr are replaced by memory_region_get_ram_addr(mr) (except for a few assertions that are replaced with mr->ram_block). Backports commit 8e41fb63c5bf29ecabe0cee1239bf6230f19978a from qemu	2018-02-21 08:53:08 -05:00
Fam Zheng	2c1a72635d	memory: Implement memory_region_get_ram_addr with mr->ram_block Backports commit 7ebb2745acbb8d910eab07dc5f0aa01a4457703c from qemu	2018-02-21 08:53:08 -05:00
Fam Zheng	4b8c428494	memory: Move assignment to ram_block to memory_region_init_* We don't force "const" qualifiers with pointers in QEMU, but it's still good to keep a clean function interface. Assigning to mr->ram_block is in this sense ugly - one initializer mutating its owning object's state. Move it to memory_region_init_*, where mr->ram_addr is assigned. Backports commit 0a75601853c00f3729fa62c49ec0d4bb1e3d9bc1 from qemu	2018-02-21 08:53:08 -05:00
Gonglei	aa80edbef0	exec: Return RAMBlock pointer from allocating functions Previously we return RAMBlock.offset; now return the pointer to the whole structure. ram_block_add returns void now, error is completely passed with errp. Backports commit 528f46af6ecd1e300db18684969104d4067b867b from qemu	2018-02-21 08:52:57 -05:00
Ralf-Philipp Weinmann	893b9f7f96	target-arm: Only trap SRS from S-EL1 if specified mode is MON Commit cbc0326b6fb9 caused SRS instructions executed from Secure EL1 to trap to EL3 even if the specified mode was not monitor mode. According to the ARMv8 Architecture reference manual [F6.1.203], ALL of the following conditions need to be met for SRS to trap to EL3: * It is executed at Secure PL1. * The specified mode is monitor mode. * EL3 is using AArch64. Correct the condition governing the trap to EL3 to check the specified mode. Backports commit ba63cf47a93041137a94e86b7d0cd87fc896949b from qemu	2018-02-21 02:49:28 -05:00
Paolo Bonzini	7f23f7004d	target-arm: implement BE32 mode in system emulation System emulation only has a little-endian target; BE32 mode is implemented by adjusting the low bits of the address for every byte and halfword load and store. 64-bit accesses flip the low and high words. Backports commit e334bd3190f6c4ca12f1d40d316dc471c70009ab from qemu	2018-02-21 02:47:22 -05:00
Paolo Bonzini	aa5be4d6ca	target-arm: implement setend Since this is not a high-performance path, just use a helper to flip the E bit and force a lookup in the hash table since the flags have changed. Backports commit 9886ecdf31165de2d4b8bccc1a220bd6ac8bc192 from qemu	2018-02-21 02:39:13 -05:00
Peter Crosthwaite	902170741a	target-arm: introduce tbflag for endianness Introduce a tbflags for endianness, set based upon the CPUs current endianness. This in turn propagates through to the disas endianness flag. Backports commit 91cca2cda9823b1e7a049cb308a05104b5076cba from qemu	2018-02-21 02:35:34 -05:00
Peter Crosthwaite	50a3c7f2ee	target-arm: a64: Add endianness support Set the dc->mo_endianness flag for AA64 and use it in all ldst ops. Backports commit aa6489da4e297fb3ffcbc09b50afd700395b6386 from qemu	2018-02-21 02:31:50 -05:00
Paolo Bonzini	9ab3d105fd	target-arm: introduce disas flag for endianness Introduce a disas flag for setting the CPU data endianness. This allows control of the endianness from the CPU state rather than hard-coding it to TARGET_WORDS_BIGENDIAN. Backports commit dacf0a2ff7d39ab12bd90f2f5496a3889facd54a from qemu	2018-02-21 02:20:50 -05:00
Peter Crosthwaite	e5cfcc3221	target-arm: implement SCTLR.EE Implement SCTLR.EE bit which controls data endianess for exceptions and page table translations. SCTLR.EE is mirrored to the CPSR.E bit on exception entry. Backports commit 73462dddf670c32c45c8ea359658092b0365b2d4 from qemu	2018-02-21 02:14:56 -05:00
Peter Crosthwaite	38f4a833a4	arm: cpu: handle BE32 user-mode as BE endian with address manipulations on subword accesses (to give the illusion of BE). But user-mode cannot tell the difference and is already implemented as straight BE. So handle the difference in the endianess query, where USER mode is BE and system is not. Backports commit b2e62d9a7b9a2eb10e451a57813bad168376e122 from qemu	2018-02-21 02:12:39 -05:00
Peter Crosthwaite	1457b73a13	target-arm: cpu: Move cpu_is_big_endian to header There is a CPU data endianness test that is used to drive the virtio_big_endian test. Move this up to the header so it can be more generally used for endian tests. The KVM specific cpu_syncronize_state call is left behind in the virtio specific function. Rename it arm_cpu-data_is_big_endian() to more accurately capture that this is for data accesses only. Backports commit ed50ff7875d61a75517c92deb0444d73fbbca878 from qemu	2018-02-21 02:10:48 -05:00
Paolo Bonzini	ec15ee10d0	target-arm: implement SCTLR.B, drop bswap_code bswap_code is a CPU property of sorts ("is the iside endianness the opposite way round to TARGET_WORDS_BIGENDIAN?") but it is not the actual CPU state involved here which is SCTLR.B (set for BE32 binaries, clear for BE8). Replace bswap_code with SCTLR.B, and pass that to arm_ld*_code. The next patches will make data fetches honor both SCTLR.B and CPSR.E appropriately. Backports commit f9fd40ebe4f55e0048e002925b8d65e66d56e7a7 from qemu	2018-02-21 02:08:05 -05:00
Peter Maydell	64a9bec68a	target-arm: Correct handling of writes to CPSR mode bits from gdb in usermode In helper.c the expression (env->uncached_cpsr & CPSR_M) != CPSR_USER is always true; the right hand side was supposed to be ARM_CPU_MODE_USR (an error in commit cb01d391). Since the incorrect expression was always true, this just meant that commit cb01d391 had no effect. However simply changing the RHS here would reveal a logic error: if the mode is USR we wish to completely ignore the attempt to set the mode bits, which means that we must clear the CPSR_M bits from mask to avoid the uncached_cpsr bits being updated at the end of the function. Move the condition into the correct place in the code, fix its RHS constant, and add a comment about the fact that we must be doing a gdbstub write if we're in user mode. Backports commit 8c4f0eb94cc65ee32a12feba88d0b32e3665d5ea from qemu	2018-02-21 01:57:34 -05:00
Lluís Vilanova	d111e2df2d	typedefs: Add CPUState Backports commit b23197f9cf2f221a6cc6272d36852f4f70cf9c1b from qemu	2018-02-21 01:55:22 -05:00
Lioncash	1c04024688	tcg: Make cpu_regs_sparc a TCGv array	2018-02-21 01:50:28 -05:00
Lioncash	db114261e3	target-sparc: Cleanup casts and unnessecary alloc/dealloc	2018-02-21 01:44:51 -05:00
Lioncash	c0210ac8a6	tcg: Make cpu_wim a TCGv	2018-02-21 01:41:53 -05:00
Lioncash	58c5a28893	tcg: Make cpu_ver a TCGv	2018-02-21 01:40:30 -05:00
Lioncash	2beea0db0d	tcg: Make cpu_ssr a TCGv	2018-02-21 01:39:15 -05:00
Lioncash	b09a8626f0	tcg: Make cpu_hver a TCGv	2018-02-21 01:38:07 -05:00
Lioncash	e161e9dcb4	tcg: Make cpu_htba a TCGv	2018-02-21 01:35:40 -05:00
Lioncash	577386b246	tcg: Make cpu_hintp a TCGv	2018-02-21 01:34:13 -05:00
Lioncash	2df9744bdb	tcg: Make cpu_stick_cmpr and cpu_hstick_cmpr TCGv	2018-02-21 01:32:59 -05:00
Lioncash	2d9d8c5e01	tcg: Make cpu_tick_cmpr a TCGv	2018-02-21 01:30:00 -05:00
Lioncash	e5401deb09	tcg: Make cpu_npc a TCGv	2018-02-21 01:25:40 -05:00
Lioncash	6ccd4479d7	tcg: Make sparc_cpu_pc a TCGv	2018-02-21 01:23:58 -05:00
Lioncash	e5a776b495	tcg: Make cpu_fsr a TCGv	2018-02-21 01:22:16 -05:00
Lioncash	b51f920404	tcg: Make cpu_gsr a TCGv Commit 5d4e1a1081d3f1ec2908ff0eaebe312389971ab4 allows making the type concrete. Also fixes a leak with sparc	2018-02-21 01:17:01 -05:00
Lioncash	4da2fd6407	tcg: Make cpu_cond a TCGv Commit 5d4e1a1081d3f1ec2908ff0eaebe312389971ab4 allows making the type concrete	2018-02-21 01:12:15 -05:00
Lioncash	2f785b11d2	tcg: Make cpu_tbr a TCGv Commit 5d4e1a1081d3f1ec2908ff0eaebe312389971ab4 allows making the type concrete	2018-02-21 01:12:11 -05:00
Lioncash	bbc8517cd2	tcg: Make cpu_y a TCGv Commit 5d4e1a1081d3f1ec2908ff0eaebe312389971ab4 allows making the type concrete	2018-02-21 01:06:36 -05:00
Lioncash	a913b3e468	tcg: Make cpu_gpr a TCGv array Commit 5d4e1a1081d3f1ec2908ff0eaebe312389971ab4 allows making the type concrete	2018-02-21 01:02:46 -05:00
Lioncash	1defc70341	tcg: Make cpu_PC a TCGv Commit 5d4e1a1081d3f1ec2908ff0eaebe312389971ab4 allows making the type concrete	2018-02-21 00:47:13 -05:00
Lioncash	372e3307c5	tcg: Make bcond, btarget and cpu_dspctrl TCGv Commit 5d4e1a1081d3f1ec2908ff0eaebe312389971ab4 allows making the type concrete	2018-02-21 00:45:59 -05:00
Lioncash	baf25644dd	tcg: Make cpu_HI and cpu_LO a TCGv array Commit 5d4e1a1081d3f1ec2908ff0eaebe312389971ab4 allows making the type concrete	2018-02-21 00:34:49 -05:00
Lioncash	50b871f523	tcg: Make store_dummy a TCGv Commit 5d4e1a1081d3f1ec2908ff0eaebe312389971ab4 allows making the type concrete	2018-02-21 00:24:40 -05:00
Lioncash	53f66f4762	tcg: Make QREG member variables TCGv instances Commit 5d4e1a1081d3f1ec2908ff0eaebe312389971ab4 allows making the type concrete	2018-02-21 00:23:22 -05:00
Lioncash	04b743a26c	tcg: Make cpu_dreg and cpu_areg TCGv arrays Commit 5d4e1a1081d3f1ec2908ff0eaebe312389971ab4 allows making the type concrete.	2018-02-21 00:23:17 -05:00
Lioncash	6b19f43925	tcg: Make cpu_tmp1 and cpu_tmp4 a TCGv Commit 5d4e1a1081d3f1ec2908ff0eaebe312389971ab4 allows making the type concrete.	2018-02-21 00:07:23 -05:00
Lioncash	7caca36070	tcg: Make cpu_cc_dst, cpu_cc_src, cpu_cc_src2, and cpu_cc_srcT a TCGv Commit 5d4e1a1081d3f1ec2908ff0eaebe312389971ab4 allows us to make the types concrete	2018-02-21 00:00:08 -05:00
Lioncash	4062dcc9bc	tcg: Make cpu_T0 and cpu_T1 TCGv Commit 5d4e1a1081d3f1ec2908ff0eaebe312389971ab4 allows us to make the type concrete	2018-02-20 23:51:44 -05:00
Lioncash	72170ae5c0	tcg: Make cpu_A0 a TCGv Commit 5d4e1a1081d3f1ec2908ff0eaebe312389971ab4 allows us to make the type concrete.	2018-02-20 23:43:58 -05:00
Lioncash	ccbf1ed6ed	tcg: Make cpu_regs a TCGv array Commit `eae07f4767` allows us to make the type concrete as opposed to using void* and malloc	2018-02-20 23:41:21 -05:00
Lioncash	02b2d3c873	tcg: Make cpu_seg_base a TCGv array Commit `eae07f4767` allows us to use the type directly instead of casting to void and using malloc (yay).	2018-02-20 23:34:38 -05:00
Lluís Vilanova	eae07f4767	tcg: Move definition of type TCGv The target-dependant type TCGv must be defined in "tcg/tcg.h" before including the tracing helper wrappers in "tcg/tcg-op.h". It also makes more sense to define it here, where other TCG types are defined too. Backports commit 5d4e1a1081d3f1ec2908ff0eaebe312389971ab4 from qemu	2018-02-20 23:09:12 -05:00
Lluís Vilanova	7db1bffdee	tcg: Add type for vCPU pointers Adds the 'TCGv_env' type for pointers to 'CPUArchState' objects. The tracing infrastructure later needs to differentiate between regular pointers and pointers to vCPUs. Also changes all targets to use the new 'TCGv_env' type instead of the generic 'TCGv_ptr'. As of now, the change is merely cosmetic ('TCGv_env' translates into 'TCGv_ptr'), but that could change in the future to enforce the difference. Note that a 'TCGv_env' type (for 'CPUState') is not added, since all helpers currently receive the architecture-specific pointer ('CPUArchState'). Backports commit 1bcea73e13b2b059d0cb3301aeaca43e5656ef57 from qemu	2018-02-20 22:53:58 -05:00
Peter Maydell	20712bcb9a	target-arm: Make reserved ranges in ID_AA64* spaces RAZ, not UNDEF The v8 ARM ARM defines that unused spaces in the ID_AA64* system register ranges are Reserved and must RAZ, rather than being UNDEF. Implement this. In particular, ARM v8.2 adds a new feature register ID_AA64MMFR2, and newer versions of the Linux kernel will attempt to read this, which causes them not to boot up on versions of QEMU missing this fix. Since the encoding .opc0 = 3, .opc1 = 0, .crn = 0, .crm = 2, .opc2 = 6 is actually defined in ARMv8 (as ID_MMFR4), we give it an entry in the ARMCPU struct so CPUs can override it, though since none do this too will just RAZ. Backports commit e20d84c1407d43d5a2e2ac95dbb46db3b0af8f9f from qemu	2018-02-20 22:49:43 -05:00
Edgar E. Iglesias	66c4bd02eb	target-arm: Mark CNTHP_TVAL_EL2 as ARM_CP_NO_RAW Mark CNTHP_TVAL_EL2 as ARM_CP_NO_RAW due to the register not having any underlying state. This fixes an issue with booting KVM enabled kernels when EL2 is on. Backports commit d44ec156300a149b386a14d3ab349d3b83b66b8c from qemu	2018-02-20 22:30:44 -05:00
Peter Maydell	eb02f0e818	target-arm: Implement MDCR_EL3.TPM and MDCR_EL2.TPM traps Implement the performance monitor register traps controlled by MDCR_EL3.TPM and MDCR_EL2.TPM. Most of the performance registers already have an access function to deal with the user-enable bit, and the TPM checks can be added there. We also need a new access function which only implements the TPM checks for use by the few not-EL0-accessible registers and by PMUSERENR_EL0 (which is always EL0-readable). Backports commit 1fce1ba985d9c5c96e5b9709e1356d1814b8fa9e from qemu	2018-02-20 22:29:26 -05:00
Peter Maydell	ece364e7cc	target-arm: Fix handling of SDCR for 32-bit code Fix two issues with our implementation of the SDCR: * it is only present from ARMv8 onwards * it does not contain several of the trap bits present in its 64-bit counterpart the MDCR_EL3 Put the register description in the right place so that it does not get enabled for ARMv7 and earlier, and give it a write function so that we can mask out the bits which should not be allowed to have an effect if EL3 is 32-bit. Backports commit a8d64e735182cbbb5dcc98f41656b118c45e57cc from qemu	2018-02-20 22:26:58 -05:00
Peter Maydell	8477ed6389	target-arm: Make Monitor->NS PL1 mode changes illegal if HCR.TGE is 1 If HCR.TGE is 1 then mode changes via CPS and MSR from Monitor to NonSecure PL1 modes are illegal mode changes. Implement this check in bad_mode_switch(). (We don't currently implement HCR.TGE, but this is the only missing check from the v8 ARM ARM G1.9.3 and so it's worth adding now; the rest of the HCR.TGE checks can be added later as necessary.) Backports commit 10eacda787ac9990dc22d4437b289200c819712c from qemu	2018-02-20 22:24:19 -05:00
Peter Maydell	8bfdc63424	target-arm: Make mode switches from Hyp via CPS and MRS illegal Mode switches from Hyp to any other mode via the CPS and MRS instructions are illegal mode switches (though obviously switching via exception return is valid). Add this check to bad_mode_switch(). Backports commit af393ffc6da116b9dd4c70901bad1f4cafb1773d from qemu	2018-02-20 22:23:23 -05:00
Peter Maydell	00d06bf20e	target-arm: In v8, make illegal AArch32 mode changes set PSTATE.IL In v8, the illegal mode changes which are UNPREDICTABLE in v7 are given architected behaviour: * the mode field is unchanged * PSTATE.IL is set (so any subsequent instructions will UNDEF) * any other CPSR fields are written to as normal This is pretty much the same behaviour we picked for our UNPREDICTABLE handling, with the exception that for v8 we need to set the IL bit. Backports commit 81907a582901671c15be36a63b5063f88f3487e2 from qemu	2018-02-20 22:22:01 -05:00
Peter Maydell	2296fb5915	target-arm: Forbid mode switch to Mon from Secure EL1 In v8 trying to switch mode to Mon from Secure EL1 is an illegal mode switch. (In v7 this is impossible as all secure modes except User are at EL3.) We can handle this case by making a switch to Mon valid only if the current EL is 3, which then gives the correct answer whether EL3 is AArch32 or AArch64. Backports commit 58ae2d1f037fae1d90eed4522053a85d79edfbec from qemu	2018-02-20 22:21:10 -05:00
Peter Maydell	4919c7287c	target-arm: Add Hyp mode checks to bad_mode_switch() We don't actually support Hyp mode yet, but add the correct checks for it to the bad_mode_switch() function for completeness. Backports commit e6c8fc07b4fce0729bb747770756835f4b0ca7f4 from qemu	2018-02-20 22:20:19 -05:00
Peter Maydell	339e3e340e	target-arm: Add comment about not implementing NSACR.RFR QEMU doesn't implement the NSACR.RFR bit, which is a permitted IMPDEF in choice in ARMv7 and the only permitted choice in ARMv8. Add a comment to bad_mode_switch() to note that this is why FIQ is always a valid mode regardless of the CPU's Secure state. Backports commit 52ff951b4f63a29593650a15efdf82f63d6d962d from qemu	2018-02-20 22:19:38 -05:00
Peter Maydell	a468baff61	target-arm: In cpsr_write() ignore mode switches from User mode The only case where we can attempt a cpsr_write() mode switch from User is from the gdbstub; all other cases are handled in the calling code (notably translate.c). Architecturally attempts to alter the mode bits from user mode are simply ignored (and not treated as a bad mode switch, which in v8 sets CPSR.IL). Make mode switches from User ignored in cpsr_write() as well, for consistency. Backports commit cb01d3912c8b000ed26d5fe95f6c194b3e3ba7a6 from qemu	2018-02-20 22:18:48 -05:00
Peter Maydell	553e230088	target-arm: Raw CPSR writes should skip checks and bank switching Raw CPSR writes should skip the architectural checks for whether we're allowed to set the A or F bits and should also not do the switching of register banks if the mode changes. Handle this inside cpsr_write(), which allows us to drop the "manually set the mode bits to avoid the bank switch" code from all the callsites which are using CPSRWriteRaw. This fixes a bug in 32-bit KVM handling where we had forgotten the "manually set the mode bits" part and could thus potentially trash the register state if the mode from the last exit to userspace differed from the mode on this exit. Backports commit f8c88bbcda76d5674e4bb125471371b41d330df8 from qemu	2018-02-20 22:17:48 -05:00
Peter Maydell	611d4dad4b	target-arm: Add write_type argument to cpsr_write() Add an argument to cpsr_write() to indicate what kind of CPSR write is being requested, since the exact behaviour should differ for the different cases. Backports commit 50866ba5a2cfe922aaf3edb79f6eac5b0653477a from qemu	2018-02-20 22:15:53 -05:00
Peter Maydell	6ae2357be6	target-arm: Give CPSR setting on 32-bit exception return its own helper The rules for setting the CPSR on a 32-bit exception return are subtly different from those for setting the CPSR via an instruction like MSR or CPS. (In particular, in Hyp mode changing the mode bits is not valid via MSR or CPS.) Split the exception-return case into its own helper for setting CPSR, so we can eventually handle them differently in the helper function. Backports commit 235ea1f5c89abf30e452539b973b0dbe43d3fe2b from qemu	2018-02-20 22:08:35 -05:00
Yongbok Kim	46284f1a41	target-mips: implement R6 multi-threading MIPS Release 6 provides multi-threading features which replace pre-R6 MT Module. CP0.Config3.MT is always 0 in R6, instead there is new CP0.Config5.VP (Virtual Processor) bit which indicates presence of multi-threading support which includes CP0.GlobalNumber register and DVP/EVP instructions. Backports commit 01bc435b44b8802cc4697faa07d908684afbce4e from qemu	2018-02-20 22:02:40 -05:00
Paolo Bonzini	abb0408274	target-i386: fix confusion in xcr0 bit position vs. mask The xsave and xrstor helpers are accessing the x86_ext_save_areas array using a bit mask instead of a bit position. Provide two sets of XSTATE_* definitions and use XSTATE_*_BIT when a bit position is requested. Backports commit cfc3b074de4b4ccee2540edbf8cfdb026dc19943 from qemu	2018-02-20 21:00:41 -05:00
Gonglei	26951bf754	memory: Remove unreachable return statement Backports commit d61524486c6e503e502241a2ea834f930f98a6a1 from qemu	2018-02-20 20:54:24 -05:00
Gonglei	d25285bc78	memory: optimize qemu_get_ram_ptr and qemu_ram_ptr_length these two functions consume too much cpu overhead to find the RAMBlock by ram address. After this patch, we can pass the RAMBlock pointer to them so that they don't need to find the RAMBlock anymore most of the time. We can get better performance in address translation processing. Backports commit 3655cb9c7375a595a8051ec677c515b24d5c1fe6 from qemu	2018-02-20 20:53:31 -05:00
Gonglei	39e4d63e68	exec: store RAMBlock pointer into memory region Each RAM memory region has a unique corresponding RAMBlock. In the current realization, the memory region only stored the ram_addr which means the offset of RAM address space, We need to qurey the global ram.list to find the ram block by ram_addr if we want to get the ram block, which is very expensive. Now, we store the RAMBlock pointer into memory region structure. So, if we know the mr, we can easily get the RAMBlock. Backports commit 58eaa2174e99d9a05172d03fd2799ab8fd9e6f60 from qemu	2018-02-20 20:43:32 -05:00
Peter Maydell	764c2d09e5	tcg: Remove unnecessary osdep.h includes from tcg-target.inc.c Commit 757e725b58c57d added a number of #include "qemu/osdep.h" files to the tcg-target.c files (as they were named at the time). These are unnecessary because these files are not standalone C files, and the tcg/tcg.c file which includes them will have already included osdep.h on their behalf. Remove the unneeded include directives. Backports commit c3b7f66800fbf9f47fddbcf2e2cd30ea932e0aae from qemu	2018-02-20 20:41:00 -05:00
Peter Maydell	7784a25470	tcg: Rename tcg-target.c to tcg-target.inc.c Rename the per-architecture tcg-target.c files to tcg-target.inc.c. This makes it clearer that they are not intended to be standalone C files, but are instead #included into another source file. Backports commit ce151109813e2770fd3cee2f37bfa2cdd01a12b9 from qemu	2018-02-20 20:39:57 -05:00
Richard Henderson	d609ab30c2	target-sparc: Use global registers for the register window Via indirection off cpu_regwptr. Backports commit d2dc4069e046deeccc4dca0f73c3077ac22ba43f from qemu	2018-02-20 20:34:42 -05:00
Richard Henderson	3653771265	tcg: Allocate indirect_base temporaries in a different order Since we've not got liveness analysis for indirect bases, placing them at the end of the call-saved registers makes it more likely that it'll stay live. Backports commit 91478cefaaf2fa678e56df8635b34957f4d5d565 from qemu	2018-02-20 19:46:59 -05:00
Richard Henderson	bf385eba3c	tcg: Implement indirect memory registers That is, global_mem registers whose base is another global_mem register, rather than a fixed register. Backports commit b3915dbbdcdb2e04753f3d34a1b0865eea005069 from qemu	2018-02-20 19:20:01 -05:00
Richard Henderson	9299329349	tcg: Work around clang bug wrt enum ranges, part 2 A previous patch patch changed the type of REG from int to enum TCGReg, which provokes the following bug in clang: https://llvm.org/bugs/show_bug.cgi?id=16154 Backports commit 869938ae2a284fe730cb6f807ea0f9e324e0f87c from qemu	2018-02-20 19:12:49 -05:00
Peter Maydell	547fabd58e	osdep.h: Include config-target.h if NEED_CPU_H is defined NEED_CPU_H is the define we use to distinguish per-target object compilation from common object compilation. For the former, we must also include config-target.h so that the .c files see the necessary CONFIG_ constants. Backports commit b1e34d1c3a9059e87719634bfc4db53174d63e14 from qemu	2018-02-20 19:11:07 -05:00
Peter Maydell	c41bb9a772	osdep.h: Define macros for the benefit of C++ before C++11 For C++ before C++11, <stdint.h> requires definition of the macros __STDC_CONSTANT_MACROS, __STDC_LIMIT_MACROS and __STDC_FORMAT_MACROS in order to enable definition of various macros by the header file. Define these in osdep.h, so that we get the right header file definitions whether osdep.h is being used by plain C, C++11 or older C++. In particular libvixl's header files depend on this and won't compile if osdep.h is included before them otherwise. Backports commit 79f56d82f805b170fa2be8c04b682117be56483f from qemu	2018-02-20 19:09:58 -05:00
Lioncash	6094a0373c	softfloat: Remove lingering fast casts	2018-02-20 19:04:22 -05:00
Lioncash	c17fa2cad3	osdep.h: Remove int_fast_t Solaris compatibility code We now do not use the int_fast_t types anywhere in QEMU, so we can remove the compatibility definitions we were providing for the benefit of ancient Solaris versions. Backports commit 50fe4df8ee6aba63ae51457bad40ba26e3c9746f from qemu	2018-02-20 18:58:53 -05:00
Peter Maydell	36551f59bb	fpu: Use plain 'int' rather than 'int_fast16_t' for exponents Use the plain 'int' type rather than 'int_fast16_t' for handling exponents. Exponents don't need to be exactly 16 bits, so using int16_t for them would confuse more than it clarified. This should be a safe change because int_fast16_t semantics permit use of 'int' (and on 32-bit glibc that is what you get). Backports commit 0c48262d4772d40677364199372fb6ffcf487558 from qemu	2018-02-20 18:57:32 -05:00
Peter Maydell	9d0463feed	fpu: Use plain 'int' rather than 'int_fast16_t' for shift counts Use the plain 'int' type rather than 'int_fast16_t' for shift counts in the various shift related functions, since we don't actually care about the size of the integer at all here, and using int16_t would be confusing. This should be a safe change because int_fast16_t semantics permit use of 'int' (and on 32-bit glibc that is what you get). Backports commit 07d792d2b08669bf6a97cbf590496078c4621068 from qemu	2018-02-20 17:01:17 -05:00
Peter Maydell	68cbe1b2ce	fpu: Remove use of int_fast16_t in conversions to int16 Make the functions which convert floating point to 16 bit integer return int16_t rather than int_fast16_t, and correspondingly use int_fast16_t in their internal implementations where appropriate. (These functions are used only by the ARM target.) Backports commit 0bb721d7217ed4a1abb44f521c5c7ec185062d58 from qemu	2018-02-20 16:54:04 -05:00
Peter Maydell	69d31fbbab	target-mips: Stop using uint_fast_t types in r4k_tlb_t struct The r4k_tlb_t structure uses the uint_fast_t types. Most of these uses are in bitfields and are thus pointless, because the bitfield itself specifies the width of the type; just use 'unsigned int' instead. (On glibc uint_fast16_t is defined as either 32 or 64 bits, so we know the code is not reliant on it being exactly 16 bits.) There is also one use of uint_fast8_t, which we replace with uint8_t, because both are exactly 8 bits on glibc and this is the only place outside the softfloat code which uses an int_fast*_t type. Backports commit d783f78933b212537ece77c7ec66866cc2bc0f4d from qemu	2018-02-20 16:48:03 -05:00
Eric Blake	e096e62127	qapi: Don't box branches of flat unions There's no reason to do two malloc's for a flat union; let's just inline the branch struct directly into the C union branch of the flat union. Surprisingly, fewer clients were actually using explicit references to the branch types in comparison to the number of flat unions thus modified. This lets us reduce the hack in qapi-types:gen_variants() added in the previous patch; we no longer need to distinguish between alternates and flat unions. The change to unboxed structs means that u.data (added in commit cee2dedb) is now coincident with random fields of each branch of the flat union, whereas beforehand it was only coincident with pointers (since all branches of a flat union have to be objects). Note that this was already the case for simple unions - but there we got lucky. Remember, visit_start_union() blindly returns true for all visitors except for the dealloc visitor, where it returns the value !!obj->u.data, and that this result then controls whether to proceed with the visit to the variant. Pre-patch, this meant that flat unions were testing whether the boxed pointer was still NULL, and thereby skipping visit_end_implicit_struct() and avoiding a NULL dereference if the pointer had not been allocated. The same was true for simple unions where the current branch had pointer type, except there we bypassed visit_type_FOO(). But for simple unions where the current branch had scalar type, the contents of that scalar meant that the decision to call visit_type_FOO() was data-dependent - the reason we got lucky there is that visit_type_FOO() for all scalar types in the dealloc visitor is a no-op (only the pointer variants had anything to free), so it did not matter whether the dealloc visit was skipped. But with this patch, we would risk leaking memory if we could skip a call to visit_type_FOO_fields() based solely on a data-dependent decision. But notice: in the dealloc visitor, visit_type_FOO() already handles a NULL obj - it was only the visit_type_implicit_FOO() that was failing to check for NULL. And now that we have refactored things to have the branch be part of the parent struct, we no longer have a separate pointer that can be NULL in the first place. So we can just delete the call to visit_start_union() altogether, and blindly visit the branch type; there is no change in behavior except to the dealloc visitor, where we now unconditionally visit the branch, but where that visit is now always safe (for a flat union, we can no longer dereference NULL, and for a simple union, visit_type_FOO() was already safely handling NULL on pointer types). Unfortunately, simple unions are not as easy to switch to unboxed layout; because we are special-casing the hidden implicit type with a single 'data' member, we really DO need to keep calling another layer of visit_start_struct(), with a second malloc; although there are some cleanups planned for simple unions in later patches. visit_start_union() and gen_visit_implicit_struct() are now unused. Drop them. Note that after this patch, the only remaining use of visit_start_implicit_struct() is for alternate types; the next patch will do further cleanup based on that fact. Backports commit 544a3731591f5d53e15f22de00ce5ac758d490b3 from qemu	2018-02-20 16:44:55 -05:00

... 20 21 22 23 24 ...

3608 commits