unicorn

mirror of https://github.com/yuzu-emu/unicorn.git synced 2024-12-24 03:15:33 +00:00

Author	SHA1	Message	Date
Andrew Baumann	5250db33b5	arm: implement cache/shareability attribute bits for PAR registers On a successful address translation instruction, PAR is supposed to contain cacheability and shareability attributes determined by the translation. We previously returned 0 for these bits (in line with the general strategy of ignoring caches and memory attributes), but some guest OSes may depend on them. This patch collects the attribute bits in the page-table walk, and updates PAR with the correct attributes for all LPAE translations. Short descriptor formats still return 0 for these bits, as in the prior implementation. Backports commit 5b2d261d60caf9d988d91ca1e02392d6fc8ea104 from qemu	2018-03-05 11:35:28 -05:00
Stefano Stabellini	1212c9b73c	fix WFI/WFE length in syndrome register WFI/E are often, but not always, 4 bytes long. When they are, we need to set ARM_EL_IL_SHIFT in the syndrome register. Pass the instruction length to HELPER(wfi), use it to decrement pc appropriately and to pass an is_16bit flag to syn_wfx, which sets ARM_EL_IL_SHIFT if needed. Set dc->insn in both arm_tr_translate_insn and thumb_tr_translate_insn. Backports commit 58803318e5a546b2eb0efd7a053ed36b6c29ae6f from qemu	2018-03-05 11:21:51 -05:00
Richard Henderson	28061c2e59	qom: Introduce CPUClass.tcg_initialize Move target cpu tcg initialization to common code, called from cpu_exec_realizefn. Backports commit 55c3ceef61fcf06fc98ddc752b7cce788ce7680b from qemu	2018-03-05 09:49:26 -05:00
Richard Henderson	4d9c8583fa	tcg: Remove TCGV_EQUAL* When we used structures for TCGv_*, we needed a macro in order to perform a comparison. Now that we use pointers, this is just clutter Backports commit 11f4e8f8bfaa2caaab24bef6bbbb8a0205015119 from qemu	2018-03-05 09:16:07 -05:00
Richard Henderson	d450156414	tcg: Remove GET_TCGV_* and MAKE_TCGV_* The GET and MAKE functions weren't really specific enough. We now have a full complement of functions that convert exactly between temporaries, arguments, tcgv pointers, and indices. The target/sparc change is also a bug fix, which would have affected a host that defines TCG_TARGET_HAS_extr[lh]_i64_i32, i.e. MIPS64. Backports commit dc41aa7d34989b552efe712ffe184236216f960b from qemu	2018-03-05 09:12:26 -05:00
Richard Henderson	eb488f5bd6	tcg: Merge opcode arguments into TCGOp Rather than have a separate buffer of 10*max_ops entries, give each opcode 10 entries. The result is actually a bit smaller and should have slightly more cache locality. Backports commit 75e8b9b7aa0b95a761b9add7e2f09248b101a392 from qemu	2018-03-05 04:45:20 -05:00
Paolo Bonzini	7dd4afd8d9	target/i386: trap on instructions longer than >15 bytes Besides being more correct, arbitrarily long instruction allow the generation of a translation block that spans three pages. This confuses the generator and even allows ring 3 code to poison the translation block cache and inject code into other processes that are in guest ring 3. This is an improved (and more invasive) fix for commit 30663fd ("tcg/i386: Check the size of instruction being translated", 2017-03-24). In addition to being more precise (and generating the right exception, which is #GP rather than #UD), it distinguishes better between page faults and too long instructions, as shown by this test case: int main() { char x = mmap(NULL, 8192, PROT_READ\|PROT_WRITE\|PROT_EXEC, MAP_PRIVATE\|MAP_ANON, -1, 0); memset(x, 0x66, 4096); x[4096] = 0x90; x[4097] = 0xc3; char i = x + 4096 - 15; mprotect(x + 4096, 4096, PROT_READ\|PROT_WRITE); ((void(*)(void)) i) (); } ... which produces a #GP without the mprotect, and a #PF with it. Backports commit b066c5375737ad0d630196dab2a2b329515a1d00 from qemu	2018-03-05 04:12:28 -05:00
Paolo Bonzini	44f87a8fbf	target/i386: introduce x86_ld*_code These take care of advancing s->pc, and will provide a unified point where to check for the 15-byte instruction length limit. Backports commit e3af7c788b73a6495eb9d94992ef11f6ad6f3c56 from qemu	2018-03-05 04:11:02 -05:00
Peter Maydell	0c06666800	target/arm: Implement SG instruction corner cases The common situation of the SG instruction is that it is executed from S&NSC memory by a CPU in NS state. That case is handled by v7m_handle_execute_nsc(). However the instruction also has defined behaviour in a couple of other cases: * SG instruction in NS memory (behaves as a NOP) * SG in S memory but CPU already secure (clears IT bits and does nothing else) * SG instruction in v8M without Security Extension (NOP) These can be implemented in translate.c. Backports commit 76eff04d166b8fe747adbe82de8b7e060e668ff9 from qemu	2018-03-05 03:47:20 -05:00
Peter Maydell	272427b4a0	target/arm: Support some Thumb insns being always unconditional A few Thumb instructions are always unconditional even inside an IT block (as opposed to being UNPREDICTABLE if used inside an IT block): BKPT, the v8M SG instruction, and the A profile HLT (debug halt) instruction. This means we need to suppress the jump-over-instruction-on-condfail code generation (though the IT state still advances as usual and subsequent insns in the IT block may be conditional). Backports commit dcf14dfb704519846f396a376339ebdb93eaf049 from qemu	2018-03-05 03:46:10 -05:00
Peter Maydell	7a293cd7cc	target-arm: Simplify insn_crosses_page() Recent changes have left insn_crosses_page() more complicated than it needed to be: * it's only called from thumb_tr_translate_insn() so we know for certain that we're looking at a Thumb insn * the caller's check for dc->pc >= dc->next_page_start - 3 means that dc->pc can't possibly be 4 aligned, so there's no need to check that (the check was partly there to ensure that we didn't treat an ARM insn as Thumb, I think) * we now have thumb_insn_is_16bit() which lets us do a precise check of the length of the next insn, rather than opencoding an inaccurate check Simplify it down to just loading the first half of the insn and calling thumb_insn_is_16bit() on it. Backports commit 5b8d7289e9e92a0d7bcecb93cd189e245fef10cd from qemu	2018-03-05 03:44:54 -05:00
Peter Maydell	96f86f472a	target/arm: Pull Thumb insn word loads up to top level Refactor the Thumb decode to do the loads of the instruction words at the top level rather than only loading the second half of a 32-bit Thumb insn in the middle of the decode. This is simple apart from the awkward case of Thumb1, where the BL/BLX prefix and suffix instructions live in what in Thumb2 is the 32-bit insn space. To handle these we decode enough to identify whether we're looking at a prefix/suffix that we handle as a 16 bit insn, or a prefix that we're going to merge with the following suffix to consider as a 32 bit insn. The translation of the 16 bit cases then moves from disas_thumb2_insn() to disas_thumb_insn(). The refactoring has the benefit that we don't need to pass the CPUARMState* down into the decoder code any more, but the major reason for doing this is that some Thumb instructions must be always unconditional regardless of the IT state bits, so we need to know the whole insn before we emit the "skip this insn if the IT bits and cond state tell us to" code. (The always unconditional insns are BKPT, HLT and SG; the last of these is 32 bits.) Backports commit 296e5a0a6c393553079a641c50521ae33ff89324 from qemu	2018-03-05 03:43:38 -05:00
Peter Maydell	b85d617bda	target-arm: Don't check for "Thumb2 or M profile" for not-Thumb1 The code which implements the Thumb1 split BL/BLX instructions is guarded by a check on "not M or THUMB2". All we really need to check here is "not THUMB2" (and we assume that elsewhere too, eg in the ARCH(6T2) test that UNDEFs the Thumb2 insns). This doesn't change behaviour because all M profile cores have Thumb2 and so ARM_FEATURE_M implies ARM_FEATURE_THUMB2. (v6M implements a very restricted subset of Thumb2, but we can cross that bridge when we get to it with appropriate feature bits.) Backports commit 6b8acf256df09c8a8dd7dcaa79b06eaff4ad63f7 from qemu	2018-03-05 03:34:48 -05:00
Peter Maydell	ee9b8a20c9	target/arm: Implement secure function return Secure function return happens when a non-secure function has been called using BLXNS and so has a particular magic LR value (either 0xfefffffe or 0xfeffffff). The function return via BX behaves specially when the new PC value is this magic value, in the same way that exception returns are handled. Adjust our BX excret guards so that they recognize the function return magic number as well, and perform the function-return unstacking in do_v7m_exception_exit(). Backports commit d02a8698d7ae2bfed3b11fe5b064cb0aa406863b from qemu	2018-03-05 03:33:42 -05:00
Peter Maydell	e312993f1f	target/arm: Implement BLXNS Implement the BLXNS instruction, which allows secure code to call non-secure code. Backports commit 3e3fa230e3b8ffe119f14ba57a6bc677a411be57 from qemu	2018-03-05 03:31:59 -05:00
Peter Maydell	2c4578f46e	target/arm: Implement SG instruction Implement the SG instruction, which we emulate 'by hand' in the exception handling code path. Backports commit 333e10c51ef5876ced26f77b61b69ce0f83161a9 from qemu	2018-03-05 03:28:28 -05:00
Peter Maydell	19ecd4f732	target/arm: Add M profile secure MMU index values to get_a32_user_mem_index() Add the M profile secure MMU index values to the switch in get_a32_user_mem_index() so that LDRT/STRT work correctly rather than asserting at translate time. Backports commit b9f587d62cebed427206539750ebf59bde4df422 from qemu	2018-03-05 03:25:54 -05:00
Emilio G. Cota	5fae6dd433	tcg: remove addr argument from lookup_tb_ptr It is unlikely that we will ever want to call this helper passing an argument other than the current PC. So just remove the argument, and use the pc we already get from cpu_get_tb_cpu_state. This change paves the way to having a common "tb_lookup" function. Backports commit 7f11636dbee89b0e4d03e9e2b96e14649a7db778 from qemu	2018-03-05 02:16:34 -05:00
Todd Eisenberger	75bdfd85a7	x86: Correct translation of some rdgsbase and wrgsbase encodings It looks like there was a transcription error when writing this code initially. The code previously only decoded src or dst of rax. This resolves https://bugs.launchpad.net/qemu/+bug/1719984. Backports commit e0dd5fd41a1a38766009f442967fab700d2d0550 from qemu	2018-03-05 02:05:26 -05:00
Peter Maydell	059f238f11	target/arm: Factor out "get mmuidx for specified security state" For the SG instruction and secure function return we are going to want to do memory accesses using the MMU index of the CPU in secure state, even though the CPU is currently in non-secure state. Write arm_v7m_mmu_idx_for_secstate() to do this job, and use it in cpu_mmu_index(). Backports commit b81ac0eb6315e602b18439961e0538538e4aed4f from qemu	2018-03-05 02:00:23 -05:00
Peter Maydell	6958a4763d	target/arm: Fix calculation of secure mm_idx values In cpu_mmu_index() we try to do this: if (env->v7m.secure) { mmu_idx += ARMMMUIdx_MSUser; } but it will give the wrong answer, because ARMMMUIdx_MSUser includes the 0x40 ARM_MMU_IDX_M field, and so does the mmu_idx we're adding to, and we'll end up with 0x8n rather than 0x4n. This error is then nullified by the call to arm_to_core_mmu_idx() which masks out the high part, but we're about to factor out the code that calculates the ARMMMUIdx values so it can be used without passing it through arm_to_core_mmu_idx(), so fix this bug first. Backports commit fe768788d29597ee56fc11ba2279d502c2617457 from qemu	2018-03-05 01:58:42 -05:00
Peter Maydell	7988aec017	target/arm: Implement security attribute lookups for memory accesses Implement the security attribute lookups for memory accesses in the get_phys_addr() functions, causing these to generate various kinds of SecureFault for bad accesses. The major subtlety in this code relates to handling of the case when the security attributes the SAU assigns to the address don't match the current security state of the CPU. In the ARM ARM pseudocode for validating instruction accesses, the security attributes of the address determine whether the Secure or NonSecure MPU state is used. At face value, handling this would require us to encode the relevant bits of state into mmu_idx for both S and NS at once, which would result in our needing 16 mmu indexes. Fortunately we don't actually need to do this because a mismatch between address attributes and CPU state means either: * some kind of fault (usually a SecureFault, but in theory perhaps a UserFault for unaligned access to Device memory) * execution of the SG instruction in NS state from a Secure & NonSecure code region The purpose of SG is simply to flip the CPU into Secure state, so we can handle it by emulating execution of that instruction directly in arm_v7m_cpu_do_interrupt(), which means we can treat all the mismatch cases as "throw an exception" and we don't need to encode the state of the other MPU bank into our mmu_idx values. This commit doesn't include the actual emulation of SG; it also doesn't include implementation of the IDAU, which is a per-board way to specify hard-coded memory attributes for addresses, which override the CPU-internal SAU if they specify a more secure setting than the SAU is programmed to. Backports commit 35337cc391245f251bfb9134f181c33e6375d6c1 from qemu	2018-03-05 01:57:07 -05:00
Peter Maydell	f9b4381ce0	nvic: Implement Security Attribution Unit registers Implement the register interface for the SAU: SAU_CTRL, SAU_TYPE, SAU_RNR, SAU_RBAR and SAU_RLAR. None of the actual behaviour is implemented here; registers just read back as written. When the CPU definition for Cortex-M33 is eventually added, its initfn will set cpu->sau_sregion, in the same way that we currently set cpu->pmsav7_dregion for the M3 and M4. Number of SAU regions is typically a configurable CPU parameter, but this patch doesn't provide a QEMU CPU property for it. We can easily add one when we have a board that requires it. Backports commit 9901c576f6c02d43206e5faaf6e362ab7ea83246 from qemu	2018-03-05 01:55:11 -05:00
Peter Maydell	3da3a3fb41	target/arm: Add v8M support to exception entry code Add support for v8M and in particular the security extension to the exception entry code. This requires changes to: * calculation of the exception-return magic LR value * push the callee-saves registers in certain cases * clear registers when taking non-secure exceptions to avoid leaking information from the interrupted secure code * switch to the correct security state on entry * use the vector table for the security state we're targeting Backports commit d3392718e1fcf0859fb7c0774a8e946bacb8419c from qemu	2018-03-05 01:51:22 -05:00
Peter Maydell	39466771d6	target/arm: Add support for restoring v8M additional state context For v8M, exceptions from Secure to Non-Secure state will save callee-saved registers to the exception frame as well as the caller-saved registers. Add support for unstacking these registers in exception exit when necessary. Backports commit 907bedb3f3ce134c149599bd9cb61856d811b8ca from qemu	2018-03-05 01:47:25 -05:00
Peter Maydell	2feecbac0d	target/arm: Update excret sanity checks for v8M In v8M, more bits are defined in the exception-return magic values; update the code that checks these so we accept the v8M values when the CPU permits them. Backports commit bfb2eb52788b9605ef2fc9bc72683d4299117fde from qemu	2018-03-05 01:44:33 -05:00
Peter Maydell	33d2358c91	target/arm: Add new-in-v8M SFSR and SFAR Add the new M profile Secure Fault Status Register and Secure Fault Address Register. Backports commit bed079da04dd9e0e249b9bc22bca8dce58b67f40 from qemu	2018-03-05 01:42:52 -05:00
Peter Maydell	7af730ed3e	target/arm: Don't warn about exception return with PC low bit set for v8M In the v8M architecture, return from an exception to a PC which has bit 0 set is not UNPREDICTABLE; it is defined that bit 0 is discarded [R_HRJH]. Restrict our complaint about this to v7M. Backports commit 4e4259d3c574a8e89c3af27bcb84bc19a442efb1 from qemu	2018-03-05 01:41:51 -05:00
Peter Maydell	2aea283c4f	target/arm: Warn about restoring to unaligned stack Attempting to do an exception return with an exception frame that is not 8-aligned is UNPREDICTABLE in v8M; warn about this. (It is not UNPREDICTABLE in v7M, and our implementation can handle the merely-4-aligned case fine, so we don't need to do anything except warn.) Backports commit cb484f9a6e790205e69d9a444c3e353a3a1cfd84 from qemu	2018-03-05 01:40:40 -05:00
Peter Maydell	5063ca11ab	target/arm: Check for xPSR mismatch usage faults earlier for v8M ARM v8M specifies that the INVPC usage fault for mismatched xPSR exception field and handler mode bit should be checked before updating the PSR and SP, so that the fault is taken with the existing stack frame rather than by pushing a new one. Perform this check in the right place for v8M. Since v7M specifies in its pseudocode that this usage fault check should happen later, we have to retain the original code for that check rather than being able to merge the two. (The distinction is architecturally visible but only in very obscure corner cases like attempting an invalid exception return with an exception frame in read only memory.) Backports commit 224e0c300a0098fb577a03bd29d774d0769f632a from qemu	2018-03-05 01:39:39 -05:00
Peter Maydell	6f08acdcfe	target/arm: Restore SPSEL to correct CONTROL register on exception return On exception return for v8M, the SPSEL bit in the EXC_RETURN magic value should be restored to the SPSEL bit in the CONTROL register banked specified by the EXC_RETURN.ES bit. Add write_v7m_control_spsel_for_secstate() which behaves like write_v7m_control_spsel() but allows the caller to specify which CONTROL bank to use, reimplement write_v7m_control_spsel() in terms of it, and use it in exception return. Backports commit 3f0cddeee1f266d43c956581f3050058360a810d from qemu	2018-03-05 01:35:17 -05:00
Peter Maydell	0bb50b9a7e	target/arm: Restore security state on exception return Now that we can handle the CONTROL.SPSEL bit not necessarily being in sync with the current stack pointer, we can restore the correct security state on exception return. This happens before we start to read registers off the stack frame, but after we have taken possible usage faults for bad exception return magic values and updated CONTROL.SPSEL. Backports commit 3919e60b6efd9a86a0e6ba637aa584222855ac3a from qemu	2018-03-05 01:31:58 -05:00
Peter Maydell	c7b5fccfb8	target/arm: Prepare for CONTROL.SPSEL being nonzero in Handler mode In the v7M architecture, there is an invariant that if the CPU is in Handler mode then the CONTROL.SPSEL bit cannot be nonzero. This in turn means that the current stack pointer is always indicated by CONTROL.SPSEL, even though Handler mode always uses the Main stack pointer. In v8M, this invariant is removed, and CONTROL.SPSEL may now be nonzero in Handler mode (though Handler mode still always uses the Main stack pointer). In preparation for this change, change how we handle this bit: rename switch_v7m_sp() to the now more accurate write_v7m_control_spsel(), and make it check both the handler mode state and the SPSEL bit. Note that this implicitly changes the point at which we switch active SP on exception exit from before we pop the exception frame to after it. Backports commit de2db7ec894f11931932ca78cd14a8d2b1389d5b from qemu	2018-03-05 01:29:54 -05:00
Peter Maydell	8036c5b3de	target/arm: Don't switch to target stack early in v7M exception return Currently our M profile exception return code switches to the target stack pointer relatively early in the process, before it tries to pop the exception frame off the stack. This is awkward for v8M for two reasons: * in v8M the process vs main stack pointer is not selected purely by the value of CONTROL.SPSEL, so updating SPSEL and relying on that to switch to the right stack pointer won't work * the stack we should be reading the stack frame from and the stack we will eventually switch to might not be the same if the guest is doing strange things Change our exception return code to use a 'frame pointer' to read the exception frame rather than assuming that we can switch the live stack pointer this early. Backports commit 5b5223997c04b769bb362767cecb5f7ec382c5f0 from qemu	2018-03-05 01:26:05 -05:00
Jan Kiszka	ae16a26c20	arm: Fix SMC reporting to EL2 when QEMU provides PSCI This properly forwards SMC events to EL2 when PSCI is provided by QEMU itself and, thus, ARM_FEATURE_EL3 is off. Found and tested with the Jailhouse hypervisor. Solution based on suggestions by Peter Maydell. Backports commit 77077a83006c3c9bdca496727f1735a3c5c5355d from qemu	2018-03-05 01:19:22 -05:00
Peter Maydell	f0569ba11a	target/arm: Remove out of date ARM ARM section references in A64 decoder In the A64 decoder, we have a lot of references to section numbers from version A.a of the v8A ARM ARM (DDI0487). This version of the document is now long obsolete (we are currently on revision B.a), and various intervening versions renumbered all the sections. The most recent B.a version of the document doesn't assign section numbers at all to the individual instruction classes in the way that the various A.x versions did. The simplest thing to do is just to delete all the out of date C.x.x references. Backports commit 4ce31af4aeb8471f6a913de7c59d3bde1fc4f03d from qemu	2018-03-05 01:05:53 -05:00
Peter Maydell	72dadc6518	target/arm: Handle banking in negative-execution-priority check in cpu_mmu_index() Now that we have a banked FAULTMASK register and banked exceptions, we can implement the correct check in cpu_mmu_index() for whether the MPU_CTRL.HFNMIENA bit's effect should apply. This bit causes handlers which have requested a negative execution priority to run with the MPU disabled. In v8M the test has to check this for the current security state and so takes account of banking. Backports relevant part of commit 5d4791991d4de12e83d44738417c9e964167b6e8 from qemu	2018-03-05 00:54:28 -05:00
Peter Maydell	4b8bdda695	target/arm: Implement MSR/MRS access to NS banked registers In v8M the MSR and MRS instructions have extra register value encodings to allow secure code to access the non-secure banked version of various special registers. (We don't implement the MSPLIM_NS or PSPLIM_NS aliases, because we don't currently implement the stack limit registers at all.) Backports commit 50f11062d4c896408731d6a286bcd116d1e08465 from qemu	2018-03-05 00:53:13 -05:00
Eric Blake	f31c3b32fb	mips: Improve macro parenthesization Although none of the existing macro call-sites were broken, it's always better to write macros that properly parenthesize arguments that can be complex expressions, so that the intended order of operations is not broken. Backports commit 2a2be359c4335607c7f746cf27c412c08ab89aff from qemu	2018-03-05 00:51:51 -05:00
Igor Mammedov	00d52414c1	mips: replace cpu_mips_init() with cpu_generic_init() now cpu_mips_init() reimplements subset of cpu_generic_init() tasks, so just drop it and use cpu_generic_init() directly. Backports commit c4c8146cfd0fc3f95418fbc82a2eded594675022 from qemu	2018-03-05 00:49:10 -05:00
Igor Mammedov	97b525a794	mips: MIPSCPU model subclasses Register separate QOM types for each mips cpu model, so it would be possible to reuse generic CPU creation routines. Backports commit 41da212c9ce9482fcfd490170c2611470254f8dc from qemu	2018-03-05 00:42:29 -05:00
Philippe Mathieu-Daudé	4729b633f1	mips: call cpu_mips_realize_env() from mips_cpu_realizefn() This changes the order between cpu_mips_realize_env() and cpu_exec_initfn(), but cpu_exec_initfn() don't have anything that depends on cpu_mips_realize_env() being called first. Backports commit df4dc10284e1d871db8adb512816a561473ffe3e from qemu	2018-03-05 00:29:54 -05:00
Philippe Mathieu-Daudé	3257a8f8c3	mips: split cpu_mips_realize_env() out of cpu_mips_init() so it can be used in mips_cpu_realizefn() in the next commit Backports commit 27e38392ca07f97edfb2257b6a1394a04d84e8d5 from qemu	2018-03-05 00:28:17 -05:00
Philippe Mathieu-Daudé	c4f351394f	mips: introduce internal.h and cleanup cpu.h no logical change, only code movement (and fix a comment typo). Backports commit 26aa3d9aecbb6fe9bce808a1d127191bdf3cc3d2 from qemu Also backports commit 5502b66fc7d0bebd08b9b7017cb7e8b5261c3a2d	2018-03-05 00:25:56 -05:00
Igor Mammedov	607bc396c3	arm: drop intermediate cpu_model -> cpu type parsing and use cpu type directly Backports defines from commit ba1ba5cca3962a9cc400c713c736b4fb8db1f38e from qemu	2018-03-05 00:10:21 -05:00
Gonglei	ce71be5c05	i386/cpu/hyperv: support over 64 vcpus for windows guests Starting with Windows Server 2012 and Windows 8, if CPUID.40000005.EAX contains a value of -1, Windows assumes specific limit to the number of VPs. In this case, Windows Server 2012 guest VMs may use more than 64 VPs, up to the maximum supported number of processors applicable to the specific Windows version being used. https://docs.microsoft.com/en-us/virtualization/hyper-v-on-windows/reference/tlfs For compatibility, Let's introduce a new property for X86CPU, named "x-hv-max-vps" as Eduardo's suggestion, and set it to 0x40 before machine 2.10. (The "x-" prefix indicates that the property is not supposed to be a stable user interface.) Backports relevant parts of commit 6c69dfb67e84747cf071958594d939e845dfcc0c from qemu	2018-03-05 00:00:53 -05:00
Joseph Myers	a237d9dbca	target/i386: fix phminposuw in-place operation The SSE4.1 phminposuw instruction finds the minimum 16-bit element in the source vector, putting the value of that element in the low 16 bits of the destination vector, the index of that element in the next three bits and zeroing the rest of the destination. The helper for this operation fills the destination from high to low, meaning that when the source and destination are the same register, the minimum source element can be overwritten before it is copied to the destination. This patch fixes it to fill the destination from low to high instead, so the minimum source element is always copied first. This fixes one gcc test failure in my GCC 6-based testing (and so concludes the present sequence of patches, as I don't have any further gcc test failures left in that testing that I attribute to QEMU bugs). Backports commit aa406feadfc5b095ca147ec56d6187c64be015a7 from qemu	2018-03-04 23:59:26 -05:00
Joseph Myers	85b647e486	target/i386: fix pcmpxstrx substring search One of the cases of the SSE4.2 pcmpestri / pcmpestrm / pcmpistri / pcmpistrm instructions does a substring search. The implementation of this case in the pcmpxstrx helper is incorrect. The operation in this case is a search for a string (argument d to the helper) in another string (argument s to the helper); if a copy of d at a particular position would run off the end of s, the resulting output bit should be 0 whether or not the strings match in the region where they overlap, but the QEMU implementation was wrongly comparing only up to the point where s ends and counting it as a match if an initial segment of d matched a terminal segment of s. Here, "run off the end of s" means that some byte of d would overlap some byte outside of s; thus, if d has zero length, it is considered to match everywhere, including after the end of s. This patch fixes the implementation to correspond with the proper instruction semantics. This fixes four gcc test failures in my GCC 6-based testing. Backports commit ae35eea7e4a9f21dd147406dfbcd0c4c6aaf2a60 from qemu	2018-03-04 23:58:45 -05:00
Joseph Myers	59df0bae12	target/i386: fix packusdw in-place operation The SSE4.1 packusdw instruction combines source and destination vectors of signed 32-bit integers into a single vector of unsigned 16-bit integers, with unsigned saturation. When the source and destination are the same register, this means each 32-bit element of that register is used twice as an input, to produce two of the 16-bit output elements, and so if the operation is carried out element-by-element in-place, no matter what the order in which it is applied to the elements, the first element's operation will overwrite some future input. The helper for packssdw avoids this issue by computing the result in a local temporary and copying it to the destination at the end; this patch fixes the packusdw helper to do likewise. This fixes three gcc test failures in my GCC 6-based testing. Backports commit 80e19606215d4df370dfe8fe21c558a129f00f0b from qemu	2018-03-04 23:57:54 -05:00
Joseph Myers	84b3c54b18	target/i386: set rip_offset for further SSE instructions It turns out that my recent fix to set rip_offset when emulating some SSE4.1 instructions needs generalizing to cover a wider class of instructions. Specifically, every instruction in the sse_op_table7 table, coming from various instruction set extensions, has an 8-bit immediate operand that comes after any memory operand, and so needs rip_offset set for correctness if there is a memory operand that is rip-relative, and my patch only set it for a subset of those instructions. This patch moves the rip_offset setting to cover the wider class of instructions, so fixing 9 further gcc testsuite failures in my GCC 6-based testing. (I do not know whether there might be still further classes of instructions missing this setting.) Backports commit c6a8242915328cda0df0fbc0803da3448137e614 from qemu	2018-03-04 23:57:12 -05:00

1 2 3 4 5 ...

308 commits