unicorn

mirror of https://github.com/yuzu-emu/unicorn.git synced 2024-12-25 07:45:26 +00:00

Author	SHA1	Message	Date
Richard Henderson	927fb85b51	target/arm: Convert T16 add, compare, move (two high registers) Backports commit 90aa042115a0fe39fe4cb3bcae4c4f728e2f3fdb from qemu	2019-11-28 02:51:46 -05:00
Richard Henderson	7ca1b3f817	target/arm: Convert T16 branch and exchange Backports commit a0ef07740425b679d010fac7d9954ae003c1b191 from qemu	2019-11-28 02:51:39 -05:00
Richard Henderson	c690f562c4	target/arm: Convert T16 one low register and immediate Backports commit 6c6d237a865041972ec5b226657398f3b3018561 from qemu	2019-11-28 02:51:31 -05:00
Richard Henderson	d9184b16a9	target/arm: Convert T16 add/sub (3 low, 2 low and imm) Backports commit c4d3095bb62bdac0b4f9cb180bd7aa0b40c2c270 from qemu	2019-11-28 02:51:24 -05:00
Richard Henderson	c52cfb9aa6	target/arm: Convert T16 load/store multiple Backports commit 6e8514ba408f3cc758cd47e2da5475d8684507ec from qemu	2019-11-28 02:51:16 -05:00
Richard Henderson	10c8008266	target/arm: Convert T16 add pc/sp (immediate) Backports commit 1cb1323433dca657a42483b2291c1ae923a91726 from qemu	2019-11-28 02:51:08 -05:00
Richard Henderson	7f79c0f2a1	target/arm: Convert T16 load/store (immediate offset) Backports commit 07afd747f9fdd79fabf3a51416c7d795f873d297 from qemu	2019-11-28 02:51:01 -05:00
Richard Henderson	0ed7d59e35	target/arm: Convert T16 load/store (register offset) Backports commit d1d229179c6b011cc3fa124d4b6b649866470530 from qemu	2019-11-28 02:50:48 -05:00
Richard Henderson	e30f636e8a	target/arm: Convert T16 data-processing (two low regs) Backports commit 080c4eadcbbaf95a6fcc4668cf16e4580f2bfe11 from qemu	2019-11-28 02:50:40 -05:00
Richard Henderson	0b2f37afaf	target/arm: Add skeleton for T16 decodetree Backports commit f97b454e9e7f5d018d34b5ea85a66cff016bd3b7 from qemu	2019-11-28 02:50:27 -05:00
Richard Henderson	19f1da260f	target/arm: Simplify disas_arm_insn Fold away all of the cases that now just goto illegal_op, because all of their internal bits are now in decodetree. Backports commit 590057d969a54de5d97261701c5702b3bebc9c07 from qemu	2019-11-28 02:50:03 -05:00
Richard Henderson	e402eef2f0	target/arm: Simplify disas_thumb2_insn Fold away all of the cases that now just goto illegal_op, because all of their internal bits are now in decodetree. Backports commit f843e77144c9334e244a422848177f2fbef5eb05 from qemu	2019-11-28 02:48:20 -05:00
Richard Henderson	1b5c72935d	target/arm: Convert TT Backports commit d449f174e820b15ca1a1f5f3ec19999eeb7da14c from qemu	2019-11-28 02:48:06 -05:00
Richard Henderson	0e03a1f59c	target/arm: Convert SG Backports commit 35d240acf1b6a89558e74b490feb13267415b236 from qemu	2019-11-28 02:47:57 -05:00
Richard Henderson	0afb6ab7af	target/arm: Convert Table Branch Backports commit 808092bbe356eef0897476be50193d0778596877 from qemu	2019-11-28 02:47:48 -05:00
Richard Henderson	e6b4480ca9	target/arm: Convert Unallocated memory hint Backports commit 610f4e1764aa2049fa1711893ff62faf777813f3 from qemu	2019-11-28 02:47:41 -05:00
Richard Henderson	4593c67444	target/arm: Convert PLI, PLD, PLDW Backports commit beb595f657d615856fee904f1e0f74f5e1e299a3 from qemu	2019-11-28 02:47:33 -05:00
Richard Henderson	5a23a1f020	target/arm: Convert SETEND Backports commit 48c04a5dfaf2c08e00b659a22c502ec098999cf1 from qemu	2019-11-28 02:47:26 -05:00
Richard Henderson	55390520d6	target/arm: Convert CPS (privileged) Backports commit 52f83b9c68bdde8d3da5f49292d3561bd474651d from qemu	2019-11-28 02:47:19 -05:00
Richard Henderson	15558cfa7e	target/arm: Convert Clear-Exclusive, Barriers Backports commit 519b84711ea36bb8a339096e207b6b9b65cd2051 from qemu	2019-11-28 02:47:11 -05:00
Richard Henderson	eff475c9a9	target/arm: Convert RFE and SRS Backports commit 885782a78c6d05932c5d8f463dc50fc8312e3eb9 from qemu	2019-11-28 02:47:04 -05:00
Richard Henderson	87ff6a8bdf	target/arm: Convert SVC Backports commit 542f5188a14758d64f7504580a9bd3cae973f546 from qemu	2019-11-28 02:46:55 -05:00
Richard Henderson	a119870e57	target/arm: Convert B, BL, BLX (immediate) Backports commit 360144f3b99f9a626ffcc6b9d76537e3a3e0e708 from qemu	2019-11-28 02:46:47 -05:00
Richard Henderson	ed9b8ad2ea	target/arm: Diagnose base == pc for LDM/STM We have been using store_reg and not store_reg_for_load when writing back a loaded value into the base register. At first glance this is incorrect when base == pc, however that case is UNPREDICTABLE. Backports commit b0e382b8cf365fed8b8c43482029ac7655961a85 from qemu	2019-11-28 02:46:40 -05:00
Richard Henderson	1a0986ee25	target/arm: Diagnose too few registers in list for LDM/STM This has been a TODO item for quite a while. The minimum bit count for A32 and T16 is 1, and for T32 is 2. Backports commit 4b222545dbf30b60c033e1cd6eddda612575fd8c from qemu	2019-11-28 02:46:33 -05:00
Richard Henderson	fc81b12631	target/arm: Diagnose writeback register in list for LDM for v7 Prior to v7, for the A32 encoding, this operation wrote an UNKNOWN value back to the base register. Starting in v7 this is UNPREDICTABLE. Backports commit 3949f4675d13c587078f8f423845a3a537a22595 from qemu	2019-11-28 02:46:24 -05:00
Richard Henderson	a501800ba6	target/arm: Convert LDM, STM This includes a minor bug fix to LDM (user), which requires bit 21 to be 0, which means no writeback. Backports commit c5c426d4c680f908a1e262091a17b088b5709200 from qemu	2019-11-28 02:46:04 -05:00
Richard Henderson	e4ca88f9d6	target/arm: Convert MOVW, MOVT Backports commit 8f4451274b7010c1f50e0baa5bb608f19f02b90f from qemu	2019-11-28 02:46:04 -05:00
Richard Henderson	b35749e239	target/arm: Convert Signed multiply, signed and unsigned divide Backports commit 2c7c4e090409189488149869797da4acf895bad0 from qemu	2019-11-28 02:45:33 -05:00
Richard Henderson	987641cf10	target/arm: Convert packing, unpacking, saturation, and reversal Backports commit 46497f6af73bb33c1064d43a28a48cbb4d233a23 from qemu	2019-11-28 02:44:55 -05:00
Richard Henderson	83cced6170	target/arm: Convert Parallel addition and subtraction Backports commit adf1a5662a47d5b5b96f4f1e440e34c26b14a154 from qemu	2019-11-28 02:44:20 -05:00
Richard Henderson	21df423e47	target/arm: Convert USAD8, USADA8, SBFX, UBFX, BFC, BFI, UDF In op_bfx, note that tcg_gen_{,s}extract_i32 already checks for width == 32, so we don't need to special case that here. Backports commit 86d21e4b509a2835ed79f234f476a4c5191d435b from qemu	2019-11-28 02:44:20 -05:00
Richard Henderson	dbcc67ab20	target/arm: Diagnose UNPREDICTABLE ldrex/strex cases Backports commit af2882289951e58363d714afd16f80050685fa29 from qemu	2019-11-28 02:44:20 -05:00
Richard Henderson	3ac019eb98	target/arm: Convert Synchronization primitives Backports commit 1efdd407a25f617129e2e0d5c009c07cbe847990 from qemu	2019-11-28 02:44:18 -05:00
Richard Henderson	c794962c42	target/arm: Convert load/store (register, immediate, literal) Backports commit 5e291fe16846d216d5a69569b1c59f497dff96e4 from qemu	2019-11-28 02:42:01 -05:00
Richard Henderson	d5d98450f3	target/arm: Convert T32 ADDW/SUBW Backports commit 145952e87fb86aaa9434d768c31eedbd323f7157 from qemu	2019-11-28 02:42:01 -05:00
Richard Henderson	7b9025910d	target/arm: Convert the rest of A32 Miscelaneous instructions Backports commit 2cde9ea57dbc4cdee3677a1a335574537810fe2e from qemu	2019-11-28 02:42:01 -05:00
Richard Henderson	be2a259d3c	target/arm: Convert ERET Pass the T5 encoding of SUBS PC, LR, #IMM through the normal SUBS path to make it clear exactly what's happening -- we hit ALUExceptionReturn along that path. Backports commit ef11bc3c461e2c650e8bef552146a4b08f81884e from qemu	2019-11-28 02:42:00 -05:00
Richard Henderson	74040da34c	target/arm: Convert CLZ Document our choice about the T32 CONSTRAINED UNPREDICTABLE behaviour. This matches the undocumented choice made by the legacy decoder. Backports commit 4c97f5b2f0fa9b37f9ff497f15411d809e6fd098 from qemu	2019-11-28 02:42:00 -05:00
Richard Henderson	94968602b8	target/arm: Convert BX, BXJ, BLX (register) Backports commit 4ed95abd700e43dee8e032f754b53bec2b047f75 from qemu	2019-11-28 02:42:00 -05:00
Richard Henderson	831e17d970	target/arm: Convert Cyclic Redundancy Check Backports commit 6c35d53f1bde7fe327c074473c3048d6e6f15e95 from qemu	2019-11-28 02:42:00 -05:00
Richard Henderson	fdd135c7d2	target/arm: Convert MRS/MSR (banked, register) The m-profile and a-profile decodings overlap. Only return false for the case of wrong profile; handle UNDEFINED for permission failure directly. This ensures that we don't accidentally pass an insn that applies to the wrong profile. Backports commit d0b26644502103ca97093ef67749812dc1df7eea from qemu	2019-11-28 02:42:00 -05:00
Richard Henderson	571d879c49	target/arm: Convert MSR (immediate) and hints Backports commit 6313059623dc512308681ba160ed862ac387e2fb from qemu	2019-11-28 02:41:59 -05:00
Richard Henderson	a011318794	target/arm: Simplify op_smlawx for SMLAW* By shifting the 16-bit input left by 16, we can align the desired portion of the 48-bit product and use tcg_gen_muls2_i32. Backports commit 485b607d4f393e0de92c922806a68aef22340c98 from qemu	2019-11-28 02:40:01 -05:00
Richard Henderson	201be7b8b1	target/arm: Simplify op_smlaxxx for SMLAL* Since all of the inputs and outputs are i32, dispense with the intermediate promotion to i64 and use tcg_gen_add2_i32. Backports commit ea96b374641bc429269096d88d4e91ee544273e9 from qemu	2019-11-28 02:40:00 -05:00
Richard Henderson	543b598d45	target/arm: Convert Halfword multiply and multiply accumulate Backports commit 26c6923de7131fa1cf223ab67131d1992dc17001 from qemu	2019-11-28 02:40:00 -05:00
Richard Henderson	44416a6794	target/arm: Convert Saturating addition and subtraction Backports commit 6d0730a82417e3a4a1911eb8e0246f3ba996f932 from qemu	2019-11-28 02:40:00 -05:00
Richard Henderson	45566b2780	target/arm: Simplify UMAAL Since all of the inputs and outputs are i32, dispense with the intermediate promotion to i64 and use tcg_gen_mulu2_i32 and tcg_gen_add2_i32. Backports commit 2409d56454f0d028619fb1002eda86bf240906dd from qemu	2019-11-28 02:40:00 -05:00
Richard Henderson	5e5ae4c0d0	target/arm: Convert multiply and multiply accumulate Backports commit bd92fe353bda4412ffc46c0f7415207a684b45f2 from qemu	2019-11-28 02:40:00 -05:00
Richard Henderson	677cf191d2	target/arm: Convert Data Processing (immediate) Convert the modified immediate form of the data processing insns. For A32, we can finally remove any code that was intertwined with the register and register-shifted-register forms. Backports commit 581c6ebd17c8f56ad52772216e6c6d8cc8997e8b from qemu	2019-11-28 02:39:16 -05:00
Richard Henderson	1b21ced6a1	target/arm: Convert Data Processing (reg-shifted-reg) Convert the register shifted by register form of the data processing insns. For A32, we cannot yet remove any code because the legacy decoder intertwines the immediate form. Backports commit 5be2c12337f4cbdbda4efe6ab485350f730faaad from qemu	2019-11-28 02:39:16 -05:00
Richard Henderson	e151696a65	target/arm: Convert Data Processing (register) Convert the register shifted by immediate form of the data processing insns. For A32, we cannot yet remove any code because the legacy decoder intertwines the reg-shifted-reg and immediate forms. Backports commit 25ae32c558182c07fc6ad01b936e9151cbf00c44 from qemu	2019-11-28 02:38:58 -05:00
Richard Henderson	9fc793b566	target/arm: Add stubs for aa32 decodetree Add the infrastructure that will become the new decoder. No instructions adjusted so far. Backports commit 51409b9e8cfe997b1ac3365df7400e0c6e844437 from qemu	2019-11-28 02:38:49 -05:00
Richard Henderson	6ec6c71d50	target/arm: Use store_reg_from_load in thumb2 code This function already includes the test for an interworking write to PC from a load. Change the T32 LDM implementation to match the A32 LDM implementation. For LDM, the reordering of the tests does not change valid behaviour because the only case that differs is has rn == 15, which is UNPREDICTABLE. Backports commit 69be3e13764111737e1a7a13bb0c231e4d5be756 from qemu	2019-11-28 02:38:42 -05:00
Richard Henderson	46a8dfff59	target/arm: Fix SMMLS argument order The previous simplification got the order of operands to the subtraction wrong. Since the 64-bit product is the subtrahend, we must use a 64-bit subtract to properly compute the borrow from the low-part of the product. Fixes: 5f8cd06ebcf5 ("target/arm: Simplify SMMLA, SMMLAR, SMMLS, SMMLSR") Backports commit e0a0c8322b8ebcdad674f443a3e86db8708d6738 from qemu	2019-11-20 17:24:44 -05:00
Peter Maydell	9fb54a7f72	target/arm: Take exceptions on ATS instructions when needed The translation table walk for an ATS instruction can result in various faults. In general these are just reported back via the PAR_EL1 fault status fields, but in some cases the architecture requires that the fault is turned into an exception: * synchronous stage 2 faults of any kind during AT S1E0* and AT S1E1* instructions executed from NS EL1 fault to EL2 or EL3 * synchronous external aborts are taken as Data Abort exceptions (This is documented in the v8A Arm ARM DDI0487A.e D5.2.11 and G5.13.4.) Backports commit 0710b2fa84a4aeb925422e1e88edac49ed407c79 from qemu	2019-11-20 17:24:44 -05:00
Peter Maydell	56b54f361e	target/arm: Allow ARMCPRegInfo read/write functions to throw exceptions Currently the only part of an ARMCPRegInfo which is allowed to cause a CPU exception is the access function, which returns a value indicating that some flavour of UNDEF should be generated. For the ATS system instructions, we would like to conditionally generate exceptions as part of the writefn, because some faults during the page table walk (like external aborts) should cause an exception to be raised rather than returning a value. There are several ways we could do this: * plumb the GETPC() value from the top level set_cp_reg/get_cp_reg helper functions through into the readfn and writefn hooks * add extra readfn_with_ra/writefn_with_ra hooks that take the GETPC() value * require the ATS instructions to provide a dummy accessfn, which serves no purpose except to cause the code generation to emit TCG ops to sync the CPU state * add an ARM_CP_ flag to mark the ARMCPRegInfo as possibly throwing an exception in its read/write hooks, and make the codegen sync the CPU state before calling the hooks if the flag is set This patch opts for the last of these, as it is fairly simple to implement and doesn't require invasive changes like updating the readfn/writefn hook function prototype signature. Backports commit 37ff584c15bc3e1dd2c26b1998f00ff87189538c from qemu	2019-11-20 17:24:37 -05:00
Richard Henderson	87c06b7fae	target/arm: Factor out unallocated_encoding for aarch32 Make this a static function private to translate.c. Thus we can use the same idiom between aarch64 and aarch32 without actually sharing function implementations. Backports commit 1ce21ba1eaf08b22da5925f3e37fc0b4322da858 from qemu	2019-11-18 23:51:45 -05:00
Richard Henderson	1f59a43544	Revert "target/arm: Use unallocated_encoding for aarch32" Despite the fact that the text for the call to gen_exception_insn is identical for aarch64 and aarch32, the implementation inside gen_exception_insn is totally different. This fixes exceptions raised from aarch64. This reverts commit `fb2d3c9a9a`.	2019-11-18 23:49:47 -05:00
Richard Henderson	9d2a3064af	target/arm: Use tcg_gen_extrh_i64_i32 to extract the high word Separate shift + extract low will result in one extra insn for hosts like RISC-V, MIPS, and Sparc. Backports commit 664b7e3b97d6376f3329986c465b3782458b0f8b from qemu	2019-11-18 20:36:19 -05:00
Richard Henderson	93c016a3e7	target/arm: Simplify SMMLA, SMMLAR, SMMLS, SMMLSR All of the inputs to these instructions are 32-bits. Rather than extend each input to 64-bits and then extract the high 32-bits of the output, use tcg_gen_muls2_i32 and other 32-bit generator functions. Backports commit 5f8cd06ebcf57420be8fea4574de2e074de46709 from qemu	2019-11-18 20:31:12 -05:00
Richard Henderson	4a1cc16eef	target/arm: Use tcg_gen_rotri_i32 for gen_swap_half Rotate is the more compact and obvious way to swap 16-bit elements of a 32-bit word. Backports commit adefba76e8bf10dfb342094d2f5debfeedb1a74d from qemu	2019-11-18 20:27:12 -05:00
Richard Henderson	751ab7b24b	target/arm: Use ror32 instead of open-coding the operation The helper function is more documentary, and also already handles the case of rotate by zero. Backports commit dd861b3f29be97a9e3cdb9769dcbc0c7d7825185 from qemu	2019-11-18 20:25:51 -05:00
Richard Henderson	df4c773ed2	target/arm: Remove redundant shift tests The immediate shift generator functions already test for, and eliminate, the case of a shift by zero. Backports commit 464eaa9571fae5867d9aea7d7209c091c8a50223 from qemu	2019-11-18 20:24:39 -05:00
Richard Henderson	4dd30ebfbd	target/arm: Use tcg_gen_deposit_i32 for PKHBT, PKHTB Use deposit as the composit operation to merge the bits from the two inputs. Backports commit d1f8755fc93911f5b27246b1da794542d222fa1b from qemu	2019-11-18 20:22:00 -05:00
Richard Henderson	25ccd28e78	target/arm: Use tcg_gen_extract_i32 for shifter_out_im Extract is a compact combination of shift + and. Backports commit 191f4bfe8d6cf0c7d5cd7f84cd7076e32e3745dd from qemu	2019-11-18 20:19:40 -05:00
Andrew Jones	ad63ee7509	target/arm/cpu: Use div-round-up to determine predicate register array size Unless we're guaranteed to always increase ARM_MAX_VQ by a multiple of four, then we should use DIV_ROUND_UP to ensure we get an appropriate array size. Backports commit 46417784d21c89446763f2047228977bdc267895 from qemu	2019-11-18 20:16:46 -05:00
Andrew Jones	bb8b3bc42b	target/arm/helper: zcr: Add build bug next to value range assumption The current implementation of ZCR_ELx matches the architecture, only implementing the lower four bits, with the rest RAZ/WI. This puts a strict limit on ARM_MAX_VQ of 16. Make sure we don't let ARM_MAX_VQ grow without a corresponding update here. Backports commit 7b351d98709d3f77d6bb18562e1bf228862b0d57 from qemu	2019-11-18 20:14:42 -05:00
Richard Henderson	3d3d56056b	target/arm: Remove helper_double_saturate Replace x = double_saturate(y) with x = add_saturate(y, y). There is no need for a separate more specialized helper. Backports commit 640581a06d14e2d0d3c3ba79b916de6bc43578b0 from qemu	2019-11-18 20:13:21 -05:00
Richard Henderson	fb2d3c9a9a	target/arm: Use unallocated_encoding for aarch32 Promote this function from aarch64 to fully general use. Use it to unify the code sequences for generating illegal opcode exceptions. Backports commit 3cb36637157088892e9e33ddb1034bffd1251d3b from qemu	2019-11-18 20:10:50 -05:00
Richard Henderson	d562bea784	target/arm: Remove offset argument to gen_exception_bkpt_insn Unlike the other more generic gen_exception{,_internal}_insn interfaces, breakpoints always refer to the current instruction. Backports commit 06bcbda3f64d464b6ecac789bce4bd69f199cd68 from qemu	2019-11-18 20:05:45 -05:00
Richard Henderson	f19b4df20d	target/arm: Replace offset with pc in gen_exception_internal_insn The offset is variable depending on the instruction set. Passing in the actual value is clearer in intent. Backpors commit aee828e7541a5895669ade3a4b6978382b6b094a from qemu	2019-11-18 20:05:23 -05:00
Richard Henderson	00fbadf637	target/arm: Replace s->pc with s->base.pc_next We must update s->base.pc_next when we return from the translate_insn hook to the main translator loop. By incrementing s->base.pc_next immediately after reading the insn word, "pc_next" contains the address of the next instruction throughout translation. All remaining uses of s->pc are referencing the address of the next insn, so this is now a simple global replacement. Remove the "s->pc" field. Backports commit a04159166b880b505ccadc16f2fe84169806883d from qemu	2019-11-18 17:32:53 -05:00
Richard Henderson	7d1fcef722	target/arm: Remove redundant s->pc & ~1 The thumb bit has already been removed from s->pc, and is always even. Backports commit 4818c3743b0e0095fdcecd24457da9b3443730ab from qemu	2019-11-18 17:32:53 -05:00
Richard Henderson	a2e60445de	target/arm: Introduce add_reg_for_lit Provide a common routine for the places that require ALIGN(PC, 4) as the base address as opposed to plain PC. The two are always the same for A32, but the difference is meaningful for thumb mode. Backports commit 16e0d8234ef9291747332d2c431e46808a060472 from qemu	2019-11-18 17:32:49 -05:00
Richard Henderson	1c0914e58c	target/arm: Introduce read_pc We currently have 3 different ways of computing the architectural value of "PC" as seen in the ARM ARM. The value of s->pc has been incremented past the current insn, but that is all. Thus for a32, PC = s->pc + 4; for t32, PC = s->pc; for t16, PC = s->pc + 2. These differing computations make it impossible at present to unify the various code paths. With the newly introduced s->pc_curr, we can compute the correct value for all cases, using the formula given in the ARM ARM. This changes the behaviour for load_reg() and load_reg_var() when called with reg==15 from a 32-bit Thumb instruction: previously they would have returned the incorrect value of pc_curr + 6, and now they will return the architecturally correct value of PC, which is pc_curr + 4. This will not affect well-behaved guest software, because all of the places we call these functions from T32 code are instructions where using r15 is UNPREDICTABLE. Using the architectural PC value here is more consistent with the T16 and A32 behaviour. Backports commit fdbcf6329d0c2984c55d7019419a72bf8e583c36 from qemu	2019-11-18 17:04:50 -05:00
Richard Henderson	0048f3e887	target/arm: Introduce pc_curr Add a new field to retain the address of the instruction currently being translated. The 32-bit uses are all within subroutines used by a32 and t32. This will become less obvious when t16 support is merged with a32+t32, and having a clear definition will help. Convert aarch64 as well for consistency. Note that there is one instance of a pre-assert fprintf that used the wrong value for the address of the current instruction. Backports commit 43722a6d4f0c92f7e7e1e291580039b0f9789df1 from qemu	2019-11-18 16:58:40 -05:00
Richard Henderson	1aa3c685a8	target/arm: Pass in pc to thumb_insn_is_16bit This function is used in two different contexts, and it will be clearer if the function is given the address to which it applies. Backports commit 331b1ca616cb708db30dab68e3262d286e687f24 from qemu	2019-11-18 16:52:35 -05:00
Peter Maydell	c61e22627d	target/arm: Fix routing of singlestep exceptions When generating an architectural single-step exception we were routing it to the "default exception level", which is to say the same exception level we execute at except that EL0 exceptions go to EL1. This is incorrect because the debug exception level can be configured by the guest for situations such as single stepping of EL0 and EL1 code by EL2. We have to track the target debug exception level in the TB flags, because it is dependent on CPU state like HCR_EL2.TGE and MDCR_EL2.TDE. (That we were previously calling the arm_debug_target_el() function to determine dc->ss_same_el is itself a bug, though one that would only have manifested as incorrect syndrome information.) Since we are out of TB flag bits unless we want to expand into the cs_base field, we share some bits with the M-profile only HANDLER and STACKCHECK bits, since only A-profile has this singlestep. Fixes: https://bugs.launchpad.net/qemu/+bug/1838913 Backports commit 8bd587c1066f4456ddfe611b571d9439a947d74c from qemu	2019-11-18 16:50:15 -05:00
Peter Maydell	3f531fac61	target/arm: Factor out 'generate singlestep exception' function Factor out code to 'generate a singlestep exception', which is currently repeated in four places. To do this we need to also pull the identical copies of the gen-exception() function out of translate-a64.c and translate.c into translate.h. (There is a bug in the code: we're taking the exception to the wrong target EL. This will be simpler to fix if there's only one place to do it.) Backports commit c1d5f50f094ab204accfacc2ee6aafc9601dd5c4 from qemu	2019-11-18 16:47:08 -05:00
Alex Bennée	0d6ed39333	target/arm: generate a custom MIDR for -cpu max While most features are now detected by probing the ID_* registers kernels can (and do) use MIDR_EL1 for working out of they have to apply errata. This can trip up warnings in the kernel as it tries to work out if it should apply workarounds to features that don't actually exist in the reported CPU type. Avoid this problem by synthesising our own MIDR value. Backports commit 2bd5f41c00686a1f847a60824d0375f3df2c26bf from qemu	2019-11-18 16:42:51 -05:00
Christophe Lyon	8264cb84fe	target/arm: Allow reading flags from FPSCR for M-profile rt==15 is a special case when reading the flags: it means the destination is APSR. This patch avoids rejecting vmrs apsr_nzcv, fpscr as illegal instruction. Backports commit cdc6896659b85f7ed8f7552850312e55170de0c5 from qemu	2019-11-18 16:32:06 -05:00
Peter Maydell	3fc86e1901	target/arm: Don't abort on M-profile exception return in linux-user mode An attempt to do an exception-return (branch to one of the magic addresses) in linux-user mode for M-profile should behave like a normal branch, because linux-user mode is always going to be in 'handler' mode. This used to work, but we broke it when we added support for the M-profile security extension in commit d02a8698d7ae2bfed. In that commit we allowed even handler-mode calls to magic return values to be checked for and dealt with by causing an EXCP_EXCEPTION_EXIT exception to be taken, because this is needed for the FNC_RETURN return-from-non-secure-function-call handling. For system mode we added a check in do_v7m_exception_exit() to make any spurious calls from Handler mode behave correctly, but forgot that linux-user mode would also be affected. How an attempted return-from-non-secure-function-call in linux-user mode should be handled is not clear -- on real hardware it would result in return to secure code (not to the Linux kernel) which could then handle the error in any way it chose. For QEMU we take the simple approach of treating this erroneous return the same way it would be handled on a CPU without the security extensions -- treat it as a normal branch. The upshot of all this is that for linux-user mode we should never do any of the bx_excret magic, so the code change is simple. This ought to be a weird corner case that only affects broken guest code (because Linux user processes should never be attempting to do exception returns or NS function returns), except that the code that assigns addresses in RAM for the process and stack in our linux-user code does not attempt to avoid this magic address range, so legitimate code attempting to return to a trampoline routine on the stack can fall into this case. This change fixes those programs, but we should also look at restricting the range of memory we use for M-profile linux-user guests to the area that would be real RAM in hardware. Backports commit 9027d3fba605d8f6093342ebe4a1da450d374630 from qemu	2019-11-18 16:30:43 -05:00
Peter Maydell	8f7f19ce43	target/arm: Free TCG temps in trans_VMOV_64_sp() The function neon_store_reg32() doesn't free the TCG temp that it is passed, so the caller must do that. We got this right in most places but forgot to free the TCG temps in trans_VMOV_64_sp(). Backports commit 38fb634853ac6547326d9f88b9a068d9fc6b4ad4 from qemu	2019-11-18 16:27:21 -05:00
Peter Maydell	c6041bf94b	target/arm: Avoid bogus NSACR traps on M-profile without Security Extension In Arm v8.0 M-profile CPUs without the Security Extension and also in v7M CPUs, there is no NSACR register. However, the code we have to handle the FPU does not always check whether the ARM_FEATURE_M_SECURITY bit is set before testing whether env->v7m.nsacr permits access to the FPU. This means that for a CPU with an FPU but without the Security Extension we would always take a bogus fault when trying to stack the FPU registers on an exception entry. We could fix this by adding extra feature bit checks for all uses, but it is simpler to just make the internal value of nsacr 0xcff ("all non-secure accesses allowed"), since this is not guest visible when the Security Extension is not present. This allows us to continue to follow the Arm ARM pseudocode which takes a similar approach. (In particular, in the v8.1 Arm ARM the register is documented as reading as 0xcff in this configuration.) Fixes: https://bugs.launchpad.net/qemu/+bug/1838475 Backports commit 02ac2f7f613b47f6a5b397b20ab0e6b2e7fb89fa from qemu	2019-08-08 19:56:56 -04:00
Lioncash	59d808cf21	target/arm: Supply uc_struct instance to tcg_enabled()	2019-08-08 19:55:12 -04:00
Peter Maydell	ecd3f0a5df	target/arm: Deliver BKPT/BRK exceptions to correct exception level Most Arm architectural debug exceptions (eg watchpoints) are ignored if the configured "debug exception level" is below the current exception level (so for example EL1 can't arrange to get debug exceptions for EL2 execution). Exceptions generated by the BRK or BPKT instructions are a special case -- they must always cause an exception, so if we're executing above the debug exception level then we must take them to the current exception level. This fixes a bug where executing BRK at EL2 could result in an exception being taken at EL1 (which is strictly forbidden by the architecture). Fixes: https://bugs.launchpad.net/qemu/+bug/1838277 Backports commit 987a23224218fa3bb3aa0024ad236dcf29ebde9e from qemu	2019-08-08 19:53:30 -04:00
Peter Maydell	fbbd582fb9	target/arm: Limit ID register assertions to TCG In arm_cpu_realizefn() we make several assertions about the values of guest ID registers: * if the CPU provides AArch32 v7VE or better it must advertise the ARM_DIV feature * if the CPU provides AArch32 A-profile v6 or better it must advertise the Jazelle feature These are essentially consistency checks that our ID register specifications in cpu.c didn't accidentally miss out a feature, because increasingly the TCG emulation gates features on the values in ID registers rather than using old-style checks of ARM_FEATURE_FOO bits. Unfortunately, these asserts can cause problems if we're running KVM, because in that case we don't control the values of the ID registers -- we read them from the host kernel. In particular, if the host kernel is older than 4.15 then it doesn't expose the ID registers via the KVM_GET_ONE_REG ioctl, and we set up dummy values for some registers and leave the rest at zero. (See the comment in target/arm/kvm64.c kvm_arm_get_host_cpu_features().) This set of dummy values is not sufficient to pass our assertions, and so on those kernels running an AArch32 guest on AArch64 will assert. We could provide a more sophisticated set of dummy ID registers in this case, but that still leaves the possibility of a host CPU which reports bogus ID register values that would cause us to assert. It's more robust to only do these ID register checks if we're using TCG, as that is the only case where this is truly a QEMU code bug. Backports commit 8f4821d77e465bc2ef77302d47640d5a43d92b30 from qemu	2019-08-08 19:44:16 -04:00
Philippe Mathieu-Daudé	9bd010263a	target/arm: Add missing break statement for Hypervisor Trap Exception Reported by GCC9 when building with -Wimplicit-fallthrough=2: target/arm/helper.c: In function ‘arm_cpu_do_interrupt_aarch32_hyp’: target/arm/helper.c:7958:14: error: this statement may fall through [-Werror=implicit-fallthrough=] 7958 \| addr = 0x14; \| ~~~~~^~~~~~ target/arm/helper.c:7959:5: note: here 7959 \| default: \| ^~~~~~~ cc1: all warnings being treated as errors Backports commit 9bbb4ef991fa93323f87769a6e217c2b9273a128 from qemu	2019-08-08 19:43:01 -04:00
Peter Maydell	cdb9422f3a	target/arm: NS BusFault on vector table fetch escalates to NS HardFault In the M-profile architecture, when we do a vector table fetch and it fails, we need to report a HardFault. Whether this is a Secure HF or a NonSecure HF depends on several things. If AIRCR.BFHFNMINS is 0 then HF is always Secure, because there is no NonSecure HardFault. Otherwise, the answer depends on whether the 'underlying exception' (MemManage, BusFault, SecureFault) targets Secure or NonSecure. (In the pseudocode, this is handled in the Vector() function: the final exc.isSecure is calculated by looking at the exc.isSecure from the exception returned from the memory access, not the isSecure input argument.) We weren't doing this correctly, because we were looking at the target security domain of the exception we were trying to load the vector table entry for. This produces errors of two kinds: * a load from the NS vector table which hits the "NS access to S memory" SecureFault should end up as a Secure HardFault, but we were raising an NS HardFault * a load from the S vector table which causes a BusFault should raise an NS HardFault if BFHFNMINS == 1 (because in that case all BusFaults are NonSecure), but we were raising a Secure HardFault Correct the logic. We also fix a comment error where we claimed that we might be escalating MemManage to HardFault, and forgot about SecureFault. (Vector loads can never hit MPU access faults, because they're always aligned and always use the default address map.) Backports commit 51c9122e92b776a3f16af0b9282f1dc5012e2a19 from qemu	2019-08-08 19:32:53 -04:00
Peter Maydell	8ec683b874	target/arm: Set VFP-related MVFR0 fields for arm926 and arm1026 The ARMv5 architecture didn't specify detailed per-feature ID registers. Now that we're using the MVFR0 register fields to gate the existence of VFP instructions, we need to set up the correct values in the cpu->isar structure so that we still provide an FPU to the guest. This fixes a regression in the arm926 and arm1026 CPUs, which are the only ones that both have VFP and are ARMv5 or earlier. This regression was introduced by the VFP refactoring, and more specifically by commits 1120827fa182f0e76 and 266bd25c485597c, which accidentally disabled VFP short-vector support and double-precision support on these CPUs. Backports commit cb7cef8b32033f6284a47d797edd5c19c5491698 from qemu	2019-08-08 19:29:56 -04:00
Alex Bennée	f893ff0b89	target/arm: report ARMv8-A FP support for AArch32 -cpu max When we converted to using feature bits in 602f6e42cfbf we missed out the fact (dp && arm_dc_feature(s, ARM_FEATURE_V8)) was supported for -cpu max configurations. This caused a regression in the GCC test suite. Fix this by setting the appropriate bits in mvfr1.FPHP to report ARMv8-A with FP support (but not ARMv8.2-FP16). Fixes: https://bugs.launchpad.net/qemu/+bug/1836078 Backports commit 45b1a243b81a7c9ae56235937280711dd9914ca7 from qemu	2019-08-08 19:28:39 -04:00
Philippe Mathieu-Daudé	6f8c8046d8	target/arm/vfp_helper: Call set_fpscr_to_host before updating to FPSCR In commit e9d652824b0 we extracted the vfp_set_fpscr_to_host() function but failed at calling it in the correct place, we call it after xregs[ARM_VFP_FPSCR] is modified. Fix by calling this function before we update FPSCR. Backports commit 85795187f416326f87177cabc39fae1911f04c50 from qemu	2019-08-08 19:21:28 -04:00
Richard Henderson	c687259bf6	target/arm: Fix sve_zcr_len_for_el Off by one error in the EL2 and EL3 tests. Remove the test against EL3 entirely, since it must always be true. Backports commit 6a02a73211c5bc634fccd652777230954b83ccba from qemu	2019-08-08 19:20:35 -04:00
Peter Maydell	1f4c3d6bcc	target/arm: Correct VMOV_imm_dp handling of short vectors Coverity points out (CID 1402195) that the loop in trans_VMOV_imm_dp() that iterates over the destination registers in a short-vector VMOV accidentally throws away the returned updated register number from vfp_advance_dreg(). Add the missing assignment. (We got this correct in trans_VMOV_imm_sp().) Backports commit 89a11ff756410aecb87d2c774df6e45dbf4105c1 from qemu	2019-08-08 18:08:55 -04:00
Peter Maydell	0d89bce217	target/arm: Execute Thumb instructions when their condbits are 0xf Thumb instructions in an IT block are set up to be conditionally executed depending on a set of condition bits encoded into the IT bits of the CPSR/XPSR. The architecture specifies that if the condition bits are 0b1111 this means "always execute" (like 0b1110), not "never execute"; we were treating it as "never execute". (See the ConditionHolds() pseudocode in both the A-profile and M-profile Arm ARM.) This is a bit of an obscure corner case, because the only legal way to get to an 0b1111 set of condbits is to do an exception return which sets the XPSR/CPSR up that way. An IT instruction which encodes a condition sequence that would include an 0b1111 is UNPREDICTABLE, and for v8A the CONSTRAINED UNPREDICTABLE choices for such an IT insn are to NOP, UNDEF, or treat 0b1111 like 0b1110. Add a comment noting that we take the latter option. Backports commit 5529de1e5512c05276825fa8b922147663fd6eac from qemu	2019-08-08 18:07:57 -04:00
Peter Maydell	9d01d50db8	target/arm: Use _ra versions of cpu_stl_data() in v7M helpers In the various helper functions for v7M/v8M instructions, use the _ra versions of cpu_stl_data() and friends. Otherwise we may get wrong behaviour or an assert() due to not being able to locate the TB if there is an exception on the memory access or if it performs an IO operation when in icount mode Backports commit 2884fbb60412049ec92389039ae716b32057382e from qemu	2019-08-08 18:06:23 -04:00
Philippe Mathieu-Daudé	bde186433d	target/arm/helper: Move M profile routines to m_helper.c In preparation for supporting TCG disablement on ARM, we move most of TCG related v7m/v8m helpers and APIs into their own file. Note: It is easier to review this commit using the 'histogram' diff algorithm: $ git diff --diff-algorithm=histogram ... or $ git diff --histogram ... Backports commit 7aab5a8c8bb525ea390b4ebc17ab82c0835cfdb6 from qemu	2019-08-08 18:04:08 -04:00
Philippe Mathieu-Daudé	199e2f8a7d	target/arm: Restrict semi-hosting to TCG Semihosting hooks either SVC or HLT instructions, and inside KVM both of those go to EL1, ie to the guest, and can't be trapped to KVM. Let check_for_semihosting() return False when not running on TCG. backports commit 91f78c58da9ba78c8ed00f5d822b701765be8499 from qemu	2019-08-08 17:48:34 -04:00
Philippe Mathieu-Daudé	6295fd7156	target/arm: Move debug routines to debug_helper.c These routines are TCG specific. Backports commit 9dd5cca42448770a940fa2145f1ff18cdc7b01a9 from qemu	2019-08-08 17:46:56 -04:00
Philippe Mathieu-Daudé	fa2a772c7b	target/arm: Declare some M-profile functions publicly In the next commit we will split the M-profile functions from this file. Some function will be called out of helper.c. Declare them in the "internals.h" header. Backports commit 787a7e76c2e93a48c47b324fea592c9910a70483 from qemu	2019-08-08 15:37:01 -04:00
Philippe Mathieu-Daudé	4eacf0ce98	target/arm: Restrict PSCI to TCG Under KVM, the kernel gets the HVC call and handle the PSCI requests. Backports commit 21fbea8c8af72817d8e21570fa4edbfae417341b from qemu	2019-08-08 15:32:19 -04:00
Philippe Mathieu-Daudé	5aaf004bb2	target/arm/vfp_helper: Restrict the SoftFloat use to TCG This code is specific to the SoftFloat floating-point implementation, which is only used by TCG. Backports commit 4a15527c9feecfd2fa2807d5e698abbc19feb35f from qemu	2019-08-08 15:30:47 -04:00
Philippe Mathieu-Daudé	4e214e5d64	target/arm/vfp_helper: Extract vfp_set_fpscr_from_host() The vfp_set_fpscr() helper contains code specific to the host floating point implementation (here the SoftFloat library). Extract this code to vfp_set_fpscr_from_host(). Backports commit 0c6ad94809b37a1f0f1f75d3cd0e4a24fb77e65c from qemu	2019-08-08 15:28:38 -04:00
Philippe Mathieu-Daudé	2b5d731f56	target/arm/vfp_helper: Extract vfp_set_fpscr_to_host() The vfp_set_fpscr() helper contains code specific to the host floating point implementation (here the SoftFloat library). Extract this code to vfp_set_fpscr_to_host(). Backports commit e9d652824b05845f143ef4797d707fae47d4b3ed from qemu	2019-08-08 15:27:27 -04:00
Philippe Mathieu-Daudé	76bae63014	target/arm/vfp_helper: Move code around To ease the review of the next commit, move the vfp_exceptbits_to_host() function directly after vfp_exceptbits_from_host(). Amusingly the diff shows we are moving vfp_get_fpscr(). Backports commit 20e62dd8c831c9065ed4a8e64813c93ad61c50d7 from qemu.	2019-08-08 15:26:03 -04:00
Philippe Mathieu-Daudé	91e264823e	target/arm: Move TLB related routines to tlb_helper.c These routines are TCG specific. The arm_deliver_fault() function is only used within the new helper. Make it static. Backports commit e21b551cb652663f2f2405a64d63ef6b4a1042b7 from qemu	2019-08-08 15:24:26 -04:00
Philippe Mathieu-Daudé	1af5deaf52	target/arm: Declare get_phys_addr() function publicly In the next commit we will split the TLB related routines of this file, and this function will also be called in the new file. Declare it in the "internals.h" header. Backports commit ebae861fc6c385a7bcac72dde4716be06e6776f1 from qemu	2019-08-08 15:14:45 -04:00
Samuel Ortiz	9296465289	target/arm: Move the DC ZVA helper into op_helper Those helpers are a software implementation of the ARM v8 memory zeroing op code. They should be moved to the op helper file, which is going to eventually be built only when TCG is enabled. Backports commit 6cdca173ef81a9dbcee9e142f1a5a34ad9c44b75 from qemu	2019-08-08 15:10:46 -04:00
Philippe Mathieu-Daudé	f77b60d7e9	target/arm: Fix coding style issues Since we'll move this code around, fix its style first. Backports commit 9798ac7162c8a720c5d28f4d1fc9e03c7ab4f015 from qemu	2019-08-08 15:05:57 -04:00
Philippe Mathieu-Daudé	4f71266524	target/arm: Fix multiline comment syntax Since commit 8c06fbdf36b checkpatch.pl enforce a new multiline comment syntax. Since we'll move this code around, fix its style first. Backports commit 9a223097e44d5320f5e0546710263f22d11f12fc from qemu	2019-08-08 15:05:37 -04:00
Philippe Mathieu-Daudé	0a5152caf8	target/arm/helper: Remove unused include Backports commit 2c8ec397f8438eea5e52be4898dfcf12a1f88267 from qemu	2019-08-08 14:43:57 -04:00
Philippe Mathieu-Daudé	8a0bae93ff	target/arm: Add copyright boilerplate Backports commit ed3baad15b0b0edc480373f9c1d805d6b8e7e78c from qemu	2019-08-08 14:43:32 -04:00
Philippe Mathieu-Daudé	b028a37fdb	target/arm: Makefile cleanup (softmmu) Group SOFTMMU objects together. Since PSCI is TCG specific, keep it separate. Backports commit b601e0cd78c2c57548a468c6fdb566d514c05b89 from qemu	2019-08-08 14:42:00 -04:00
Philippe Mathieu-Daudé	348b813c03	target/arm: Makefile cleanup (ARM) Group ARM objects together, TCG related ones at the bottom. This will help when restricting TCG-only objects. Backports commit 07774d584267488c8c2f104ae5b552791961908a from qemu	2019-08-08 14:39:52 -04:00
Lioncash	76d33b34e1	target/arm: Fix bad patch merge in arm_tr_init_disas_context	2019-08-08 14:37:38 -04:00
Philippe Mathieu-Daudé	5c0cff59f2	target/arm: Makefile cleanup (Aarch64) Group Aarch64 rules together, TCG related ones at the bottom. This will help when restricting TCG-only objects. Backports commit 87f4f183484dba7a460c59e99dac0dbb9f42ed87 from qemu	2019-08-08 14:31:24 -04:00
Peter Maydell	fa19f96e8c	target/arm: Check for dp support for dp VFM, not sp In commit 1120827fa182f0e7622 we accidentally put the "UNDEF unless FPU has double-precision support" check in the single-precision VFM function. Put it in the dp function where it belongs. Backports commit 34bea4edb9bbe8edf4b8606276482acdff5ca58b from qemu	2019-06-25 18:56:34 -05:00
Peter Maydell	dc1f2247ec	target/arm: Only implement doubles if the FPU supports them The architecture permits FPUs which have only single-precision support, not double-precision; Cortex-M4 and Cortex-M33 are both like that. Add the necessary checks on the MVFR0 FPDP field so that we UNDEF any double-precision instructions on CPUs like this. Note that even if FPDP==0 the insns like VMOV-to/from-gpreg, VLDM/VSTM, VLDR/VSTR which take double precision registers still exist. Backports commit 1120827fa182f0e76226df7ffe7a86598d1df54f from qemu	2019-06-25 18:55:25 -05:00
Peter Maydell	cfac686c95	target/arm: Fix typos in trans function prototypes In several places cut and paste errors meant we were using the wrong type for the 'arg' struct in trans_ functions called by the decodetree decoder, because we were using the _sp version of the struct in the _dp function. These were harmless, because the two structs were identical and so decodetree made them typedefs of the same underlying structure (and we'd have had a compile error if they were not harmless), but we should clean them up anyway. Backports commit 83655223ac6143a563e981906ce13fd6f2cfbefd from qemu	2019-06-25 18:48:34 -05:00
Peter Maydell	318a1ddf39	target/arm: Remove unused cpu_F0s, cpu_F0d, cpu_F1s, cpu_F1d Remove the now unused TCG globals cpu_F0s, cpu_F0d, cpu_F1s, cpu_F1d. cpu_M0 is still used by the iwmmxt code, and cpu_V0 and cpu_V1 are used by both iwmmxt and Neon. Backports commit d9eea52c67c04c58ecceba6ffe5a93d1d02051fa from qemu	2019-06-25 18:45:53 -05:00
Peter Maydell	74168c20f2	target/arm: Stop using deprecated functions in NEON_2RM_VCVT_F32_F16 Remove some old constructns from NEON_2RM_VCVT_F16_F32 code: * don't use CPU_F0s * don't use tcg_gen_st_f32 Backports commit b66f6b9981004bbf120b8d17c20f92785179bdf2 from qemu	2019-06-25 18:43:40 -05:00
Peter Maydell	8ae25f6e4c	target/arm: stop using deprecated functions in NEON_2RM_VCVT_F16_F32 Remove some old constructs from NEON_2RM_VCVT_F16_F32 code: * don't use cpu_F0s * don't use tcg_gen_ld_f32 Backports commit 58f2682eee738e8890f9cfe858e0f4f68b00d45d from qemu	2019-06-25 18:39:43 -05:00
Peter Maydell	d419fbc270	target/arm: Stop using cpu_F0s in Neon VCVT fixed-point ops Stop using cpu_F0s in the Neon VCVT fixed-point operations. Backports commit c253dd7832bc6b4e140a0da56410a9336cce05bc from qemu	2019-06-25 18:35:33 -05:00
Peter Maydell	46216ae382	target/arm: Stop using cpu_F0s for Neon f32/s32 VCVT Stop using cpu_F0s for the Neon f32/s32 VCVT operations. Since this is the last user of cpu_F0s in the Neon 2rm-op loop, we can remove the handling code for it too. Backports commit 60737ed5785b9c1c6f1c85575dfdd1e9eec91878 from qemu	2019-06-25 18:32:32 -05:00
Peter Maydell	2fbe9c1d1d	target/arm: Stop using cpu_F0s for NEON_2RM_VRECPE_F and NEON_2RM_VRSQRTE_F Stop using cpu_F0s for NEON_2RM_VRECPE_F and NEON_2RM_VRSQRTE_F. Backports commit 9a011fece7201f8e268c982df8c7836f3335bbe6 from qemu	2019-06-25 18:29:22 -05:00
Peter Maydell	f82ea34369	target/arm: Stop using cpu_F0s for NEON_2RM_VCVT[ANPM][US] Stop using cpu_F0s for the NEON_2RM_VCVT[ANPM][US] ops. Backports commit 30bf0a018f6c706913c8c0ea57b386907f4229be from qemu	2019-06-25 18:28:03 -05:00
Peter Maydell	0d4535bf16	target/arm: Stop using cpu_F0s for NEON_2RM_VRINT* Switch NEON_2RM_VRINT* away from using cpu_F0s. Backports commit 3b52ad1fae804acdc2fdc41b418a65249beae430 from qemu	2019-06-25 18:26:24 -05:00
Peter Maydell	a62cbc7ac5	target/arm: Stop using cpu_F0s for NEON_2RM_VNEG_F Switch NEON_2RM_VABS_F away from using cpu_F0s. Backports commit cedcc96fc7c8e520a190a010ac97dbb53e57d7d2 from qemu	2019-06-25 18:24:01 -05:00
Peter Maydell	63d7f92eba	target/arm: Stop using cpu_F0s for NEON_2RM_VABS_F Where Neon instructions are floating point operations, we mostly use the old VFP utility functions like gen_vfp_abs() which work on the TCG globals cpu_F0s and cpu_F1s. The Neon for-each-element loop conditionally loads the inputs into either a plain old TCG temporary for most operations or into cpu_F0s for float operations, and similarly stores back either cpu_F0s or the temporary. Switch NEON_2RM_VABS_F away from using cpu_F0s, and update neon_2rm_is_float_op() accordingly. Backports commit fd8a68cdcf81d70eebf866a132e9780d4108da9c from qemu	2019-06-25 18:22:05 -05:00
Peter Maydell	ba0ddd3459	target/arm: Use vfp_expand_imm() for AArch32 VFP VMOV_imm The AArch32 VMOV (immediate) instruction uses the same VFP encoded immediate format we already handle in vfp_expand_imm(). Use that function rather than hand-decoding it. Backports commit 9bee50b498410ed6466018b26464d7384c7879e9 from qemu	2019-06-25 18:20:19 -05:00
Peter Maydell	b2dc290454	target/arm: Move vfp_expand_imm() to translate.[ch] We want to use vfp_expand_imm() in the AArch32 VFP decode; move it from the a64-only header/source file to the AArch32 one (which is always compiled even for AArch64). Backports commit d6a092d479333b5f20a647a912a31b0102d37335 from qemu	2019-06-25 18:17:49 -05:00
Peter Maydell	021da28bfd	target/arm: Fix short-vector increment behaviour For VFP short vectors, the VFP registers are divided into a series of banks: for single-precision these are s0-s7, s8-s15, s16-s23 and s24-s31; for double-precision they are d0-d3, d4-d7, ... d28-d31. Some banks are "scalar" meaning that use of a register within them triggers a pure-scalar or mixed vector-scalar operation rather than a full vector operation. The scalar banks are s0-s7, d0-d3 and d16-d19. When using a bank as part of a vector operation, we iterate through it, increasing the register number by the specified stride each time, and wrapping around to the beginning of the bank. Unfortunately our calculation of the "increment" part of this was incorrect: vd = ((vd + delta_d) & (bank_mask - 1)) \| (vd & bank_mask) will only do the intended thing if bank_mask has exactly one set high bit. For instance for doubles (bank_mask = 0xc), if we start with vd = 6 and delta_d = 2 then vd is updated to 12 rather than the intended 4. This only causes problems in the unlikely case that the starting register is not the first in its bank: if the register number doesn't have to wrap around then the expression happens to give the right answer. Fix this bug by abstracting out the "check whether register is in a scalar bank" and "advance register within bank" operations to utility functions which use the right bit masking operations Backports commit 18cf951af9a27ae573a6fa17f9d0c103f7b7679b from qemu	2019-06-13 19:44:27 -04:00
Peter Maydell	1a0d31c05e	target/arm: Convert float-to-integer VCVT insns to decodetree Convert the float-to-integer VCVT instructions to decodetree. Since these are the last unconverted instructions, we can delete the old decoder structure entirely now. Backports commit 3111bfc2da6ba0c8396dc97ca479942d711c6146 from qemu	2019-06-13 19:40:02 -04:00
Peter Maydell	f6c67559d4	target/arm: Convert VCVT fp/fixed-point conversion insns to decodetree Convert the VCVT (between floating-point and fixed-point) instructions to decodetree. Backports commit e3d6f4290c788e850c64815f0b3e331600a4bcc0 from qemu	2019-06-13 19:35:51 -04:00
Peter Maydell	c66d477359	target/arm: Convert VJCVT to decodetree Convert the VJCVT instruction to decodetree. Backports commit 92073e947487e2109f3dfebfeaa48d6323cbd981 from qemu	2019-06-13 19:31:35 -04:00
Peter Maydell	7be9e6f9b4	target/arm: Convert integer-to-float insns to decodetree Convert the VCVT integer-to-float instructions to decodetree. Backports commit 8fc9d8918cde342c71923e361b9f2193e36ed18b from qemu	2019-06-13 19:20:41 -04:00
Peter Maydell	e0e4f99103	target/arm: Convert double-single precision conversion insns to decodetree Convert the VCVT double/single precision conversion insns to decodetree. Backports commit 6ed7e49c3693ed8411773c4880f42b2932beb12d from qemu	2019-06-13 19:18:01 -04:00
Peter Maydell	ab9d0235ed	target/arm: Convert VFP round insns to decodetree Convert the VFP round-to-integer instructions VRINTR, VRINTZ and VRINTX to decodetree. These instructions were only introduced as part of the "VFP misc" additions in v8A, so we check this. The old decoder's implementation was incorrectly providing them even for v7A CPUs. Backports commit e25155f55dc4abb427a88dfe58bbbc550fe7d643 from qemu	2019-06-13 19:15:05 -04:00
Peter Maydell	9e842a0f2a	target/arm: Convert the VCVT-to-f16 insns to decodetree Convert the VCVTT and VCVTB instructions which convert from f32 and f64 to f16 to decodetree. Since we're no longer constrained to the old decoder's style using cpu_F0s and cpu_F0d we can perform a direct 16 bit store of the right half of the input single-precision register rather than doing a load/modify/store sequence on the full 32 bits. Backports commit cdfd14e86ab0b1ca29a702d13a8e4af2e902a9bf from qemu	2019-06-13 19:03:59 -04:00
Peter Maydell	7d927b2d0e	target/arm: Convert the VCVT-from-f16 insns to decodetree Convert the VCVTT, VCVTB instructions that deal with conversion from half-precision floats to f32 or 64 to decodetree. Since we're no longer constrained to the old decoder's style using cpu_F0s and cpu_F0d we can perform a direct 16 bit load of the right half of the input single-precision register rather than loading the full 32 bits and then doing a separate shift or sign-extension. Backports commit b623d803dda805f07aadcbf098961fde27315c19 from qemu	2019-06-13 19:00:23 -04:00
Peter Maydell	e6cc2616d2	target/arm: Convert VFP comparison insns to decodetree Convert the VFP comparison instructions to decodetree. Note that comparison instructions should not honour the VFP short-vector length and stride information: they are scalar-only operations. This applies to all the 2-operand instructions except for VMOV, VABS, VNEG and VSQRT. (In the old decoder this is implemented via the "if (op == 15 && rn > 3) { veclen = 0; }" check.) Backports commit 386bba2368842fc74388a3c1651c6c0c0c70adbd from qemu	2019-06-13 18:55:53 -04:00
Peter Maydell	a75a3e321f	target/arm: Convert VMOV (register) to decodetree Backports commit 17552b979ebb9848a534c25ebed18a1072710058 from qemu	2019-06-13 18:49:49 -04:00
Peter Maydell	ee30962891	target/arm: Convert VSQRT to decodetree Convert the VSQRT instruction to decodetree. Backports commit b8474540cbce4e2fa45010416375d1bcbe86dc15 from qemu	2019-06-13 18:47:32 -04:00
Peter Maydell	7aea3da6b7	target/arm: Convert VNEG to decodetree Convert the VNEG instruction to decodetree. Backports commit 1882651afdb0ca44f0631192fbe65a71c660d809 from qemu	2019-06-13 18:43:50 -04:00
Peter Maydell	1032d86ad3	target/arm: Convert VABS to decodetree Convert the VFP VABS instruction to decodetree. Unlike the 3-op versions, we don't pass fpst to the VFPGen2OpSPFn or VFPGen2OpDPFn because none of the operations which use this format and support short vectors will need it. Backports commit 90287e22c987e9840704345ed33d237cbe759dd9 from qemu	2019-06-13 18:41:43 -04:00
Peter Maydell	7a16bc6876	target/arm: Convert VMOV (imm) to decodetree Convert the VFP VMOV (immediate) instruction to decodetree. Backports commit b518c753f0b94e14e01e97b4ec42c100dafc0cc2 from qemu	2019-06-13 18:37:58 -04:00
Peter Maydell	0ebb6b8b90	target/arm: Convert VFP fused multiply-add insns to decodetree Convert the VFP fused multiply-add instructions (VFNMA, VFNMS, VFMA, VFMS) to decodetree. Note that in the old decode structure we were implementing these to honour the VFP vector stride/length. These instructions were introduced in VFPv4, and in the v7A architecture they are UNPREDICTABLE if the vector stride or length are non-zero. In v8A they must UNDEF if stride or length are non-zero, like all VFP instructions; we choose to UNDEF always. Backports commit d4893b01d23060845ee3855bc96626e16aad9ab5 from qemu	2019-06-13 18:24:36 -04:00
Peter Maydell	321bcc822b	target/arm: Convert VDIV to decodetree Convert the VDIV instruction to decodetree. Backports commit 519ee7ae31e050eb0ff9ad35c213f0bd7ab1c03e from qemu	2019-06-13 18:19:47 -04:00
Peter Maydell	76c74bc657	target/arm: Convert VSUB to decodetree Convert the VSUB instruction to decodetree. Backports commit 8fec9a119264b7936503abce3c106fad7e3ccb76 from qemu.	2019-06-13 18:18:00 -04:00
Peter Maydell	f56f0342ad	target/arm: Convert VADD to decodetree Convert the VADD instruction to decodetree. Backports commit ce28b303716e7eca3f3765bf6776d722ebbe1122 from qemu	2019-06-13 18:15:52 -04:00
Peter Maydell	06584edf61	target/arm: Convert VNMUL to decodetree Convert the VNMUL instruction to decodetree. Backports commit 43c4be1236c105090d134540da1036073d157cd4 from qemu	2019-06-13 18:14:16 -04:00
Peter Maydell	2c5e102017	target/arm: Convert VMUL to decodetree Convert the VMUL instruction to decodetree. Backports commit 88c5188ced60e9f2b8cc3af3b9bc4a8031c8c996 from qemu	2019-06-13 18:12:03 -04:00
Peter Maydell	b26b6a12a2	target/arm: Convert VFP VNMLA to decodetree Convert the VFP VNMLA instruction to decodetree. Backports commit 8a483533adc1bdc2decb8f456dbe930a2d245a8b from qemu	2019-06-13 18:09:57 -04:00
Peter Maydell	638b90de31	target/arm: Convert VFP VNMLS to decodetree Convert the VFP VNMLS instruction to decodetree. Backports commit c54a416cc6d60efbc79dd37aaf0c8918c05b5815 from qemu	2019-06-13 18:06:59 -04:00
Peter Maydell	67ad40ffa4	target/arm: Convert VFP VMLS to decodetree Convert the VFP VMLS instruction to decodetree. Backports commit e7258280d46af4ab6a0cc93ccfe8f6614defb4b7 from qemu	2019-06-13 18:02:37 -04:00
Peter Maydell	edf81eb214	target/arm: Convert VFP VMLA to decodetree Convert the VFP VMLA instruction to decodetree. This is the first of the VFP 3-operand data processing instructions, so we include in this patch the code which loops over the elements for an old-style VFP vector operation. The existing code to do this looping uses the deprecated cpu_F0s/F0d/F1s/F1d TCG globals; since we are going to be converting instructions one at a time anyway we can take the opportunity to make the new loop use TCG temporaries, which means we can do that conversion one operation at a time rather than needing to do it all in one go. We include an UNDEF check which was missing in the old code: short-vector operations (with stride or length non-zero) were deprecated in v7A and must UNDEF in v8A, so if the MVFR0 FPShVec field does not indicate that support for short vectors is present we UNDEF the operations that would use them. (This is a change of behaviour for Cortex-A7, Cortex-A15 and the v8 CPUs, which previously were all incorrectly allowing short-vector operations.) Note that the conversion fixes a bug in the old code for the case of VFP short-vector "mixed scalar/vector operations". These happen where the destination register is in a vector bank but but the second operand is in a scalar bank. For example vmla.f64 d10, d1, d16 with length 2 stride 2 is equivalent to the pair of scalar operations vmla.f64 d10, d1, d16 vmla.f64 d8, d3, d16 where the destination and first input register cycle through their vector but the second input is scalar (d16). In the old decoder the gen_vfp_F1_mul() operation uses cpu_F1{s,d} as a temporary output for the multiply, which trashes the second input operand. For the fully-scalar case (where we never do a second iteration) and the fully-vector case (where the loop loads the new second input operand) this doesn't matter, but for the mixed scalar/vector case we will end up using the wrong value for later loop iterations. In the new code we use TCG temporaries and so avoid the bug. This bug is present for all the multiply-accumulate insns that operate on short vectors: VMLA, VMLS, VNMLA, VNMLS. Note 2: the expression used to calculate the next register number in the vector bank is not in fact correct; we leave this behaviour unchanged from the old decoder and will fix this bug later in the series. Backports commit 266bd25c485597c94209bfdb3891c1d0c573c164 from qemu	2019-06-13 17:59:16 -04:00
Peter Maydell	93fe4cbe9e	target/arm: Remove VLDR/VSTR/VLDM/VSTM use of cpu_F0s and cpu_F0d Expand out the sequences in the new decoder VLDR/VSTR/VLDM/VSTM trans functions which perform the memory accesses by going via the TCG globals cpu_F0s and cpu_F0d, to use local TCG temps instead. Backports commit 3993d0407dff7233e42f2251db971e126a0497e9 from qemu	2019-06-13 17:31:28 -04:00
Peter Maydell	ff7042567e	target/arm: Convert the VFP load/store multiple insns to decodetree Convert the VFP load/store multiple insns to decodetree. This includes tightening up the UNDEF checking for pre-VFPv3 CPUs which only have D0-D15 : they now UNDEF for any access to D16-D31, not merely when the smallest register in the transfer list is in D16-D31. This conversion does not try to share code between the single precision and the double precision versions; this looks a bit duplicative of code, but it leaves the door open for a future refactoring which gets rid of the use of the "F0" registers by inlining the various functions like gen_vfp_ld() and gen_mov_F0_reg() which are hiding "if (dp) { ... } else { ... }" conditionalisation. Backports commit fa288de272c5c8a66d5eb683b123706a52bc7ad6 from qemu	2019-06-13 17:26:52 -04:00
Peter Maydell	6f0633ce80	target/arm: Convert VFP VLDR and VSTR to decodetree Convert the VFP single load/store insns VLDR and VSTR to decodetree. Backports commit 79b02a3b5231c5b8cd31e50cd549968dd0a05c49 from qemu	2019-06-13 17:22:48 -04:00
Peter Maydell	fe98885ff2	target/arm: Convert VFP two-register transfer insns to decodetree Convert the VFP two-register transfer instructions to decodetree (in the v8 Arm ARM these are the "Advanced SIMD and floating-point 64-bit move" encoding group). Again, we expand out the sequences involving gen_vfp_msr() and gen_msr_vfp(). Backports commit 81f681106eabe21c55118a5a41999fb7387fb714 from qemu	2019-06-13 17:20:00 -04:00
Peter Maydell	3fb3403b82	target/arm: Convert single-precision register moves to decodetree Convert the "single-precision" register moves to decodetree: * VMSR * VMRS * VMOV between general purpose register and single precision Note that the VMSR/VMRS conversions make our handling of the "should this UNDEF?" checks consistent between the two instructions: * VMSR to MVFR0, MVFR1, MVFR2 now UNDEF from EL0 (previously was a nop) * VMSR to FPSID now UNDEFs from EL0 or if VFPv3 or better (previously was a nop) * VMSR to FPINST and FPINST2 now UNDEF if VFPv3 or better (previously would write to the register, which had no guest-visible effect because we always UNDEF reads) We also tighten up the decode: we were previously underdecoding some SBZ or SBO bits. The conversion of VMOV_single includes the expansion out of the gen_mov_F0_vreg()/gen_vfp_mrs() and gen_mov_vreg_F0()/gen_vfp_msr() sequences into the simpler direct load/store of the TCG temp via neon_{load,store}_reg32(): we know in the new function that we're always single-precision, we don't need to use the old-and-deprecated cpu_F0* TCG globals, and we don't happen to have the declaration of gen_vfp_msr() and gen_vfp_mrs() at the point in the file where the new function is. Backports commit a9ab50011aeda2dd012da99069e078379315ea18 from qemu	2019-06-13 17:16:38 -04:00
Peter Maydell	694058da94	target/arm: Convert double-precision register moves to decodetree Convert the "double-precision" register moves to decodetree: this covers VMOV scalar-to-gpreg, VMOV gpreg-to-scalar and VDUP. Note that the conversion process has tightened up a few of the UNDEF encoding checks: we now correctly forbid: * VMOV-to-gpr with U:opc1:opc2 == 10x00 or x0x10 * VMOV-from-gpr with opc1:opc2 == 0x10 * VDUP with B:E == 11 * VDUP with Q == 1 and Vn<0> == 1 Signed-off-by: Peter Maydell <peter.maydell@linaro.org> --- The accesses of elements < 32 bits could be improved by doing direct ld/st of the right size rather than 32-bit read-and-shift or read-modify-write, but we leave this for later cleanup, since this series is generally trying to stick to fixing the decode. Backports commit 9851ed9269d214c0c6feba960dd14ff09e6c34b4 from qemu	2019-06-13 17:11:56 -04:00
Peter Maydell	7265161108	target/arm: Add helpers for VFP register loads and stores The current VFP code has two different idioms for loading and storing from the VFP register file: 1 using the gen_mov_F0_vreg() and similar functions, which load and store to a fixed set of TCG globals cpu_F0s, CPU_F0d, etc 2 by direct calls to tcg_gen_ld_f64() and friends We want to phase out idiom 1 (because the use of the fixed globals is a relic of a much older version of TCG), but idiom 2 is quite longwinded: tcg_gen_ld_f64(tmp, cpu_env, vfp_reg_offset(true, reg)) requires us to specify the 64-bitness twice, once in the function name and once by passing 'true' to vfp_reg_offset(). There's no guard against accidentally passing the wrong flag. Instead, let's move to a convention of accessing 64-bit registers via the existing neon_load_reg64() and neon_store_reg64(), and provide new neon_load_reg32() and neon_store_reg32() for the 32-bit equivalents. Implement the new functions and use them in the code in translate-vfp.inc.c. We will convert the rest of the VFP code as we do the decodetree conversion in subsequent commits. Backports commit 160f3b64c5cc4c8a09a1859edc764882ce6ad6bf from qemu	2019-06-13 17:01:59 -04:00
Peter Maydell	033a386ffb	target/arm: Move the VFP trans_* functions to translate-vfp.inc.c Move the trans_*() functions we've just created from translate.c to translate-vfp.inc.c. This is pure code motion with no textual changes (this can be checked with 'git show --color-moved'). Backports commit f7bbb8f31f0761edbf0c64b7ab3c3f49c13612ea from qemu	2019-06-13 16:56:24 -04:00
Peter Maydell	e55d31a5ac	target/arm: Convert VCVTA/VCVTN/VCVTP/VCVTM to decodetree Convert the VCVTA/VCVTN/VCVTP/VCVTM instructions to decodetree. trans_VCVT() is temporarily left in translate.c. Backports commit c2a46a914cd5c38fd0ee57ff0befc1c5bde27bcf from qemu	2019-06-13 16:54:42 -04:00
Peter Maydell	9fb01cb526	target/arm: Convert VRINTA/VRINTN/VRINTP/VRINTM to decodetree Convert the VRINTA/VRINTN/VRINTP/VRINTM instructions to decodetree. Again, trans_VRINT() is temporarily left in translate.c. Backports commit e3bb599d16e4678b228d80194cee328f894b1ceb from qemu	2019-06-13 16:50:36 -04:00
Peter Maydell	4501daf010	target/arm: Convert VMINNM, VMAXNM to decodetree Convert the VMINNM and VMAXNM instructions to decodetree. As with VSEL, we leave the trans_VMINMAXNM() function in translate.c for the moment. Backports commit f65988a1efdb42f9058db44297591491842e697c from qemu	2019-06-13 16:43:50 -04:00
Peter Maydell	3994dfd079	target/arm: Convert the VSEL instructions to decodetree Convert the VSEL instructions to decodetree. We leave trans_VSEL() in translate.c for now as this allows the patch to show just the changes from the old handle_vsel(). In the old code the check for "do D16-D31 exist" was hidden in the VFP_DREG macro, and assumed that VFPv3 always implied that D16-D31 exist. In the new code we do the correct ID register test. This gives identical behaviour for most of our CPUs, and fixes previously incorrect handling for Cortex-R5F, Cortex-M4 and Cortex-M33, which all implement VFPv3 or better with only 16 double-precision registers. Backports commit b3ff4b87b4ae08120a51fe12592725e1dca8a085 from qemu	2019-06-13 16:41:22 -04:00
Peter Maydell	93adaa7de2	target/arm: Explicitly enable VFP short-vectors for aarch32 -cpu max At the moment our -cpu max for AArch32 supports VFP short-vectors because we always implement them, even for CPUs which should not have them. The following commits are going to switch to using the correct ID-register-check to enable or disable short vector support, so we need to turn it on explicitly for -cpu max, because Cortex-A15 doesn't implement it. We don't enable this for the AArch64 -cpu max, because the v8A architecture never supports short-vectors. Backports commit 973751fd798d41402d34f9f705c0c6d1633d0cda from qemu	2019-06-13 16:38:01 -04:00
Peter Maydell	808d929d7c	target/arm: Fix Cortex-R5F MVFR values The Cortex-R5F initfn was not correctly setting up the MVFR ID register values. Fill these in, since some subsequent patches will use ID register checks rather than CPU feature bit checks. Backports commit 3de79d335c9aa7d726865e3933d9b21781032183 from qemu	2019-06-13 16:36:48 -04:00
Lioncash	b3cfede44f	target/arm: Make load_cpu_offset() take a DisasContext* instead of uc_struct* Keeps it consistent with store_cpu_offset	2019-06-13 16:35:31 -04:00
Peter Maydell	78997058e4	target/arm: Factor out VFP access checking code Factor out the VFP access checking code so that we can use it in the leaf functions of the decodetree decoder. We call the function full_vfp_access_check() so we can keep the more natural vfp_access_check() for a version which doesn't have the 'ignore_vfp_enabled' flag -- that way almost all VFP insns will be able to use vfp_access_check(s) and only the special-register access function will have to use full_vfp_access_check(s, ignore_vfp_enabled). Backports commit 06db8196bba34776829020192ed623a0b22e6557 from qemu	2019-06-13 16:33:38 -04:00
Peter Maydell	9732ebba5c	target/arm: Add stubs for AArch32 VFP decodetree Add the infrastructure for building and invoking a decodetree decoder for the AArch32 VFP encodings. At the moment the new decoder covers nothing, so we always fall back to the existing hand-written decode. We need to have one decoder for the unconditional insns and one for the conditional insns, as otherwise the patterns for conditional insns would incorrectly match against the unconditional ones too. Since translate.c is over 14,000 lines long and we're going to be touching pretty much every line of the VFP code as part of the decodetree conversion, we create a new translate-vfp.inc.c to hold the code which deals with VFP in the new scheme. It should be possible to convert this into a standalone translation unit eventually, but the conversion process will be much simpler if we simply #include it midway through translate.c to start with. Backports commit 78e138bc1f672c145ef6ace74617db00eebaa2ba from qemu	2019-06-13 16:24:37 -04:00
Richard Henderson	afaea6a291	target/arm: Fix output of PAuth Auth The ARM pseudocode installs the error_code into the original pointer, not the encrypted pointer. The difference applies within the 7 bits of pac data; the result should be the sign extension of bit 55. Add a testcase to that effect. Backports commit d67ebada159148bfdfde84871338738e4465e985 from qemu	2019-06-13 16:17:00 -04:00
Peter Maydell	230f8a091a	target/arm: Implement NSACR gating of floating point The NSACR register allows secure code to configure the FPU to be inaccessible to non-secure code. If the NSACR.CP10 bit is set then: * NS accesses to the FPU trap as UNDEF (ie to NS EL1 or EL2) * CPACR.{CP10,CP11} behave as if RAZ/WI * HCPTR.{TCP11,TCP10} behave as if RAO/WI Note that we do not implement the NSACR.NSASEDIS bit which gates only access to Advanced SIMD, in the same way that we don't implement the equivalent CPACR.ASEDIS and HCPTR.TASE. Backports commit fc1120a7f5f2d4b601003205c598077d3eb11ad2 from qemu	2019-06-13 16:15:28 -04:00
Richard Henderson	7c32498b7f	target/arm: Use tcg_gen_gvec_bitsel This replaces 3 target-specific implementations for BIT, BIF, and BSL. Backports commit 3a7a2b4e5cf0d49cd8b14e8225af0310068b7d20 from qemu	2019-06-13 16:12:56 -04:00
Richard Henderson	8f53f09a05	cpu: Introduce CPUNegativeOffsetState Nothing in there so far, but all of the plumbing done within the target ArchCPU state. Backports commit 5b146dc716cfd247f99556c04e6e46fbd67565a0 from qemu	2019-06-13 15:08:25 -04:00
Richard Henderson	a672b89e3b	cpu: Introduce cpu_set_cpustate_pointers Consolidate some boilerplate from foo_cpu_initfn. Backports commit 7506ed902eb97fe4e2a1dd16766c621d32ecc40d from qemu	2019-06-12 12:27:16 -04:00
Richard Henderson	ac176ccb38	cpu: Move ENV_OFFSET to exec/gen-icount.h Now that we have ArchCPU, we can define this generically, in the one place that needs it. Backports commit 677c4d69ac21961e76a386f9bfc892a44923acc0 from qemu	2019-06-12 12:20:21 -04:00
Richard Henderson	b8bd543390	target/arm: Use env_cpu, env_archcpu Cleanup in the boilerplate that each target must define. Replace arm_env_get_cpu with env_archcpu. The combination CPU(arm_env_get_cpu) should have used ENV_GET_CPU to begin; use env_cpu now. Backports commit 2fc0cc0e1e034582f4718b1a2d57691474ccb6aa from qemu	2019-06-12 11:34:08 -04:00
Richard Henderson	fbf91a6535	cpu: Replace ENV_GET_CPU with env_cpu Now that we have both ArchCPU and CPUArchState, we can define this generically instead of via macro in each target's cpu.h. Backports commit 29a0af618ddd21f55df5753c3e16b0625f534b3c from qemu	2019-06-12 11:16:16 -04:00
Richard Henderson	ae94fb5992	cpu: Define ArchCPU For all targets, do this just before including exec/cpu-all.h. Backports commit 2161a612b4e1d388046320bc464adefd6bba01a0 from qemu	2019-06-12 11:08:39 -04:00
Richard Henderson	e3f1f25996	cpu: Define CPUArchState with typedef For all targets, do this just before including exec/cpu-all.h. Backports commit 4f7c64b3819d559417615ed2b1d028ebc1a49580 from qemu	2019-06-12 11:06:36 -04:00
Richard Henderson	df2a890bd7	tcg: Split out target/arch/cpu-param.h For all targets, into this new file move TARGET_LONG_BITS, TARGET_PAGE_BITS, TARGET_PHYS_ADDR_SPACE_BITS, TARGET_VIRT_ADDR_SPACE_BITS, and NB_MMU_MODES. Include this new file from exec/cpu-defs.h. This now removes the somewhat odd requirement that target/arch/cpu.h defines TARGET_LONG_BITS before including exec/cpu-defs.h, so push the bulk of the includes within target/arch/cpu.h to the top. Backports commit 74433bf083b0766aba81534f92de13194f23ff3e from qemu	2019-06-10 19:35:46 -04:00
Alistair Francis	f8f3e50372	target/arm: Fix vector operation segfault Commit 89e68b575 "target/arm: Use vector operations for saturation" causes this abort() when booting QEMU ARM with a Cortex-A15: 0 0x00007ffff4c2382f in raise () at /usr/lib/libc.so.6 1 0x00007ffff4c0e672 in abort () at /usr/lib/libc.so.6 2 0x00005555559c1839 in disas_neon_data_insn (insn=<optimized out>, s=<optimized out>) at ./target/arm/translate.c:6673 3 0x00005555559c1839 in disas_neon_data_insn (s=<optimized out>, insn=<optimized out>) at ./target/arm/translate.c:6386 4 0x00005555559cd8a4 in disas_arm_insn (insn=4081107068, s=0x7fffe59a9510) at ./target/arm/translate.c:9289 5 0x00005555559cd8a4 in arm_tr_translate_insn (dcbase=0x7fffe59a9510, cpu=<optimized out>) at ./target/arm/translate.c:13612 6 0x00005555558d1d39 in translator_loop (ops=0x5555561cc580 <arm_translator_ops>, db=0x7fffe59a9510, cpu=0x55555686a2f0, tb=<optimized out>, max_insns=<optimized out>) at ./accel/tcg/translator.c:96 7 0x00005555559d10d4 in gen_intermediate_code (cpu=cpu@entry=0x55555686a2f0, tb=tb@entry=0x7fffd7840080 <code_gen_buffer+126091347>, max_insns=max_insns@entry=512) at ./target/arm/translate.c:13901 8 0x00005555558d06b9 in tb_gen_code (cpu=cpu@entry=0x55555686a2f0, pc=3067096216, cs_base=0, flags=192, cflags=-16252928, cflags@entry=524288) at ./accel/tcg/translate-all.c:1736 9 0x00005555558ce467 in tb_find (cf_mask=524288, tb_exit=1, last_tb=0x7fffd783e640 <code_gen_buffer+126084627>, cpu=0x1) at ./accel/tcg/cpu-exec.c:407 10 0x00005555558ce467 in cpu_exec (cpu=cpu@entry=0x55555686a2f0) at ./accel/tcg/cpu-exec.c:728 11 0x000055555588b0cf in tcg_cpu_exec (cpu=0x55555686a2f0) at ./cpus.c:1431 12 0x000055555588d223 in qemu_tcg_cpu_thread_fn (arg=0x55555686a2f0) at ./cpus.c:1735 13 0x000055555588d223 in qemu_tcg_cpu_thread_fn (arg=arg@entry=0x55555686a2f0) at ./cpus.c:1709 14 0x0000555555d2629a in qemu_thread_start (args=<optimized out>) at ./util/qemu-thread-posix.c:502 15 0x00007ffff4db8a92 in start_thread () at /usr/lib/libpthread. This patch ensures that we don't hit the abort() in the second switch case in disas_neon_data_insn() as we will return from the first case. Backports commit 2f143d3ad1c05e91cf2cdf5de06d59a80a95e6c8 from qemu	2019-05-24 18:02:32 -04:00
Richard Henderson	9287750362	target/arm: Simplify BFXIL expansion The mask implied by the extract is redundant with the one implied by the deposit. Also, fix spelling of BFXIL. Backports commit 87eb65a3c45c788a309986d48170a54a0d1c0705 from qemu	2019-05-24 18:01:26 -04:00
Richard Henderson	1778828644	target/arm: Use extract2 for EXTR This is, after all, how we implement extract2 in tcg/aarch64. Backports commit 80ac954c369e7e61bd1ed00cef07b63e11f9c734 from qemu	2019-05-24 17:58:58 -04:00
Richard Henderson	3dd7358a53	target/arm: Implement ARMv8.5-RNG Use the newly introduced infrastructure for guest random numbers. Backports commit de390645675966cce113bf5394445bc1f8d07c85 from qemu (with the actual RNG portion disabled to preserve determinism for the time being).	2019-05-23 15:03:23 -04:00
Richard Henderson	a8df33c37c	target/arm: Put all PAC keys into a structure This allows us to use a single syscall to initialize them all. Backports commit 108b3ba891408c4dce93df78261ec4aca38c0e2e from qemu	2019-05-23 14:54:06 -04:00
Richard Henderson	2a4a7b9391	tcg: Use tlb_fill probe from tlb_vaddr_to_host Most of the existing users would continue around a loop which would fault the tlb entry in via a normal load/store. But for AArch64 SVE we have an existing emulation bug wherein we would mark the first element of a no-fault vector load as faulted (within the FFR, not via exception) just because we did not have its address in the TLB. Now we can properly only mark it as faulted if there really is no valid, readable translation, while still not raising an exception. (Note that beyond the first element of the vector, the hardware may report a fault for any reason whatsoever; with at least one element loaded, forward progress is guaranteed.) Backports commit 4811e9095c0491bc6f5450e5012c9c4796b9e59d from qemu	2019-05-16 18:27:03 -04:00
Richard Henderson	dab0061a0d	tcg: Use CPUClass::tlb_fill in cputlb.c We can now use the CPUClass hook instead of a named function. Create a static tlb_fill function to avoid other changes within cputlb.c. This also isolates the asserts within. Remove the named tlb_fill function from all of the targets. Backports commit c319dc13579a92937bffe02ad2c9f1a550e73973 from qemu	2019-05-16 17:35:37 -04:00
Richard Henderson	31ecdb5341	target/arm: Convert to CPUClass::tlb_fill Backports commit 7350d553b5066abdc662045d7db5cdb73d0f9d53 from qemu	2019-05-16 16:55:12 -04:00
Richard Henderson	552e48f14e	target/arm: Use tcg_gen_abs_i64 and tcg_gen_gvec_abs Backports commit 4e027a710673f5d4dc6cff88728bcfd32e4c47b0 from qemu	2019-05-16 16:43:02 -04:00
Richard Henderson	6d1730048d	tcg: Add support for integer absolute value Remove a function of the same name from target/arm/. Use a branchless implementation of abs gleaned from gcc. Backports commit ff1f11f7f8710a768f9313f24bd7f509d3db27e5 from qemu	2019-05-16 16:25:15 -04:00
Richard Henderson	c54b2776f6	tcg: Specify optional vector requirements with a list Replace the single opcode in .opc with a null-terminated array in .opt_opc. We still require that all opcodes be used with the same .vece. Validate the contents of this list with CONFIG_DEBUG_TCG. All tcg_gen_*_vec functions will check any list active during .fniv expansion. Swap the active list in and out as we expand other opcodes, or take control away from the front-end function. Convert all existing vector aware front ends. Backports commit 53229a7703eeb2bbe101a19a33ef22aaf960c65b from qemu	2019-05-16 15:05:02 -04:00
Peter Maydell	26cb1b8767	target/arm: Stop using variable length array in dc_zva Currently the dc_zva helper function uses a variable length array. In fact we know (as the comment above remarks) that the length of this array is bounded because the architecture limits the block size and QEMU limits the target page size. Use a fixed array size and assert that we don't run off it. Backports commit 63159601fb3e396b28da14cbb71e50ed3f5a0331 from qemu	2019-05-09 17:48:25 -04:00
Peter Maydell	7861820e94	target/arm: Implement XPSR GE bits In the M-profile architecture, if the CPU implements the DSP extension then the XPSR has GE bits, in the same way as the A-profile CPSR. When we added DSP extension support we forgot to add support for reading and writing the GE bits, which are stored in env->GE. We did put in the code to add XPSR_GE to the mask of bits to update in the v7m_msr helper, but forgot it in v7m_mrs. We also must not allow the XPSR we pull off the stack on exception return to set the nonexistent GE bits. Correct these errors: * read and write env->GE in xpsr_read() and xpsr_write() * only set GE bits on exception return if DSP present * read GE bits for MRS if DSP present Backports commit f1e2598c46d480c9e21213a244bc514200762828 from qemu	2019-05-09 17:46:31 -04:00
Lioncash	a71c027063	decodetree: Add DisasContext argument to !function expanders This does require adjusting all existing users. Backports commit 451e4ffdb0003ab5ed0d98bd37b385c076aba183 from qemu	2019-05-09 17:40:45 -04:00
Emilio G. Cota	1715f382b4	target/arm: check CF_PARALLEL instead of parallel_cpus Thereby decoupling the resulting translated code from the current state of the system. Backports commit 2399d4e7cec22ecf1c51062d2ebfd45220dbaace from qemu	2019-05-04 22:44:32 -04:00
Peter Maydell	d4549fccfb	target/arm: Enable FPU for Cortex-M4 and Cortex-M33 Enable the FPU by default for the Cortex-M4 and Cortex-M33. Backports commit 14fd0c31e26b88a2189b3f459b864d5e1faf302a from qemu	2019-04-30 11:29:26 -04:00
Peter Maydell	77ae3982b4	target/arm: Implement VLLDM for v7M CPUs with an FPU Implement the VLLDM instruction for v7M for the FPU present cas. Backports commit 956fe143b4f254356496a0a1c479fa632376dfec from qemu	2019-04-30 11:27:54 -04:00
Peter Maydell	b483951046	target/arm: Implement VLSTM for v7M CPUs with an FPU Implement the VLSTM instruction for v7M for the FPU present case. Backports commit 019076b036da4444494de38388218040d9d3a26c from qemu	2019-04-30 11:25:44 -04:00
Peter Maydell	a976d7642a	target/arm: Implement M-profile lazy FP state preservation The M-profile architecture floating point system supports lazy FP state preservation, where FP registers are not pushed to the stack when an exception occurs but are instead only saved if and when the first FP instruction in the exception handler is executed. Implement this in QEMU, corresponding to the check of LSPACT in the pseudocode ExecuteFPCheck(). Backports commit e33cf0f8d8c9998a7616684f9d6aa0d181b88803 from qemu	2019-04-30 11:21:50 -04:00
Peter Maydell	72e5ae480d	target/arm: Add lazy-FP-stacking support to v7m_stack_write() Pushing registers to the stack for v7M needs to handle three cases: * the "normal" case where we pend exceptions * an "ignore faults" case where we set FSR bits but do not pend exceptions (this is used when we are handling some kinds of derived exception on exception entry) * a "lazy FP stacking" case, where different FSR bits are set and the exception is pended differently Implement this by changing the existing flag argument that tells us whether to ignore faults or not into an enum that specifies which of the 3 modes we should handle. Backports commit a356dacf647506bccdf8ecd23574246a8bf615ac from qemu	2019-04-30 10:59:53 -04:00
Peter Maydell	b1d6bd2792	target/arm: New function armv7m_nvic_set_pending_lazyfp() In the v7M architecture, if an exception is generated in the process of doing the lazy stacking of FP registers, the handling of possible escalation to HardFault is treated differently to the normal approach: it works based on the saved information about exception readiness that was stored in the FPCCR when the stack frame was created. Provide a new function armv7m_nvic_set_pending_lazyfp() which pends exceptions during lazy stacking, and implements this logic. This corresponds to the pseudocode TakePreserveFPException(). Backports the relevant parts of commit a99ba8ab1601904e0fa20325192fc850362ce80e from qemu	2019-04-30 10:56:54 -04:00
Peter Maydell	3fff653e20	target/arm: New helper function arm_v7m_mmu_idx_all() Add a new helper function which returns the MMU index to use for v7M, where the caller specifies all of the security state, privilege level and whether the execution priority is negative, and reimplement the existing arm_v7m_mmu_idx_for_secstate_and_priv() in terms of it. We are going to need this for the lazy-FP-stacking code. Backports commit fa6252a988dbe440cd6087bf93cbe0887f0c401b from qemu	2019-04-30 10:54:26 -04:00
Peter Maydell	719231b4c0	target/arm: Activate M-profile floating point context when FPCCR.ASPEN is set The M-profile FPCCR.ASPEN bit indicates that automatic floating-point context preservation is enabled. Before executing any floating-point instruction, if FPCCR.ASPEN is set and the CONTROL FPCA/SFPA bits indicate that there is no active floating point context then we must create a new context (by initializing FPSCR and setting FPCA/SFPA to indicate that the context is now active). In the pseudocode this is handled by ExecuteFPCheck(). Implement this with a new TB flag which tracks whether we need to create a new FP context. Backports commit 6000531e19964756673a5f4b694a649ef883605a from qemu	2019-04-30 10:51:31 -04:00
Peter Maydell	87c8c0fde7	target/arm: Set FPCCR.S when executing M-profile floating point insns The M-profile FPCCR.S bit indicates the security status of the floating point context. In the pseudocode ExecuteFPCheck() function it is unconditionally set to match the current security state whenever a floating point instruction is executed. Implement this by adding a new TB flag which tracks whether FPCCR.S is different from the current security state, so that we only need to emit the code to update it in the less-common case when it is not already set correctly. Note that we will add the handling for the other work done by ExecuteFPCheck() in later commits. Backports commit 6d60c67a1a03be32c3342aff6604cdc5095088d1 from qemu	2019-04-30 10:50:17 -04:00
Peter Maydell	8d726490ff	target/arm: Overlap VECSTRIDE and XSCALE_CPAR TB flags We are close to running out of TB flags for AArch32; we could start using the cs_base word, but before we do that we can economise on our usage by sharing the same bits for the VFP VECSTRIDE field and the XScale XSCALE_CPAR field. This works because no XScale CPU ever had VFP. Backports commit ea7ac69d124c94c6e5579145e727adec9ccbefef from qemu	2019-04-30 10:45:14 -04:00
Peter Maydell	3c1f3548c4	target/arm: Move NS TBFLAG from bit 19 to bit 6 Move the NS TBFLAG down from bit 19 to bit 6, which has not been used since commit c1e3781090b9d36c60 in 2015, when we started passing the entire MMU index in the TB flags rather than just a 'privilege level' bit. This rearrangement is not strictly necessary, but means that we can put M-profile-only bits next to each other rather than scattered across the flag word. Backports commit 7fbb535f7aeb22896fedfcf18a1eeff48165f1d7 from qemu	2019-04-30 10:41:04 -04:00
Peter Maydell	86776d451e	target/arm: Handle floating point registers in exception return Handle floating point registers in exception return. This corresponds to pseudocode functions ValidateExceptionReturn(), ExceptionReturn(), PopStack() and ConsumeExcStackFrame(). Backports commit 6808c4d2d2826920087533f517472c09edc7b0d2 from qemu	2019-04-30 10:40:12 -04:00
Peter Maydell	2244bb085a	target/arm: Allow for floating point in callee stack integrity check The magic value pushed onto the callee stack as an integrity check is different if floating point is present. Backports commit 0dc51d66fcfcc4c72011cdafb401fd876ca216e7 from qemu	2019-04-30 10:36:58 -04:00
Peter Maydell	746d377221	target/arm: Clean excReturn bits when tail chaining The TailChain() pseudocode specifies that a tail chaining exception should sanitize the excReturn all-ones bits and (if there is no FPU) the excReturn FType bits; we weren't doing this. Backports commit 60fba59a2f9a092a44b688df5d058cdd6dd9c276 from qemu	2019-04-30 10:35:36 -04:00
Peter Maydell	ca0ac5dca9	target/arm: Clear CONTROL.SFPA in BXNS and BLXNS For v8M floating point support, transitions from Secure to Non-secure state via BLNS and BLXNS must clear the CONTROL.SFPA bit. (This corresponds to the pseudocode BranchToNS() function.) Backports commit 3cd6726f0ba7cc77342ee721bd86094e13b2a42a from qemu	2019-04-30 10:33:25 -04:00
Peter Maydell	c7f5633cfe	target/arm: Implement v7m_update_fpccr() Implement the code which updates the FPCCR register on an exception entry where we are going to use lazy FP stacking. We have to defer to the NVIC to determine whether the various exceptions are currently ready or not. Backports commit b593c2b81287040ab6f452afec6281e2f7ee487b from qemu	2019-04-30 10:32:12 -04:00
Peter Maydell	065e60503f	target/arm: Handle floating point registers in exception entry Handle floating point registers in exception entry. This corresponds to the FP-specific parts of the pseudocode functions ActivateException() and PushStack(). We defer the code corresponding to UpdateFPCCR() to a later patch. Backports commit 0ed377a8013f40653a83f6ad2c9693897522d7dc from qemu	2019-04-30 10:25:23 -04:00
Peter Maydell	c164a9f191	target/arm/helper: don't return early for STKOF faults during stacking Currently the code in v7m_push_stack() which detects a violation of the v8M stack limit simply returns early if it does so. This is OK for the current integer-only code, but won't work for the floating point handling we're about to add. We need to continue executing the rest of the function so that we check for other exceptions like not having permission to use the FPU and so that we correctly set the FPCCR state if we are doing lazy stacking. Refactor to avoid the early return. Backports commit 3432c79a4e7345818d2defcf9e61a1bcb2907f9f from qemu	2019-04-30 10:22:36 -04:00
Peter Maydell	05add081a3	target/arm: Handle SFPA and FPCA bits in reads and writes of CONTROL The M-profile CONTROL register has two bits -- SFPA and FPCA -- which relate to floating-point support, and should be RES0 otherwise. Handle them correctly in the MSR/MRS register access code. Neither is banked between security states, so they are stored in v7m.control[M_REG_S] regardless of current security state. Backports commit 2e1c5bcd32014c9ede1b604ae6c2c653de17fc53 from qemu	2019-04-30 10:21:24 -04:00
Peter Maydell	c0cebeb5b5	target/arm: Clear CONTROL_S.SFPA in SG insn if FPU present If the floating point extension is present, then the SG instruction must clear the CONTROL_S.SFPA bit. Implement this. (On a no-FPU system the bit will always be zero, so we don't need to make the clearing of the bit conditional on ARM_FEATURE_VFP.) Backports commit 1702071302934af77a072b7ee7c5eadc45b37573 from qemu	2019-04-30 10:20:45 -04:00
Peter Maydell	89baa5cffa	target/arm: Decode FP instructions for M profile Correct the decode of the M-profile "coprocessor and floating-point instructions" space: * op0 == 0b11 is always unallocated * if the CPU has an FPU then all insns with op1 == 0b101 are floating point and go to disas_vfp_insn() For the moment we leave VLLDM and VLSTM as NOPs; in a later commit we will fill in the proper implementation for the case where an FPU is present. Backports commit 8859ba3c9625e7ceb5599f457a344bcd7c5e112b from qemu	2019-04-30 10:19:45 -04:00
Peter Maydell	18bb21c035	target/arm: Honour M-profile FP enable bits Like AArch64, M-profile floating point has no FPEXC enable bit to gate floating point; so always set the VFPEN TB flag. M-profile also has CPACR and NSACR similar to A-profile; they behave slightly differently: * the CPACR is banked between Secure and Non-Secure * if the NSACR forces a trap then this is taken to the Secure state, not the Non-Secure state Honour the CPACR and NSACR settings. The NSACR handling requires us to borrow the exception.target_el field (usually meaningless for M profile) to distinguish the NOCP UsageFault taken to Secure state from the more usual fault taken to the current security state. Backports commit d87513c0abcbcd856f8e1dee2f2d18903b2c3ea2 from qemu	2019-04-30 10:18:21 -04:00
Peter Maydell	c6bb8d483d	target/arm: Disable most VFP sysregs for M-profile The only "system register" that M-profile floating point exposes via the VMRS/VMRS instructions is FPSCR, and it does not have the odd special case for rd==15. Add a check to ensure we only expose FPSCR. Backports commit ef9aae2522c22c05df17dd898099dd5c3f20d688 from qemu	2019-04-30 10:15:25 -04:00
Peter Maydell	a4f332f3e9	target/arm: Implement dummy versions of M-profile FP-related registers The M-profile floating point support has three associated config registers: FPCAR, FPCCR and FPDSCR. It also makes the registers CPACR and NSACR have behaviour other than reads-as-zero. Add support for all of these as simple reads-as-written registers. We will hook up actual functionality later. The main complexity here is handling the FPCCR register, which has a mix of banked and unbanked bits. Note that we don't share storage with the A-profile cpu->cp15.nsacr and cpu->cp15.cpacr_el1, though the behaviour is quite similar, for two reasons: * the M profile CPACR is banked between security states * it preserves the invariant that M profile uses no state inside the cp15 substruct Backports commit d33abe82c7c9847284a23e575e1078cccab540b5 from qemu	2019-04-30 10:13:41 -04:00
Peter Maydell	978cd9c524	target/arm: Make sure M-profile FPSCR RES0 bits are not settable Enforce that for M-profile various FPSCR bits which are RES0 there but have defined meanings on A-profile are never settable. This ensures that M-profile code can't enable the A-profile behaviour (notably vector length/stride handling) by accident. Backports commit 5bcf8ed9401e62c73158ba110864ee1375558bf7 from qemu	2019-04-30 10:12:17 -04:00
Richard Henderson	bca82cde84	tcg: Hoist max_insns computation to tb_gen_code In order to handle TB's that translate to too much code, we need to place the control of the length of the translation in the hands of the code gen master loop. Backports commit 8b86d6d25807e13a63ab6ea879f976b9f18cc45a from qemu	2019-04-30 09:49:57 -04:00
Lioncash	f6911ea73d	target/arm: Handle AArch32 CRC instructions	2019-04-27 10:50:25 -04:00
Lioncash	c3df12e534	target/arm/translate: Synchronize with Qemu	2019-04-27 10:13:01 -04:00
Lioncash	6d80445fe1	unicorn_arm: Treat registers as unsigned values in casts It isn't particularly advisable to treat these as signed values, given the registers themselves have no notion of signedness associated with them.	2019-04-26 08:48:31 -04:00
Lioncash	f419015aa3	unicorn_arm: Don't steamroll CPSR bits defined as RAZ/SBZP Prevents bits from being set that should always read as zero according to the ARM architecture reference manual.	2019-04-26 08:47:50 -04:00
Lioncash	d844d7cc9d	exec: Backport tb_cflags accessor	2019-04-22 06:12:59 -04:00
Lioncash	14f1cb03e9	target/arm/unicorn_arm: Get rid of magic constants where applicable	2019-04-19 20:23:41 -04:00
Lioncash	5968b3d96f	target/arm: Synchronize with qemu	2019-04-19 15:31:18 -04:00
Lioncash	bf6dfeb175	target/arm/translate: Synchronize with qemu Backports a few other missing pieces from mainline qemu.	2019-04-18 06:22:36 -04:00
Lioncash	5b062dacf2	target/arm: Simplify and correct thumb instruction tracing This wasn't subtracting the size of the instruction off the PC like how the ARM mode tracing was performing the tracing. This simplifies it and makes the behavior identical.	2019-04-18 06:00:15 -04:00
Lioncash	5d6ddec7fb	target/arm/translate: Subtract PC value properly for thumb tracecode calls	2019-04-18 05:44:48 -04:00
Lioncash	3521e72580	target/arm: Sychronize with qemu Synchronizes with bits and pieces that were missed due to merging incorrectly (sorry :<)	2019-04-18 04:49:11 -04:00
Lioncash	ddcf400955	arm: Always enable access to coprocessors initially Allows non-AArch64 environments to always access coprocessors initially. Removes the need to do avoidable register management when testing floating-point code.	2019-04-13 19:49:43 -04:00
Richard Henderson	f5cb1a5865	target/arm: Set SIMDMISC and FPMISC for 32-bit -cpu max Fixes: https://bugs.launchpad.net/bugs/1821430 Backports commit c8877d0f2f662bf01346a03bc9fd279954b4132d from qemu	2019-03-26 20:41:01 -04:00
Andrew Jones	8719b3edb3	target/arm: make pmccntr_op_start/finish static These functions are not used outside helper.c Backports commit f2b2f53f6429b5abd7cd86bd65747f5f13e195eb from qemu	2019-03-26 20:35:34 -04:00
Andrew Jones	6482182ba5	target/arm: cortex-a7 and cortex-a15 have pmus cortex-a7 and cortex-a15 have pmus (PMUv2) and they advertise them in ID_DFR0. Let's allow them to function. This also enables the pmu cpu property to work with these cpu types, i.e. we can now do '-cpu cortex-a15,pmu=off' to remove the pmu. Backports commit a46118fc16537a593119e5b316052a98514046bb from qemu	2019-03-26 20:34:11 -04:00
Andrew Jones	3c50e72c40	target/arm: fix crash on pmu register access Fix a QEMU NULL derefence that occurs when the guest attempts to enable PMU counters with a non-v8 cpu model or a v8 cpu model which has not configured a PMU. Backports commit cbbb3041fe2f57a475cef5d6b0ef836118aad106 from qemu	2019-03-26 20:32:49 -04:00
Richard Henderson	2427ace0c0	target/arm: Fix non-parallel expansion of CASP The second word has been loaded from the unincremented address since the first commit. Backports commit a036f5302c13634f3d375615b2949fd1fa1657b6 from qemu	2019-03-26 20:31:01 -04:00
Amir Charif	2392d8b8ab	target/arm: Check access permission to ADDVL/ADDPL/RDVL These instructions do not trap when SVE is disabled in EL0, causing them to be executed with wrong size information. Backports commit 5de56742a3c91de3d646326bec43a989bba83ca4 from qemu	2019-03-19 05:42:59 -04:00
Dongjiu Geng	4dc3d59fd3	target/arm: change arch timer registers access permission Some generic arch timer registers are Config-RW in the EL0, which means the EL0 exception level can have write permission if it is appropriately configured. When VM access registers, QEMU firstly checks whether they have RW permission, then check whether it is appropriately configured. If they are defined to read only in EL0, even though they have been appropriately configured, they still do not have write permission. So need to add the write permission according to ARMV8 spec when define it. Backports commit daf1dc5f82cefe2a57f184d5053e8b274ad2ba9a from qemu	2019-03-19 05:40:44 -04:00
yhql	1723cb1015	Add ARM MSP, PSP and CONTROL register access (#1071 ) Necessary for NVIC exception emulation from user. Backports commit 31851280316d37305f412fff42f45bb375999074 from unicorn	2019-03-08 02:24:49 -05:00
Lioncash	5075a0158a	target/arm: Restore Qemu's organization of coprocessor registers These changes were mostly made in upstream unicorn for what I can guess, was to support old versions of MSVC's compiler. This is also a pain to maintain, since everything needs to be done manually and can be a source of errors. It also makes it take more work than it needs to, to backport changes from qemu. Because of that, this change restores Qemu's organization of the coprocessor registers.	2019-03-08 01:32:47 -05:00
Richard Henderson	f116560d2c	target/arm: Implement ARMv8.5-FRINT Backports 6bea25631af92531027d3bf3ef972a4d51d62e7c from qemu.	2019-03-05 23:17:33 -05:00
Richard Henderson	f855ac073d	target/arm: Restructure handle_fp_1src_{single, double} This will allow sharing code that adjusts rmode beyond the existing users. Backports commit 0e4db23d1fdbfed4fc1ec19b6e59820209600358 from qemu	2019-03-05 23:09:48 -05:00
Richard Henderson	94b5aab8f8	target/arm: Implement ARMv8.5-CondM Backports commit 5ef84f111483e3f7b57efc690e22081ca8f99544 from qemu	2019-03-05 23:04:06 -05:00
Richard Henderson	1dfa15a683	target/arm: Implement ARMv8.4-CondM Backports commit b89d9c988a988d5547c73e2bc43f59b0c07420a5 from qemu	2019-03-05 22:59:51 -05:00
Richard Henderson	65a3f3be5b	target/arm: Rearrange disas_data_proc_reg This decoding more closely matches the ARMv8.4 Table C4-6, Encoding table for Data Processing - Register Group. In particular, op2 == 0 is now more than just Add/sub (with carry). Backports commit 2fba34f70d9a81bab56e61bb99a4d6632bdfe531 from qemu	2019-03-05 22:55:27 -05:00
Richard Henderson	45c297c99b	target/arm: Add set/clear_pstate_bits, share gen_ss_advance We do not need an out-of-line helper for manipulating bits in pstate. While changing things, share the implementation of gen_ss_advance. Backports commit 22ac3c49641f6eed93dca5b852030b4d3eacf6c4 from qemu	2019-03-05 22:55:22 -05:00
Richard Henderson	60742608f5	target/arm: Split helper_msr_i_pstate into 3 The EL0+UMA check is unique to DAIF. While SPSel had avoided the check by nature of already checking EL >= 1, the other post v8.0 extensions to MSR (imm) allow EL0 and do not require UMA. Avoid the unconditional write to pc and use raise_exception_ra to unwind. Backports commit ff730e9666a716b669ac4a8ca7c521177d1d2b15 from qemu	2019-03-05 22:45:11 -05:00
Richard Henderson	5d42ca6a65	target/arm: Implement ARMv8.0-PredInv Backports commit cb570bd318beb2ecce83cabf8016dacceb824dce from qemu	2019-03-05 22:37:57 -05:00
Richard Henderson	1721e429c2	target/arm: Implement ARMv8.0-SB Backports commit 9888bd1e20425dfe4dcca5dcd1ca2fac8e90ad19 from qemu	2019-03-05 22:35:16 -05:00
Richard Henderson	a552a7b2e0	target/arm: Split out arm_sctlr Minimize the number of places that will need updating when the virtual host extensions are added. Backports commit 64e40755cd41fbe8cd266cf387e42ddc57a449ef from qemu	2019-03-05 22:29:25 -05:00
Richard Henderson	fa70a2bc69	target/arm: Fix PC test for LDM (exception return) Found by inspection: Rn is the base register against which the load began; I is the register within the mask being processed. The exception return should of course be processed from the loaded PC. Backports commit 9d090d17234058f55c3c439d285db78c94d7d4de from qemu	2019-03-05 22:27:38 -05:00
Lioncash	0868015992	target/arm: Move TCGContext variable within arm_post_translate_insn into a narrower scope This is only used within the scope of the if statement, so we can just move it there.	2019-02-28 18:53:33 -05:00
Lioncash	15440a83c5	target/arm: Fix execution of ARM instructions Previously we'd be checking prior to the actual decoding if we were at the ending address. This worked fine using the old model of the translation process in qemu. However, this causes the wrong behavior to occur in both ARM and Thumb/Thumb-2 modes using the newer translator model. Given the translator itself checks for the end address already, this needs to be placed within arm_post_translate_insn(). This prevents the emulation process being off-by-one as well when it comes to actually executing the instructions.	2019-02-28 18:49:22 -05:00
cfrantz	5ad3a0ea82	Add support for the ARM IPSR register. (#1067 ) 1. Create an enum name for the IPSR register. 2. Implement read and write of the IPSR via the xpsr helper functions. Fixes #1065 Backports commit 6c319941a5462ee3a4af4593c371f5674394d6ce from unicorn.	2019-02-28 16:40:54 -05:00
Richard Henderson	fbe1ee25ff	target/arm: Enable ARMv8.2-FHM for -cpu max Backports commit 991c05995a7bbafbebc1e4d405e947f2edcee063 from qemu	2019-02-28 15:47:03 -05:00
Richard Henderson	4ae3ff8e61	target/arm: Implement VFMAL and VFMSL for aarch32 Backports commit 87732318c5d68a366fc2d6fc394d9c20412099fa from qemu	2019-02-28 15:44:59 -05:00
Richard Henderson	625d3f3cfb	target/arm: Implement FMLAL and FMLSL for aarch64 Backports commit 0caa5af802ff622c854ff4ee2e2b8cdd135b4d73 from qemu	2019-02-28 15:36:41 -05:00
Richard Henderson	5473c3603f	target/arm: Add helpers for FMLAL Note that float16_to_float32 rightly squashes SNaN to QNaN. But of course pickNaNMulAdd, for ARM, selects SNaNs first. So we have to preserve SNaN long enough for the correct NaN to be selected. Thus float16_to_float32_by_bits. Backports commit a4e943a716d5fac923d82df3eabc65d1e3624019 from qemu	2019-02-28 15:31:48 -05:00
Peter Maydell	82b8e97f76	target/arm: Gate "miscellaneous FP" insns by ID register field There is a set of VFP instructions which we implement in disas_vfp_v8_insn() and gate on the ARM_FEATURE_V8 bit. These were all first introduced in v8 for A-profile, but in M-profile they appeared in v7M. Gate them on the MVFR2 FPMisc field instead, and rename the function appropriately. Backports commit c0c760afe800b60b48c80ddf3509fec413594778 from qemu	2019-02-28 15:26:27 -05:00
Peter Maydell	118a2bde5c	target/arm: Use MVFR1 feature bits to gate A32/T32 FP16 instructions Instead of gating the A32/T32 FP16 conversion instructions on the ARM_FEATURE_VFP_FP16 flag, switch to our new approach of looking at ID register bits. In this case MVFR1 fields FPHP and SIMDHP indicate the presence of these insns. This change doesn't alter behaviour for any of our CPUs. Backports commit 602f6e42cfbfe9278be34e9b91d2ceb695837e02 from qemu	2019-02-28 15:23:51 -05:00
Richard Henderson	c9ad233678	target/arm: Implement ARMv8.3-JSConv Backports commit 6c1f6f2733a7692793135ea5ce72b829add99a50 from qemu	2019-02-22 19:08:57 -05:00
Richard Henderson	f16dcbe226	target/arm: Rearrange Floating-point data-processing (2 regs) There are lots of special cases within these insns. Split the major argument decode/loading/saving into no_output (compares), rd_is_dp, and rm_is_dp. We still need to special case argument load for compare (rd as input, rm as zero) and vcvt fixed (rd as input+output), but lots of special cases do disappear. Now that we have a full switch at the beginning, hoist the ISA checks from the code generation. Backports commit e80941bd64cc388554770fd72334e9e7d459a1ef from qemu	2019-02-22 18:57:25 -05:00
Richard Henderson	dbe623dacc	target/arm: Split out vfp_helper.c Move all of the fp helpers out of helper.c into a new file. This is code movement only. Since helper.c has no copyright header, take the one from cpu.h for the new file. Backports commit 37356079fcdb34e13abbed8ea0c00ca880c31247 from qemu	2019-02-22 18:48:44 -05:00
Richard Henderson	d6fbc0f4f3	target/arm: Restructure disas_fp_int_conv For opcodes 0-5, move some if conditions into the structure of a switch statement. For opcodes 6 & 7, decode everything at once with a second switch. Backports commit 3c3ff68492c2d00bd8cb39ed2d02bdaf5caf5cb8 from qemu	2019-02-22 18:39:08 -05:00
Aaron Lindsay OS	5c153537f5	target/arm: Stop unintentional sign extension in pmu_init This was introduced by commit bf8d09694ccc07487cd73d7562081fdaec3370c8 target/arm: Don't clear supported PMU events when initializing PMCEID1 and identified by Coverity (CID 1398645). Backports commit 67da43d668320e1bcb0a0195aaf2de4ff2a001a0 from qemu	2019-02-22 18:32:10 -05:00
Peter Maydell	928f226ed6	target/arm: v8M MPU should use background region as default, not always The "background region" for a v8M MPU is a default which will be used (if enabled, and if the access is privileged) if the access does not match any specific MPU region. We were incorrectly using it always (by putting the condition at the wrong nesting level). This meant that we would always return the default background permissions rather than the correct permissions for a specific region, and also that we would not return the right information in response to a TT instruction. Move the check for the background region to the same place in the logic as the equivalent v8M MPUCheck() pseudocode puts it. This in turn means we must adjust the condition we use to detect matches in multiple regions to avoid false-positives. Backports commit cff21316c666c8053b1f425577e324038d0ca30d from qemu	2019-02-22 18:30:44 -05:00
Richard Henderson	5c34cab41c	target/arm: Add missing clear_tail calls Fortunately, the functions affected are so far only called from SVE, so there is no tail to be cleared. But as we convert more of AdvSIMD to gvec, this will matter. Backports commit d8efe78e8039511b95c23d75bb48eca6873fbb0f from qemu	2019-02-15 18:15:20 -05:00
Richard Henderson	f3cb92c86c	target/arm: Use vector operations for saturation For same-sign saturation, we have tcg vector operations. We can compute the QC bit by comparing the saturated value against the unsaturated value. Backports commit 89e68b575e138d0af1435f11a8ffcd8779c237bd from qemu	2019-02-15 18:14:09 -05:00
Richard Henderson	10d468f601	target/arm: Split out FPSCR.QC to a vector field Change the representation of this field such that it is easy to set from vector code. Backports commit a4d5846245c5e029e5aa3945a9bda1de1c3fedbf from qemu	2019-02-15 18:04:13 -05:00
Richard Henderson	356b70e931	target/arm: Fix set of bits kept in xregs[ARM_VFP_FPSCR] Given that we mask bits properly on set, there is no reason to mask them again on get. We failed to clear the exception status bits, 0x9f, which means that the wrong value would be returned on get. Except in the (probably normal) case in which the set clears all of the bits. Simplify the code in set to also clear the RES0 bits. Backports commit 18aaa59c622208743565307668a2100ab24f7de9 from qemu	2019-02-15 18:00:57 -05:00
Richard Henderson	ca4bb1b4bc	target/arm: Split out flags setting from vfp compares Minimize the code within a macro by splitting out a helper function. Use deposit32 instead of manual bit manipulation. Backports commit 55a889456ef78f3f9b8eae9846c2f1453b1dd77b from qemu	2019-02-15 17:59:34 -05:00
Richard Henderson	4e44043956	target/arm: Fix arm_cpu_dump_state vs FPSCR Backports commit ec527e4eeccc31e3beadf3b61b66c61bbd873811 from qemu	2019-02-15 17:58:25 -05:00
Richard Henderson	ed7c9d0710	target/arm: Remove neon min/max helpers These are now unused. Backports commit a5c5dc53c4688efc149b235361d2d49869e77139 from qemu	2019-02-15 17:57:18 -05:00
Richard Henderson	198befc50e	target/arm: Use tcg integer min/max primitives for neon The 32-bit PMIN/PMAX has been decomposed to scalars, and so can be trivially expanded inline. Backports commit 9ecd3c5c1651fa7f9adbedff4806a2da0b50490c from qemu	2019-02-15 17:55:11 -05:00
Richard Henderson	eee33bd692	target/arm: Use vector minmax expanders for aarch32 Backports commit 6f2782218230bbb33fa22f9a2f73f8a570046007 from qemu	2019-02-15 17:54:05 -05:00
Richard Henderson	96d1df966b	target/arm: Use vector minmax expanders for aarch64 Backports commit 264d2a481a6c34dfda53be3fbea66116bcef9c5a from qemu	2019-02-15 17:52:36 -05:00
Richard Henderson	d147946edc	target/arm: Rely on optimization within tcg_gen_gvec_or Since we're now handling a == b generically, we no longer need to do it by hand within target/arm/. Backports commit 2900847ff4c862887af750935a875059615f509a from qemu	2019-02-15 17:50:28 -05:00
Alex Bennée	bf9c8499ca	target/arm: expose remaining CPUID registers as RAZ There are a whole bunch more registers in the CPUID space which are currently not used but are exposed as RAZ. To avoid too much duplication we expand ARMCPRegUserSpaceInfo to understand glob patterns so we only need one entry to tweak whole ranges of registers. Backports commit d040242effe47850060d2ef1c461ff637d88a84d from qemu	2019-02-15 17:48:37 -05:00
Alex Bennée	890983f186	target/arm: expose MPIDR_EL1 to userspace As this is a single register we could expose it with a simple ifdef but we use the existing modify_arm_cp_regs mechanism for consistency. Backports commit 522641660c3de64ed8322b8636c58625cd564a3f from qemu	2019-02-15 17:29:23 -05:00
Alex Bennée	babf31dfa0	target/arm: expose CPUID registers to userspace A number of CPUID registers are exposed to userspace by modern Linux kernels thanks to the "ARM64 CPU Feature Registers" ABI. For QEMU's user-mode emulation we don't need to emulate the kernels trap but just return the value the trap would have done. To avoid too much #ifdef hackery we process ARMCPRegInfo with a new helper (modify_arm_cp_regs) before defining the registers. The modify routine is driven by a simple data structure which describes which bits are exported and which are fixed. Backports commit 6c5c0fec29bbfe36c64eca1edfd8455be46b77c6 from qemu	2019-02-15 17:27:30 -05:00
Alex Bennée	0a51e5055f	target/arm: relax permission checks for HWCAP_CPUID registers Although technically not visible to userspace the kernel does make them visible via a trap and emulate ABI. We provide a new permission mask (PL0U_R) which maps to PL0_R for CONFIG_USER builds and adjust the minimum permission check accordingly. Backports commit b5bd7440422bb66deaceb812bb9287a6a3cdf10c from qemu	2019-02-15 17:18:06 -05:00
Catherine Ho	841ac2b3bb	target/arm: Fix int128_make128 lo, hi order in paired_cmpxchg64_be The lo,hi order is different from the comments. And in commit 1ec182c33379 ("target/arm: Convert to HAVE_CMPXCHG128"), it changes the original code logic. So just restore the old code logic before this commit: do_paired_cmpxchg64_be(): cmpv = int128_make128(env->exclusive_high, env->exclusive_val); newv = int128_make128(new_hi, new_lo); This fixes a bug that would only be visible for big-endian AArch64 guest code. Fixes: 1ec182c33379 ("target/arm: Convert to HAVE_CMPXCHG128") Backports commit abd5abc58c5d4c9bd23427b0998a44eb87ed47a2 from qemu	2019-02-15 17:16:55 -05:00
Peter Maydell	31813bafe2	target/arm: Implement HACR_EL2 HACR_EL2 is a register with IMPDEF behaviour, which allows implementation specific trapping to EL2. Implement it as RAZ/WI, since QEMU's implementation has no extra traps. This also matches what h/w implementations like Cortex-A53 and A57 do. Backports commit 831a2fca343ebcd6651eab9102bd7a36b77da65d from qemu	2019-02-15 17:15:41 -05:00
Aaron Lindsay OS	af17f7fa59	target/arm: Fix CRn to be 14 for PMEVTYPER/PMEVCNTR This bug was introduced in: commit 5ecdd3e47cadae83a62dc92b472f1fe163b56f59 target/arm: Finish implementation of PM[X]EVCNTR and PM[X]EVTYPER Backports commit 62c7ec3488fe0dcbabffd543f458914e27736115 from qemu	2019-02-15 17:12:04 -05:00
Peter Maydell	04676ed074	target/arm: Make FPSCR/FPCR trapped-exception bits RAZ/WI The {IOE, DZE, OFE, UFE, IXE, IDE} bits in the FPSCR/FPCR are for enabling trapped IEEE floating point exceptions (where IEEE exception conditions cause a CPU exception rather than updating the FPSR status bits). QEMU doesn't implement this (and nor does the hardware we're modelling), but for implementations which don't implement trapped exception handling these control bits are supposed to be RAZ/WI. This allows guest code to test for whether the feature is present by trying to write to the bit and checking whether it sticks. QEMU is incorrectly making these bits read as written. Make them RAZ/WI as the architecture requires. In particular this was causing problems for the NetBSD automatic test suite. Backports commit a15945d98d3a3390c3da344d1b47218e91e49d8b from qemu	2019-02-05 17:45:22 -05:00
Richard Henderson	9b0e04f3ab	target/arm: Enable TBI for user-only This has been enabled in the linux kernel since v3.11 (commit d50240a5f6cea, 2013-09-03, "arm64: mm: permit use of tagged pointers at EL0"). Backports commit f6a148fef63698826e69ca91cc11877ab1ed786f from qemu	2019-02-05 17:44:17 -05:00
Peter Maydell	8124b9f975	target/arm: Compute TB_FLAGS for TBI for user-only Enables, but does not turn on, TBI for CONFIG_USER_ONLY. Backports commit c47eaf9fc2af68cfbdbd9ae31f8e2e5ebb7022b4 from qemu	2019-02-05 17:43:11 -05:00
Richard Henderson	b928902908	target/arm: Clean TBI for data operations in the translator This will allow TBI to be used in user-only mode, as well as avoid ping-ponging the softmmu TLB when TBI is in use. It will also enable other armv8 extensions. Backports commit 3a471103ac1823bafc907962dcaf6bd4fc0942a2 from qemu	2019-02-05 17:39:12 -05:00
Richard Henderson	5c6ffde710	target/arm: Add TBFLAG_A64_TBID, split out gen_top_byte_ignore Split out gen_top_byte_ignore in preparation of handling these data accesses; the new tbflags field is not yet honored. Backports commit 4a9ee99db38ba513bf1e8f43665b79c60accd017 from qemu	2019-02-05 17:20:11 -05:00
Richard Henderson	fbd8992e27	target/arm: Enable BTI for -cpu max Backports commit a15daafa1cba96ff28abdfb6c860e0939655dbd1 from qemu	2019-02-05 17:15:32 -05:00
Richard Henderson	4dc5f80683	target/arm: Set btype for indirect branches Backports commit 001d47b6efbe4795ed77366986b8ef384ab8b127 from qemu	2019-02-05 17:14:16 -05:00
Richard Henderson	11736a659b	target/arm: Reset btype for direct branches This is all of the non-exception cases of DISAS_NORETURN. Backports commit 358622703583d2e2967e0a93da990e747dcc3ac6 from qemu	2019-02-05 17:11:59 -05:00
Richard Henderson	88193cf7c3	target/arm: Default handling of BTYPE during translation The branch target exception for guarded pages has high priority, and only 8 instructions are valid for that case. Perform this check before doing any other decode. Clear BTYPE after all insns that neither set BTYPE nor exit via exception (DISAS_NORETURN). Not yet handled are insns that exit via DISAS_NORETURN for some other reason, like direct branches. Backports commit 51bf0d7aa91a9d4e2563240a42e6cb705cef84aa from qemu	2019-02-05 17:05:31 -05:00
Richard Henderson	d594b2047f	target/arm: Cache the GP bit for a page in MemTxAttrs Caching the bit means that we will not have to re-walk the page tables to look up the bit during translation. Backports commit 1bafc2ba7e6bfe89fff3503fdac8db39c973de48 from qemu	2019-02-05 17:02:19 -05:00
Richard Henderson	cf3ac035bc	target/arm: Add BT and BTYPE to tb->flags Backports commit 08f1434a71ddf2bdfdb034dcd24b24464d1efd72 from qemu	2019-02-05 16:59:53 -05:00
Richard Henderson	a99119ce39	target/arm: Add PSTATE.BTYPE Place this in its own field within ENV, as that will make it easier to reset from within TCG generated code. With the change to pstate_read/write, exception entry and return are automatically handled. Backports commit f6e52eaac13b6947f4406c127e3090c898e439c9 from qemu	2019-02-05 16:57:51 -05:00
Richard Henderson	6b4f7a28b5	target/arm: Introduce isar_feature_aa64_bti Also create field definitions for id_aa64pfr1 from ARMv8.5. Backports commit be53b6f4d7ace2e6a018e45af825069ccb7bab66 from qemu	2019-02-05 16:56:01 -05:00
Remi Denis-Courmont	0b7f1ff086	target/arm: fix decoding of B{,L}RA{A,B} A flawed test lead to the instructions always being treated as unallocated encodings. Fixes: https://bugs.launchpad.net/bugs/1813460 Backports commit 1cf86a8618644beb860951ff4383457ee88a7f4a from qemu	2019-02-03 17:55:31 -05:00
Remi Denis-Courmont	a20bb60f06	target/arm: fix AArch64 virtual address space size Since QEMU does not support the ARMv8.2-LVA, Large Virtual Address, extension (yet), the VA address space is 48-bits plus a sign bit. User mode can only handle the positive half of the address space, so that makes a limit of 48 bits. (With LVA, it would be 53 and 52 bits respectively.) The incorrectly large address space conflicts with PAuth instructions, which use bits 48-54 and 56-63 for the pointer authentication code. This also conflicts with (as yet unsupported by QEMU) data tagging and with the ARMv8.5-MTE extension. Backports commit f6768aa1b4c6a80448eabd22bb9b4123c709caea from qemu	2019-02-03 17:55:30 -05:00
Richard Henderson	932c4e8569	target/arm: Always enable pac keys for user-only Drop the pac properties. This approach cannot work as written because the properties are applied before arm_cpu_reset, which zeros SCTLR_EL1 (amongst everything else). We can re-introduce the properties if they turn out to be useful. But since linux 5.0 enables all of the keys, they may not be. Backports commit 276c6e813719568bdc9743e87ff8f42115006206 from qemu	2019-02-03 17:55:30 -05:00
Julia Suvorova	93acc4dc56	arm: Clarify the logic of set_pc() Until now, the set_pc logic was unclear, which raised questions about whether it should be used directly, applying a value to PC or adding additional checks, for example, set the Thumb bit in Arm cpu. Let's set the set_pc logic for “Configure the PC, as was done in the ELF file” and implement synchronize_with_tb hook for preserving PC to cpu_tb_exec. Backports commit 42f6ed919325413392bea247a1e6f135deb469cd from qemu	2019-02-03 17:55:30 -05:00
Richard Henderson	50cf5634da	target/arm: Enable API, APK bits in SCR, HCR These bits become writable with the ARMv8.3-PAuth extension. Backports commit ef682cdb4aded5c65a018e175482e875de66059d from qemu	2019-02-03 17:55:30 -05:00
Aaron Lindsay OS	0dd5bf84fc	target/arm: Send interrupts on PMU counter overflow Whenever we notice that a counter overflow has occurred, send an interrupt. This is made more reliable with the addition of a timer in a follow-on commit. Backports commit f4efb4b2a17528837cb445f9bdfaef8df4a5acf7 from qemu	2019-02-03 17:55:30 -05:00
Peter Maydell	edfb13f8eb	target/arm/translate-a64: Fix mishandling of size in FCMLA decode In disas_simd_indexed(), for the case of "complex fp", each indexable element is a complex pair, so the total size is twice that indicated in the 'size' field in the encoding. We were trying to do this "double the size" operation with a left shift by 1, but this is incorrect because the 'size' field is a MO_8/MO_16/MO_32/MO_64 value, and doubling the size should be done by a simple increment. This meant we were mishandling FCMLA (by element) of values where the real and imaginary parts are 32-bit floats, and would incorrectly UNDEF this encoding. (No other insns take this code path, and for 16-bit floats it happens that 1 << 1 and 1 + 1 are both the same). Backports commit eaefb97a8b97dbf42c016fe65b68b92f99a346f6 from qemu	2019-02-03 17:55:30 -05:00
Peter Maydell	f999e06c22	target/arm/translate-a64: Fix FCMLA decoding error The FCMLA (by element) instruction exists in the "vector x indexed element" encoding group, but not in the "scalar x indexed element" group. Correctly UNDEF the unallocated encodings. Backports commit 4dfabb6d568e6b315594d7d464dacaf3368aff60 from qemu	2019-02-03 17:55:30 -05:00
Peter Maydell	eaecbe7901	target/arm/translate-a64: Don't underdecode SDOT and UDOT In the AdvSIMD scalar x indexed element and vector x indexed element encoding group, the SDOT and UDOT instructions are vector only, and their opcode is unallocated in the scalar group. Correctly UNDEF this unallocated encoding. Backports commit 4977986ca38fb1d5357532e1a8032b984047a369 from qemu	2019-02-03 17:55:30 -05:00
Peter Maydell	f7d78d9e08	target/arm/translate-a64: Don't underdecode FP insns In the encoding groups * floating-point data-processing (1 source) * floating-point data-processing (2 source) * floating-point data-processing (3 source) * floating-point immediate * floating-point compare * floating-ponit conditional compare * floating-point conditional select bit 31 is M and bit 29 is S (and bit 30 is 0, already checked at this point in the decode). None of these groups allocate any encoding for M=1 or S=1. We checked this in disas_fp_compare(), disas_fp_ccomp() and disas_fp_csel(), but missed it in disas_fp_1src(), disas_fp_2src(), disas_fp_3src() and disas_fp_imm(). We also missed that in the fp immediate encoding the imm5 field must be all zeroes. Correctly UNDEF the unallocated encodings here. Backports commit c1e20801f5ee53472dbf2757df605543f3f4ce0b from qemu	2019-02-03 17:55:29 -05:00
Peter Maydell	1128b4d77d	target/arm/translate-a64: Don't underdecode add/sub extended register In the "add/subtract (extended register)" encoding group, the "opt" field in bits [23:22] must be zero. Correctly UNDEF the unallocated encodings where this field is not zero. Backports commit 4f61106614410945b1d1c93081544ad5b13044fc from qemu	2019-02-03 17:55:29 -05:00
Peter Maydell	decebb5936	target/arm/translate-a64: Don't underdecode SIMD ld/st single In the AdvSIMD load/store single structure encodings, the non-post-indexed case should have zeroes in [20:16] (which is the Rm field for the post-indexed case). Bit 31 must also be zero (a check we got right in ldst_multiple but not here). Correctly UNDEF these unallocated encodings. Backports commit 9c72b68ad746a51f63822cffab4d144b5957823a from qemu	2019-02-03 17:55:29 -05:00
Peter Maydell	60ccaf56ac	target/arm/translate-a64: Don't underdecode SIMD ld/st multiple In the AdvSIMD load/store multiple structures encodings, the non-post-indexed case should have zeroes in [20:16] (which is the Rm field for the post-indexed case). Correctly UNDEF the currently unallocated encodings which have non-zeroes in those bits. Backports commit e1f220811dbd5d85fb02ff286358f9ee6188938f from qemu	2019-02-03 17:55:29 -05:00
Peter Maydell	80248fecb6	target/arm/translate-a64: Don't underdecode PRFM The PRFM prefetch insn in the load/store with imm9 encodings requires idx field 0b00; we were underdecoding this by only checking !is_unpriv (which is equivalent to idx != 2). Correctly UNDEF the unallocated encodings where idx == 0b01 and 0b11 as well as 0b10. Backports commit a80c4256543987ca88407349ee012a673a10a2ae from qemu	2019-02-03 17:55:29 -05:00
Peter Maydell	147269ed81	target/arm/translate-a64: Don't underdecode system instructions The "system instructions" and "system register move" subcategories of "branches, exception generating and system instructions" for A64 only apply if bits [23:22] are zero; other values are currently unallocated. Correctly UNDEF these unallocated encodings. Backports commit 08d5e3bde6b4ad32996bf69d93aa66ae43d3f3ff from qemu	2019-02-03 17:55:29 -05:00
Lioncash	8eaa850287	target/arm/vec_helper: Remove use of void pointer arithmetic This is a GNU-specific extension.	2019-01-30 14:03:26 -05:00
Aaron Lindsay OS	8d7bb2cab3	target/arm: Don't clear supported PMU events when initializing PMCEID1 A bug was introduced during a respin of: commit 57a4a11b2b281bb548b419ca81bfafb214e4c77a target/arm: Add array for supported PMU events, generate PMCEID[01]_EL0 This patch introduced two calls to get_pmceid() during CPU initialization - one each for PMCEID0 and PMCEID1. In addition to building the register values, get_pmceid() clears an internal array mapping event numbers to their implementations (supported_event_map) before rebuilding it. This is an optimization since much of the logic is shared. However, since it was called twice, the contents of supported_event_map reflect only the events in PMCEID1 (the second call to get_pmceid()). Fix this bug by moving the initialization of PMCEID0 and PMCEID1 back into a single function call, and name it more appropriately since it is doing more than simply generating the contents of the PMCEID[01] registers. Backports commit bf8d09694ccc07487cd73d7562081fdaec3370c8 from qemu	2019-01-29 17:12:23 -05:00
Richard Henderson	64f10fa075	target/arm: Fix validation of 32-bit address spaces for aa32 When tsz == 0, aarch32 selects the address space via exclusion, and there are no "top_bits" remaining that require validation. Fixes: ba97be9f4a4 Backports commit 36d820af0eddf4fc6a533579b052d8f0085a9fb8 from qemu	2019-01-29 16:46:36 -05:00
Lioncash	29d84a9296	target: Resolve repeated typedef warnings	2019-01-22 20:27:35 -05:00
Aaron Lindsay	cfd84d84f6	target/arm: Implement PMSWINC Backports commit 0d4bfd7df809863b1f45fad35229fb9419527d06 from qemu	2019-01-22 18:59:26 -05:00
Aaron Lindsay	9727b7c3e0	target/arm: PMU: Set PMCR.N to 4 This both advertises that we support four counters and enables them because the pmu_num_counters() reads this value from PMCR. Backports commit ac689a2e5155d129acaa39603e2a7a29abd90d89 from qemu	2019-01-22 18:55:34 -05:00
Aaron Lindsay	dede23994a	target/arm: PMU: Add instruction and cycle events The instruction event is only enabled when icount is used, cycles are always supported. Always defining get_cycle_count (but altering its behavior depending on CONFIG_USER_ONLY) allows us to remove some CONFIG_USER_ONLY #defines throughout the rest of the code. Backports commit b2e2372511946fae86fbb8709edec7a41c6f3167 from qemu	2019-01-22 18:54:01 -05:00
Aaron Lindsay	1a815a1afc	target/arm: Finish implementation of PM[X]EVCNTR and PM[X]EVTYPER Add arrays to hold the registers, the definitions themselves, access functions, and logic to reset counters when PMCR.P is set. Update filtering code to support counters other than PMCCNTR. Support migration with raw read/write functions. Backports commit 5ecdd3e47cadae83a62dc92b472f1fe163b56f59 from qemu	2019-01-22 18:07:53 -05:00
Aaron Lindsay	c8c3defb18	target/arm: Add array for supported PMU events, generate PMCEID[01]_EL0 This commit doesn't add any supported events, but provides the framework for adding them. We store the pm_event structs in a simple array, and provide the mapping from the event numbers to array indexes in the supported_event_map array. Because the value of PMCEID[01] depends upon which events are supported at runtime, generate it dynamically. Backports commit 57a4a11b2b281bb548b419ca81bfafb214e4c77a from qemu	2019-01-22 17:45:36 -05:00
Aaron Lindsay	67accd4d88	target/arm: Make PMCEID[01]_EL0 64 bit registers, add PMCEID[23] Backports commit cad8673744d0914587cd7380e70df11e8c4a0f50 from qemu	2019-01-22 17:32:29 -05:00
Aaron Lindsay	346e4226ec	target/arm: Define FIELDs for ID_DFR0 This is immediately necessary for the PMUv3 implementation to check ID_DFR0.PerfMon to enable/disable specific features, but defines the full complement of fields for possible future use elsewhere. Backports commit beceb99c0c1218d0b55cc04ce6ef77579d3416cb from qemu	2019-01-22 17:26:25 -05:00
Aaron Lindsay	4ff0389d2d	target/arm: Implement PMOVSSET Add an array for PMOVSSET so we only define it for v7ve+ platforms Backports commit 327dd5108bb3b66bb5102b659e83a4348a005f5e from qemu	2019-01-22 17:25:22 -05:00
Aaron Lindsay	d6d1d49cdd	target/arm: Allow AArch32 access for PMCCFILTR Backports commit 4b8afa1f99c3575f6cfb26770f90e9fd7bc45468 from qemu	2019-01-22 17:19:03 -05:00
Aaron Lindsay	b916fb2cae	target/arm: Filter cycle counter based on PMCCFILTR_EL0 Rename arm_ccnt_enabled to pmu_counter_enabled, and add logic to only return 'true' if the specified counter is enabled and neither prohibited or filtered. Backports commit 033614c47de78409ad3fb39bb7bd1483b71c6789 from qemu	2019-01-22 17:12:46 -05:00
Aaron Lindsay	b6b0c18ce5	target/arm: Swap PMU values before/after migrations Because of the PMU's design, many register accesses have side effects which are inter-related, meaning that the normal method of saving CP registers can result in inconsistent state. These side-effects are largely handled in pmu_op_start/finish functions which can be called before and after the state is saved/restored. By doing this and adding raw read/write functions for the affected registers, we avoid migration-related inconsistencies. Backports relevant parts of commit 980ebe87053792a5bdefaa87777c40914fd4f673 from qemu	2019-01-22 17:01:34 -05:00
Aaron Lindsay	001283c45b	target/arm: Reorganize PMCCNTR accesses pmccntr_read and pmccntr_write contained duplicate code that was already being handled by pmccntr_sync. Consolidate the duplicated code into two functions: pmccntr_op_start and pmccntr_op_finish. Add a companion to c15_ccnt in CPUARMState so that we can simultaneously save both the architectural register value and the last underlying cycle count - this ensures time isn't lost and will also allow us to access the 'old' architectural register value in order to detect overflows in later patches. Backports commit 5d05b9d462666ed21b7fef61aa45dec9aaa9f0ff from qemu	2019-01-22 16:57:29 -05:00
Richard Henderson	8c14f0ea57	target/arm: Tidy TBI handling in gen_a64_set_pc We can perform this with fewer operations. Backports commit 8733d762d5e089a017ac7c5bc71077a8d75d1b46 from qemu	2019-01-22 16:46:11 -05:00
Richard Henderson	1939b28276	target/arm: Enable PAuth for user-only Add 4 attributes that controls the EL1 enable bits, as we may not always want to turn on pointer authentication with -cpu max. However, by default they are enabled. Backports relevant parts of commit 1ae9cfbd470bffb8d9bacd761344e9b5e8adecb6 from qemu.	2019-01-22 16:42:18 -05:00
Richard Henderson	7f03b024d2	target/arm: Enable PAuth for -cpu max Backports commit 1ce32e47db52e3511132c7104770eae65d412144 from qemu	2019-01-22 16:38:07 -05:00
Richard Henderson	6e1c9df11f	target/arm: Add PAuth system registers Backports commit 967aa94f2b0df08d744b6d0bfa21031175e8a0e6 from qemu	2019-01-22 16:36:55 -05:00
Richard Henderson	d633a04e61	target/arm: Implement pauth_computepac This is the main crypto routine, an implementation of QARMA. This matches, as much as possible, ARM pseudocode. Backports commit 990870b205ddfdba3fd3c1321e6083005ef59d1a from qemu	2019-01-22 16:35:07 -05:00
Richard Henderson	e23d1ef408	target/arm: Implement pauth_addpac This is not really functional yet, because the crypto is not yet implemented. This, however follows the AddPAC pseudo function. Backports commit 63ff0ca94cb84764d2aee45b37c5502a54811dab from qemu	2019-01-22 16:33:25 -05:00
Richard Henderson	4281ae11c4	target/arm: Implement pauth_auth This is not really functional yet, because the crypto is not yet implemented. This, however follows the Auth pseudo function. Backports commit a7bfa086c973a51fc18116c9d2e22a0e0069edba from qemu	2019-01-22 16:32:19 -05:00
Richard Henderson	2545f8cf84	target/arm: Implement pauth_strip Stripping out the authentication data does not require any crypto, it merely requires the virtual address parameters. Backports commit 04d13549fa10bb9775a90701e4e6fd0a2cbf83cb from qemu	2019-01-22 16:31:11 -05:00
Richard Henderson	048d8da45c	target/arm: Reuse aa64_va_parameters for setting tbflags The arm_regime_tbi{0,1} functions are replacable with the new function by giving the lowest and highest address. Backports commit 5d8634f5a3a8474525edcfd581a659830e9e97c0 from qemu	2019-01-22 16:29:54 -05:00
Richard Henderson	028aef155a	target/arm: Decode TBID from TCR Use TBID in aa64_va_parameters depending on the data parameter. This automatically updates all existing users of the function. Backports commit 8220af7e4d34c858898fbfe55943aeea8f4e875f from qemu	2019-01-22 16:27:37 -05:00
Richard Henderson	b99e2f920b	target/arm: Add aa64_va_parameters_both We will want to check TBI for I and D simultaneously. Backports commit e737ed2ad8c14b4b82ed241646ffa370d29d0937 from qemu	2019-01-22 16:25:12 -05:00
Richard Henderson	23b162f2fb	target/arm: Export aa64_va_parameters to internals.h We need to reuse this from helper-a64.c. Provide a stub definition for CONFIG_USER_ONLY. This matches the stub definitions that we removed for arm_regime_tbi{0,1} before. Backports commit bf0be433878935e824479e8ae890493e1fb646ed from qemu	2019-01-22 16:22:57 -05:00
Richard Henderson	3fbde0ab73	target/arm: Merge TBFLAG_AA_TB{0, 1} to TBII We will shortly want to talk about TBI as it relates to data. Passing around a pair of variables is less convenient than a single variable. Backports commit 476a4692f06e381117fb7ad0d04d37c9c2612198 from qemu	2019-01-22 16:19:57 -05:00
Richard Henderson	b6415f7a4b	target/arm: Create ARMVAParameters and helpers Split out functions to extract the virtual address parameters. Let the functions choose T0 or T1 address space half, if present. Extract (most of) the control bits that vary between EL or Tx. Backports commit ba97be9f4a4ecaf16a1454dc669e5f3d935d3b63 from qemu	2019-01-22 16:17:16 -05:00
Richard Henderson	377bd123bd	target/arm: Introduce arm_stage1_mmu_idx While we could expose stage_1_mmu_idx, the combination is probably going to be more useful. Backports commit 64be86ab1b5ef10b660a4230ee7f27c0da499043 from qemu	2019-01-22 16:08:37 -05:00
Richard Henderson	9743787d0f	target/arm: Introduce arm_mmu_idx The pattern ARMMMUIdx mmu_idx = core_to_arm_mmu_idx(env, cpu_mmu_index(env, false)); is computing the full ARMMMUIdx, stripping off the ARM bits, and then putting them back. Avoid the extra two steps with the appropriate helper function. Backports commit 50494a279dab22a015aba9501a94fcc3cd52140e from qemu	2019-01-22 16:06:34 -05:00
Richard Henderson	8856234574	target/arm: Move cpu_mmu_index out of line This function is, or will shortly become, too big to inline. Backports commit 65e4655c609a4a2fd428459d3efb62b704488fd6 from qemu	2019-01-22 16:02:39 -05:00
Richard Henderson	5de5903d1c	target/arm: Decode Load/store register (pac) Not that there are any stores involved, but why argue with ARM's naming convention. Backports commit bd889f4810839a2b68e339696ccf7c406cd62879 from qemu	2019-01-22 15:58:22 -05:00
Richard Henderson	8356fc365f	target/arm: Decode PAuth within disas_uncond_b_reg Backports commit 561c0a33080f1058a7671d0223122c98e4c9c1f0 from qemu	2019-01-22 15:55:14 -05:00
Richard Henderson	0c09366d68	target/arm: Rearrange decode in disas_uncond_b_reg This will enable PAuth decode in a subsequent patch. Backports commit f7cf3bfc1b65a8c550b8439374c0060f7311ad50 from qemu	2019-01-22 15:50:45 -05:00
Richard Henderson	4329ded5e2	target/arm: Add new_pc argument to helper_exception_return Backports commit d9f482a0279773c3253092ea149daaf00358c635 from qemu	2019-01-22 15:48:15 -05:00
Richard Henderson	0c6f58ebc6	target/arm: Move helper_exception_return to helper-a64.c This function is only used by AArch64. Code movement only. Backports commit ce02fd99e6d53df6f3cf5eca85bcac403b402510 from qemu	2019-01-22 15:44:53 -05:00
Richard Henderson	febfda4e98	target/arm: Decode PAuth within disas_data_proc_2src Backports commit b6342a9f929b5946beb6726eef2ef2c5f4a11f09 from qemu	2019-01-22 15:41:44 -05:00
Richard Henderson	999222f0a1	target/arm: Decode PAuth within disas_data_proc_1src Backports commit 95ebd99dcd37b8574426c876502bfcc7c299584b from qemu	2019-01-22 15:39:53 -05:00
Richard Henderson	967e6a1e90	target/arm: Rearrange decode in disas_data_proc_1src Now properly signals unallocated for REV64 with SF=0. Allows for the opcode2 field to be decoded shortly. Backports commit 18de2813c35e359621a24a0a2a77570e83cb73b9 from qemu	2019-01-22 15:35:44 -05:00
Richard Henderson	02fd7e2472	target/arm: Decode PAuth within system hint space Backports commit 7c94c8343c6a0eea1633a65ed27987b6a71b9089 from qemu	2019-01-22 15:33:27 -05:00
Richard Henderson	e6196b2040	target/arm: Add PAuth helpers The cryptographic internals are stubbed out for now, but the enable and trap bits are checked. Backports commit 0d43e1a2d29a05f7b0d5629caaff18733cbdf3bb from qemu	2019-01-22 15:27:15 -05:00
Richard Henderson	1f7d228c8a	target/arm: Introduce raise_exception_ra This path uses cpu_loop_exit_restore to unwind current processor state. Backports commit 7469f6c696d74ad3b22b67c08e1e8f79e2b5d3d6 from qemu	2019-01-22 15:20:06 -05:00
Richard Henderson	e6ffbc22c2	target/arm: Add PAuth active bit to tbflags There are 5 bits of state that could be added, but to save space within tbflags, add only a single enable bit. Helpers will determine the rest of the state at runtime. Backports commit 0816ef1bfcd3ac53e7454b62ca436727887f6056 from qemu	2019-01-22 15:15:59 -05:00
Richard Henderson	54721be30e	target/arm: Add SCTLR bits through ARMv8.5 Post v8.4 bits taken from SysReg_v85_xml-00bet8. Backports commit b2af69d0b6e0edcb789ca1fcc1199bbfde6ed986 from qemu	2019-01-22 15:14:02 -05:00
Richard Henderson	8ef39cc2d5	target/arm: Add state for the ARMv8.3-PAuth extension Add storage space for the 5 encryption keys. Backports commit 991ad91b6a1f09a6ad62b6e6da78d83b548daec7 from qemu	2019-01-22 15:11:39 -05:00
Alexander Graf	f2682ff309	target/arm: Allow Aarch32 exception return to switch from Mon->Hyp In U-boot, we switch from S-SVC -> Mon -> Hyp mode when we want to enter Hyp mode. The change into Hyp mode is done by doing an exception return from Mon. This doesn't work with current QEMU. The problem is that in bad_mode_switch() we refuse to allow the change of mode. Note that bad_mode_switch() is used to do validation for two situations: (1) changes to mode by instructions writing to CPSR.M (ie not exception take/return) -- this corresponds to the Armv8 Arm ARM pseudocode Arch32.WriteModeByInstr (2) changes to mode by exception return Attempting to enter or leave Hyp mode via case (1) is forbidden in v8 and UNPREDICTABLE in v7, and QEMU is correct to disallow it there. However, we're already doing that check at the top of the bad_mode_switch() function, so if that passes then we should allow the case (2) exception return mode changes to switch into Hyp mode. We want to test whether we're trying to return to the nonexistent "secure Hyp" mode, so we need to look at arm_is_secure_below_el3() rather than arm_is_secure(), since the latter is always true if we're in Mon (EL3). Backports commit 2d2a4549cc29850aab891495685a7b31f5254b12 from qemu	2019-01-22 15:09:30 -05:00
Peter Maydell	55bc017af4	target/arm: Emit barriers for A32/T32 load-acquire/store-release insns Now that MTTCG is here, the comment in the 32-bit Arm decoder that "Since the emulation does not have barriers, the acquire/release semantics need no special handling" is no longer true. Emit the correct barriers for the load-acquire/store-release insns, as we already do in the A64 decoder. Backports commit 96c552958dbb63453b5f02bea6e704006d50e39a from qemu	2019-01-13 19:48:27 -05:00
Richard Henderson	254f882efc	target/arm: SVE brk[ab] merging does not have s bit While brk[ab] zeroing has a flags setting option, the merging variant does not. Retain the same argument structure, to share expansion but force the flag zero and do not decode bit 22. Backports commit 407e6ce7f1f428cb242d424cd35381a77b5b2071 from qemu	2019-01-13 19:39:34 -05:00
Richard Henderson	4d8b7a9967	target/arm: Convert ARM_TBFLAG_* to FIELDs Use "register" TBFLAG_ANY to indicate shared state between A32 and A64, and "registers" TBFLAG_A32 & TBFLAG_A64 for fields that are specific to the given cpu state. Move ARM_TBFLAG_BE_DATA to shared state, instead of its current placement within "Bit usage when in AArch32 state". Backports commit aad821ac4faad369fad8941d25e59edf2514246b from qemu	2019-01-13 19:21:18 -05:00
Peter Maydell	b9d5e1d927	target/arm: Free name string in ARMCPRegInfo hashtable entries' When we add a new entry to the ARMCPRegInfo hash table in add_cpreg_to_hashtable(), we allocate memory for tehe ARMCPRegInfo struct itself, and we also g_strdup() the name string. So the hashtable's value destructor function must free the name string as well as the struct. Spotted by clang's leak sanitizer. The leak here is a small one-off leak at startup, because we don't support CPU hotplug, and so the only time when we destroy hash table entries is for the case where ARM_CP_OVERRIDE means we register a wildcard entry and then override it later. Backports commit ac87e5072e2cbfcf8e80caac7ef43ceb6914c7af from qemu	2018-12-18 05:09:59 -05:00
Richard Henderson	8816550c10	target/arm: Implement the ARMv8.1-LOR extension Provide a trivial implementation with zero limited ordering regions, which causes the LDLAR and STLLR instructions to devolve into the LDAR and STLR instructions from the base ARMv8.0 instruction set. Backports commit 2d7137c10fafefe40a0a049ff8a7bd78b66e661f from qemu	2018-12-18 04:36:58 -05:00
Richard Henderson	32208e482b	target/arm: Use arm_hcr_el2_eff more places Since arm_hcr_el2_eff includes a check against arm_is_secure_below_el3, we can often remove a nearby check against secure state. In some cases, sort the call to arm_hcr_el2_eff to the end of a short-circuit logical sequence. Backports commit 7c208e0f4171c9e2cc35efc12e1bf264a45c229f from qemu	2018-12-18 04:33:07 -05:00
Richard Henderson	78798d10eb	target/arm: Introduce arm_hcr_el2_eff Replace arm_hcr_el2_{fmo,imo,amo} with a more general routine that also takes SCR_EL3.NS (aka arm_is_secure_below_el3) into account, as documented for the plethora of bits in HCR_EL2. Backports commit f77784446045231f7dfa46c9b872091241fa1557 from qemu	2018-12-18 04:27:34 -05:00
Richard Henderson	4515df5ee7	target/arm: Implement the ARMv8.2-AA32HPD extension The bulk of the work here, beyond base HPD, is defining the TTBCR2 register. In addition we must check TTBCR.T2E, which is not present (RES0) for AArch64. Backports commit ab638a328fd099ba0b23c8c818eb39f2c35414f3 from qemu	2018-12-18 04:23:37 -05:00
Richard Henderson	3cfd660814	target/arm: Implement the ARMv8.1-HPD extension Since the TCR_*.HPD bits were RES0 in ARMv8.0, we can simply interpret the bits as if ARMv8.1-HPD is present without checking. We will need a slightly different check for hpd for aarch32. Backports commit 037c13c5904f5fc67bb0ab7dd91ae07347aedee9 from qemu	2018-12-18 04:20:39 -05:00
Richard Henderson	0e08c37773	target/arm: Tidy scr_write Because EL3 has a fixed execution mode, we can properly decide which of the bits are RES{0,1}. Backports commit ea22747c63c9a894777aa41a7af85c3d08e39f81 from qemu	2018-12-18 04:18:09 -05:00
Richard Henderson	52ac2b2f7c	target/arm: Fix HCR_EL2.TGE check in arm_phys_excp_target_el The enable for TGE has already occurred within arm_hcr_el2_amo and friends. Moreover, when E2H is also set, the sense is supposed to be reversed, which has also already occurred within the helpers. Backports commit 619959c3583dad325c36f09ce670e7d091382cae from qemu	2018-12-18 04:17:08 -05:00
Richard Henderson	a8fb6fc8d5	target/arm: Add SCR_EL3 bits up to ARMv8.5 Post v8.4 bits taken from SysReg_v85_xml-00bet8. Backports commit 99f8f86d365701fad695be606266aa7dac97ca1c from qemu	2018-12-18 04:16:30 -05:00
Richard Henderson	ad27526b58	target/arm: Add HCR_EL2 bits up to ARMv8.5 Post v8.3 bits taken from SysReg_v85_xml-00bet8. Backports commit 099bf53b7a4bf69f4c4007c769895263a9c3f6f4 from qemu	2018-12-18 04:15:50 -05:00
Peter Maydell	8b69824de7	target/arm: Move id_aa64mmfr* to ARMISARegisters At the same time, define the fields for these registers, and use those defines in arm_pamax(). Backports commit 3dc91ddbc68391f934bf6945853e99cf6810fc00 from qemu	2018-12-18 04:03:50 -05:00
Luc Michel	1ae9d988c6	target/arm: fix smc incorrectly trapping to EL3 when secure is off This commit fixes a case where the CPU would try to go to EL3 when executing an smc instruction, even though ARM_FEATURE_EL3 is false. This case is raised when the PSCI conduit is set to smc, but the smc instruction does not lead to a valid PSCI call. QEMU crashes with an assertion failure latter on because of incoherent mmu_idx. This commit refactors the pre_smc helper by enumerating all the possible way of handling an scm instruction, and covering the previously missing case leading to the crash. The following minimal test would crash before this commit: .global _start .text _start: ldr x0, =0xdeadbeef ; invalid PSCI call smc #0 run with the following command line: aarch64-linux-gnu-gcc -nostdinc -nostdlib -Wl,-Ttext=40000000 \ -o test test.s qemu-system-aarch64 -M virt,virtualization=on,secure=off \ -cpu cortex-a57 -kernel test Backports commit 7760da729ac88f112f98f36395ac3b55fc9e4211 from qemu	2018-11-23 18:57:23 -05:00
Peter Maydell	900747236f	target/arm/cpu: Give Cortex-A15 and -A7 the EL2 feature The Cortex-A15 and Cortex-A7 both have EL2; now we've implemented it properly we can enable the feature bit. Backports commit 436c0cbbeb38dd97c02fe921a7cb253a18afdd86 from qemu	2018-11-16 21:59:55 -05:00
Peter Maydell	61c0f40ac3	target/arm: Hyp mode R14 is shared with User and System Hyp mode is an exception to the general rule that each AArch32 mode has its own r13, r14 and SPSR -- it has a banked r13 and SPSR but shares its r14 with User and System mode. We were incorrectly implementing it as banked, which meant that on entry to Hyp mode r14 was 0 rather than the USR/SYS r14. We provide a new function r14_bank_number() which is like the existing bank_number() but provides the index into env->banked_r14[]; bank_number() provides the index to use for env->banked_r13[] and env->banked_cpsr[]. All the points in the code that were using bank_number() to index into env->banked_r14[] are updated for consintency: * switch_mode() -- this is the only place where we fix an actual bug * aarch64_sync_32_to_64() and aarch64_sync_64_to_32(): no behavioural change as we already special-cased Hyp R14 * kvm32.c: no behavioural change since the guest can't ever be in Hyp mode, but conceptually the right thing to do * msr_banked()/mrs_banked(): we can never get to the case that accesses banked_r14[] with tgtmode == ARM_CPU_MODE_HYP, so no behavioural change Backports commit 593cfa2b637b92d37eef949653840dc065cdb960 from qemu	2018-11-16 21:58:29 -05:00
Peter Maydell	92bf8ee620	target/arm: Correctly implement handling of HCR_EL2.{VI, VF} In commit 8a0fc3a29fc2315325400 we tried to implement HCR_EL2.{VI,VF}, but we got it wrong and had to revert it. In that commit we implemented them as simply tracking whether there is a pending virtual IRQ or virtual FIQ. This is not correct -- these bits cause a software-generated VIRQ/VFIQ, which is distinct from whether there is a hardware-generated VIRQ/VFIQ caused by the external interrupt controller. So we need to track separately the HCR_EL2 bit state and the external virq/vfiq line state, and OR the two together to get the actual pending VIRQ/VFIQ state. Fixes: 8a0fc3a29fc2315325400c738f807d0d4ae0ab7f Backports commit 89430fc6f80a5aef1d4cbd6fc26b40c30793786c from qemu	2018-11-16 21:53:53 -05:00
Peter Maydell	a9d07a19c0	target/arm: Track the state of our irq lines from the GIC explicitly Currently we track the state of the four irq lines from the GIC only via the cs->interrupt_request or KVM irq state. That means that we assume that an interrupt is asserted if and only if the external line is set. This assumption is incorrect for VIRQ and VFIQ, because the HCR_EL2.{VI,VF} bits allow assertion of VIRQ and VFIQ separately from the state of the external line. To handle this, start tracking the state of the external lines explicitly in a CPU state struct field, as is common practice for devices. The complicated part of this is dealing with inbound migration from an older QEMU which didn't have this state. We assume in that case that the older QEMU did not implement the HCR_EL2.{VI,VF} bits as generating interrupts, and so the line state matches the current state in cs->interrupt_request. (This is not quite true between commit 8a0fc3a29fc2315325400c7 and its revert, but that commit is broken and never made it into any released QEMU version.) Backports relevant parts of commit ed89f078ff3d6684ce3e538e4777a3bb4ec3e2b1 from qemu	2018-11-16 21:47:59 -05:00
Peter Maydell	730c99bcc2	Revert "target/arm: Implement HCR.VI and VF" This reverts commit 8a0fc3a29fc2315325400c738f807d0d4ae0ab7f. The implementation of HCR.VI and VF in that commit is not correct -- they do not track the overall "is there a pending VIRQ or VFIQ" status, but whether there is a pending interrupt due to "this mechanism", ie the hypervisor having set the VI/VF bits. The overall pending state for VIRQ and VFIQ is effectively the logical OR of the inbound lines from the GIC with the VI and VF bits. Commit 8a0fc3a29fc231 would result in pending VIRQ/VFIQ possibly being lost when the hypervisor wrote to HCR. As a preliminary to implementing the HCR.VI/VF feature properly, revert the broken one entirely. Backports commit c624ea0fa7ffc9e2cc3e2b36c92b5c960954489f from qemu	2018-11-16 21:46:29 -05:00
Alex Bennée	5b4c46eeab	arm: fix aa64_generate_debug_exceptions to work with EL2 The test was incomplete and incorrectly caused debug exceptions to be generated when returning to EL2 after a failed attempt to single-step an EL1 instruction. Fix this while cleaning up the function a little. Backports commit 22af90255ec2100a44cbbb7f0460ba15eed79538 from qemu	2018-11-16 21:43:41 -05:00
Alex Bennée	2a3edfce37	arm: use symbolic MDCR_TDE in arm_debug_target_el We already have this symbol defined so lets use it. Backports commit b281ba4223c302c41ff8104152bfdd44de4fe798 from qemu	2018-11-16 21:42:30 -05:00
Richard Henderson	295dd866b4	target/arm: Fix typo in tlbi_aa64_vmalle1_write This would cause an infinite recursion or loop. Backports commit 09a86dfa3f07904792a85990b0b41939417785cd from qemu	2018-11-16 21:41:28 -05:00
Peter Maydell	3edbb8180a	target/arm: Remove workaround for small SAU regions Before we supported direct execution from MMIO regions, we implemented workarounds in commit 720424359917887c926a33d2 which let us avoid doing so, even if the SAU or MPU region was less than page-sized. Once we implemented execute-from-MMIO, we removed part of those workarounds in commit d4b6275df320cee76; but we forgot the one in get_phys_addr_pmsav8() which suppressed use of small SAU regions in executable regions. Remove that workaround now. Backports commit 521ed6b4015ba39a2e39c65a94643f3e6412edc4 from qemu	2018-11-16 21:40:10 -05:00
Peter Maydell	680c79956d	target/arm: Allow execution from small regions Now that we have full support for small regions, including execution, we can remove the workarounds where we marked all small regions as non-executable for the M-profile MPU and SAU. Backports commit d4b6275df320cee764d56b194b1898547f545857 from qemu	2018-11-16 21:39:33 -05:00
Peter Maydell	f694264212	target/arm: Remove antique TODO comment Remove a TODO comment about implementing the vectored interrupt controller. We have had an implementation of that for a decade; it's in hw/intc/pl190.c. Backports commit e24ad484909e7a00ca4f6332f3698facf0ba3394 from qemu	2018-11-16 18:24:49 -05:00
Peter Maydell	84d5a60c15	target/arm: Fix ATS1Hx instructions ATS1HR and ATS1HW (which allow AArch32 EL2 to do address translations on the EL2 translation regime) were implemented in commit 14db7fe09a2c8. However, we got them wrong: these should do stage 1 address translations as defined for NS-EL2, which is ARMMMUIdx_S1E2. We were incorrectly making them perform stage 2 translations. A few years later in commit 1313e2d7e2cd we forgot entirely that we'd implemented ATS1Hx, and added a comment that ATS1Hx were "not supported yet". Remove the comment; there is no extra code needed to handle these operations in do_ats_write(), because arm_s1_regime_using_lpae_format() returns true for ARMMMUIdx_S1E2, which forces 64-bit PAR format. Backports commit 23463e0e4aeb2f0a9c60549a2c163f4adc0b8512 from qemu	2018-11-11 08:39:19 -05:00
Peter Maydell	a445db5c02	target/arm: Set S and PTW in 64-bit PAR format In do_ats_write() we construct a PAR value based on the result of the translation. A comment says "S2WLK and FSTAGE are always zero, because we don't implement virtualization". Since we do in fact now implement virtualization, add the missing code that sets these bits based on the reported ARMMMUFaultInfo. (These bits are named PTW and S in ARMv8, so we follow that convention in the new comments in this patch.) Backports commit 0f7b791b35f24cb1333f779705a3f6472e6935de from qemu	2018-11-11 08:38:23 -05:00
Peter Maydell	5aa5ebbcc9	target/arm: Remove can't-happen if() from handle_vec_simd_shli() In handle_vec_simd_shli() we have a check: if (size > 3 && !is_q) { unallocated_encoding(s); return; } However this can never be true, because we calculate int size = 32 - clz32(immh) - 1; where immh is a 4 bit field which we know cannot be all-zeroes. So the clz32() return must be in {28,29,30,31} and the resulting size is in {0,1,2,3}, and "size > 3" is never true. This unnecessary code confuses Coverity's analysis: in CID 1396476 it thinks we might later index off the end of an array because the condition implies that we might have a size > 3. Remove the code, and instead assert that the size is in [0..3], since the decode that enforces that is somewhat distant from this function. Backports commit f6c98f91f56031141a47f86225fdc30f0f9f84fb from qemu	2018-11-11 08:37:16 -05:00
Richard Henderson	0557899c8e	target/arm: Conditionalize some asserts on aarch32 support When populating id registers from kvm, on a host that doesn't support aarch32 mode at all, neither arm_div nor jazelle will be supported either. Backports commit 0f8d06f16c9d1041d728d09d464462ebe713c662 from qemu	2018-11-11 08:32:46 -05:00
Richard Henderson	4d49c004e4	decodetree: Remove insn argument from trans_* expanders This allows trans_* expanders to be shared between decoders for 32 and 16-bit insns, by not tying the expander to the size of the insn that produced it. This change requires adjusting the two existing users to match. Backports commit 3a7be5546506be62d5c6c4b804119cedf9e367d6 from qemu	2018-11-11 08:27:01 -05:00
Richard Henderson	e9fe7d3c8d	target/arm: Only flush tlb if ASID changes Since QEMU does not implement ASIDs, changes to the ASID must flush the tlb. However, if the ASID does not change there is no reason to flush. In testing a boot of the Ubuntu installer to the first menu, this reduces the number of flushes by 30%, or nearly 600k instances. Backports commit 93f379b0c43617b1361f742f261479eaed4959cb from qemu	2018-11-10 11:26:24 -05:00
Richard Henderson	15a6b3f771	target/arm: Remove writefn from TTBR0_EL3 The EL3 version of this register does not include an ASID, and so the tlb_flush performed by vmsa_ttbr_write is not needed. Backports commit f478847f1ee0df9397f561025ab2f687fd923571 from qemu	2018-11-10 11:25:39 -05:00

... 6 7 8 9 10 ...

1388 commits