unicorn

mirror of https://github.com/yuzu-emu/unicorn.git synced 2024-12-26 02:05:40 +00:00

Author	SHA1	Message	Date
Richard Henderson	533a3f6a6c	tcg: Fix helper function vs host abi for float16 Depending on the host abi, float16, aka uint16_t, values are passed and returned either zero-extended in the host register or with garbage at the top of the host register. The tcg code generator has so far been assuming garbage, as that matches the x86 abi, but this is incorrect for other host abis. Further, target/arm has so far been assuming zero-extended results, so that it may store the 16-bit value into a 32-bit slot with the high 16-bits already clear. Rectify both problems by mapping "f16" in the helper definition to uint32_t instead of (a typedef for) uint16_t. This forces the host compiler to assume garbage in the upper 16 bits on input and to zero-extend the result on output. Backports commit 6c2be133a7478e443c99757b833d0f265c48e0a6 from qemu	2018-06-02 10:10:12 -04:00
Peter Maydell	0f0b2e0bd8	target/arm: Honour FPCR.FZ in FRECPX The FRECPX instructions should (like most other floating point operations) honour the FPCR.FZ bit which specifies whether input denormals should be flushed to zero (or FZ16 for the half-precision version). We forgot to implement this, which doesn't affect the results (since the calculation doesn't actually care about the mantissa bits) but did mean we were failing to set the FPSR.IDC bit. Backports commit 2cfbf36ec07f7cac1aabb3b86f1c95c8a55424ba from qemu	2018-06-02 10:02:57 -04:00
Richard Henderson	1b6cac4e7e	target/arm: Remove floatX_maybe_silence_nan from conversions This is now handled properly by the generic softfloat code. Backports commit a9d173dc603af74102c24c1c92d479ba580bbf07 from qemu	2018-05-19 23:23:09 -04:00
Richard Henderson	5e532f6d20	target/arm: Use floatX_silence_nan when we have already checked for SNaN Backports commit d7ecc062c4e264f716ed239df931f52adb340508 from qemu	2018-05-19 23:21:28 -04:00
Alex Bennée	80074e4745	target/arm: Implement FCMP for fp16 These where missed out from the rest of the half-precision work. Backports commit 7a1929256ea1a03df12625e75ed571c60dca5bfb from qemu	2018-05-15 22:24:39 -04:00
Richard Henderson	688d0fd0ed	target/arm: Implement CAS and CASP Backports commit 44ac14b06fa33f60982923b6b8a3bf8dd2fea61d from qemu	2018-05-14 08:28:45 -04:00
Lioncash	9a0632bfcf	target/arm/helper64: Correct bad merge	2018-03-12 11:37:27 -04:00
Alex Bennée	fdb07713e6	arm/translate-a64: add FP16 FSQRT to simd_two_reg_misc_fp16 Backports commit b96a54c7e5576bd35b7d00d37b7929d2892d8cac from qemu	2018-03-08 21:57:35 -05:00
Alex Bennée	6102a61b14	arm/translate-a64: add FP16 FRCPX to simd_two_reg_misc_fp16 We go with the localised helper. Backports commit 986950283837f697b35782b9ac3bc99fca614640 from qemu	2018-03-08 19:15:23 -05:00
Alex Bennée	39a68548d1	arm/translate-a64: add FCVTxx to simd_two_reg_misc_fp16 This covers all the floating point convert operations. Backports commit 2df581304193d70eaf0d22cf4cb4613f74b6e59b from qemu	2018-03-08 18:25:29 -05:00
Alex Bennée	d5f002b39a	arm/translate-a64: add FP16 FPRINTx to simd_two_reg_misc_fp16 This adds the full range of half-precision floating point to integral instructions. Backports commit 6109aea2d954891027acba64a13f1f1c7463cfac from qemu	2018-03-08 18:21:58 -05:00
Alex Bennée	82ffaab7de	arm/translate-a64: add FP16 x2 ops for simd_indexed A bunch of the vectorised bitwise operations just operate on larger chunks at a time. We can do the same for the new half-precision operations by introducing some TWOHALFOP helpers which work on each half of a pair of half-precision operations at once. Hopefully all this hoop jumping will get simpler once we have generically vectorised helpers here. Backports commit 6089030c7322d8f96b54fb9904e53b0f464bb8fe from qemu	2018-03-08 18:08:39 -05:00
Alex Bennée	4b2577537b	arm/translate-a64: add FP16 FR[ECP/SQRT]S to simd_three_reg_same_fp16 As some of the constants here will also be needed elsewhere (specifically for the upcoming SVE support) we move them out to softfloat.h. Backports commit 026e2d6ef74000afb9049f46add4b94f594c8fb3 from qemu	2018-03-08 15:47:34 -05:00
Alex Bennée	a02b9b81a9	arm/translate-a64: add FP16 FMULA/X/S to simd_three_reg_same_fp16 Backports commit 2deb992b767d28035fac3b374c7730494ff0b43d from qemu Also backports the fp16 changes introduced in commit f566c0474a9b9bbd9ed248607e4007e24d3358c0	2018-03-08 15:42:48 -05:00
Alex Bennée	ba8df54753	arm/translate-a64: add FP16 F[A]C[EQ/GE/GT] to simd_three_reg_same_fp16 These use the generic float16_compare functionality which in turn uses the common float_compare code from the softfloat re-factor. Backports commit d32adeae1a71a8e71374fa48d3d6ab0ad4c23e94 from qemu	2018-03-08 12:59:37 -05:00
Alex Bennée	4a6a41d2c5	arm/translate-a64: add FP16 FADD/FABD/FSUB/FMUL/FDIV to simd_three_reg_same_fp16 The fprintf is only there for debugging as the skeleton is added to, it will be removed once the skeleton is complete. Backports commit 372087348d561e7f4051d7b32609bda417092ddf from qemu	2018-03-08 12:56:15 -05:00
Alex Bennée	af75074fe7	arm/translate-a64: implement half-precision F(MIN\|MAX)(V\|NMV) This implements the half-precision variants of the across vector reduction operations. This involves a re-factor of the reduction code which more closely matches the ARM ARM order (and handles 8 element reductions). Backports commit 807cdd504283c11addcd7ea95ba594bbddc86fe4 from qemu	2018-03-08 12:49:30 -05:00
Alex Bennée	0eee5afd0e	target/*/cpu.h: remove softfloat.h As cpu.h is another typically widely included file which doesn't need full access to the softfloat API we can remove the includes from here as well. Where they do need types it's typically for float_status and the rounding modes so we move that to softfloat-types.h as well. As a result of not having softfloat in every cpu.h call we now need to add it to various helpers that do need the full softfloat.h definitions. Backports commit 24f91e81b65fcdd0552d1f0fcb0ea7cfe3829c19 from qemu	2018-03-08 09:58:47 -05:00
Michael Weiser	5fabebabee	target/arm: Fix stlxp for aarch64_be ldxp loads two consecutive doublewords from memory regardless of CPU endianness. On store, stlxp currently assumes to work with a 128bit value and consequently switches order in big-endian mode. With this change it packs the doublewords in reverse order in anticipation of the 128bit big-endian store operation interposing them so they end up in memory in the right order. This makes it work for both MTTCG and !MTTCG. It effectively implements the ARM ARM STLXP operation pseudo-code: data = if BigEndian() then el1:el2 else el2:el1; With this change an aarch64_be Linux 4.14.4 kernel succeeds to boot up in system emulation mode. Backports commit 0785557f8811133bd69be02aeccf018d47a26373 from qemu	2018-03-06 08:48:12 -05:00
Richard Henderson	a58eb310eb	target/arm: Use helper_retaddr in stxp helpers We use raw memory primitives along the !parallel_cpus paths in order to simplify the endianness handling. Because of that, we did not benefit from the generic changes to cpu_ldst_user_only_template.h. The simplest fix is to manipulate helper_retaddr here. Backports commit 3bdb5fcc9a08a9a47ce30c4e0c2d64c95190b49d from qemu	2018-03-05 13:48:28 -05:00
Thomas Huth	b2f1326437	Move target-* CPU file into a target/ folder We've currently got 18 architectures in QEMU, and thus 18 target-xxx folders in the root folder of the QEMU source tree. More architectures (e.g. RISC-V, AVR) are likely to be included soon, too, so the main folder of the QEMU sources slowly gets quite overcrowded with the target-xxx folders. To disburden the main folder a little bit, let's move the target-xxx folders into a dedicated target/ folder, so that target-xxx/ simply becomes target/xxx/ instead. Backports commit fcf5ef2ab52c621a4617ebbef36bf43b4003f4c0 from qemu	2018-03-01 22:50:58 -05:00

21 commits