unicorn

mirror of https://github.com/yuzu-emu/unicorn.git synced 2024-12-23 20:05:30 +00:00

Author	SHA1	Message	Date
Emilio G. Cota	baa0983ae3	target/aarch64: optimize indirect branches Measurements: [Baseline performance is that before applying this and the previous commit] - NBench, aarch64-softmmu. Host: Intel i7-4790K @ 4.00GHz 1.7x +-+--------------------------------------------------------------------------------------------------------------+-+ \| \| \| cross \| 1.6x +cross+jr.................................................####...................................................+-+ \| #++# \| \| # # \| 1.5x +-+...................................................****..#...................................................+-+ \| +++* # \| \| * * # \| 1.4x +-+........................................................#...................................................+-+ \| * * # \| \| ##### * * # \| 1.3x +-+................................***+++#................#...................................................+-+ \| ++* # * * # \| \| * * # * * # \| 1.2x +-+.....................................#................#...................................................+-+ \| * * # * * # \| \| #### * * # * * # \| 1.1x +-+.......................+++#..#.......#................#...................................................+-+ \| **** # * * # * * # ***#### \| \| * # * * # * * # **### +++#### *### * # \| 1x +-++-++++++-++++***###++-++++#+++++-+#++**++++++++++#++++-+#++**++#++*###-++++-+#+++-+++#+-++-+ \| ***### * # * * # * * # ++### * * # * * # * * # * ++# * # * * # \| \| * ++# * # * * # * * # * * # * * # * * # * * # * * # * * # * * # \| 0.9x +-+---***###--###---###--####--###--*###--###--*###--###---###--####---+-+ ASSIGNMENT BITFIELD FOURFP EMULATION HUFFMAN LU DECOMPOSITIONNEURAL NUMERIC SORSTRING SORT hmean png: http://imgur.com/qO9ubtk NB. cross here represents the previous commit. - SPECint06 (test set), aarch64-linux-user. Host: Intel i7-4790K @ 4.00GHz 1.5x +-+--------------------------------------------------------------------------------------------------------------+-+ \| *** \| \| +++ jr \| \| * * \| 1.4x +-+.............................................................................................+++............+-+ \| * * \| \| \| ***** * * \| \| \| * * * * ***** \| 1.3x +-+...........................................................................................\|............+-+ \| +++ * * * * * \| * \| \| ***** * * * * +++ \| \| * * * * * * * * \| 1.2x +-+...............................................................................****..................+-+ \| **** * * * * * * * * * * +++ \| \| * * * * * * * * * * * * ***** \| \| * * * * ***** * * * * * * * * * * \| 1.1x +-+....................................................................+++.......................+-+ \| * * * * * * * * * * ***** * * * * * * \| \| * * * * * * * * ***** * * * * * * * * * * \| \| * * ***** * * * * * * * * ****** * * * * * * * * * * \| 1x +-++-++++-++++++++++-++++-+++++-++++++++++-++++-++****+++++-+++++-++++-++++++++++-++++-++-+ \| * * * * * * * * * * * * * +++ * * * * * * * * * * \| \| * * * * * * * * * * * * * * * * * * * * * * * * * * \| \| * * * * * * * * * * * * * * * * * * * * * * * * * * \| 0.9x +-+---***---*----*---*---*---*---**---*---*---*---*----*---*---+-+ astar bzip2 gcc gobmk h264ref hmmlibquantum mcf omnetpperlbench sjengxalancbmk hmean png: http://imgur.com/3Dp4vvq - SPECint06 (train set), aarch64-linux-user. Host: Intel i7-4790K @ 4.00GHz 1.7x +-+--------------------------------------------------------------------------------------------------------------+-+ \| \| \| jr \| 1.6x +-+...............................................................................................+++............+-+ \| *** \| \| +++ \| \| * * \| 1.5x +-+............................................................................................................+-+ \| +++ * * \| \| ***** * * \| 1.4x +-+.....................................................................+++..................................+-+ \| * * * * \| \| ***** * * * * \| \| * * * * ***** * * \| 1.3x +-+......................................................................................................+-+ \| +++ * * * * * * * * \| \| ***** * * * * * * ***** * * \| 1.2x +-+.............................................................................+++..........****...+-+ \| * * * * * * * * * * * +++ \| \| ***** * * ***** * * * * * * * * * * * * \| \| * * * * +++ * * * * * * * * * * * * \| 1.1x +-+............................................................................................+-+ \| * * ***** * * * * * * ***** * * * * * * * * * * \| \| * * * * * * * * * * +++ ****** +++ * * * * * * * * * * \| 1x +-+---***---*----*---*---*---*---**---*---*---*---*----*---***---+-+ astar bzip2 gcc gobmk h264ref hmmlibquantum mcf omnetpperlbench sjengxalancbmk hmean png: http://imgur.com/vRrdc9j Backports commit e75449a346bf558296966a44277bfd93412c6da6 from qemu	2018-03-03 14:22:12 -05:00
Emilio G. Cota	83ea5b72f2	target/aarch64: optimize cross-page direct jumps in softmmu Perf numbers in next commit's log. Backports commit e78722368c721f3c5b8109ed525adac1653ae97b from qemu	2018-03-03 14:20:55 -05:00
Aurelien Jarno	0e9d3d1943	tcg/mips: implement goto_ptr Backports commit 5786e0683c4f8170dd05a550814b8809d8ae6d86 from qemu	2018-03-03 14:19:46 -05:00
Richard Henderson	1d6c4f1a42	tcg/arm: Implement goto_ptr Backports commit 085c648bef7301eabe7d4a3301c8d012ae4423b8 from qemu	2018-03-03 14:18:41 -05:00
Richard Henderson	3b02642372	tcg/arm: Clarify tcg_out_bx for arm4 host In theory this would re-enable usage of QEMU on an armv4 host. Whether this is worthwhile is debatable -- we've been unconditionally issuing the armv5t BX instruction in the prologue since 2011 without complaint. Possibly we should simply require an armv6 host. Backports commit 702a947484eb3e615183dafc93de590ab0679f60 from qemu	2018-03-03 14:17:13 -05:00
Richard Henderson	d496bb6150	tcg/s390: Implement goto_ptr Backports commit 46644483cae978c734460131bb1d9071f813b287 from qemu	2018-03-03 14:16:03 -05:00
Richard Henderson	f0420c3427	tcg/sparc: Implement goto_ptr Backports commit 38f81dc5938fb7025531c5ed602afd41fef799a7 from qemu	2018-03-03 14:14:32 -05:00
Richard Henderson	81f1aae572	tcg/aarch64: Implement goto_ptr Measurements: SPECint06 (test set), x86_64-linux-user. Host: APM 64-bit ARMv8 (Atlas/A57) @ 2.4 GHz 1.45x +-+-------------------------------------------------------------------------------------------------------------+-+ \| ***** \| \| +++ * * +goto-ptr \| 1.4x +-+...****...................................................................................................+-+ \| +++* * * +++ \| 1.35x +-+................................................................****....................................+-+ \| * * * +++ \| \| * * * * * * \| 1.3x +-+.......................................................................................................+-+ \| * * * * * * \| \| * * * * * * ***** \| 1.25x +-+.................****.........................................................***.................+-+ \| * * * * * * * +++ * * \| 1.2x +-+.................................................................................................+-+ \| * * * * * * * * * * * * \| \| * * * * * * * * * * * * ***** \| 1.15x +-+...............................................................................................+-+ \| * * * * * * * * +++ * * * * * * \| \| * * * * * * * * ***** * * * * * * \| 1.1x +-+........................****.........***..................................................+-+ \| * * * * * * * * * * * * * * * * * * * \| 1.05x +-+.........................................................................................+-+ \| * * ***** * * * * * * * * * * * * * * * * * * \| \| * * * * * * * * * * * * *** *** * * * * * * * * * * \| 1x +-+---***---*---*----*---*---*---*---*---*---*----*---*---***---+-+ astar bzip2 gcc gobmk h264ref hmmlibquantum mcf omnetpperlbench sjenxalancbmk hmean png: http://imgur.com/en9HE8L Backports commit b19f0c2e7d344d4d62daf554951acdb6c94a34b0 from qemu	2018-03-03 14:13:09 -05:00
Emilio G. Cota	7d0440dec4	tb-hash: improve tb_jmp_cache hash function in user mode Optimizations to cross-page chaining and indirect branches make performance more sensitive to the hit rate of tb_jmp_cache. The constraint of reserving some bits for the page number lowers the achievable quality of the hashing function. However, user-mode does not have this requirement. Thus, with this change we use for user-mode a hashing function that is both faster and of better quality than the previous one. Measurements: Note: baseline (i.e. speedup == 1x) is QEMU v2.9.0. - SPECint06 (test set), x86_64-linux-user. Host: Intel i7-6700K @ 4.00GHz 2.2x +-+--------------------------------------------------------------------------------------------------------------+-+ \| \| \| jr \| 2x +jr+multhash +....................................................+++++...................................+-+ \| jr+hash \|$$$ \| \| \|$+$ \| \| ### $ \| 1.8x +-+......................................................................#\|#.$...................................+-+ \| ++#+# $ \| \| \|# # $ \| 1.6x +-+....................................................................**.#.$....................++$$$..........+-+ \| $$$ +* # $ \|$+$ \| \| ++$$$ ### $ * * # $ +++\|$ $ \| \| ++###+$ # # $ * * # $ ### **## $ \| 1.4x +-+...................+#.$.........*.#.$............................#.$...........#+#$$.++\|#.$..........+-+ \| +* # $ * * # $ * * # $ # # $ * +# $ \| \| * # $ +++++ * * # $ * * # $ *** # $ * * # $ ###$$ \| 1.2x +-+.....................#.$.**##$$...#.$............................#.$...........#.$....#.$.*+#+$..+-+ \| * # $ + # $ * * # $ +++ * * # $ ++###$$ * * # $ * * # $ * * # $ \| \| **##$$ * # $ * * # $ * * # $ **##$$ ++### * # $ *** #+$ * * # $ * * # $ * * # $ \| \| ++#+$ **##$$$ * # $ * * # $ * * # $ + # $ ++####$$ **+# * # $ * * # $ * * # $ * * # $ * * # $ \| 1x +-++-++#+$+++#-+$++-#+$+++#+$+++#+$+-+#+$+**++#+$+++#$$+++#+$+++#+$++-#+$++-+#+$+++#+$-++-+ \| * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ \| \| * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ * * # $ \| 0.8x +-+--*##$$-##$$$-##$$-##$$-##$$-##$$-###$$-##$$-##$$-##$$-##$$-##$$-##$$--+-+ astar bzip2 gcc gobmk h264ref hmmlibquantum mcf omnetpperlbench sjengxalancbmk hmean png: http://imgur.com/4UXTrEc Here I also tried the hash function suggested by Paolo ("multhash"): return ((uint64_t) (pc 2654435761) >> 32) & (TB_JMP_CACHE_SIZE - 1); As you can see it is just as good as the other new function ("hash"), which is what I ended up going with. - SPECint06 (train set), x86_64-linux-user. Host: Intel i7-6700K @ 4.00GHz 2.6x +-+--------------------------------------------------------------------------------------------------------------+-+ \| \| \| jr ### \| 2.4x +jr+hash...........................................................................................#.#...........+-+ \| # # \| \| # # \| 2.2x +-+................................................................................................#.#...........+-+ \| # # \| \| # # \| 2x +-+................................................................................................#.#...........+-+ \| **** # \| \| * * # \| 1.8x +-+................................................................................................#...........+-+ \| +++ * * # \| \| #### #### * * # \| 1.6x +-+......................................####.............................#..#.***..#.............#...........+-+ \| +++ #++# *** # * * # #### * * # \| \| ### # # * * # * * # # # * * # \| 1.4x +-+...................**+#..........*..#..............................#.....#....#..#.....#...........+-+ \| ++* # * * # * * # * * # *** # * * # #### \| \| * * # #### * * # * * # * * # * * # * * # **** # \| 1.2x +-+......................#..***++#.....#..............................#.....#.....#.....#......#..+-+ \| **### * # * * # * * # * * # * * # * * # * * # * * # \| \| * * # **### * # * * # * * # ***## * # * * # * * # * * # * * # \| 1x +-+--**###--###--*##--###-###--###--###--##--###-###--###--*##--###--+-+ astar bzip2 gcc gobmk h264ref hmmlibquantum mcf omnetpperlbench sjengxalancbmk hmean png: http://imgur.com/ArCbHqo - NBench, x86_64-linux-user. Host: Intel i7-6700K @ 4.00GHz 1.12x +-+-------------------------------------------------------------------------------------------------------------+-+ \| \| \| jr +++ \| 1.1x +jr+hash...........................................................####.........................................+-+ \| +++#\| # \| \| \| #++# \| 1.08x +-+................................+++................+++.+++..**..#.........................................+-+ \| \| +++ \| \| \| * # \| \| \| \| \| \| +++ # \| 1.06x +-+................................***###.............\|...\|........#.........................+++.............+-+ \| \| * \|# ***### * # \| \| \| \| ++# \| \|# * * # #### \| 1.04x +-+................................++..#............\|..\|#.......#........................#.\|#.............+-+ \| * * # ++++# * * # +++#++# \| \| * * # * * # * * # \| # # +++#### \| 1.02x +-+....................................#......+++.......#.......#.....................**..#..**++#...+-+ \| +++ * # +++ \| * * # * * # +++ \| # +++ # \| \| +++ \| +++ +++ ++++++ * * # ****### * # * * # \| +++ ++++++ ++ # * * # \| 1x +-++-+++++####++***###++++-+####+-++++#-++++-+#++++++#+++-+++#+-+++####-+***###++++++#+++-+++#+-++-+ \| ***\| # ++* \|# ****\| # * # * ++# * # * * # **** \|# * * # * * # * * # \| \| * \| \| # ++# \| ++# * # * * # * * # * * # \| ++# * * # * * # * * # \| 0.98x +-+....\|.++#......#..+++..#......#.......#......#.......#..++..#.......#......#.......#...+-+ \| +++ # * * # * * # * * # * * # * * # * * # * * # * * # * * # * * # \| \| * * # * * # * * # * * # * * # * * # * * # * * # * * # * * # * * # \| 0.96x +-+---***###--###--*###--###--*###--###--*###--###--*###--###--*###---+-+ ASSIGNMENT BITFIELD FOURFP EMULATION HUFFMAN LU DECOMPOSITIONEURAL NNUMERIC SOSTRING SORT hmean png: http://imgur.com/ZXFX0hJ - NBench, arm-linux-user. Host: Intel i7-4790K @ 4.00GHz 1.3x +-+-------------------------------------------------------------------------------------------------------------+-+ \| #### \| \| jr # # +++ \| 1.25x +jr+hash.....................#..#...........................................####................................+-+ \| # # # # \| \| # # # # \| 1.2x +-+..........................#..#...........................................#..#................................+-+ \| # # # # \| \| # # # # \| 1.15x +-+..........................#..#...........................................#..#................................+-+ \| # # #### # # \| \| # # # # # # \| 1.1x +-+..........................#..#..................................#..#.....#..#................................+-+ \| # # # # # # +++ \| \| # # #### # # # # #### \| 1.05x +-+..........................#..#...............#..#.....####......#..#.....#..#.........................#..#...+-+ \| # # # # # # # # # # +++ # # \| \| +++ * # #### * # # # +++# # # ### # # \| 1x +-++-+*###++*++++++-+++#+-**++#-++++-+#+++++#++#++***++#+-++++#-+***-++++++++#++***++#+-++-+ \| * # * * \| * * # * * # * * # **** # * * # * * # * ### ++# * # \| \| * * # * ### * # * * # * * # * * # * * # * * # * * # * * # * * # \| 0.95x +-+........#.....\|#.......#......#.......#......#.......#......#.......#......#.......#...+-+ \| * * # * * \|# * * # * * # * * # * * # * * # * * # * * # * * # * * # \| \| * * # * * \|# * * # * * # * * # * * # * * # * * # * * # * * # * * # \| 0.9x +-+---***###--###--*###--###--*###--###--*###--###--*###--###--***###---+-+ ASSIGNMENT BITFIELD FOURFP EMULATION HUFFMAN LU DECOMPOSITIONEURAL NNUMERIC SOSTRING SORT hmean png: http://imgur.com/FfD27ey Backports commit 6f1653180f5701c6a8f1b35b89a80b1e3260928e from qemu	2018-03-03 14:11:29 -05:00
Emilio G. Cota	2d16da435e	target/i386: optimize indirect branches Speed up indirect branches by jumping to the target if it is valid. Softmmu measurements (see later commit for user-mode numbers): Note: baseline (i.e. speedup == 1x) is QEMU v2.9.0. - SPECint06 (test set), x86_64-softmmu (Ubuntu 16.04 guest). Host: Intel i7-4790K @ 4.00GHz 2.4x +-+--------------------------------------------------------------------------------------------------------------+-+ \| \| \| cross \| 2.2x +cross+jr..........................................................................+++...........................+-+ \| \| \| \| +++ \| \| 2x +-+..............................................................................\|..\|............................+-+ \| \| \| \| \| \| \| \| 1.8x +-+..............................................................................\|####...........................+-+ \| \|# \|# \| \| **** \|# \| 1.6x +-+.............................................................................\|.\|#...........................+-+ \| * \|* \|# \| \| * \|* \|# \| 1.4x +-+.......................................................................+++...\|.\|#...........................+-+ \| ++++++ #### * \|++# +++ \| \| +++ \| \| #++# ++* # +++ \| \| 1.2x +-+......................###.....####....+++............\|..\|...........***..#.....#....####...\|.###.....####..+-+ \| +++ * # # #### ### ++* # * * # #++# **\|# +++#++# \| \| *### +++ ++* # ++ # ++# # #### \| \|# +++ * * # * * # *** # \| \|# **** # \| 1x +-++-++++#++**###+++++#+++-++#+*++#++++#+-+++#-+**##++++-+#+++-+#+++++#++-++#++++++#-++-+ \| * # * * # * * # * * # * * # * * # \| \|# ++ # * * # * * # * * # * * # * * # \| \| * * # * * # * * # * * # * * # * * # +++# * * # * * # * * # * * # * * # * * # \| 0.8x +-+--**###--###--*##--###-###--###--###--##--###-###--###--*##--**###--+-+ astar bzip2 gcc gobmk h264ref hmmlibquantum mcf omnetpperlbench sjengxalancbmk hmean png: http://imgur.com/DU36YFU NB. 'cross' represents the previous commit. Backports commit b4aa297781ceddef79deb0e99da7817551fa89f8 from qemu	2018-03-03 14:10:14 -05:00
Emilio G. Cota	3895eea3b4	target/i386: optimize cross-page direct jumps in softmmu Instead of unconditionally exiting to the exec loop, use the gen_jr helper to jump to the target if it is valid. Perf impact: see next commit's log. Backports commit fe62089563ffc6a42f16ff28a6b6be34d2697766 from qemu	2018-03-03 14:08:27 -05:00
Emilio G. Cota	baa017d29b	target/i386: introduce gen_jr helper to generate lookup_and_goto_ptr This helper will be used by subsequent changes. Backports commit 1ebb1af1b8068fca36f48f738eb7146ecdf03625 from qemu	2018-03-03 14:06:05 -05:00
Emilio G. Cota	9aaad9ed27	target/arm: optimize indirect branches Speed up indirect branches by jumping to the target if it is valid. Softmmu measurements (see later commit for user-mode results): Note: baseline (i.e. speedup == 1x) is QEMU v2.9.0. - Impact on Boot time \| setup \| ARM debian jessie boot+shutdown time \| stddev \| \|--------+--------------------------------------+--------\| \| v2.9.0 \| 8.84 \| 0.07 \| \| +cross \| 8.85 \| 0.03 \| \| +jr \| 8.83 \| 0.06 \| - NBench, arm-softmmu (debian jessie guest). Host: Intel i7-4790K @ 4.00GHz 1.3x +-+-------------------------------------------------------------------------------------------------------------+-+ \| \| \| cross #### \| 1.25x +cross+jr..........................................................#++#.........................................+-+ \| #### # # \| \| +++# # # # \| \| +++ **** # # # \| 1.2x +-+...................................####................#......#..#.........................................+-+ \| **** # * * # # # #### \| \| * * # * * # # # # # \| 1.15x +-+....................................#................#......#..#.....#..#................................+-+ \| * * # * * # # # # # \| \| * * # #### * * # # # # # \| \| * * # # # * * # # # # # #### \| 1.1x +-+....................................#......#..#......#......#..#.....#..#.........................#..#...+-+ \| * * # # # * * # # # # # # # \| \| * * # # # * * # # # # # # # \| 1.05x +-+..........................####......#......#..#......#......#..#.....#..#......+++............***..#...+-+ \| *** # * * # # # * * # *** # # # +++ \| *### * # \| \| +++ # * * # # # * * # +++ # ** # **### * # * * # \| \| ****### +++#### * # * * # ***** # * * # * * # * * # * \| ++# * # * * # \| 1x +-++-++++-+#++***++#+++-+++#+-++++#-++++-+#++++++#+++-+++#+-++++#-++++-+#++++++#+++-+++#+-++-+ \| * # * * # * * # * * # * * # * * # * * # * * # * * # * * # * * # \| \| * * # * * # * * # * * # * * # * * # * * # * * # * * # * * # * * # \| 0.95x +-+---***###--###--*###--###--*###--###--*###--###--*###--###--***###---+-+ ASSIGNMENT BITFIELD FOURFP EMULATION HUFFMAN LU DECOMPOSITIONEURAL NNUMERIC SOSTRING SORT hmean png: http://imgur.com/eOLmZNR NB. 'cross' represents the previous commit. Backports commit 8a6b28c7b5104263344508df0f4bce97f22cfcaf from qemu	2018-03-02 21:18:15 -05:00
Emilio G. Cota	5a42602b92	target/arm: optimize cross-page direct jumps in softmmu Instead of unconditionally exiting to the exec loop, use the lookup_and_goto_ptr helper to jump to the target if it is valid. Perf impact: see next commit's log. Backports commit 7ad55b4ffd982c80f26f7f3658138d94cdc678e8 from qemu	2018-03-02 21:09:44 -05:00
Emilio G. Cota	e4dfb7f807	tcg/i386: implement goto_ptr Backports commit 5cb4ef80f65252dd85b86fa7f3c985015423d670 from qemu	2018-03-02 21:08:38 -05:00
Emilio G. Cota	8f4f15e5f5	tcg: Introduce goto_ptr opcode and tcg_gen_lookup_and_goto_ptr Instead of exporting goto_ptr directly to TCG frontends, export tcg_gen_lookup_and_goto_ptr(), which calls goto_ptr with the pointer returned by the lookup_tb_ptr() helper. This is the only use case we have for goto_ptr and lookup_tb_ptr, so having this function is very convenient. Furthermore, it trivially allows us to avoid calling the lookup helper if goto_ptr is not implemented by the backend. Backports commit cedbcb01529cb6cf9a2289cdbebbc63f6149fc18 from qemu	2018-03-02 21:05:18 -05:00
Richard Henderson	23d8f5fba2	qemu/atomic: Loosen restrictions for 64-bit ILP32 hosts We need to coordinate with the TCG_OVERSIZED_GUEST test in cputlb.c, and allow 64-bit atomics even though sizeof(void *) == 4. Backports commit 374aae653499f4d405caf32b7fff0c8639113fe4 from qemu	2018-03-02 20:06:39 -05:00
Luc MICHEL	393019de26	target/arm: add data cache invalidation cp15 instruction to cortex-r5 The cp15, CRn=15, opc1=0, CRm=5, opc2=0 instruction invalidates all the data cache on the cortex-r5. Implementing it as a NOP. Backports commit 95e9a242e2a393c7d4e5cc04340e39c3a9420f03 from qemu	2018-03-02 20:04:20 -05:00
Peter Maydell	565626ca63	armv7m: Raise correct kind of UsageFault for attempts to execute ARM code M profile doesn't implement ARM, and the architecturally required behaviour for attempts to execute with the Thumb bit clear is to generate a UsageFault with the CFSR INVSTATE bit set. We were incorrectly implementing this as generating an UNDEFINSTR UsageFault; fix this. Backports commit e13886e3a790b52f0b2e93cb5e84fdc2ada5471a from qemu	2018-03-02 20:00:58 -05:00
Peter Maydell	fbfeca93b3	armv7m: Check exception return consistency Implement the exception return consistency checks described in the v7M pseudocode ExceptionReturn(). Inspired by a patch from Michael Davidsaver's series, but this is a reimplementation from scratch based on the ARM ARM pseudocode. Backports commit aa488fe3bb5460c6675800ccd80f6dccbbd70159 from qemu	2018-03-02 19:59:18 -05:00
Peter Maydell	0736054d6d	armv7m: Extract "exception taken" code into functions Extract the code from the tail end of arm_v7m_do_interrupt() which enters the exception handler into a pair of utility functions v7m_exception_taken() and v7m_push_stack(), which correspond roughly to the pseudocode PushStack() and ExceptionTaken(). This also requires us to move the arm_v7m_load_vector() utility routine up so we can call it. Handling illegal exception returns has some cases where we want to take a UsageFault either on an existing stack frame or with a new stack frame but with a specific LR value, so we want to be able to call these without having to go via arm_v7m_cpu_do_interrupt(). Backports commit 39ae2474e337247e5930e8be783b689adc9f6215 from qemu	2018-03-02 19:54:46 -05:00
Michael Davidsaver	5b9f53bd27	armv7m: Simpler and faster exception start All the places in armv7m_cpu_do_interrupt() which pend an exception in the NVIC are doing so for synchronous exceptions. We know that we will always take some exception in this case, so we can just acknowledge it immediately, rather than returning and then immediately being called again because the NVIC has raised its outbound IRQ line. Backports commit a25dc805e2e63a55029e787a52335e12dabf07dc from qemu	2018-03-02 19:52:01 -05:00
Peter Maydell	43ba76cb28	armv7m: Fix condition check for taking exceptions The M profile condition for when we can take a pending exception or interrupt is not the same as that for A/R profile. The code originally copied from the A/R profile version of the cpu_exec_interrupt function only worked by chance for the very simple case of exceptions being masked by PRIMASK. Replace it with a call to a function in the NVIC code that correctly compares the priority of the pending exception against the current execution priority of the CPU. Backports commit 7ecdaa4a9635f1ded0dfa9218c25273b6d4dcd44 from qemu	2018-03-02 19:50:05 -05:00
Peter Maydell	5470bd1763	armv7m: Remove unused armv7m_nvic_acknowledge_irq() return value Having armv7m_nvic_acknowledge_irq() return the new value of env->v7m.exception and its one caller assign the return value back to env->v7m.exception is pointless. Just make the return type void instead. Backports commit a5d8235545e98c1ce02560d5f4f57552d937efe9 from qemu	2018-03-02 19:36:07 -05:00
Peter Maydell	50c956db7e	arm: Implement HFNMIENA support for M profile MPU Implement HFNMIENA support for the M profile MPU. This bit controls whether the MPU is treated as enabled when executing at execution priorities of less than zero (in NMI, HardFault or with the FAULTMASK bit set). Doing this requires us to use a different MMU index for "running at execution priority < 0", because we will have different access permissions for that case versus the normal case. Backports commit 3bef7012560a7f0ea27b265105de5090ba117514 from qemu	2018-03-02 19:33:24 -05:00
Michael Davidsaver	611a711f7b	arm: add MPU support to M profile CPUs The M series MPU is almost the same as the already implemented R profile MPU (v7 PMSA). So all we need to implement here is the MPU register interface in the system register space. This implementation has the same restriction as the R profile MPU that it doesn't permit regions to be sized down smaller than 1K. We also do not yet implement support for MPU_CTRL.HFNMIENA; this bit should if zero disable use of the MPU when running HardFault, NMI or with FAULTMASK set to 1 (ie at an execution priority of less than zero) -- if the MPU is enabled we don't treat these cases any differently. Backports commit 29c483a506070e8f554c77d22686f405e30b9114 from qemu	2018-03-02 19:30:20 -05:00
Michael Davidsaver	09d69209a0	armv7m: Classify faults as MemManage or BusFault General logic is that operations stopped by the MPU are MemManage, and those which go through the MPU and are caught by the unassigned handle are BusFault. Distinguish these by looking at the exception.fsr values, and set the CFSR bits and (if appropriate) fill in the BFAR or MMFAR with the exception address. Backports commit 5dd0641d234e355597be62e5279d8a519c831625 from qemu	2018-03-02 19:28:21 -05:00
Peter Maydell	9bc3050c51	arm: All M profile cores are PMSA All M profile CPUs are PMSA, so set the feature bit. (We haven't actually implemented the M profile MPU register interface yet, but setting this feature bit gives us closer to correct behaviour for the MPU-disabled case.) Backports commit 790a11503cfb5e1dcd031ea2212bbebae4ca3cec from qemu	2018-03-02 19:26:41 -05:00
Michael Davidsaver	4d8ae4a2b2	armv7m: Implement M profile default memory map Add support for the M profile default memory map which is used if the MPU is not present or disabled. The main differences in behaviour from implementing this correctly are that we set the PAGE_EXEC attribute on the right regions of memory, such that device regions are not executable. Backports commit 3a00d560bcfca7ad04327062c1986a016c104b1f from qemu	2018-03-02 19:25:02 -05:00
Michael Davidsaver	7c845dabe8	armv7m: Improve "-d mmu" tracing for PMSAv7 MPU Improve the "-d mmu" tracing for the PMSAv7 MPU translation process as an aid in debugging guest MPU configurations: * fix a missing newline for a guest-error log * report the region number with guest-error or unimp logs of bad region register values * add a log message for the overall result of the lookup * print "0x" prefix for hex values Backports commit c9f9f1246d630960bce45881e9c0d27b55be71e2 from qemu	2018-03-02 19:17:05 -05:00
Peter Maydell	bfe99e9a0b	arm: Remove unnecessary check on cpu->pmsav7_dregion Now that we enforce both: * pmsav7_dregion == 0 implies has_mpu == false * PMSA with has_mpu == false means SCTLR.M cannot be set we can remove a check on pmsav7_dregion from get_phys_addr_pmsav7(), because we can only reach this code path if the MPU is enabled (and so region_translation_disabled() returned false). Backports commit e9235c6983b261e04e897e8ff900b2b7a391e644 from qemu	2018-03-02 19:14:50 -05:00
Peter Maydell	349227bb05	arm: Don't let no-MPU PMSA cores write to SCTLR.M If the CPU is a PMSA config with no MPU implemented, then the SCTLR.M bit should be RAZ/WI, so that the guest can never turn on the non-existent MPU. Backports commit 06312febfb2d35367006ef23608ddd6a131214d4 from qemu	2018-03-02 19:13:37 -05:00
Peter Maydell	e564ed6311	arm: Don't clear ARM_FEATURE_PMSA for no-mpu configs Fix the handling of QOM properties for PMSA CPUs with no MPU: Allow no-MPU to be specified by either: * has-mpu = false * pmsav7_dregion = 0 and make setting one imply the other. Don't clear the PMSA feature bit in this situation. Backports commit f50cd31413d8bc9d1eef8edd1f878324543bf65d from qemu	2018-03-02 19:12:20 -05:00
Peter Maydell	6614ba9615	arm: Clean up handling of no-MPU PMSA CPUs ARM CPUs come in two flavours: * proper MMU ("VMSA") * only an MPU ("PMSA") For PMSA, the MPU may be implemented, or not (in which case there is default "always acts the same" behaviour, but it isn't guest programmable). QEMU is a bit confused about how we indicate this: we have an ARM_FEATURE_MPU, but it's not clear whether this indicates "PMSA, not VMSA" or "PMSA and MPU present" , and sometimes we use it for one purpose and sometimes the other. Currently trying to implement a PMSA-without-MPU core won't work correctly because we turn off the ARM_FEATURE_MPU bit and then a lot of things which should still exist get turned off too. As the first step in cleaning this up, rename the feature bit to ARM_FEATURE_PMSA, which indicates a PMSA CPU (with or without MPU). Backports commit 452a095526a0537f16c271516a2200877a272ea8 from qemu	2018-03-02 19:05:31 -05:00
Peter Maydell	b50d2da03c	arm: Use different ARMMMUIdx values for M profile Make M profile use completely separate ARMMMUIdx values from those that A profile CPUs use. This is a prelude to adding support for the MPU and for v8M, which together will require 6 MMU indexes which don't map cleanly onto the A profile uses: non secure User non secure Privileged non secure Privileged, execution priority < 0 secure User secure Privileged secure Privileged, execution priority < 0 Backports commit e7b921c2d9efc249f99b9feb0e7dca82c96aa5c4 from qemu	2018-03-02 19:01:42 -05:00
Michael Davidsaver	f532e80749	armv7m: Escalate exceptions to HardFault if necessary The v7M exception architecture requires that if a synchronous exception cannot be taken immediately (because it is disabled or at too low a priority) then it should be escalated to HardFault (and the HardFault exception is then taken). Implement this escalation logic. Backports commit a73c98e159d18155445d29b6044be6ad49fd802f from qemu	2018-03-02 18:59:13 -05:00
Peter Maydell	b7bf752d3c	arm: Add support for M profile CPUs having different MMU index semantics The M profile CPU's MPU has an awkward corner case which we would like to implement with a different MMU index. We can avoid having to bump the number of MMU modes ARM uses, because some of our existing MMU indexes are only used by non-M-profile CPUs, so we can borrow one. To avoid that getting too confusing, clean up the code to try to keep the two meanings of the index separate. Instead of ARMMMUIdx enum values being identical to core QEMU MMU index values, they are now the core index values with some high bits set. Any particular CPU always uses the same high bits (so eventually A profile cores and M profile cores will use different bits). New functions arm_to_core_mmu_idx() and core_to_arm_mmu_idx() convert between the two. In general core index values are stored in 'int' types, and ARM values are stored in ARMMMUIdx types. Backports commit 8bd5c82030b2cb09d3eef6b444f1620911cc9fc5 from qemu	2018-03-02 18:59:13 -05:00
Wei Huang	19335c32c9	target/arm: clear PMUVER field of AA64DFR0 when vPMU=off The PMUv3 driver of linux kernel (in arch/arm64/kernel/perf_event.c) relies on the PMUVER field of id_aa64dfr0_el1 to decide if PMU support is present or not. This patch clears the PMUVER field under TCG mode when vPMU=off. Without it, PMUv3 will init insider guest VMs even with vPMU=off. This patch also removes a redundant line inside the if-statement. Backports commit 2b3ffa929249b15a75d8bde3e8e57a744f52aff0 from qemu	2018-03-02 18:59:12 -05:00
Peter Maydell	4789e49c4d	arm: Use the mmu_idx we're passed in arm_cpu_do_unaligned_access() When identifying the DFSR format for an alignment fault, use the mmu index that we are passed, rather than calling cpu_mmu_index() to get the mmu index for the current CPU state. This doesn't actually make any difference since the only cases where the current MMU index differs from the index used for the load are the "unprivileged load/store" instructions, and in that case the mmu index may differ but the translation regime is the same (apart from the "use from Hyp mode" case which is UNPREDICTABLE). However it's the more logical thing to do. Backports commit e517d95b63427fae9f03958dbc005c36b4ebf2cf from qemu	2018-03-02 18:59:12 -05:00
Peter Xu	fce1b469e5	memory: tune last param of iommu_ops.translate() This patch converts the old "is_write" bool into IOMMUAccessFlags. The difference is that "is_write" can only express either read/write, but sometimes what we really want is "none" here (neither read nor write). Replay is an good example - during replay, we should not check any RW permission bits since thats not an actual IO at all. Backports commit bf55b7afce53718ef96f4e6616da62c0ccac37dd from qemu	2018-03-02 18:59:12 -05:00
Peter Xu	5621c7e09f	exec: abstract address_space_do_translate() This function is an abstraction helper for address_space_translate() and address_space_get_iotlb_entry(). It does the lookup of address into memory region section, then does proper IOMMU translation if necessary. Refactor the two existing functions to use it. This fixes vhost when IOMMU is disabled by guest. Backports commit a764040cc831cfe5b8bf1c80e8341b9bf2de3ce8 from qemu	2018-03-02 18:59:12 -05:00
Nikunj A Dadhania	d907423bac	cputlb: handle first atomic write to the page In case where the conditional write is the first write to the page, TLB_NOTDIRTY will be set and stop_the_world is triggered. Handle this as a special case and set the dirty bit. After that fall through to the actual atomic instruction below. Backports commit 7f9af1abdcc69fd1d3d8d2be68464329600616d6 from qemu	2018-03-02 18:59:12 -05:00
Aurelien Jarno	00ebbae128	tcg/mips: fix field extraction opcode The "msb" argument should correspond to (len - 1). Backports commit 2f5a5f5774d95baacf86c03aa8a77a2d0390f2b2 from qemu	2018-03-02 18:59:12 -05:00
Richard Henderson	69116abafc	tcg: Initialize return value after exit_atomic Users of tcg_gen_atomic_cmpxchg and do_atomic_op rightfully utilize the output. Even though this code is dead, it gets translated, and without the initialization we encounter a tcg_error. Backports commit 79b1af906245558c30e0a5faf26cb52b63f83cce from qemu	2018-03-02 18:59:11 -05:00
Gerd Hoffmann	108354cc4a	bitmap: add bitmap_copy_and_clear_atomic Backports commit d6eb1413920affb7be3df9982682dd183a805dd7 from qemu	2018-03-02 18:59:11 -05:00
Peter Maydell	b8b70dfcd2	Drop QEMU_GNUC_PREREQ() checks for gcc older than 4.1 We already require gcc 4.1 or newer (for the atomic support), so the fallback codepaths for older gcc versions than that are now dead code and we can just delete them. NB: clang reports itself as gcc 4.2 (regardless of clang version), so clang won't be using the fallbacks either. Backports commit fa54abb8c298f892639ffc4bc2f61448ac3be4a1 from qemu	2018-03-02 18:59:05 -05:00
Peter Maydell	2935a9af7a	arm: Remove workarounds for old M-profile exception return implementation Now that we've rewritten M-profile exception return so that the magic PC values are not visible to other parts of QEMU, we can delete the special casing of them elsewhere. Backports commit f4e8e4edda875cab9df91dc4ae9767f7cb1f50aa from qemu	2018-03-02 15:02:14 -05:00
Peter Maydell	44bf8985e5	arm: Implement M profile exception return properly On M profile, return from exceptions happen when code in Handler mode executes one of the following function call return instructions: * POP or LDM which loads the PC * LDR to PC * BX register and the new PC value is 0xFFxxxxxx. QEMU tries to implement this by not treating the instruction specially but then catching the attempt to execute from the magic address value. This is not ideal, because: * there are guest visible differences from the architecturally specified behaviour (for instance jumping to 0xFFxxxxxx via a different instruction should not cause an exception return but it will in the QEMU implementation) * we have to account for it in various places (like refusing to take an interrupt if the PC is at a magic value, and making sure that the MPU doesn't deny execution at the magic value addresses) Drop these hacks, and instead implement exception return the way the architecture specifies -- by having the relevant instructions check for the magic value and raise the 'do an exception return' QEMU internal exception immediately. The effect on the generated code is minor: bx lr, old code (and new code for Thread mode): TCG: mov_i32 tmp5,r14 movi_i32 tmp6,$0xfffffffffffffffe and_i32 pc,tmp5,tmp6 movi_i32 tmp6,$0x1 and_i32 tmp5,tmp5,tmp6 st_i32 tmp5,env,$0x218 exit_tb $0x0 set_label $L0 exit_tb $0x7f2aabd61993 x86_64 generated code: 0x7f2aabe87019: mov %ebx,%ebp 0x7f2aabe8701b: and $0xfffffffffffffffe,%ebp 0x7f2aabe8701e: mov %ebp,0x3c(%r14) 0x7f2aabe87022: and $0x1,%ebx 0x7f2aabe87025: mov %ebx,0x218(%r14) 0x7f2aabe8702c: xor %eax,%eax 0x7f2aabe8702e: jmpq 0x7f2aabe7c016 bx lr, new code when in Handler mode: TCG: mov_i32 tmp5,r14 movi_i32 tmp6,$0xfffffffffffffffe and_i32 pc,tmp5,tmp6 movi_i32 tmp6,$0x1 and_i32 tmp5,tmp5,tmp6 st_i32 tmp5,env,$0x218 movi_i32 tmp5,$0xffffffffff000000 brcond_i32 pc,tmp5,geu,$L1 exit_tb $0x0 set_label $L1 movi_i32 tmp5,$0x8 call exception_internal,$0x0,$0,env,tmp5 x86_64 generated code: 0x7fe8fa1264e3: mov %ebp,%ebx 0x7fe8fa1264e5: and $0xfffffffffffffffe,%ebx 0x7fe8fa1264e8: mov %ebx,0x3c(%r14) 0x7fe8fa1264ec: and $0x1,%ebp 0x7fe8fa1264ef: mov %ebp,0x218(%r14) 0x7fe8fa1264f6: cmp $0xff000000,%ebx 0x7fe8fa1264fc: jae 0x7fe8fa126509 0x7fe8fa126502: xor %eax,%eax 0x7fe8fa126504: jmpq 0x7fe8fa122016 0x7fe8fa126509: mov %r14,%rdi 0x7fe8fa12650c: mov $0x8,%esi 0x7fe8fa126511: mov $0x56095dbeccf5,%r10 0x7fe8fa12651b: callq *%r10 which is a difference of one cmp/branch-not-taken. This will be lost in the noise of having to exit generated code and look up the next TB anyway. Backports commit 3bb8a96f5348913ee130169504f3642f501b113e from qemu	2018-03-02 14:58:14 -05:00
Peter Maydell	cfc1611d6f	arm: Track M profile handler mode state in TB flags For M profile exception-return handling we'd like to generate different code for some instructions depending on whether we are in Handler mode or Thread mode. This isn't the same as "are we privileged or user", so we need an extra bit in the TB flags to distinguish. Backports commit 064c379c99b835bdcc478d21a3849507ea07d53a from qemu	2018-03-02 14:54:16 -05:00
Peter Maydell	8233756382	arm: Move condition-failed codepath generation out of if() Move the code to generate the "condition failed" instruction codepath out of the if (singlestepping) {} else {}. This will allow adding support for handling a new is_jmp type which can't be neatly split into "singlestepping case" versus "not singlestepping case". Backports commit f021b2c4627890d82fbcc300db3bd782b37b7f8a from qemu arm: Abstract out "are we singlestepping" test to utility function We now test for "are we singlestepping" in several places and it's not a trivial check because we need to care about both architectural singlestep and QEMU gdbstub singlestep. We're also about to add another place that needs to make this check, so pull the condition out into a function. Backports commit b636649f5a2e108413dd171edaf320f781f57942 from qemu	2018-03-02 14:52:30 -05:00
Peter Maydell	43d6e73fea	arm: Move gen_set_condexec() and gen_set_pc_im() up in the file Move the utility routines gen_set_condexec() and gen_set_pc_im() up in the file, as we will want to use them from a function placed earlier in the file than their current location. Backports commit 4d5e8c969a74c86124fc2284ea603cc6dd3c5dfa from qemu	2018-03-02 14:48:36 -05:00
Peter Maydell	23141d7620	arm: Factor out "generate right kind of step exception" We currently have two places that do: if (dc->ss_active) { gen_step_complete_exception(dc); } else { gen_exception_internal(EXCP_DEBUG); } Factor this out into its own function, as we're about to add a third place that needs the same logic. Backports commit 5425415ebba5fa20558e1ef25e1997a6f5ea4c7c from qemu	2018-03-02 14:45:30 -05:00
Peter Maydell	ddfe550411	arm: Thumb shift operations should not permit interworking branches In Thumb mode, the only instructions which can cause an interworking branch by writing the PC are BLX, BX, BXJ, LDR, POP and LDM. Unlike ARM mode, data processing instructions which target the PC do not cause interworking branches. When we added support for doing interworking branches on writes to PC from data processing instructions in commit 21aeb3430ce7ba, we accidentally changed a Thumb instruction to have interworking branch behaviour for writes to PC. (MOV, MOVS register-shifted register, encoding T2; this is the standard encoding for LSL/LSR/ASR/ROR (register).) For this encoding, behaviour with Rd == R15 is specified as UNPREDICTABLE, so allowing an interworking branch is within spec, but it's confusing and differs from our handling of this class of UNPREDICTABLE for other Thumb ALU operations. Make it perform a simple (non-interworking) branch like the others. Backports commit bedb8a6b09c1754c3b9f155750c62dc087706698 from qemu	2018-03-02 14:42:40 -05:00
Peter Maydell	9f938da9e1	arm: Don't implement BXJ on M-profile CPUs For M-profile CPUs, the BXJ instruction does not exist at all, and the encoding should always UNDEF. We were accidentally implementing it to behave like A-profile BXJ; correct the error. Backports commit 9d7c59c84d4530d05e8702b1c3a31e6da00a397e from qemu	2018-03-02 14:42:04 -05:00
Peter Maydell	e9d507a193	target/arm: Add assertion about FSC format for syndrome registers In tlb_fill() we construct a syndrome register value from a fault status register value which is filled in by arm_tlb_fill(). arm_tlb_fill() returns FSR values which might be in the format used with short-format page descriptors, or the format used with long-format (LPAE) descriptors. The syndrome register always uses LPAE-format FSR status codes. It isn't actually possible to end up delivering a syndrome register value to the guest for a fault which is reported with a short-format FSR (that kind of stage 1 fault will only happen for an AArch32 translation regime which doesn't have a syndrome register, and can never be redirected to an AArch64 or Hyp exception level). Add an assertion which checks this, and adjust the code so that we construct a syndrome with an invalid status code, rather than allowing set bits in the FSR input to randomly corrupt other fields in the syndrome. Backports commit 65ed2ed90d9d81fd4b639029be850ea5651f919f from qemu	2018-03-02 14:41:07 -05:00
Peter Maydell	1cf80d7536	arm: Move excnames[] array into arm_log_exceptions() The excnames[] array is defined in internals.h because we used to use it from two different source files for handling logging of AArch32 and AArch64 exception entry. Refactoring means that it's now used only in arm_log_exception() in helper.c, so move the array into that function. Backports commit 2c4a7cc5afb1bfc1728a39abd951ddd7714c476e from qemu	2018-03-02 14:39:37 -05:00
Peter Maydell	1af1944903	target/arm: Add missing entries to excnames[] for log strings Recent changes have added new EXCP_ values to ARM but forgot to update the excnames[] array which is used to provide human-readable strings when printing information about the exception for debug logging. Add the missing entries, and add a comment to the list of #defines to help avoid the mistake being repeated in future. Backports commit 32b81e620ea562d56ab2733421b5da1082b237a2 from qemu	2018-03-02 14:38:23 -05:00
Richard Henderson	13242af398	target/arm: Fix aa64 ldp register writeback For "ldp x0, x1, [x0]", if the second load is on a second page and the second page is unmapped, the exception would be raised with x0 already modified. This means the instruction couldn't be restarted. Backports commit 2d1bbf51c2cb948da4b6fd5f91cf3ecc80b28156 from qemu	2018-03-02 14:35:46 -05:00
Paolo Bonzini	c27870520a	exec: revert MemoryRegionCache MemoryRegionCache did not know about virtio support for IOMMUs (because the two features were developed at the same time). Revert MemoryRegionCache to "normal" address_space_* operations for 2.9, as it is simpler than undoing the virtio patches. Backports commit 90c4fe5fc517a045e7a7cf2f23472e114042ca29 from qemu	2018-03-02 14:30:41 -05:00
Peter Maydell	008a235b5e	tcg/sparc: Zero extend address argument to ld/st helpers The C store helper functions take the address argument as a target_ulong type; if this is 32 bit but the host is 64 bit then the SPARC calling convention requires that the caller must zero extend the value. We weren't doing this, which meant we could pass values to the caller with high bits set and QEMU would crash if it was compiled with optimizations. In particular, the i386 BIOS would not start. Backports commit 5c32be5baf41aec4f4675d2bf24f9948756abf3c from qemu	2018-03-02 14:25:17 -05:00
Peter Maydell	40718df109	tcg/sparc: Zero extend data argument to store helpers The C store helper functions take the data argument as a uint8_t, uint16_t, etc depending on the store size. The SPARC calling convention requires that data types smaller than the register size must be extended by the caller. We weren't doing this, which meant that if QEMU was compiled with optimizations enabled we could end up storing incorrect values to guest memory. (In particular the i386 guest BIOS would crash on startup.) Add code to the trampolines that call the store helpers to do the zero extension as required. Backports commit 709a340d679d95a0c6cbb9b5f654498f04345b50 from qemu	2018-03-02 14:24:24 -05:00
Eduardo Habkost	e71c7b7819	i386: Don't override -cpu options on -cpu host/max The existing code for "host" and "max" CPU models overrides every single feature in the CPU object at realize time, even the ones that were explicitly enabled or disabled by the user using "feat=on" or "feat=off", while features set using +feat/-feat are kept. This means "-cpu host,+invtsc" works as expected, while "-cpu host,invtsc=on" doesn't. This was a known bug, already documented in a comment inside x86_cpu_expand_features(). What makes this bug worse now is that libvirt 3.0.0 and newer now use "feat=on\|off" instead of +feat/-feat when it detects a QEMU version that supports it (see libvirt commit d47db7b16dd5422c7e487c8c8ee5b181a2f9cd66). Change the feature property getter/setter to set a env->user_features field, to keep track of features that were explicitly changed using QOM properties. Then make the max_features code not override user features when handling "-cpu host" and "-cpu max". This will also allow us to remove the plus_features/minus_features hack in the future, but I plan to do that after 2.9.0 is released. Backports commit d4a606b38b5d4b3689b86cc1575908e82179ecfb from qemu	2018-03-02 14:22:45 -05:00
Pranith Kumar	31b977ab3e	tcg/i386: Check the size of instruction being translated This fixes the bug: 'user-to-root privesc inside VM via bad translation caching' reported by Jann Horn here: https://bugs.chromium.org/p/project-zero/issues/detail?id=1122 Backports commit 30663fd26c0307e414622c7a8607fbc04f92ec14 from qemu	2018-03-02 14:19:35 -05:00
Paolo Bonzini	b750ec2363	configure: remove Cygwin The Cygwin target is really compiling for native Win32 with -mno-cygwin. Except, GCC 4.7.0 has finally removed the long deprecated -mno-cygwin option, and that happened about five years ago. Let it rest in peace. Backports commit c8645752ce31cc044ecc5f969a986fdcb6aab590 from qemu	2018-03-02 14:17:41 -05:00
Yongbok Kim	ce3aecf263	target/mips: fix delay slot detection in gen_msa_branch() It is unnecessary to test R6 from delay/forbidden slot check in gen_msa_branch(). https://bugs.launchpad.net/qemu/+bug/1663287 Backports commit 075a1fe788d36b271ec25507466c30b9a90b5d54 from qemu	2018-03-02 14:15:50 -05:00
Philippe Mathieu-Daudé	d17c07b548	target-mips: replace break by goto cp0_unimplemented this fixes many warnings like: target/mips/translate.c:6253:13: warning: Value stored to 'rn' is never read rn = "invalid sel"; ^ ~~~~~~~~~~~~~ Backports commit 3570d7f6672836140f0a1ec9bf95dd5ea50a2aaa from qemu	2018-03-02 14:14:57 -05:00
Philippe Mathieu-Daudé	5f78f3cd80	target-mips: log bad coprocessor0 register accesses with LOG_UNIMP Backports commit 965447eecb6b98d6dfc4dbd97f836093c7e398a0 from qemu	2018-03-02 14:12:29 -05:00
Philippe Mathieu-Daudé	e7176e6c85	target-mips: remove old & unuseful comments Backports commit 989f2aa9af7f05c323761b66c0e299059a19b7b1 from qemu	2018-03-02 14:11:20 -05:00
Philippe Mathieu-Daudé	65c69e6ccb	target-mips: fix compiler warnings (clang 5) static code analyzer complain: target/mips/helper.c:453:5: warning: Function call argument is an uninitialized value qemu_log_mask(CPU_LOG_MMU, ^~~~~~~~~~~~~~~~~~~~~~~~~~ 'physical' and 'prot' are uninitialized if 'ret' is not TLBRET_MATCH. Backports commit def74c0cf05722b2e502d4b4f1219966c5b0cbd3 from qemu	2018-03-02 14:09:55 -05:00
Peter Maydell	78303d4c1b	arm: Fix APSR writes via M profile MSR Our implementation of writes to the APSR for M-profile via the MSR instruction was badly broken. First and worst, we had the sense wrong on the test of bit 2 of the SYSm field -- this is supposed to request an APSR write if bit 2 is 0 but we were doing it if bit 2 was 1. This bug was introduced in commit 58117c9bb429cd, so hasn't been in a QEMU release. Secondly, the choice of exactly which parts of APSR should be written is defined by bits in the 'mask' field. We were not passing these through from instruction decode, making it impossible to check them in the helper. Pass the mask bits through from the instruction decode to the helper function and process them appropriately; fix the wrong sense of the SYSm bit 2 check. Invalid mask values and invalid combinations of mask and register number are UNPREDICTABLE; we choose to treat them as if the mask values were valid. Backports commit b28b3377d7e9ba35611d454d5a63ef50cab1f8c5 from qemu	2018-03-02 14:08:13 -05:00
Peter Maydell	bb5819cbbc	armv7m: R14 should reset to 0xffffffff For M profile (unlike A profile) the reset value of R14 is specified as 0xffffffff. (The rationale is that this is an illegal exception return value, so if guest code tries to return to it it will result in a helpful exception.) Registers r0 to r12 and the flags are architecturally UNKNOWN on reset, so we leave those at zero. Backports commit 056f43df9168413f304500b69c33158d66efb7cf from qemu	2018-03-02 13:56:36 -05:00
Michael Davidsaver	f42f22ec02	armv7m: FAULTMASK should be 0 on reset For M profile CPUs, FAULTMASK should be 0 on reset, like PRIMASK. QEMU stores FAULTMASK in the PSTATE F bit, so (as with PRIMASK in the I bit) we have to clear these to undo the A profile default of 1. Update the comment accordingly and move it so that it's closer to the code it's referring to. Backports commit dc7abe4d65ad39390b2db120f5ad18f8f6576f8b from qemu	2018-03-02 13:55:59 -05:00
Peter Maydell	8a6d746aef	armv7m: Report no-coprocessor faults correctly For v7M attempts to access a nonexistent coprocessor are reported differently from plain undefined instructions (as UsageFaults of type NOCP rather than type UNDEFINSTR). Split them out into a new EXCP_NOCP so we can report the FSR value correctly. Backports commit 7517748e3f71a3099e57915fba95c4c308e6d842 from qemu	2018-03-02 13:54:36 -05:00
Peter Maydell	fce8138187	armv7m: Report no-coprocessor faults correctly For v7M attempts to access a nonexistent coprocessor are reported differently from plain undefined instructions (as UsageFaults of type NOCP rather than type UNDEFINSTR). Split them out into a new EXCP_NOCP so we can report the FSR value correctly. Backports commit 7517748e3f71a3099e57915fba95c4c308e6d842 from qemu	2018-03-02 13:47:14 -05:00
Michael Davidsaver	eaa080e232	armv7m: set CFSR.UNDEFINSTR on undefined instructions When we take an exception for an undefined instruction, set the appropriate CFSR bit. Backports commit 81dd9648c69bb89afdd6f4bb3ed6f3efdac96524 from qemu	2018-03-02 13:45:56 -05:00
Michael Davidsaver	2297b8134b	armv7m: honour CCR.STACKALIGN on exception entry The CCR.STACKALIGN bit controls whether the CPU is supposed to force 8-alignment of the stack pointer on entry to the exception handler. Backports commit dc858c6633a9af8b80c1509cf6f825e4390d3ad1 from qemu	2018-03-02 13:45:06 -05:00
Peter Maydell	7870bcfcb0	armv7m: add state for v7M CCR, CFSR, HFSR, DFSR, MMFAR, BFAR Add the structure fields, VMState fields, reset code and macros for the v7M system control registers CCR, CFSR, HFSR, DFSR, MMFAR and BFAR. Backports commit 2c4da50d9477fb830d778bb5d6a11215aa359b44 from qemu	2018-03-02 13:43:55 -05:00
Michael Davidsaver	7f044bf8cc	armv7m: Clear FAULTMASK on return from non-NMI exceptions FAULTMASK must be cleared on return from all exceptions other than NMI. Backports commit a20ee6005564590d33eabec11ed4dc7c432db36b from qemu	2018-03-02 13:41:16 -05:00
Michael Davidsaver	cc9458cf59	armv7m: Explicit error for bad vector table Give an explicit error and abort when a load from the vector table fails. Architecturally this should HardFault (which will then immediately fail to load the HardFault vector and go into Lockup). Since we don't model Lockup, just report this guest error via cpu_abort(). This is more helpful than the previous behaviour of reading a zero, which is the address of the reset stack pointer and not a sensible location to jump to. Backports commit 1b9ea408fca1ce8caae67b792355b023c69c5ac5 from qemu	2018-03-02 13:38:08 -05:00
Michael Davidsaver	703489071f	armv7m: Replace armv7m.hack with unassigned_access handler For v7m we need to catch attempts to execute from special addresses at 0xfffffff0 and above. Previously we did this with the aid of a hacky special purpose lump of memory in the address space and a check in translate.c for whether we were translating code at those addresses. We can implement this more cleanly using a CPU unassigned access handler which throws the exception if the unassigned access is for one of the special addresses. Backports commit 542b3478a00cb7ef51c259255b3ab1e2a7daada2 from qemu	2018-03-02 13:33:31 -05:00
Michael Davidsaver	8828b4e595	armv7m: MRS/MSR: handle unprivileged access The MRS and MSR instruction handling has a number of flaws: * unprivileged accesses should only be able to read CONTROL and the xPSR subfields, and only write APSR (others RAZ/WI) * privileged access should not be able to write xPSR subfields other than APSR * accesses to unimplemented registers should log as guest errors, not abort QEMU Backports commit 58117c9bb429cd9552d998687aa99088eb1d8528 from qemu	2018-03-02 13:29:59 -05:00
Michael Davidsaver	2769c6ada0	armv7m: Fix reads of CONTROL register bit 1 The v7m CONTROL register bit 1 is SPSEL, which indicates the stack being used. We were storing this information not in v7m.control but in the separate v7m.other_sp structure field. Unfortunately, the code handling reads of the CONTROL register didn't take account of this, and so if SPSEL was updated by an exception entry or exit then a subsequent guest read of CONTROL would get the wrong value. Using a separate structure field doesn't really gain us anything in efficiency, so drop this unnecessary complexity in favour of simply storing all the bits in v7m.control. This is a migration compatibility break for M profile CPUs only. Backports commit abc24d86cc0364f402e438fae3acb14289b40734 from qemu	2018-03-02 13:26:38 -05:00
Peter Maydell	d8eb259032	arm: Enforce should-be-1 bits in MRS decoding The MRS instruction requires that bits [19..16] are all 1s, and for A/R profile also that bits [7..0] are all 0s. At this point in the decode tree we have checked all of the rest of the instruction but were allowing these to be any value. If these bits are not set then the result is architecturally UNPREDICTABLE, but choosing to UNDEF is more helpful to the user and avoids unexpected odd behaviour if the encodings are used for some purpose in future architecture versions. Backports commit 3d54026fb06d1aea7ebb4e9825970b06bebcacac from qemu	2018-03-02 13:09:17 -05:00
Peter Maydell	dc44eded51	arm: Don't decode MRS(banked) or MSR(banked) for M profile M profile doesn't have the MSR(banked) and MRS(banked) instructions and uses the encodings for different kinds of M-profile MRS/MSR. Guard the relevant bits of the decode logic to make sure we don't accidentally fall into them by accident on M-profile. (The bit being checked for this (bit 5) is part of the SYSm field on M-profile, but since no currently allocated system registers have encodings with bit 5 of SYSm set, this hasn't been a problem in practice.) Backports commit 43ac65742319ef5ac4461daf43316b189cd21e89 from qemu	2018-03-02 13:08:20 -05:00
Peter Maydell	cc2a6a2728	arm: HVC and SMC encodings don't exist for M profile M profile doesn't have the HVC or SMC encodings, so make them always UNDEF rather than generating calls to helper functions that assume A/R profile. Backports commit 001b3cab51ebfcb13e8dd03ea25bfa3bd0c517a3 from qemu	2018-03-02 13:07:31 -05:00
Dr. David Alan Gilbert	55d79cf4c0	RAMBlocks: qemu_ram_is_shared Provide a helper to say whether a RAMBlock was created as a shared mapping. Backports commit 463a4ac23bcf0f0b65c850fa66f5ae6e43edd243 from qemu	2018-03-02 13:05:35 -05:00
Dr. David Alan Gilbert	5dfbee8930	memory_region: Fix name comments The 'name' parameter to memory_region_init_* had been marked as debug only, however vmstate_region_ram uses it as a parameter to qemu_ram_set_idstr to set RAMBlock names and these form part of the migration stream. Backports commit e8f5fe2de125a0bfbefbaa6a69af81f4817cb7a0 from qemu	2018-03-02 13:01:23 -05:00
Andrew Jones	0139cbc2cd	target/arm/arm-powerctl: Fix psci info return values The power state spec section 5.1.5 AFFINITY_INFO defines the affinity info return values as 0 ON 1 OFF 2 ON_PENDING I grepped QEMU for power_state to ensure that no assumptions of OFF=0 were being made. Backports commit d5affb0d8677e1a8a8fe03fa25005b669e7cdc02 from qemu	2018-03-02 12:59:49 -05:00
Andrew Baumann	76cd64dd7e	target/arm: implement armv8 PMUSERENR (user-mode enable bits) In armv8, this register implements more than a single bit, with fine-grained enables for read access to event counters, cycles counters, and write access to the software increment. This change implements those checks using custom access functions for the relevant registers. Backports commit 6ecd0b6ba0591ef280ed984103924d4bdca5ac32 from qemu	2018-03-02 12:55:46 -05:00
Eduardo Habkost	9ddce7c01d	i386: Change stepping of Haswell to non-blacklisted value glibc blacklists TSX on Haswell CPUs with model==60 and stepping < 4. To make the Haswell CPU model more useful, make those guests actually use TSX by changing CPU stepping to 4. References: * glibc commit 2702856bf45c82cf8e69f2064f5aa15c0ceb6359 https://sourceware.org/git/?p=glibc.git;a=commit;h=2702856bf45c82cf8e69f2064f5aa15c0ceb6359 Backports commit ec56a4a7b07e2943f49da273a31e3195083b1f2e from qemu	2018-03-02 12:53:11 -05:00
Eduardo Habkost	f865b17639	i386: host_vendor_fms() helper function Helper function for code that needs to check the host CPU vendor/family/model/stepping values. Backports commit 20271d484069f154fb262507e63adc3a37e885d2 from qemu	2018-03-02 12:51:44 -05:00
Alex Bennée	b8caaac110	target/arm/helper: make it clear the EC field is also in hex ..just like the rest of the displayed ESR register. Otherwise people might scratch their heads if a not obviously hex number is displayed for the EC field. Backports commit 6568da459b611845ef55526cd23afc9fa9f4647f from qemu	2018-03-02 12:50:33 -05:00
Paolo Bonzini	bc7a9ccfbd	target-i386: defer VMEXIT to do_interrupt Paths through the softmmu code during code generation now need to be audited to check for double locking of tb_lock. In particular, VMEXIT can take tb_lock through cpu_vmexit -> cpu_x86_update_cr4 -> tlb_flush. To avoid this, split VMEXIT delivery in two parts, similar to what is done with exceptions. cpu_vmexit only records the VMEXIT exit code and information, and cc->do_interrupt can then deliver it when it is safe to take the lock. Backports commit 10cde894b63146139f981857e4eedf756fa53dcb from qemu	2018-03-02 12:49:18 -05:00
Alex Bennée	ad548f8110	translate-all: exit cpu_restore_state early if translating The translation code uses cpu_ld*_code which can trigger a tlb_fill which if it fails will erroneously attempts a fault resolution. This never works during translation as the TB being generated hasn't been added yet. The target should have checked retaddr before calling cpu_restore_state but for those that have yet to be fixed we do it here to avoid a recursive tb_lock() under MTTCG's new locking regime Backports commit d8b2239bcd8872a5c5f7534d1658fc2365caab2d from qemu	2018-03-02 12:46:16 -05:00
Alex Bennée	a01496e6d9	target/i386/cpu.h: declare TCG_GUEST_DEFAULT_MO This suppresses the incorrect warning when forcing MTTCG for x86 guests on x86 hosts. A future patch will still warn when TARGET_SUPPORT_MTTCG hasn't been defined for the guest (which is still pending for x86). Backports commit 72c1701f62e8d44eb24a0583a958edc280105455 from qemu	2018-03-02 12:43:37 -05:00
Markus Armbruster	8a8dc93945	qapi: Improve qobject visitor documentation Backports commit aa3a982e674b09ae32502940f93ba98b3a8ad50e from qemu	2018-03-02 12:24:21 -05:00
Markus Armbruster	67cb4b0900	qapi: Fix object input visit beyond end of list Backports commit 1f41a645b65530859bf5984aa08e103bb452b473 from qemu	2018-03-02 12:22:50 -05:00
Markus Armbruster	ac1a61af47	qapi: Make input visitors detect unvisited list tails Fix the design flaw demonstrated in the previous commit: new method check_list() lets input visitors report that unvisited input remains for a list, exactly like check_struct() lets them report that unvisited input remains for a struct or union. Implement the method for the qobject input visitor (straightforward), and the string input visitor (less so, due to the magic list syntax there). The opts visitor's list magic is even more impenetrable, and all I can do there today is a stub with a FIXME comment. No worse than before. Backports commit a4a1c70dc759e5b81627e96564f344ab43ea86eb from qemu	2018-03-02 12:21:04 -05:00
Markus Armbruster	e0ee098c4a	qapi: Drop unused non-strict qobject input visitor The split between tests/test-qobject-input-visitor.c and tests/test-qobject-input-strict.c now makes less sense than ever. The next commit will take care of that. Backports commit 048abb7b20c9f822ad9d4b730bade73b3311a47a from qemu	2018-03-02 12:14:52 -05:00
Markus Armbruster	3e8b0c66a3	qom: Make object_property_set_qobject()'s input visitor strict Commit 240f64b made all qobject input visitors created outside tests strict, except for the one in object_property_set_qobject(). That one was left behind only because Eric couldn't spare the time to figure out whether making it strict would break anything, with a TODO comment. Time to resolve it. Strict makes a difference only for otherwise successful visits of QAPI structs or unions. Let's examine what the callers of object_property_set_qobject() visit: * object_property_set_str(), object_property_set_bool(), object_property_set_int() visit a QString, QBool, QInt, respectively. Strictness can't matter. * qmp_qom_set visits its @value argument. Comes straight from QMP and can be anything ('any' in the QAPI schema). Strictness matters when the property's set() method visits a struct or union QAPI type. No such methods exist, thus switching to strict can't break anything. If we acquire such methods in the future, we'll want the visitor to be strict, so that unexpected members get rejected as they should be. Switch to strict. Backports commit 05601ed2de60df0e344d6b783a6bc0c1ff2b5d1f from qemu	2018-03-02 12:10:50 -05:00
Markus Armbruster	2b7daee13b	qapi: Make string input and opts visitor require non-null input The string input visitor tries to cope with null input. Null input isn't used anywhere, and isn't covered by tests. Unsurprisingly, it doesn't fully work: start_list() crashes because it passes the input via parse_str() to strtoll() unchecked. Make string_input_visitor_new() assert its argument isn't null, and drop the code trying to deal with null input. The opts visitor crashes when you try to actually visit something with null input. Make opts_visitor_new() assert its argument isn't null, mostly for clarity. qobject_input_visitor_new() already asserts its argument isn't null. Backports commit f332e830e38b3ff3953ef02ac04e409ae53769c5 from qemu	2018-03-02 12:10:07 -05:00
Markus Armbruster	50e3cda49a	qapi: Drop string input visitor method optional() visit_optional() is to be called only between visit_start_struct() and visit_end_struct(). Visitors that don't support struct visits, i.e. don't implement start_struct(), end_struct(), have no use for it. Clarify documentation. The string input visitor doesn't support struct visits. Its parse_optional() is therefore useless. Drop it. Backports commit a8aec6de2ac1a5e36989fdfba29067b361009b75 from qemu	2018-03-02 12:07:55 -05:00
Markus Armbruster	84e5261cdf	qapi: Improve qobject input visitor error reporting Error messages refer to nodes of the QObject being visited by name. Trouble is the names are sometimes less than helpful: * The name of the root QObject is whatever @name argument got passed to the visitor, except NULL gets mapped to "null". We commonly pass NULL. Not good. Avoiding errors "at the root" mitigates. For instance, visit_start_struct() can only fail when the visited object is not a dictionary, and we commonly ensure it is beforehand. * The name of a QDict's member is the member key. Good enough only when this happens to be unique. * The name of a QList's member is "null". Not good. Improve error messages by referring to nodes by path instead, as follows: * The path of the root QObject is whatever @name argument got passed to the visitor, except NULL gets mapped to "<anonymous>". * The path of a root QDict's member is the member key. * The path of a root QList's member is "[%u]", where %u is the list index, starting at zero. * The path of a non-root QDict's member is the path of the QDict concatenated with "." and the member key. * The path of a non-root QList's member is the path of the QList concatenated with "[%u]", where %u is the list index. For example, the incorrect QMP command { "execute": "blockdev-add", "arguments": { "node-name": "foo", "driver": "raw", "file": {"driver": "file" } } } now fails with {"error": {"class": "GenericError", "desc": "Parameter 'file.filename' is missing"}} instead of {"error": {"class": "GenericError", "desc": "Parameter 'filename' is missing"}} and { "execute": "input-send-event", "arguments": { "device": "bar", "events": [ [] ] } } now fails with {"error": {"class": "GenericError", "desc": "Invalid parameter type for 'events[0]', expected: object"}} instead of {"error": {"class": "GenericError", "desc": "Invalid parameter type for 'null', expected: QDict"}} Aside: calling the thing "parameter" is suboptimal for QMP, because the root object is "arguments" there. The qobject output visitor doesn't have this problem because it should not fail. Same for dealloc and clone visitors. The string visitors don't have this problem because they visit just one value, whose name needs to be passed to the visitor as @name. The string output visitor shouldn't fail anyway. The options visitor uses QemuOpts names. Their name space is flat, so the use of QDict member keys as names is fine. NULL names used with roots and lists could conceivably result in bad error messages. Left for another day. Backports commit a9fc37f6bc3f2ab90585cb16493da9f6dcfbfbcf from qemu	2018-03-02 12:05:53 -05:00
Markus Armbruster	a5cf19858d	qapi: Make QObject input visitor set list reliably qobject_input_start_struct() sets list, except when it fails because qobject_input_get_object() fails, i.e. the input object doesn't exist. All the other input visitor start_struct(), start_list(), start_alternate() always set obj / list. Change qobject_input_start_struct() to match. Backports commit 58561c27669ddf1c6d39ff8ce25837c6f2d9d92c from qemu	2018-03-02 11:31:58 -05:00
Markus Armbruster	fdf09c6d12	qapi: Clean up after commit 3d344c2 Drop unused QIV_STACK_SIZE and unused qobject_input_start_struct() parameter errp. Backports commit b8874fbfd329b5084463bcacd1418d493a93c383 from qemu	2018-03-02 11:30:38 -05:00
Markus Armbruster	d7da652d4e	qapi: Improve a QObject input visitor error message The QObject input visitor has three error message formats: * Parameter '%s' is missing * "Invalid parameter type for '%s', expected: %s" * "QMP input object member '%s' is unexpected" The '%s' are member names (or "null", but I'll fix that later). The last error message calls the thing "QMP input object member" instead of "parameter". Misleading when the visitor is used on QObjects that don't come from QMP. Change it to "Parameter '%s' is unexpected". Backports commit 910f738b851a263396fc85b2052e47f884ffead3 from qemu	2018-03-02 11:29:02 -05:00
Markus Armbruster	d07bcef231	qmp: Eliminate silly QERR_QMP_* macros The QERR_ macros are leftovers from the days of "rich" error objects. QERR_QMP_BAD_INPUT_OBJECT, QERR_QMP_BAD_INPUT_OBJECT_MEMBER, QERR_QMP_EXTRA_MEMBER are used in just one place now, except for one use that has crept into qobject-input-visitor.c. Drop these macros, to make the (bad) error messages more visible. Backports commit 99fb0c53c038105bae68b02a3d9f1cbf7951ba10 from qemu	2018-03-02 11:28:17 -05:00
Yongji Xie	23f5b17a08	memory: Introduce DEVICE_HOST_ENDIAN for ram device At the moment ram device's memory regions are DEVICE_NATIVE_ENDIAN. It's incorrect. This memory region is backed by a MMIO area in host, so the uint64_t data that MemoryRegionOps read from/write to this area should be host-endian rather than target-endian. Hence, current code does not work when target and host endianness are different which is the most common case on PPC64. To fix it, this introduces DEVICE_HOST_ENDIAN for the ram device. This has been tested on PPC64 BE/LE host/guest in all possible combinations including TCG. Backports commit c99a29e702528698c0ce2590f06ca7ff239f7c39 from qemu	2018-03-02 11:24:32 -05:00
Paolo Bonzini	11709d0afa	cpu-exec: remove unnecessary check of cpu->exit_request The cpu->exit_request check in cpu_loop_exec_tb is unnecessary, because cpu->tcg_exit_req is always set after cpu->exit_request. So let the TB exit and we will pick up the exit request later in cpu_handle_interrupt. Backports commit 55ac0a9bf4e1b1adfc7d73586a7aa085f58c9851 from qemu	2018-03-02 11:21:35 -05:00
Eduardo Habkost	33ab5f71c9	i386: Reorganize and document CPUID initialization steps CPU runnability checks and CPU model expansion have slightly different requirements. Document the steps involved in loading a CPU model and realizing a CPU, so their requirements and purpose are clearly defined. This patch doesn't change any implementation. It just add comments, rename the x86_cpu_load_features() function for clarity (so it won't be confused with x86_cpu_load_def()), and move x86_cpu_filter_features() closer to it. Backports commit b8d834a00fa3ed4dad7d371e1a00938a126a54a0 from qemu	2018-03-02 10:55:00 -05:00
Eduardo Habkost	be606acff9	i386: Rename X86CPU::host_features to X86CPU::max_features Rename the field and add a small comment to make its purpose clearer. Backports commit 44bd8e530661be1d22ae0f461a5c9bdbcc3847ec from qemu	2018-03-02 10:51:40 -05:00
Pranith Kumar	ee609fa59f	aarch64: Change ext type to TCGType to fix warnings To fix the following warnings: In file included from /users/pranith/qemu/tcg/tcg.c:255: /users/pranith/qemu/tcg/aarch64/tcg-target.inc.c:879:24: warning: implicit conversion from enumeration type 'TCGMemOp' (aka 'enum TCGMemOp') to different enumeration type 'TCGType' (aka 'enum TCGType') [-Wenum-conversion] tcg_out_cmp(s, ext, a, b, b_const); ~~~~~~~~~~~ ^~~ /users/pranith/qemu/tcg/aarch64/tcg-target.inc.c:893:36: warning: implicit conversion from enumeration type 'TCGMemOp' (aka 'enum TCGMemOp') to different enumeration type 'TCGType' (aka 'enum TCGType') [-Wenum-conversion] tcg_out_insn(s, 3201, CBZ, ext, a, offset); ~~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~ /users/pranith/qemu/tcg/aarch64/tcg-target.inc.c:389:65: note: expanded from macro 'tcg_out_insn' glue(tcg_out_insn_,FMT)(S, glue(glue(glue(I,FMT),_),OP), ## __VA_ARGS__) ^ /users/pranith/qemu/tcg/aarch64/tcg-target.inc.c:895:37: warning: implicit conversion from enumeration type 'TCGMemOp' (aka 'enum TCGMemOp') to different enumeration type 'TCGType' (aka 'enum TCGType') [-Wenum-conversion] tcg_out_insn(s, 3201, CBNZ, ext, a, offset); ~~~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~ /users/pranith/qemu/tcg/aarch64/tcg-target.inc.c:389:65: note: expanded from macro 'tcg_out_insn' glue(tcg_out_insn_,FMT)(S, glue(glue(glue(I,FMT),_),OP), ## __VA_ARGS__) ^ /users/pranith/qemu/tcg/aarch64/tcg-target.inc.c:1610:27: warning: implicit conversion from enumeration type 'TCGType' (aka 'enum TCGType') to different enumeration type 'TCGMemOp' (aka 'enum TCGMemOp') [-Wenum-conversion] tcg_out_brcond(s, ext, a2, a0, a1, const_args[1], arg_label(args[3])); ~~~~~~~~~~~~~~ ^~~ backports commit dc1eccd661ada3b746ca4438e444993c36a0f04f from qemu	2018-03-02 10:48:56 -05:00
Peter Maydell	e141ea5dd2	softfloat: Use correct type in float64_to_uint64_round_to_zero() In float64_to_uint64_round_to_zero() a typo meant that we were taking the uint64_t return value from float64_to_uint64() and putting it into an int64_t variable before returning it as uint64_t again. Use uint64_t instead of pointlessly casting it back and forth to int64_t. Backports commit d000b477f2693dbca97cd8ea751c2e0b71890662 from qemu	2018-03-02 10:44:10 -05:00
Peter Maydell	0c9ef6f4b3	cputlb: Don't assume do_unassigned_access() never returns In get_page_addr_code(), if the guest PC doesn't correspond to RAM then we currently run the CPU's do_unassigned_access() hook if it has one, and otherwise we give up and exit QEMU with a more-or-less useful message. This code assumes that the do_unassigned_access hook will never return, because if it does then we'll plough on attempting to use a non-RAM TLB entry to get a RAM address and will abort() in qemu_ram_addr_from_host_nofail(). Unfortunately some CPU implementations of this hook do return: Microblaze, SPARC and the ARM v7M. Change the code to call report_bad_exec() if the hook returns, as well as if it didn't have one. This means we can tidy it up to use the cpu_unassigned_access() function which wraps the "get the CPU class and call the hook if it has one" work, since we aren't trying to distinguish "no hook" from "hook existed and returned" any more. This brings the handling of this hook into line with the handling used for data accesses, where "hook returned" is treated the same as "no hook existed" and gets you the default behaviour. Backports commit 44d7ce0ef39cb45e13d384574d79799eb3d39834 from qemu	2018-03-02 10:42:35 -05:00
Nick Reilly	4114fb2c0e	Add missing fp_access_check() to aarch64 crypto instructions The aarch64 crypto instructions for AES and SHA are missing the check for if the FPU is enabled. Backports commit a4f5c5b72380deeccd53a6890ea3782f10ca8054 from qemu	2018-03-02 10:39:16 -05:00
Alex Bennée	caba238b5a	tcg: enable MTTCG by default for ARM on x86 hosts This enables the multi-threaded system emulation by default for ARMv7 and ARMv8 guests using the x86_64 TCG backend. This is because on the guest side: - The ARM translate.c/translate-64.c have been converted to - use MTTCG safe atomic primitives - emit the appropriate barrier ops - The ARM machine has been updated to - hold the BQL when modifying shared cross-vCPU state - defer powerctl changes to async safe work All the host backends support the barrier and atomic primitives but need to provide same-or-better support for normal load/store operations. Backports commit ca759f9e387db87e1719911f019bc60c74be9ed8 from qemu	2018-03-02 10:32:47 -05:00
Alex Bennée	ff0ff28939	target-arm: don't generate WFE/YIELD calls for MTTCG The WFE and YIELD instructions are really only hints and in TCG's case they were useful to move the scheduling on from one vCPU to the next. In the parallel context (MTTCG) this just causes an unnecessary cpu_exit and contention of the BQL. Backports commit c22edfebff29f63d793032e4fbd42a035bb73e27 from qemu	2018-03-02 10:27:36 -05:00
Alex Bennée	157efaa8a9	cputlb: tweak qemu_ram_addr_from_host_nofail reporting This moves the helper function closer to where it is called and updates the error message to report via error_report instead of the deprecated fprintf. Backports commit 857baec1d9e80947f0c1007c3a3d2331d62b4b53 from qemu	2018-03-02 10:24:03 -05:00
Alex Bennée	454932263c	cputlb and arm/sparc targets: convert mmuidx flushes from varg to bitmap While the vargs approach was flexible the original MTTCG ended up having munge the bits to a bitmap so the data could be used in deferred work helpers. Instead of hiding that in cputlb we push the change to the API to make it take a bitmap of MMU indexes instead. For ARM some the resulting flushes end up being quite long so to aid readability I've tended to move the index shifting to a new line so all the bits being or-ed together line up nicely, for example: tlb_flush_page_by_mmuidx(other_cs, pageaddr, (1 << ARMMMUIdx_S1SE1) \| (1 << ARMMMUIdx_S1SE0)); Backports commit 0336cbf8532935d8e23c2aabf3e2ce2c0697b6ac from qemu	2018-03-02 10:12:40 -05:00
Alex Bennée	d56a4b0be4	tcg: handle EXCP_ATOMIC exception for system emulation The patch enables handling atomic code in the guest. This should be preferably done in cpu_handle_exception(), but the current assumptions regarding when we can execute atomic sections cause a deadlock. The current mechanism discards the flags which were set in atomic execution. We ensure they are properly saved by calling the cc->cpu_exec_enter/leave() functions around the loop. As we are running cpu_exec_step_atomic() from the outermost loop we need to avoid an abort() when single stepping over atomic code since debug exception longjmp will point to the the setlongjmp in cpu_exec(). We do this by setting a new jmp_env so that it jumps back here on an exception. Backports relevant parts of commit 08e73c48b053566bfe0c994f154f73991cd0ff0e from qemu	2018-03-02 09:56:43 -05:00
Alex Bennée	6760605e1c	tcg: enable thread-per-vCPU There are a couple of changes that occur at the same time here: - introduce a single vCPU qemu_tcg_cpu_thread_fn One of these is spawned per vCPU with its own Thread and Condition variables. qemu_tcg_rr_cpu_thread_fn is the new name for the old single threaded function. - the TLS current_cpu variable is now live for the lifetime of MTTCG vCPU threads. This is for future work where async jobs need to know the vCPU context they are operating in. The user to switch on multi-thread behaviour and spawn a thread per-vCPU. For a simple test kvm-unit-test like: ./arm/run ./arm/locking-test.flat -smp 4 -accel tcg,thread=multi Will now use 4 vCPU threads and have an expected FAIL (instead of the unexpected PASS) as the default mode of the test has no protection when incrementing a shared variable. We enable the parallel_cpus flag to ensure we generate correct barrier and atomic code if supported by the front and backends. This doesn't automatically enable MTTCG until default_mttcg_enabled() is updated to check the configuration is supported. Backports relevant parts of commit 372579427a5040a26dfee78464b50e2bdf27ef26	2018-03-02 09:43:14 -05:00
Alex Bennée	632b853761	tcg: remove global exit_request There are now only two uses of the global exit_request left. The first ensures we exit the run_loop when we first start to process pending work and in the kick handler. This is just as easily done by setting the first_cpu->exit_request flag. The second use is in the round robin kick routine. The global exit_request ensured every vCPU would set its local exit_request and cause a full exit of the loop. Now the iothread isn't being held while running we can just rely on the kick handler to push us out as intended. We lightly re-factor the main vCPU thread to ensure cpu->exit_requests cause us to exit the main loop and process any IO requests that might come along. As an cpu->exit_request may legitimately get squashed while processing the EXCP_INTERRUPT exception we also check cpu->queued_work_first to ensure queued work is expedited as soon as possible. Backports commit e5143e30fb87fbf179029387f83f98a5a9b27f19 from qemu	2018-03-02 09:38:08 -05:00
Alex Bennée	4d90497d14	tcg: rename tcg_current_cpu to tcg_current_rr_cpu ..and make the definition local to cpus. In preparation for MTTCG the concept of a global tcg_current_cpu will no longer make sense. However we still need to keep track of it in the single-threaded case to be able to exit quickly when required. qemu_cpu_kick_no_halt() moves and becomes qemu_cpu_kick_rr_cpu() to emphasise its use-case. qemu_cpu_kick now kicks the relevant cpu as well as qemu_kick_rr_cpu() which will become a no-op in MTTCG. For the time being the setting of the global exit_request remains. Backports commit 791158d93b27f22a17c2ada06621831d54f09a2c from qemu Also atomically sets the unicorn equivalents	2018-03-02 09:28:51 -05:00
Lioncash	18a229a69f	Resolve symbol errors with softfloat	2018-03-02 09:25:05 -05:00
KONRAD Frederic	c5730ff194	tcg: add options for enabling MTTCG We know there will be cases where MTTCG won't work until additional work is done in the front/back ends to support. It will however be useful to be able to turn it on. As a result MTTCG will default to off unless the combination is supported. However the user can turn it on for the sake of testing. Backports commit 8d4e9146b3568022ea5730d92841345d41275d66 from qemu	2018-03-02 09:25:01 -05:00
Alex Bennée	8c89344517	tcg: move TCG_MO/BAR types into own file We'll be using the memory ordering definitions to define values for both the host and guest. To avoid fighting with circular header dependencies just move these types into their own minimal header. Backports commit 20937143145b8f5a4194e5c407731ba38797864e from qemu	2018-03-02 09:08:44 -05:00
Pranith Kumar	616becc2dc	mttcg: translate-all: Enable locking debug in a debug build Enable tcg lock debug asserts in a debug build by default instead of relying on DEBUG_LOCKING. None of the other DEBUG_* macros have asserts, so this patch removes DEBUG_LOCKING and enable these asserts in a debug build. Backports commit 6ac3d7e845549f08473f020c1c70f14b8911a67e from qemu	2018-03-02 09:00:58 -05:00
Markus Armbruster	89d8e58718	util/cutils: Change qemu_strtosz*() from int64_t to uint64_t This will permit its use in parse_option_size(). Backports commit f46bfdbfc8f95cf65d7818ef68a801e063c40332 from qemu	2018-03-02 08:58:55 -05:00
Markus Armbruster	8650d0213c	util/cutils: Return qemu_strtosz*() error and value separately This makes qemu_strtosz(), qemu_strtosz_mebi() and qemu_strtosz_metric() similar to qemu_strtoi64(), except negative values are rejected. Backports commit f17fd4fdf0df3d2f3444399d04c38d22b9a3e1b7 from qemu	2018-03-02 08:57:16 -05:00
Markus Armbruster	6093e67947	util/cutils: Let qemu_strtosz*() optionally reject trailing crap Change the qemu_strtosz() & friends to return -EINVAL when @endptr is null and the conversion doesn't consume the string completely. Matches how qemu_strtol() & friends work. Only test_qemu_strtosz_simple() passes a null @endptr. No functional change there, because its conversion consumes the string. Simplify callers that use @endptr only to fail when it doesn't point to '\0' to pass a null @endptr instead. Backports commit 4fcdf65ae2c00ae69f7625f26ed41f37d77b403c from qemu	2018-03-02 08:54:53 -05:00
Markus Armbruster	f9c9eb7334	util/cutils: Drop QEMU_STRTOSZ_DEFSUFFIX_* macros Writing QEMU_STRTOSZ_DEFSUFFIX_* instead of '*' gains nothing. Get rid of these eyesores. Backports commit 17f942560e54f8ee72996bc3276c697503606d7b from qemu	2018-03-02 08:53:15 -05:00
Markus Armbruster	858acd4142	util/cutils: New qemu_strtosz() Most callers of qemu_strtosz_suffix() pass QEMU_STRTOSZ_DEFSUFFIX_B. Capture the pattern in new qemu_strtosz(). Inline qemu_strtosz_suffix() into its only remaining caller. Backports commit 466dea14e677555dd24465aca75d00a3537ad062 from qemu	2018-03-02 08:50:56 -05:00
Markus Armbruster	a3358798d6	util/cutils: Rename qemu_strtosz() to qemu_strtosz_MiB() With qemu_strtosz(), no suffix means mebibytes. It's used rarely. I'm going to add a similar function where no suffix means bytes. Rename qemu_strtosz() to qemu_strtosz_MiB() to make the name qemu_strtosz() available for the new function. Backports commit e591591b323772eea733de6027f5e8b50692d0ff from qemu	2018-03-02 08:49:26 -05:00
Markus Armbruster	f656cd91ec	util/cutils: New qemu_strtosz_metric() To parse numbers with metric suffixes, we use qemu_strtosz_suffix_unit(nptr, &eptr, QEMU_STRTOSZ_DEFSUFFIX_B, 1000) Capture this in a new function for legibility: qemu_strtosz_metric(nptr, &eptr) Replace test_qemu_strtosz_suffix_unit() by test_qemu_strtosz_metric(). Rename qemu_strtosz_suffix_unit() to do_strtosz() and give it internal linkage. Backports commit d2734d2629266006b0413433778474d5801c60be from qemu	2018-03-02 08:47:40 -05:00
Markus Armbruster	fb962d2e74	util/cutils: Clean up control flow around qemu_strtol() a bit Reorder check_strtox_error() to make it obvious that we always store through a non-null @endptr. Transform if (some error) { error case ... err = value for error case; } else { normal case ... err = value for normal case; } return err; to if (some error) { error case ... return value for error case; } normal case ... return value for normal case; Backports commit 4baef2679e029c76707be1e2ed54bf3dd21693fe from qemu	2018-03-02 08:45:18 -05:00
Markus Armbruster	9236950e61	util/cutils: Clean up variable names around qemu_strtol() Name same things the same, different things differently. * qemu_strtol()'s parameter @nptr is called @p in check_strtox_error(). Rename the latter. * qemu_strtol()'s parameter @endptr is called @next in check_strtox_error(). Rename the latter. * qemu_strtol()'s variable @p is called @endptr in check_strtox_error(). Rename both to @ep. * qemu_strtol()'s variable @err is negative errno, check_strtox_error()'s parameter @err is positive. Rename the latter to @libc_errno. Same for qemu_strtoul(), qemu_strtoi64(), qemu_strtou64(), of course. Backports commit 717adf960933da0650d995f050d457063d591914 from qemu	2018-03-02 08:41:47 -05:00
Markus Armbruster	41c2e1168f	util/cutils: Rename qemu_strtoll(), qemu_strtoull() The name qemu_strtoll() suggests conversion to long long, but it actually converts to int64_t. Rename to qemu_strtoi64(). The name qemu_strtoull() suggests conversion to unsigned long long, but it actually converts to uint64_t. Rename to qemu_strtou64(). Backports commit b30d188677456b17c1cd68969e08ddc634cef644 from qemu	2018-03-02 08:39:45 -05:00
Markus Armbruster	ac34d92d09	util/cutils: Rewrite documentation of qemu_strtol() & friends Fixes the following documentation bugs: * Fails to document that null @nptr is safe. * Fails to document that we return -EINVAL when no conversion could be performed (commit 47d4be1). * Confuses long long with int64_t, and unsigned long long with uint64_t. * Claims the unsigned conversions can underflow. They can't. While there, mark problematic assumptions that int64_t is long long, and uint64_t is unsigned long long with FIXME comments. Backports commit 4295f879becfbbb9f4330489311586b96915d920 from qemu	2018-03-02 08:37:57 -05:00
Markus Armbruster	9d1937f25d	qdict: Make qdict_get_qlist() safe like qdict_get_qdict() Commit 89cad9f changed qdict_get_qdict() to return NULL instead of crash when the key doesn't exist or its value isn't a QDict. Commit 2d6421a neglected to do the same for qdict_get_qlist(). Correct that, and update the function comments. qdict_get_obj() is now unused, remove. Backports commit b25f23e7dbc6bc0dcda010222a4f178669d1aedc from qemu	2018-03-02 08:35:17 -05:00
Bharata B Rao	7fadaf0bc4	softfloat: Add float128_to_uint32_round_to_zero() float128_to_uint32_round_to_zero() is needed by xscvqpuwz instruction of PowerPC ISA 3.0. Backports commit fd425037d25cecaaffdb3831697e0adc10ca2ba3 from qemu	2018-03-02 08:33:09 -05:00
Bharata B Rao	64d32a2237	softfloat: Add float128_to_uint64_round_to_zero() Implement float128_to_uint64() and use that to implement float128_to_uint64_round_to_zero() This is required by xscvqpudz instruction of PowerPC ISA 3.0. Backports commit 2e6d85683576c970c714c1cc071dca742835b9d4 from qemu	2018-03-02 08:32:02 -05:00
Bharata B Rao	80e522b499	softfloat: Add round-to-odd rounding mode Power ISA 3.0 introduces a few quadruple precision floating point instructions that support round-to-odd rounding mode. The round-to-odd mode is explained as under: Let Z be the intermediate arithmetic result or the operand of a convert operation. If Z can be represented exactly in the target format, the result is Z. Otherwise the result is either Z1 or Z2 whichever is odd. Here Z1 and Z2 are the next larger and smaller numbers representable in the target format respectively. Backports commit 9ee6f678f473007e252934d6acd09c24490d9d42 from qemu	2018-03-02 08:25:00 -05:00
Paul Burton	411ddd16cf	target-mips: Provide function to test if a CPU supports an ISA Provide a new cpu_supports_isa function which allows callers to determine whether a CPU supports one of the ISA_ flags, by testing whether the associated struct mips_def_t sets the ISA flags in its insn_flags field. An example use of this is to allow boards which generate bootloader code to determine the properties of the CPU that will be used, for example whether the CPU is 64 bit or which architecture revision it implements. Backports commit bed9e5ceb158c886d548fe59675a6eba18baeaeb from qemu	2018-03-02 08:20:19 -05:00
Paolo Bonzini	37918ba5b0	exec: make address_space_cache_destroy idempotent Clear cache->mr so that address_space_cache_destroy does nothing the second time it is called. Backports commit 91047df38dffa80222179f63fbb74c1dfefa25ed from qemu	2018-03-02 08:16:17 -05:00
Paolo Bonzini	e66da21a56	cpu-exec: remove outermost infinite loop Reorganize the sigsetjmp so that the restart case falls through to cpu_handle_exception and the execution loop. Backports commit 4515e58d60dc3aac53dbd5e53e4c3bec126967d8 from qemu	2018-03-02 08:13:43 -05:00
Paolo Bonzini	af524401ad	cpu-exec: avoid repeated sigsetjmp on interrupts The sigsetjmp only needs to be prepared once for the whole execution of cpu_exec. This patch takes care of the "== 0" side, using a nested loop so that cpu_handle_interrupt goes straight back to cpu_handle_exception without doing another sigsetjmp. Backports commit a42cf3f3f266a97ceb13e8b99bc7b13f7bf4192a from qemu	2018-03-02 08:09:50 -05:00
Paolo Bonzini	28b615a8b7	cpu-exec: avoid cpu_loop_exit in cpu_handle_interrupt The siglongjmp goes straight back to the beginning of cpu_exec's outermost loop. We do not need a siglongjmp, we can simply leave the inner TB execution loop. Backports commit 209b71b60ef3341246038e1c926c3b704969cdd3 from qemu	2018-03-02 08:03:18 -05:00
Paolo Bonzini	b39acfc3c6	cpu-exec: tighten barrier on TCG_EXIT_REQUESTED This seems to have worked just fine so far on weakly-ordered architectures, but I don't see anything that prevents the reordering from: store 1 to exit_request store 1 to tcg_exit_req load tcg_exit_req store 0 to tcg_exit_req load exit_request store 0 to exit_request store 1 to exit_request store 1 to tcg_exit_req to this: store 1 to exit_request store 1 to tcg_exit_req load tcg_exit_req load exit_request store 1 to exit_request store 1 to tcg_exit_req store 0 to tcg_exit_req store 0 to exit_request therefore losing a request. It's possible that other memory barriers (e.g. in rcu_read_unlock) are hiding it, but better safe than sorry. Backports commit a70fe14b7dddcb944fbd6c9f3739cd3a22089af5 from qemu	2018-03-02 08:01:08 -05:00
Wei Huang	c9bdf5e6c7	target-arm: Enable vPMU support under TCG mode This patch contains several fixes to enable vPMU under TCG mode. It first removes the checking of kvm_enabled() while unsetting ARM_FEATURE_PMU. With it, the .pmu option can be used to turn on/off vPMU under TCG mode. Secondly the PMU node of DT table is now created under TCG. The last fix is to disable the masking of PMUver field of ID_AA64DFR0_EL1. Backports commit d6f02ce3b8a43ddd8f83553fe754a34b26fb273f from qemu	2018-03-02 07:58:48 -05:00
Wei Huang	5e3349a818	target-arm: Add support for PMU register PMINTENSET_EL1 This patch adds access support for PMINTENSET_EL1. Backports commit e6ec54571e424bb1d6e50e32fe317c616cde3e05 from qemu	2018-03-02 07:57:40 -05:00

1 2 3 4 5 ...

2509 commits