unicorn

mirror of https://github.com/yuzu-emu/unicorn.git synced 2024-12-24 12:35:42 +00:00

Author	SHA1	Message	Date
Richard Henderson	395e00cdc5	target-sparc: Remove softint as a TCG global The global is only ever read for one insn; we can just as well use a load from env instead and generate the same code. This also allows us to indicate the the associated helpers do not touch TCG globals. Backports commit e86ceb0d652baa5738e05a59ee0e7989dafbeaa1 from qemu	2018-02-25 04:49:27 -05:00
Richard Henderson	dcd1d6f8ce	target-sparc: Mark more flags for helpers Quite a few helpers do not modify tcg globals but did not so indicate. Backports commit be72f9fcca742c5e9a949f5eac901ed6cc26a2a0 from qemu	2018-02-25 04:28:54 -05:00
Markus Armbruster	c2ffbc575d	Clean up decorations and whitespace around header guards Cleaned up with scripts/clean-header-guards.pl. Backports commit 175de52487ce0b0c78daa4cdf41a5a465a168a25 from qemu	2018-02-25 04:26:02 -05:00
Markus Armbruster	1275b9b459	Clean up ill-advised or unusual header guards Cleaned up with scripts/clean-header-guards.pl. Backports commit 2a6a4076e117113ebec97b1821071afccfdfbc96 from qemu	2018-02-25 04:22:46 -05:00
Markus Armbruster	9ae2fc4d9e	Clean up header guards that don't match their file name Header guard symbols should match their file name to make guard collisions less likely. Offenders found with scripts/clean-header-guards.pl -vn. Cleaned up with scripts/clean-header-guards.pl, followed by some renaming of new guard symbols picked by the script to better ones. Backports commit 121d07125bb6d7079c7ebafdd3efe8c3a01cc440 from qemu	2018-02-25 04:18:42 -05:00
Markus Armbruster	25ec9ab016	tcg: Clean up tcg-target.h header guards These use guard symbols like TCG_TARGET_$target. scripts/clean-header-guards.pl doesn't like them because they don't match their file name (they should, to make guard collisions less likely). Clean them up: use guard symbol $target_TCG_TARGET_H for tcg/$target/tcg-target.h. Backports commit 14e54f8ecfe9c5e17348f456781344737ed10b3b from qemu	2018-02-25 04:15:08 -05:00
Markus Armbruster	2b65f98538	target-*: Clean up cpu.h header guards Most of them use guard symbols like CPU_$target_H, but we also have __MIPS_CPU_H__ and __TRICORE_CPU_H__. They all upset scripts/clean-header-guards.pl. The script dislikes CPU_$target_H because they don't match their file name (they should, to make guard collisions less likely). The others are reserved identifiers. Clean them all up: use guard symbol $target_CPU_H for target-$target/cpu.h. Backports commit 07f5a258750b3b9a6e10fd5ec3e29c9a943b650e from qemu	2018-02-25 04:12:46 -05:00
Markus Armbruster	60e8836b74	Use #include "..." for our own headers, <...> for others Tracked down with an ugly, brittle and probably buggy Perl script. Also move includes converted to <...> up so they get included before ours where that's obviously okay. Backports commit a9c94277f07d19d3eb14f199c3e93491aa3eae0e from qemu	2018-02-25 04:10:33 -05:00
Peter Maydell	f6f843b4d4	bswap.h: Document cpu_to_* and *_to_cpu conversion functions Add a documentation comment describing the functions for converting between the cpu and little or bigendian formats. Backports commit 7d820b766a2049f33ca7e078aa51018f2335f8c5 from qemu	2018-02-25 04:06:28 -05:00
Peter Maydell	1d7f813942	bswap.h: Remove unused cpu_to_w() and _to_cpup() Now that all uses of cpu_to_w() and _to_cpup() have been replaced with either ld_p()/st_p() or by doing direct dereferences and using the cpu_to_()/_to_cpu() byteswap functions, we can remove the unused implementations. Backports commit f76bde702916d0230bf359d478bcac8d7f3b30ae from qemu	2018-02-25 04:04:46 -05:00
Sergey Sorokin	d1e4ac0451	Fix confusing argument names in some common functions There are functions tlb_fill(), cpu_unaligned_access() and do_unaligned_access() that are called with access type and mmu index arguments. But these arguments are named 'is_write' and 'is_user' in their declarations. The patches fix the arguments to avoid a confusion. Backports commit b35399bb4e9968296a12303b00f9f2066470e987 from qemu	2018-02-25 03:58:27 -05:00
Leon Alrae	a465707a47	target-mips: enable 10-bit ASIDs in I6400 CPU Backports commit cdc46fab07a122dfcc8a1054510a68d936ae3440 from qemu	2018-02-25 03:50:58 -05:00
Paul Burton	002b392a15	target-mips: support CP0.Config4.AE bit The read-only Config4.AE bit set denotes extended 10 bits ASID. Backports commit a0c8060841f2d56fb3504292c18522b957972e4c from qemu	2018-02-25 03:49:36 -05:00
Paul Burton	ba4dcc8c2f	target-mips: change ASID type to hold more than 8 bits ASID currently has uint8_t type which is too small since some processors support more than 8 bits ASID. Therefore change its type to uint16_t. Backports commit 2d72e7b047d800c9f99262466f65a98684ecca14 from qemu	2018-02-25 03:48:10 -05:00
Paul Burton	ac27c881ff	target-mips: add ASID mask field and replace magic values Backports commit 6ec98bd7b64ad75870c8e9d87a90fcd1a64b4942 from qemu	2018-02-25 03:44:26 -05:00
Leon Alrae	7e589c117b	target-mips: replace MIPS64R6-generic with the real I6400 CPU model MIPS64R6-generic gradually gets closer to I6400 CPU, feature-wise. Rename it to make it clear which MIPS processor it is supposed to emulate. Backports commit 8f95ad1c79b4166350b982a6defe0e21faa04dac from qemu	2018-02-25 03:35:55 -05:00
Leon Alrae	c0b3938b88	target-mips: add exception base to MIPS CPU Replace hardcoded 0xbfc00000 with exception_base which is initialized with this default address so there is no functional change here. However, it is now exposed and consequently it will be possible to modify it from outside of the CPU. Backports commit 89777fd10fc3dd573c3b4d1b2efdd10af823c001 from qemu	2018-02-25 03:22:10 -05:00
Stanislav Shmarov	6f20d35cd1	translate-all: Fix user-mode self-modifying code in 2 page long TB In user-mode emulation Translation Block can consist of 2 guest pages. In that case QEMU also mprotects 2 host pages that are dedicated for guest memory, containing instructions. QEMU detects self-modifying code with SEGFAULT signal processing. In case if instruction in 1st page is modifying memory of 2nd page (or vice versa) QEMU will mark 2nd page with PAGE_WRITE, invalidate TB, generate new TB contatining 1 guest instruction and exit to CPU loop. QEMU won't call mprotect, and new TB will cause same SEGFAULT. Page will have both PAGE_WRITE_ORG and PAGE_WRITE flags, so QEMU will handle the signal as guest binary problem, and exit with guest SEGFAULT. Solution is to do following: In case if current TB was invalidated continue to invalidate TBs from remaining guest pages and mark pages as PAGE_WRITE. After that disable host page protection with mprotect. If current tb was invalidated longjmp to main loop. That is more efficient, since we won't get SEGFAULT when executing new TB. Backports commit 7399a337e4126f7c8c8af3336726f001378c4798 from qemu	2018-02-25 03:14:22 -05:00
Samuel Damashek	670d81367b	cputlb: Fix for self-modifying writes across page boundaries As it currently stands, QEMU does not properly handle self-modifying code when the write is unaligned and crosses a page boundary. The procedure for handling a write to the current translation block is to write-protect the current translation block, catch the write, split up the translation block into the current instruction (which remains write-protected so that the current instruction is not modified) and the remaining instructions in the translation block, and then restore the CPU state to before the write occurred so the write will be retried and successfully executed. However, since unaligned writes across pages are split into one-byte writes for simplicity, writes to the second page (which is not the current TB) may succeed before a write to the current TB is attempted, and since these writes are not invalidated before resuming state after splitting the TB, these writes will be performed a second time, thus corrupting the second page. Credit goes to Patrick Hulin for discovering this. In recent 64-bit versions of Windows running in emulated mode, this results in either being very unstable (a BSOD after a couple minutes of uptime), or being entirely unable to boot. Windows performs one or more 8-byte unaligned self-modifying writes (xors) which intersect the end of the current TB and the beginning of the next TB, which runs into the aforementioned issue. This commit fixes that issue by making the unaligned write loop perform the writes in forwards order, instead of reverse order. This way, QEMU immediately tries to write to the current TB, and splits the TB before any write to the second page is executed. The write then proceeds as intended. With this patch applied, I am able to boot and use Windows 7 64-bit and Windows 10 64-bit in QEMU without KVM. Per Richard Henderson's input, this patch also ensures the second page is in the TLB before executing the write loop, to ensure the second page is mapped. The original discussion of the issue is located at http://lists.nongnu.org/archive/html/qemu-devel/2014-08/msg02161.html. Backports commit 81daabaf7a572f138a8b88ba6eea556bdb0cce46 from qemu	2018-02-25 03:12:11 -05:00
Samuel Damashek	04c423b081	cputlb: Add address parameter to VICTIM_TLB_HIT Backports commit a390284b80d2b6581143cdb40666674e60e635ae from qemu	2018-02-25 03:03:36 -05:00
Richard Henderson	9e2422032a	cputlb: Move VICTIM_TLB_HIT out of line There are currently 22 invocations of this function, and we're about to increase that number. Backports commit 7e9a7c50d9a400ef51242d661a261123c2cc9485 from qemu	2018-02-25 02:58:47 -05:00
Haozhong Zhang	2893a1c381	target-i386: Publish advised value of MSR_IA32_FEATURE_CONTROL via fw_cfg It's a prerequisite that certain bits of MSR_IA32_FEATURE_CONTROL should be set before some features (e.g. VMX and LMCE) can be used, which is usually done by the firmware. This patch adds a fw_cfg file "etc/msr_feature_control" which contains the advised value of MSR_IA32_FEATURE_CONTROL and can be used by guest firmware (e.g. SeaBIOS). Backports commit 217f1b4a72153cf8d556e9d45919e9222c38d25e from qemu	2018-02-25 02:49:42 -05:00
Ashok Raj	b58f1fccce	target-i386: kvm: Add basic Intel LMCE support This patch adds the support to inject SRAR and SRAO as LMCE, i.e. they are injected to only one VCPU rather than broadcast to all VCPUs. As KVM reports LMCE support on Intel platforms, this features is only available on Intel platforms. LMCE is disabled by default and can be enabled/disabled by cpu option 'lmce=on/off'. Backports commit 87f8b626041ceaea9adcfdbd549359f0ca7b871d from qemu	2018-02-25 02:48:22 -05:00
Evgeny Yakovlev	49fdd75329	target-i386: Report hyperv feature words through qom This change adds hyperv feature words report through qom rpc. When VM is configured with hyperv features enabled libvirt will check that required feature words are set in cpuid leaf 40000003 through qom request. Currently qemu does not report hyperv feature words which prevents windows guests from starting with libvirt. To avoid conflicting with current hyperv properties all added feature words cannot be set directly with -cpu +feature yet. Backports commit c35bd19a5c9140bce8b913cc5cefe6f071135bdb from qemu	2018-02-25 02:45:20 -05:00
Paolo Bonzini	f39cc9e3b9	target-i386: Avoid using locals outside their scope x86_cpu_parse_featurestr has a "val = num;" assignment just before num goes out of scope. Push num up to fix the issue. Backports commit cf2887c9738451eb989c6c102af070dee2dc172a from qemu	2018-02-25 02:30:06 -05:00
Paolo Bonzini	1be92ac243	target-i386: TCG can support CPUID.07H:EBX.erms ERMS just says "rep movsb" and "rep stosb" are fast. It does not imply any new instruction, so we can support it easily. Backports commit 7eb24386dbfb0b66464c7f856c1074c606efccda from qemu	2018-02-25 02:29:00 -05:00
Igor Mammedov	78f9128dbb	target-sparc: Use sparc_cpu_parse_features() directly Make SPARC target use sparc_cpu_parse_features() directly so it won't get in the way of switching other propertified targets to handling features as global properties. Backports commit fb02d56e96d553088c5b4267a3c954a3e952a50a from qemu	2018-02-25 02:27:23 -05:00
Sergey Sorokin	e4d123caa9	tcg: Improve the alignment check infrastructure Some architectures (e.g. ARMv8) need the address which is aligned to a size more than the size of the memory access. To support such check it's enough the current costless alignment check implementation in QEMU, but we need to support an alignment size specifying. Backports commit 1f00b27f17518a1bcb4cedca49eaec96a4d560bd from qemu	2018-02-25 02:23:28 -05:00
Richard Henderson	23586e2674	tcg: Optimize spills of constants While we can store constants via constrants on INDEX_op_st_i32 et al, we weren't able to spill constants to backing store. Add a new backend interface, tcg_out_sti, which may store the constant (and is allowed to fail). Rearrange the temp_* helpers so that we only attempt to directly store a constant when the temp is becoming dead/free. Backports commit 59d7c14eeff8d2ad7f61aed86ce5a176113bc153 from qemu	2018-02-25 01:45:29 -05:00
Richard Henderson	64fda683b1	tcg: Fix name for high-half register	2018-02-25 01:36:35 -05:00
Lioncash	532f840dc3	qapi: Add new clone visitor We have a couple places in the code base that want to deep-clone one QAPI object into another, and they were resorting to serializing the struct out to QObject then reparsing it. A much more efficient version can be done by adding a new clone visitor. Since cloning is still relatively uncommon, expose the use of the new visitor via a QAPI_CLONE() macro that takes care of type-punning the underlying function pointer, rather than generating lots of unused functions for types that won't be cloned. And yes, we're relying on the compiler treating all pointers equally, even though a strict C program cannot portably do so - but we're not the first one in the qemu code base to expect it to work (hello, glib!). The choice of adding a fourth visitor type deserves some explanation. On the surface, the clone visitor is mostly an input visitor (it takes arbitrary input - in this case, another QAPI object - and creates a new QAPI object during the course of the visit). But ever since commit da72ab0 consolidated enum visits based on the visitor type, using VISITOR_INPUT would cause us to run visit_type_str(), even though for cloning there is nothing to do (we just copy the enum value across, without regards to its mapping to strings). Also, since our input happens to be a QAPI object, we can also satisfy the internal checks for VISITOR_OUTPUT. So in the end, I settled with a new VISITOR_CLONE, and chose its value such that many internal checks can use 'v->type & mask', sticking to 'v->type == value' where the difference matters. Note that we can only clone objects (including alternates) and lists, not built-ins or enums. The visitor core hides integer width from the actual visitor (since commit 04e070d), and as long as that's the case, we can't clone top-level integers. Then again, those can always be cloned by direct copy, since they are not objects with deep pointers, so it's no real loss. And restricting cloning to just objects and lists is cleaner than restricting it to non-integers. As such, I documented that the clone visitor is for direct use only by code internal to QAPI, and should not be used on incomplete objects (other than a hack to work around the fact that we allow NULL in place of "" in visit_type_str() in other output visitors). Note that as written, the clone visitor will never fail on a complete object. Scalars (including enums) not at the root of the clone copy just fine with no additional effort while visiting the scalar, by virtue of a g_memdup() each time we push another struct onto the stack. Cloning a string requires deduplication of a pointer, which means it can also provide the guarantee of an input visitor of never producing NULL even when still accepting NULL in place of "" the way the QMP output visitor does. Cloning an 'any' type could be possible by incrementing the QObject refcnt, but it's not obvious whether that is better than implementing a QObject deep clone. So for now, we document it as unsupported, and intentionally omit the .type_any() callback to let a developer know their usage needs implementation. Add testsuite coverage for several different clone situations, to ensure that the code is working. I also tested that valgrind was happy with the test. Backports commit a15fcc3cf69ee3d408f60d6cc316488d2b0249b4 from qemu	2018-02-25 01:34:12 -05:00
Eric Blake	85af4b2030	qapi: Add new visit_complete() function Making each output visitor provide its own output collection function was the only remaining reason for exposing visitor sub-types to the rest of the code base. Add a polymorphic visit_complete() function which is a no-op for input visitors, and which populates an opaque pointer for output visitors. For maximum type-safety, also add a parameter to the output visitor constructors with a type-correct version of the output pointer, and assert that the two uses match. This approach was considered superior to either passing the output parameter only during construction (action at a distance during visit_free() feels awkward) or only during visit_complete() (defeating type safety makes it easier to use incorrectly). Most callers were function-local, and therefore a mechanical conversion; the testsuite was a bit trickier, but the previous cleanup patch minimized the churn here. The visit_complete() function may be called at most once; doing so lets us use transfer semantics rather than duplication or ref-count semantics to get the just-built output back to the caller, even though it means our behavior is not idempotent. Generated code is simplified as follows for events: \|@@ -26,7 +26,7 @@ void qapi_event_send_acpi_device_ost(ACP \| QDict qmp; \| Error err = NULL; \| QMPEventFuncEmit emit; \|- QmpOutputVisitor qov; \|+ QObject obj; \| Visitor v; \| q_obj_ACPI_DEVICE_OST_arg param = { \| info \|@@ -39,8 +39,7 @@ void qapi_event_send_acpi_device_ost(ACP \| \| qmp = qmp_event_build_dict("ACPI_DEVICE_OST"); \| \|- qov = qmp_output_visitor_new(); \|- v = qmp_output_get_visitor(qov); \|+ v = qmp_output_visitor_new(&obj); \| \| visit_start_struct(v, "ACPI_DEVICE_OST", NULL, 0, &err); \| if (err) { \|@@ -55,7 +54,8 @@ void qapi_event_send_acpi_device_ost(ACP \| goto out; \| } \| \|- qdict_put_obj(qmp, "data", qmp_output_get_qobject(qov)); \|+ visit_complete(v, &obj); \|+ qdict_put_obj(qmp, "data", obj); \| emit(QAPI_EVENT_ACPI_DEVICE_OST, qmp, &err); and for commands: \| { \| Error err = NULL; \|- QmpOutputVisitor qov = qmp_output_visitor_new(); \| Visitor v; \| \|- v = qmp_output_get_visitor(qov); \|+ v = qmp_output_visitor_new(ret_out); \| visit_type_AddfdInfo(v, "unused", &ret_in, &err); \|- if (err) { \|- goto out; \|+ if (!err) { \|+ visit_complete(v, ret_out); \| } \|- *ret_out = qmp_output_get_qobject(qov); \|- \|-out: \| error_propagate(errp, err); Backports commit 3b098d56979d2f7fd707c5be85555d114353a28d from qemu	2018-02-25 01:20:03 -05:00
Eric Blake	ec53301cda	qmp-output-visitor: Favor new visit_free() function Now that we have a polymorphic visit_free(), we no longer need qmp_output_visitor_cleanup(); however, we still need to expose the subtype for qmp_output_get_qobject(). Backports commit 1830f22a6777cedaccd67a08f675d30f7a85ebfd from qemu	2018-02-25 01:12:27 -05:00
Eric Blake	f008d93ac0	qmp-input-visitor: Favor new visit_free() function Now that we have a polymorphic visit_free(), we no longer need qmp_input_visitor_cleanup(); which in turn means we no longer need to return a subtype from qmp_input_visitor_new() nor a public upcast function. Generated code changes to qmp-marshal.c look like: \|@@ -52,11 +52,10 @@ void qmp_marshal_add_fd(QDict args, QOb \| { \| Error err = NULL; \| AddfdInfo retval; \|- QmpInputVisitor qiv = qmp_input_visitor_new(QOBJECT(args), true); \| Visitor *v; \| q_obj_add_fd_arg arg = {0}; \| \|- v = qmp_input_get_visitor(qiv); \|+ v = qmp_input_visitor_new(QOBJECT(args), true); \| visit_start_struct(v, NULL, NULL, 0, &err); \| if (err) { \| goto out; Backports commit b70ce1018a251c0c33498d9c927a07cade655a5e from qemu	2018-02-25 01:10:53 -05:00
Eric Blake	e88a7e260b	string-input-visitor: Favor new visit_free() function Now that we have a polymorphic visit_free(), we no longer need string_input_visitor_cleanup(); which in turn means we no longer need to return a subtype from string_input_visitor_new() nor a public upcast function. Backports commit 7a0525c7be6b38d32d586e3fd12e7377ded21faa from qemu	2018-02-25 01:08:04 -05:00
Eric Blake	7f741a6c9b	qapi: Add new visit_free() function Making each visitor provide its own (awkwardly-named) FOO_cleanup() is unusual, when we can instead have a polymorphic visit_free() interface. Over the next few patches, we can use the polymorphic functions to eliminate the need for a FOO_get_visitor() function for accessing specific visitor functionality, once everything can be accessed directly through the Visitor* interfaces. The dealloc visitor is the first one converted to completely use the new entry point, since qapi_dealloc_visitor_cleanup() was the only reason that qapi_dealloc_get_visitor() existed, and only generated and testsuite code was even using it. With the new visit_free() entry point in place, we no longer need to expose the QapiDeallocVisitor subtype through qapi_dealloc_visitor_new(), and can get by with less generated code, with diffs that look like: \| void qapi_free_ACPIOSTInfo(ACPIOSTInfo obj) \| { \|- QapiDeallocVisitor qdv; \| Visitor *v; \| \| if (!obj) { \| return; \| } \| \|- qdv = qapi_dealloc_visitor_new(); \|- v = qapi_dealloc_get_visitor(qdv); \|+ v = qapi_dealloc_visitor_new(); \| visit_type_ACPIOSTInfo(v, NULL, &obj, NULL); \|- qapi_dealloc_visitor_cleanup(qdv); \|+ visit_free(v); \|} Backports commit 2c0ef9f411ae6081efa9eca5b3eab2dbeee45a6c from qemu	2018-02-25 01:05:41 -05:00
Eric Blake	37ae4dfdfd	qapi: Add parameter to visit_end_* Rather than making the dealloc visitor track of stack of pointers remembered during visit_start_* in order to free them during visit_end_, it's a lot easier to just make all callers pass the same pointer to visit_end_. The generated code has access to the same pointer, while all other users are doing virtual walks and can pass NULL. The dealloc visitor is then greatly simplified. All three visit_end_() functions intentionally take a void, even though the visit_start_() functions differ between void, GenericList, and GenericAlternate*. This is done for several reasons: when doing a virtual walk, passing NULL doesn't care what the type is, but when doing a generated walk, we already have to cast the caller's specific FOO to call visit_start, while using void** lets us use visit_end without a cast. Also, an upcoming patch will add a clone visitor that wants to use the same implementation for all three visit_end callbacks, which is made easier if all three share the same signature. For visitors with already track per-object state (the QMP visitors via a stack, and the string visitors which do not allow nesting), add an assertion that the caller is indeed passing the same pointer to paired calls. Backports commit 1158bb2a058fcdd0c8fc3e60dc77f7a57ddbb271 from qemu	2018-02-25 00:57:54 -05:00
Changlong Xie	2ca07642f1	qom: Fix comment typo It's qom_unref, not qdef_unref. Backports commit ada03a0e8423ef8950e30d216f56a9661a4070e2 from qemu	2018-02-25 00:46:15 -05:00
Markus Armbruster	eeef227560	range: Replace internal representation of Range Range represents a range as follows. Member @start is the inclusive lower bound, member @end is the exclusive upper bound. Zero @end is special: if @start is also zero, the range is empty, else @end is to be interpreted as 2^64. No other empty ranges may occur. The range [0,2^64-1] cannot be represented. If you try to create it with range_set_bounds1(), you get the empty range instead. If you try to create it with range_set_bounds() or range_extend(), assertions fail. Before range_set_bounds() existed, the open-coded creation usually got you the empty range instead. Open deathtrap. Moreover, the code dealing with the janus-faced @end is too clever by half. Dumb this down to a more pedestrian representation: members @lob and @upb are inclusive lower and upper bounds. The empty range is encoded as @lob = 1, @upb = 0. Backports commit 6dd726a2bf1b800289d90a84d5fcb5ce7b78a8e1 from qemu	2018-02-25 00:44:36 -05:00
Markus Armbruster	8b2a0c4ece	range: Eliminate direct Range member access Users of struct Range mess liberally with its members, which makes refactoring hard. Create a set of methods, and convert all users to call them instead of accessing members. The methods have carefully worded contracts, and use assertions to check them. Backports commit a0efbf16604770b9d805bcf210ec29942321134f from qemu	2018-02-25 00:39:43 -05:00
Alistair Francis	fbb0645fb3	bitops: Add MAKE_64BIT_MASK macro Add a macro that creates a 64bit value which has length number of ones shifted across by the value of shift. Backports commit ae2923b5c20a21c6457680330506a9c13873485c from qemu	2018-02-25 00:30:39 -05:00
Peter Maydell	efc6cc2b83	memory: Assert that memory_region_init_rom_device() ops aren't NULL It doesn't make sense to pass a NULL ops argument to memory_region_init_rom_device(), because the effect will be that if the guest tries to write to the memory region then QEMU will segfault. Catch the bug earlier by sanity checking the arguments to this function, and remove the misleading documentation that suggests that passing NULL might be sensible. Backports commit 39e0b03dec518254fabd2acff29548d3f1d2b754 from qemu	2018-02-25 00:29:52 -05:00
Peter Maydell	334e951ec1	memory: Provide memory_region_init_rom() Provide a new helper function memory_region_init_rom() for memory regions which are read-only (and unlike those created by memory_region_init_rom_device() don't have special behaviour for writes). This has the same behaviour as calling memory_region_init_ram() and then memory_region_set_readonly() (which is what we do today in boards with pure ROMs) but is a more easily discoverable API for the purpose. Backports commit a1777f7f6462c66e1ee6e98f0d5c431bfe988aa5 from qemu	2018-02-25 00:28:17 -05:00
Alexey Kardashevskiy	7187d77cfa	memory: Add MemoryRegionIOMMUOps.notify_started/stopped callbacks The IOMMU driver may change behavior depending on whether a notifier client is present. In the case of POWER, this represents a change in the visibility of the IOTLB, for other drivers such as intel-iommu and future AMD-Vi emulation, notifier support is not yet enabled and this provides the opportunity to flag that incompatibility. Backports commit d22d8956b185c002b50a4d0883aff61f857347ef from qemu	2018-02-25 00:23:00 -05:00
Eric Blake	c14d8226ab	qapi: Fix memleak in string visitors on int lists Commit 7f8f9ef1 introduced the ability to store a list of integers as a sorted list of ranges, but when merging ranges, it leaks one or more ranges. It was also using range_get_last() incorrectly within range_compare() (a range is a start/end pair, but range_get_last() is for start/len pairs), and will also mishandle a range ending in UINT64_MAX (remember, we document that no range covers 2*64 bytes, but that ranges that end on UINT64_MAX have end < begin). The whole merge algorithm was rather complex, and included unnecessary passes over data within glib functions, and enough indirection to make it hard to easily plug the data leaks. Since we are already hard-coding things to a list of ranges, just rewrite the thing to open-code the traversal and comparisons, by making the range_compare() helper function give us an answer that is easier to use, at which point we avoid the need to pass any callbacks to g_list_(). Then by reusing range_extend() instead of duplicating effort with range_merge(), we cover the corner cases correctly. Drop the now-unused range_merge() and ranges_can_merge(). Doing this lets test-string-{input,output}-visitor pass under valgrind without leaks. Backports commit db486cc334aafd3dbdaf107388e37fc3d6d3e171 from qemu	2018-02-25 00:20:34 -05:00
Eric Blake	ef357d06bc	qapi: Simplify use of range.h Calling our function g_list_insert_sorted_merged is a misnomer, since we are NOT writing a glib function. Furthermore, we are making every caller pass the same comparator function of range_merge(): any caller that would try otherwise would break in weird ways since our internal call to ranges_can_merge() is hard-coded to operate only on ranges, rather than paying attention to the caller's comparator. Better is to fix things so that callers don't have to care about our internal comparator, by picking a function name and updating the parameter type away from a gratuitous use of void*, to make it obvious that we are operating specifically on a list of ranges and not a generic list. Plus, refactoring the code here will make it easier to plug a memory leak in the next patch. range_compare() is now internal only, and moves to the .c file. Backports commit 7c47959d0cb05db43014141a156ada0b6d53a750 from qemu	2018-02-25 00:02:42 -05:00
Eric Blake	5e22c7e180	range: Create range.c for code that should not be inline g_list_insert_sorted_merged() is rather large to be an inline function; move it to its own file. range_merge() and ranges_can_merge() can likewise move, as they are only used internally. Also, it becomes obvious that the condition within range_merge() is already satisfied by its caller, and that the return value is not used. The diffstat is misleading, because of the copyright boilerplate. Backports commit fec0fc0a13ac7f1a1130433a6740cd850c3db34a from qemu	2018-02-24 23:59:13 -05:00
Eric Blake	ebeb0e46f8	qapi: Fix crash on missing alternate member of QAPI struct If a QAPI struct has a mandatory alternate member which is not present on input, the input visitor reports an error for the missing alternate without setting the discriminator, but the cleanup code for the struct still tries to use the dealloc visitor to clean up the alternate. Commit dbf11922 changed visit_start_alternate to set obj to NULL when an error occurs, where it was previously left untouched. Thus, before the patch, the dealloc visitor is blindly trying to cleanup whatever branch corresponds to (obj)->type == 0 (that is, QTYPE_NONE, because obj still pointed to zeroed memory), which selects the default branch of the switch and sets an error, but this second error is ignored by the way the dealloc visitor is used; but after the patch, the attempt to switch dereferences NULL. When cleaning up after a partial object parse, we specifically check for !obj after visit_start_struct() (see gen_visit_object()); doing the same for alternates fixes the crash. Enhance the testsuite to give coverage for both missing struct and missing alternate members. Also add an abort - we expect visit_start_alternate() to either set an error or to set (obj)->type to a valid QType that corresponds to actual user input, and QTYPE_NONE should never be reachable from valid input. Had the abort() been in place earlier, we might have noticed the dealloc visitor dereferencing bogus zeroed memory prior to when commit dbf11922 forced our hand by setting obj to NULL and causing a fault. Test case: {'execute':'blockdev-add', 'arguments':{'options':{'driver':'raw'}}} The choice of 'driver':'raw' selects a BlockdevOptionsGenericFormat struct, which has a mandatory 'file':'BlockdevRef' in QAPI. Since 'file' is missing as a sibling of 'driver', this should report a graceful error rather than fault. After this patch, we are back to: {"error": {"class": "GenericError", "desc": "Parameter 'file' is missing"}} Generated code in qapi-visit.c changes as: \|@@ -2444,6 +2444,9 @@ void visit_type_BlockdevRef(Visitor v, \| if (err) { \| goto out; \| } \|+ if (!obj) { \|+ goto out_obj; \|+ } \| switch ((obj)->type) { \| case QTYPE_QDICT: \| visit_start_struct(v, name, NULL, 0, &err); \|@@ -2459,10 +2462,13 @@ void visit_type_BlockdevRef(Visitor v, \| case QTYPE_QSTRING: \| visit_type_str(v, name, &(*obj)->u.reference, &err); \| break; \|+ case QTYPE_NONE: \|+ abort(); \| default: \| error_setg(&err, QERR_INVALID_PARAMETER_TYPE, name ? name : "null", \| "BlockdevRef"); \| } \|+out_obj: \| visit_end_alternate(v); Backports commit 9b4e38fe6a35890bb1d995316d7be08de0b30ee5 from qemu	2018-02-24 23:53:29 -05:00
Aleksandar Markovic	f95e0e9e98	target-mips: Add FCR31's FS bit definition Add preprocessor definition of FCR31's FS bit, and update related code for setting this bit. Backports commit 77be419980114d75605811e1681115d0919cfa1a from qemu	2018-02-24 21:32:10 -05:00
Aleksandar Markovic	4a540f88de	target-mips: Implement FCR31's R/W bitmask and related functionalities This patch implements read and write access rules for Mips floating point control and status register (FCR31). The change can be divided into following parts: - Add fields that will keep FCR31's R/W bitmask in procesor definitions and processor float_status structure. - Add appropriate value for FCR31's R/W bitmask for each supported processor. - Add function for setting snan_bit_is_one, and integrate it in appropriate places. - Modify handling of CTC1 (case 31) instruction to use FCR31's R/W bitmask. - Modify handling user mode executables for Mips, in relation to the bit EF_MIPS_NAN2008 from ELF header, that is in turn related to reading and writing to FCR31. - Modify gdb behavior in relation to FCR31. Backports commit 599bc5e89c46f95f86ccad0d747d041c89a28806 from qemu	2018-02-24 21:30:24 -05:00
Aleksandar Markovic	84b516d9db	target-mips: Add nan2008 flavor of <CEIL\|CVT\|FLOOR\|ROUND\|TRUNC>.<L\|W>.<S\|D> New set of helpers for handling nan2008-syle versions of instructions <CEIL\|CVT\|FLOOR\|ROUND\|TRUNC>.<L\|W>.<S\|D>, for Mips R6. All involved instructions have float operand and integer result. Their core functionality is implemented via invocations of appropriate SoftFloat functions. The problematic cases are when the operand is a NaN, and also when the operand (float) is out of the range of the result. Here one can distinguish three cases: CASE MIPS-A: (FCR31.NAN2008 == 1) 1. Operand is a NaN, result should be 0; 2. Operand is larger than INT_MAX, result should be INT_MAX; 3. Operand is smaller than INT_MIN, result should be INT_MIN. CASE MIPS-B: (FCR31.NAN2008 == 0) 1. Operand is a NaN, result should be INT_MAX; 2. Operand is larger than INT_MAX, result should be INT_MAX; 3. Operand is smaller than INT_MIN, result should be INT_MAX. CASE SoftFloat: 1. Operand is a NaN, result is INT_MAX; 2. Operand is larger than INT_MAX, result is INT_MAX; 3. Operand is smaller than INT_MIN, result is INT_MIN. Current implementation of <CEIL\|CVT\|FLOOR\|ROUND\|TRUNC>.<L\|W>.<S\|D> implements case MIPS-B. This patch relates to case MIPS-A. For case MIPS-A, only return value for NaN-operands should be corrected after appropriate SoftFloat library function is called. Related MSA instructions FTRUNC_S and FTINT_S already handle well all cases, in the fashion similar to the code from this patch. Backports commit 87552089b62fa229d2ff86906e4e779177fb5835 from qemu	2018-02-24 21:14:04 -05:00
Aleksandar Markovic	a411a12170	target-mips: Add abs2008 flavor of <ABS\|NEG>.<S\|D> Updated handling of instructions <ABS\|NEG>.<S\|D>. Note that legacy (pre-abs2008) ABS and NEG instructions are arithmetic (and, therefore, any NaN operand causes signaling invalid operation), while abs2008 ones are non-arithmetic, always and only changing the sign bit, even for NaN-like operands. Details on these instructions are documented in [1] p. 35 and 359. Implementation-wise, abs2008 versions are implemented without helpers, for simplicity and performance sake. [1] "MIPS Architecture For Programmers Volume II-A: The MIPS64 Instruction Set Reference Manual", Imagination Technologies LTD, Revision 6.04, November 13, 2015 Backports commit 6be77480052b1a71557081896e7080363a8a2f95 from qemu	2018-02-24 20:45:06 -05:00
Aleksandar Markovic	ef9f33a345	target-mips: Activate IEEE 754-2008 signaling NaN bit meaning for MSA Function msa_reset() is updated so that flag snan_bit_is_one is properly set to 0. By applying this patch, a number of incorrect MSA behaviors that require IEEE 754-2008 compliance will be fixed. Those are behaviors that (up to the moment of applying this patch) did not get the desired functionality from SoftFloat library with respect to distinguishing between quiet and signaling NaN, getting default NaN values (both quiet and signaling), establishing if a floating point number is NaN or not, etc. Two examples: * FMAX, FMIN will now correctly detect and propagate NaNs. * FCLASS.D ans FCLASS.S will now correcty detect NaN flavors Backports commit 40bd6dd456e61a36e454fb9dd2cc739b67c224cf from qemu	2018-02-24 20:41:48 -05:00
Aleksandar Markovic	3e9325f1e9	softfloat: Handle snan_bit_is_one == 0 in MIPS pickNaNMulAdd() Only for Mips platform, and only for cases when snan_bit_is_one is 0, correct the order of argument comparisons in pickNaNMulAdd(). For more info, see [1], page 53, section "3.5.3 NaN Propagation". [1] "MIPS Architecture for Programmers Volume IV-j: The MIPS32 SIMD Architecture Module", Imagination Technologies LTD, Revision 1.12, February 3, 2016 Backports commit c27644f0e9659471e1c9355da5b667960d311937 from qemu	2018-02-24 20:40:11 -05:00
Aleksandar Markovic	33833b6605	softfloat: For Mips only, correct default NaN values Only for Mips platform, and only for cases when snan_bit_is_one is 0, correct default NaN values (in their 16-, 32-, and 64-bit flavors). For more info, see [1], page 84, Table 6.3 "Value Supplied When a New Quiet NaN Is Created", and [2], page 52, Table 3.7 "Default NaN Encodings". [1] "MIPS Architecture For Programmers Volume II-A: The MIPS64 Instruction Set Reference Manual", Imagination Technologies LTD, Revision 6.04, November 13, 2015 [2] "MIPS Architecture for Programmers Volume IV-j: The MIPS32 SIMD Architecture Module", Imagination Technologies LTD, Revision 1.12, February 3, 2016 Backports commit a7c04d545a97126c9df9d96623747d8613aaf7db from qemu	2018-02-24 20:38:23 -05:00
Aleksandar Markovic	33ee9429b2	softfloat: Clean code format in fpu/softfloat-specialize.h fpu/softfloat-specialize.h is the most critical file in SoftFloat library, since it handles numerous differences between platforms in relation to floating point arithmetics. This patch makes the code in this file more consistent format-wise, and hopefully easier to debug and maintain. Backports commit a59eaea64686c8966b7653303660f8c26f285c77 from qemu	2018-02-24 20:35:05 -05:00
Aleksandar Markovic	6eb4fa54f6	softfloat: Implement run-time-configurable meaning of signaling NaN bit This patch modifies SoftFloat library so that it can be configured in run-time in relation to the meaning of signaling NaN bit, while, at the same time, strictly preserving its behavior on all existing platforms. Background: In floating-point calculations, there is a need for denoting undefined or unrepresentable values. This is achieved by defining certain floating-point numerical values to be NaNs (which stands for "not a number"). For additional reasons, virtually all modern floating-point unit implementations use two kinds of NaNs: quiet and signaling. The binary representations of these two kinds of NaNs, as a rule, differ only in one bit (that bit is, traditionally, the first bit of mantissa). Up to 2008, standards for floating-point did not specify all details about binary representation of NaNs. More specifically, the meaning of the bit that is used for distinguishing between signaling and quiet NaNs was not strictly prescribed. (IEEE 754-2008 was the first floating-point standard that defined that meaning clearly, see [1], p. 35) As a result, different platforms took different approaches, and that presented considerable challenge for multi-platform emulators like QEMU. Mips platform represents the most complex case among QEMU-supported platforms regarding signaling NaN bit. Up to the Release 6 of Mips architecture, "1" in signaling NaN bit denoted signaling NaN, which is opposite to IEEE 754-2008 standard. From Release 6 on, Mips architecture adopted IEEE standard prescription, and "0" denotes signaling NaN. On top of that, Mips architecture for SIMD (also known as MSA, or vector instructions) also specifies signaling bit in accordance to IEEE standard. MSA unit can be implemented with both pre-Release 6 and Release 6 main processor units. QEMU uses SoftFloat library to implement various floating-point-related instructions on all platforms. The current QEMU implementation allows for defining meaning of signaling NaN bit during build time, and is implemented via preprocessor macro called SNAN_BIT_IS_ONE. On the other hand, the change in this patch enables SoftFloat library to be configured in run-time. This configuration is meant to occur during CPU initialization, at the moment when it is definitely known what desired behavior for particular CPU (or any additional FPUs) is. The change is implemented so that it is consistent with existing implementation of similar cases. This means that structure float_status is used for passing the information about desired signaling NaN bit on each invocation of SoftFloat functions. The additional field in float_status is called snan_bit_is_one, which supersedes macro SNAN_BIT_IS_ONE. IMPORTANT: This change is not meant to create any change in emulator behavior or functionality on any platform. It just provides the means for SoftFloat library to be used in a more flexible way - in other words, it will just prepare SoftFloat library for usage related to Mips platform and its specifics regarding signaling bit meaning, which is done in some of subsequent patches from this series. Further break down of changes: 1) Added field snan_bit_is_one to the structure float_status, and correspondent setter function set_snan_bit_is_one(). 2) Constants <float16\|float32\|float64\|floatx80\|float128>_default_nan (used both internally and externally) converted to functions <float16\|float32\|float64\|floatx80\|float128>_default_nan(float_status). This is necessary since they are dependent on signaling bit meaning. At the same time, for the sake of code cleanup and simplicity, constants <floatx80\|float128>_default_nan_<low\|high> (used only internally within SoftFloat library) are removed, as not needed. 3) Added a float_status argument to SoftFloat library functions XXX_is_quiet_nan(XXX a_), XXX_is_signaling_nan(XXX a_), XXX_maybe_silence_nan(XXX a_). This argument must be present in order to enable correct invocation of new version of functions XXX_default_nan(). (XXX is <float16\|float32\|float64\|floatx80\|float128> here) 4) Updated code for all platforms to reflect changes in SoftFloat library. This change is twofolds: it includes modifications of SoftFloat library functions invocations, and an addition of invocation of function set_snan_bit_is_one() during CPU initialization, with arguments that are appropriate for each particular platform. It was established that all platforms zero their main CPU data structures, so snan_bit_is_one(0) in appropriate places is not added, as it is not needed. [1] "IEEE Standard for Floating-Point Arithmetic", IEEE Computer Society, August 29, 2008. Backports commit af39bc8c49224771ec0d38f1b693ea78e221d7bc from qemu	2018-02-24 20:27:12 -05:00
Alexey Kardashevskiy	096ca207af	memory: Add reporting of supported page sizes Every IOMMU has some granularity which MemoryRegionIOMMUOps::translate uses when translating, however this information is not available outside the translate context for various checks. This adds a get_min_page_size callback to MemoryRegionIOMMUOps and a wrapper for it so IOMMU users (such as VFIO) can know the minimum actual page size supported by an IOMMU. As IOMMU MR represents a guest IOMMU, this uses TARGET_PAGE_SIZE as fallback. This removes vfio_container_granularity() and uses new helper in memory_region_iommu_replay() when replaying IOMMU mappings on added IOMMU memory region. Backports the relevant parts of commit f682e9c244af7166225f4a50cc18ff296bb9d43e from qemu	2018-02-24 19:23:28 -05:00
Lluís Vilanova	2297527755	exec: [tcg] Track which vCPU is performing translation and execution Information is tracked inside the TCGContext structure, and later used by tracing events with the 'tcg' and 'vcpu' properties. The 'cpu' field is used to check tracing of translation-time events ("_trans"). The 'tcg_env' field is used to pass it to execution-time events ("_exec"). Backports commit 7c2550432abe62f53e6df878ceba6ceaf71f0e7e from qemu	2018-02-24 19:21:39 -05:00
Eduardo Habkost	0f6513ef62	error: Remove unnecessary local_err variables This patch simplifies code that uses a local_err variable just to immediately use it for an error_propagate() call. Coccinelle patch used to perform the changes added to scripts/coccinelle/remove_local_err.cocci. Backports commit 6b62d961373e0327f2af8fb77d6d5d6308864180 from qemu	2018-02-24 19:12:25 -05:00
Peter Maydell	5ae787f895	target-arm: Provide hook to tell GICv3 about changes of security state The GICv3 CPU interface needs to know when the CPU it is attached to makes an exception level or mode transition that changes the security state, because whether it is asserting IRQ or FIQ can change depending on these things. Provide a mechanism for letting the GICv3 device register a hook to be called on such changes. Backports commit bd7d00fc50c9960876dd194ebf0c88889b53e765 from qemu	2018-02-24 19:09:22 -05:00
Peter Maydell	eec3a5f843	target-arm: Define new arm_is_el3_or_mon() function The GICv3 system registers need to know if the CPU is AArch64 in EL3 or AArch32 in Monitor mode. This happens to be the first part of the check for arm_is_secure(), so factor it out into a new arm_is_el3_or_mon() function that the GIC can also use. Backports commit 712058764da29b2908f6fbf56760ca4f15980709 from qemu	2018-02-24 19:04:27 -05:00
Peter Maydell	f893dacef0	bitops.h: Implement half-shuffle and half-unshuffle ops A half-shuffle operation takes a word with zeros in the high half: 0000 0000 0000 0000 ABCD EFGH IJKL MNOP and spreads the bits out so they are in every other bit of the word: 0A0B 0C0D 0E0F 0G0H 0I0J 0K0L 0M0N 0O0P A half-unshuffle performs the reverse operation. Provide functions in bitops.h which implement these operations for 32-bit and 64-bit inputs, and add tests for them. Backports commit b355438de52d0782983bf4bdc47936189a0c988b from qemu	2018-02-24 19:02:36 -05:00
Bharata B Rao	851dec945d	qom: API to get instance_size of a type Add an API object_type_get_size(const char *typename) that returns the instance_size of the give typename. Backports commit 3f97b53a682d2595747c926c00d78b9d406f1be0 from qemu	2018-02-24 19:00:16 -05:00
Thomas Huth	aee5c93f58	configure: Enable -Werror for MinGW builds, too MinGW seems to compile currently without warnings, so it should be safe to enable -Werror now for this environment, too. Backports commit e4650c81b3d15ba67236815defbb475c4bdf8690 from qemu	2018-02-24 18:56:05 -05:00
Eduardo Habkost	b918dd95f3	target-i386: Consolidate calls of object_property_parse() in x86_cpu_parse_featurestr Backports commit f6750e959a397dea988efd4e488e1ff813011065 from qemu	2018-02-24 18:53:55 -05:00
Igor Mammedov	800b28483b	target-i386: Move features logic that requires CPUState to realize time Making x86_cpu_parse_featurestr() a pure convertor of legacy feature string into global properties, needs it to be called before a CPU instance is created so parser shouldn't modify CPUState directly or access it at all. Hence move current hack that directly pokes into CPUState, to set/unset +-feats, from parser to CPU's realize method. Backports commit dc15c0517b010a9444a2c05794dae980f2a2cbd9 from qemu	2018-02-24 18:47:46 -05:00
Eduardo Habkost	b9ca5c4d33	target-i386: Remove xlevel & hv-spinlocks option fixups The "fixup will be removed in future versions" warnings are present since QEMU 1.7.0, at least, so users should have fixed their scripts and configurations, already. In the case of libvirt users, libvirt doesn't use the "xlevel" option, and already rejects HyperV spinlock retry count < 0xFFF. Backports commit c19b85216b5d47d922ac010931d4c7b2d79b2f68 from qemu	2018-02-24 18:33:32 -05:00
Radim Krčmář	610a52e9c7	target-i386: Implement CPUID[0xB] (Extended Topology Enumeration) I looked at a dozen Intel CPU that have this CPUID and all of them always had Core offset as 1 (a wasted bit when hyperthreading is disabled) and Package offset at least 4 (wasted bits at <= 4 cores). QEMU uses more compact IDs and it doesn't make much sense to change it now. I keep the SMT and Core sub-leaves even if there is just one thread/core; it makes the code simpler and there should be no harm. Backports commit 5232d00a041c8f3628b3532ef35d703a1f0dac19 from qemu	2018-02-24 18:31:14 -05:00
Eduardo Habkost	8991e8bf0b	target-i386: add Skylake-Client cpu model Introduce Skylake-Client cpu mode which inherits the features from Broadwell and supports some additional features that are: MPX, XSAVEC, and XGETBV1. Backports commit f6f949e9295889fb272698aea763dcea77d616ce from qemu	2018-02-24 18:25:50 -05:00
Peter Maydell	9bdf310d49	target-arm: Don't permit ARMv8-only Neon insns on ARMv7 The Neon instructions VCVTA, VCVTM, VCVTN, VCVTP, VRINTA, VRINTM, VRINTN, VRINTP, VRINTX, and VRINTZ were only introduced with ARMv8, so they need a guard to make them UNDEF if the CPU only supports ARMv7. (We got this right for all the other new-in-v8 insns, but forgot it for these Neon 2-reg-misc ops.) Backports commit fe8fcf3d642b4de1369841bf6acac13e0ec8770d from qemu	2018-02-24 18:20:00 -05:00
Peter Maydell	a9fb399490	target-arm: Fix reset and migration of TTBCR(S) Commit 6459b94c26dd666badb3 broke reset and migration of the AArch32 TTBCR(S) register if the guest used non-LPAE page tables. This is because the AArch32 TTBCR register definition is marked as ARM_CP_ALIAS, meaning that the AArch64 variant has to handle migration and reset. Although AArch64 TCR_EL3 doesn't need to care about the mask and base_mask fields, AArch32 may do so, and so we must use the special TTBCR reset and raw write functions to ensure they are set correctly. This doesn't affect TCR_EL2, because the AArch32 equivalent of that is HTCR, which never uses the non-LPAE page table variant. Backports commit 811595a2d4ab8c6354857a50ffd29fafce52a892 from qemu	2018-02-24 18:18:24 -05:00
Shannon Zhao	51c9e12605	target-arm: kvm64: set guest PMUv3 feature bit if supported Check if kvm supports guest PMUv3. If so, set the corresponding feature bit for vcpu. Backports commit 5c0a3819f009639f67ce0453dff6ec7211bfee54 from qemu	2018-02-24 18:17:11 -05:00
Emilio G. Cota	ae3e22a689	tb hash: hash phys_pc, pc, and flags with xxhash For some workloads such as arm bootup, tb_phys_hash is performance-critical. The is due to the high frequency of accesses to the hash table, originated by (frequent) TLB flushes that wipe out the cpu-private tb_jmp_cache's. More info: https://lists.nongnu.org/archive/html/qemu-devel/2016-03/msg05098.html To dig further into this I modified an arm image booting debian jessie to immediately shut down after boot. Analysis revealed that quite a bit of time is unnecessarily spent in tb_phys_hash: the cause is poor hashing that results in very uneven loading of chains in the hash table's buckets; the longest observed chain had ~550 elements. The appended addresses this with two changes: 1) Use xxhash as the hash table's hash function. xxhash is a fast, high-quality hashing function. 2) Feed the hashing function with not just tb_phys, but also pc and flags. This improves performance over using just tb_phys for hashing, since that resulted in some hash buckets having many TB's, while others getting very few; with these changes, the longest observed chain on a single hash bucket is brought down from ~550 to ~40. Tests show that the other element checked for in tb_find_physical, cs_base, is always a match when tb_phys+pc+flags are a match, so hashing cs_base is wasteful. It could be that this is an ARM-only thing, though. UPDATE: On Tue, Apr 05, 2016 at 08:41:43 -0700, Richard Henderson wrote: > The cs_base field is only used by i386 (in 16-bit modes), and sparc (for a TB > consisting of only a delay slot). > It may well still turn out to be reasonable to ignore cs_base for hashing. BTW, after this change the hash table should not be called "tb_hash_phys" anymore; this is addressed later in this series. This change gives consistent bootup time improvements. I tested two host machines: - Intel Xeon E5-2690: 11.6% less time - Intel i7-4790K: 19.2% less time Increasing the number of hash buckets yields further improvements. However, using a larger, fixed number of buckets can degrade performance for other workloads that do not translate as many blocks (600K+ for debian-jessie arm bootup). This is dealt with later in this series. Backports commit 42bd32287f3a18d823f2258b813824a39ed7c6d9 from qemu	2018-02-24 18:00:14 -05:00
Emilio G. Cota	9ef9de9cf8	exec: add tb_hash_func5, derived from xxhash This will be used by upcoming changes for hashing the tb hash. Add this into a separate file to include the copyright notice from xxhash. Backports commit dc8b295d05ec35a8c032f9abca421772347ba5d4 from qemu	2018-02-24 17:36:35 -05:00
Emilio G. Cota	8518f55df7	compiler.h: add QEMU_ALIGNED() to enforce struct alignment Backports commit 911a4d2215b05267b16925503218f49d607c6b29 from qemu	2018-02-24 17:32:43 -05:00
Peter Maydell	48539e54da	target-i386: Move user-mode exception actions out of user-exec.c The exception_action() function in user-exec.c is just a call to cpu_loop_exit() for every target CPU except i386. Since this function is only called if the target's handle_mmu_fault() hook has indicated an MMU fault, and that hook is only called from the handle_cpu_signal() code path, we can simply move the x86-specific setup into that hook, which allows us to remove the TARGET_I386 ifdef from user-exec.c. Of the actions that were done by the call to raise_interrupt_err(): * cpu_svm_check_intercept_param() is a no-op in user mode * check_exception() is a no-op since double faults are impossible for user-mode * assignments to cs->exception_index and env->error_code are no-ops * assigning to env->exception_next_eip is unnecessary because it is not used unless env->exception_is_int is true * cpu_loop_exit_restore() is equivalent to cpu_loop_exit() since pc is 0 which leaves just setting env_>exception_is_int as the action that needs to be added to x86_cpu_handle_mmu_fault(). Backports commit 0c33682d5f29b0a4ae53bdec4c8e52e4fae37b34 from qemu	2018-02-24 17:27:08 -05:00
Peter Maydell	fa2679ba96	target-i386: Add comment about do_interrupt_user() next_eip argument Add a comment to do_interrupt_user() along the same lines as the existing one for do_interrupt_all() noting that the next_eip argument is not used unless is_int is true or intno is EXCP_SYSCALL. Backports commit 33271823323483b4ede1ae99de83d33b25875402 from qemu	2018-02-24 17:26:18 -05:00
Peter Maydell	d7dccff836	cpu-exec: Rename cpu_resume_from_signal() to cpu_loop_exit_noexc() The function cpu_resume_from_signal() is now always called with a NULL puc argument, and is rather misnamed since it is never called from a signal handler. It is essentially forcing an exit to the top level cpu loop but without raising any exception, so rename it to cpu_loop_exit_noexc() and drop the useless unused argument. Backports commit 6886b98036a8f8f5bce8b10756ce080084cef11b from qemu	2018-02-24 17:25:28 -05:00
Peter Maydell	b2013255aa	user-exec: Push resume-from-signal code out to handle_cpu_signal() Since the only caller of page_unprotect() which might cause it to need to call cpu_resume_from_signal() is handle_cpu_signal() in the user-mode code, push the longjump handling out to that function. Since this is the only caller of cpu_resume_from_signal() which passes a non-NULL puc argument, split the non-NULL handling into a new cpu_exit_tb_from_sighandler() function. This allows us to merge the softmmu and usermode implementations of the cpu_resume_from_signal() function, which are now identical. Backports commit f213e72f2356b77768b9cb73814a3b26ad5a0099 from qemu	2018-02-24 17:21:06 -05:00
Peter Maydell	37b7538d85	translate-all.c: Don't pass puc, locked to tb_invalidate_phys_page() The user-mode-only function tb_invalidate_phys_page() is only called from two places: * page_unprotect(), which passes in a non-zero pc, a puc pointer and the value 'true' for the locked argument * page_set_flags(), which passes in a zero pc, a NULL puc pointer and a 'false' locked argument If the pc is non-zero then we may call cpu_resume_from_signal(), which does a longjmp out of the calling code (and out of the signal handler); this is to cover the case of a target CPU with "precise self-modifying code" (currently only x86) executing a store instruction which modifies code in the same TB as the store itself. Rather than doing the longjump directly here, return a flag to the caller which indicates whether the current TB was modified, and move the longjump to page_unprotect. Backports commit 75809229bbf28b371afce14921ff5be98ddc5faa from qemu	2018-02-24 17:11:30 -05:00
Paolo Bonzini	1db22b5889	Makefile: add dependency on scripts/create_config Make sure that config-host.h and config-target.h are rebuilt whenever there is a change in the scripts that generates them; add the dependency to the pattern rule as suggested by Peter. Backports commit 553350156d80c18d0127c742f47b7adbd642f3ef from qemu	2018-02-24 17:05:03 -05:00
Fam Zheng	c17a3070ea	Makefile: Add a FORCE target Backports commit d41d4da3c5d702b505d74265900a13fae2c8d0e0 from qemu	2018-02-24 17:03:51 -05:00
Peter Maydell	8d0faac1dc	qemu-common.h: Drop WORDS_ALIGNED define The WORDS_ALIGNED #define is not used anywhere, and hasn't been since 2013 when commit 612d590ebc6cef rewrote the various ld<type>_<endian>_p functions to not use it. Remove the #define and the comment describing it. Also remove the line in the comment about TARGET_WORDS_ALIGNED, since it has never actually existed. Backports commit 0d5c21f2b3bf1e0b562a2c74e353d2e03f2f50ef from qemu	2018-02-24 17:01:55 -05:00
Stefan Weil	4470900f3b	configure: Use instead of deprecated This fixes these warnings from shellcheck: ^-- SC2006: Use $(..) instead of deprecated `..` Backports commit 89138857619b2a023c32200e9af780792ccaa8c3 from qemu	2018-02-24 16:59:40 -05:00
Sergey Sorokin	c05902eddd	target-arm: Fix TTBR selecting logic on AArch32 Stage 2 translation Address size is 40-bit for the AArch32 stage 2 translation, and t0sz can be negative (from -8 to 7), so we need to adjust it to use the existing TTBR selecting logic. Backports commit 6e99f762612827afeff54add2e4fc2c3b2657fed from qemu	2018-02-24 16:54:32 -05:00
Peter Maydell	806d72035e	target-arm: Don't try to set ESR IL bit in arm_cpu_do_interrupt_aarch64() Remove some incorrect code from arm_cpu_do_interrupt_aarch64() which attempts to set the IL bit in the syndrome register based on the value of env->thumb. This is wrong in several ways: * IL doesn't indicate Thumb-vs-ARM, it indicates instruction length (which may be 16 or 32 for Thumb and is always 32 for ARM) * not every syndrome format uses IL like this -- for some IL is always set, and for some it is always clear * the code is changing esr_el[new_el] even for interrupt entry, which is not supposed to modify ESR_ELx at all Delete the code, and instead rely on the syndrome value in env->exception.syndrome having already been set up with the correct value of IL. Backports commit 78f1edb19fe11fa0c5d0bf484db59a384f455d3c from qemu	2018-02-24 16:49:53 -05:00
Peter Maydell	dc8bf22d88	target-arm: Set IL bit in syndromes for insn abort, watchpoint, swstep For some exception syndrome types, the IL bit should always be set. This includes the instruction abort, watchpoint and software step syndrome types; add the missing ARM_EL_IL bit to the syndrome values returned by syn_insn_abort(), syn_swstep() and syn_watchpoint(). Backports commit 04ce861ea545477425ad9e045eec3f61c8a27df9 from qemu	2018-02-24 16:48:59 -05:00
Edgar E. Iglesias	8aee797956	target-arm: A64: Create Instruction Syndromes for Data Aborts Add support for generating the ISS (Instruction Specific Syndrome) for Data Abort exceptions taken from AArch64. These syndromes are used by hypervisors for example to trap and emulate memory accesses. We save the decoded data out-of-band with the TBs at translation time. When exceptions hit, the extra data attached to the TB is used to recreate the state needed to encode instruction syndromes. This avoids the need to emit moves with every load/store. Based on a suggestion from Peter Maydell. Backports commit aaa1f954d4cab243e3d5337a72bc6d104e1c4808 from qemu	2018-02-24 16:46:44 -05:00
Alistair Francis	25daa5363e	target-arm: Add the HSTR_EL2 register Add the Hypervisor System Trap Register for EL2. This register is used early in the Linux boot and without it the kernel aborts with a "Synchronous Abort" error. Backports commit 2a5a9abd4bc45e2f4c62c77e07aebe53608c6915 from qemu	2018-02-24 16:24:57 -05:00
Fam Zheng	495c39300c	rules.mak: Add COMMA constant Using "," literal in $(call quiet-command, ...) arguments is awkward. Add this constant to make it at least doable. Backports commit 2f4e4dc237261c76734d8ae1d8e09d2983d2f1ca from qemu	2018-02-24 16:20:31 -05:00
Paolo Bonzini	8df5ad80b1	exec: hide mr->ram_addr from qemu_get_ram_ptr users Let users of qemu_get_ram_ptr and qemu_ram_ptr_length pass in an address that is relative to the MemoryRegion. This basically means what address_space_translate returns. Because the semantics of the second parameter change, rename the function to qemu_map_ram_ptr. Backports commit 0878d0e11ba8013dd759c6921cbf05ba6a41bd71 from qemu	2018-02-24 16:17:49 -05:00
Paolo Bonzini	b2e1b34bcc	memory: split memory_region_from_host from qemu_ram_addr_from_host Move the old qemu_ram_addr_from_host to memory_region_from_host and make it return an offset within the region. For qemu_ram_addr_from_host return the ram_addr_t directly, similar to what it was before commit 1b5ec23 ("memory: return MemoryRegion from qemu_ram_addr_from_host", 2013-07-04). Backports commit 07bdaa4196b51bc7ffa7c3f74e9e4a9dc8a7966a from qemu	2018-02-24 16:06:49 -05:00
Paolo Bonzini	918c626847	exec: remove ram_addr argument from qemu_ram_block_from_host Of the two callers, one does not use it, and the other can compute it itself based on the other output argument (offset) and the RAMBlock. Backports commit f615f39616c4fd1a3a3b078af8d75bb4be6390de from qemu	2018-02-24 03:37:40 -05:00
Paolo Bonzini	f26f1f123c	memory: remove qemu_get_ram_fd, qemu_set_ram_fd, qemu_ram_block_host_ptr Remove direct uses of ram_addr_t and optimize memory_region_{get,set}_fd now that a MemoryRegion knows its RAMBlock directly. Backports commit 4ff87573df3606856a92c14eef3393a63d736d11 from qemu	2018-02-24 03:34:44 -05:00
Emilio G. Cota	ab569f5cde	atomics: do not emit consume barrier for atomic_rcu_read Currently we emit a consume-load in atomic_rcu_read. Because of limitations in current compilers, this is overkill for non-Alpha hosts and it is only useful to make Thread Sanitizer work. This patch leaves the consume-load in atomic_rcu_read when compiling with Thread Sanitizer enabled, and resorts to a relaxed load + smp_read_barrier_depends otherwise. On an RMO host architecture, such as aarch64, the performance improvement of this change is easily measurable. For instance, qht-bench performs an atomic_rcu_read on every lookup. Performance before and after applying this patch: $ tests/qht-bench -d 5 -n 1 Before: 9.78 MT/s After: 10.96 MT/s Backports commit 15487aa132109891482f79d78a30d6cfd465a391 from qemu	2018-02-24 03:28:11 -05:00
Emilio G. Cota	87ef2a2c5f	atomics: emit an smp_read_barrier_depends() barrier only for Alpha and Thread Sanitizer For correctness, smp_read_barrier_depends() is only required to emit a barrier on Alpha hosts. However, we are currently emitting a consume fence unconditionally, and most compilers currently treat consume and acquire fences as equivalent. Fix it by keeping the consume fence if we're compiling with Thread Sanitizer, since this might help prevent false warnings. Otherwise, only emit the barrier for Alpha hosts. Note that we still guarantee that smp_read_barrier_depends() is a compiler barrier. Backports commit c983895258a771f8a5e4a53950bfb7fd2216651c from qemu	2018-02-24 03:26:52 -05:00
Sergey Fedorov	3a9c5e7509	cpu-exec: Fix direct jump to TB spanning page It is not safe to make a direct jump to a TB spanning two pages in system emulation because the mapping for the second page can get changed but we don't take care of direct jumps in this case. However in user mode emulation, this is not the case because there's only static address translation and TBs are always invalidated properly. Backports commit c88c67e58b61618a904d2333ceebefc3c852d32e from qemu	2018-02-24 03:24:53 -05:00
Eduardo Habkost	9c04a28bd2	target-i386: Move TCG initialization to realize time QOM instance_init functions are not supposed to have any side-effects, as new objects may be created at any moment for querying property information (see qmp_device_list_properties()). Move TCG initialization to realize time so it won't be called when just doing object_new() on a X86CPU subclass. Backports commit 57f2453ab48a771b30aeced01b329ee85853bb7b from qemu	2018-02-24 03:23:09 -05:00
Eduardo Habkost	fee2c27f2b	cpu: Eliminate cpudef_init(), cpudef_setup() x86_cpudef_init() doesn't do anything anymore, cpudef_init(), cpudef_setup(), and x86_cpudef_init() can be finally removed. Backports commit 3e2c0e062f0963a6b73b0cd1990fad79495463d9 from qemu	2018-02-24 03:20:46 -05:00
Eduardo Habkost	956e20ea6b	target-i386: Set constant model_id for qemu64/qemu32/athlon Newer PC machines don't set hw_version, and older machines set model-id on compat_props explicitly, so we don't need the x86_cpudef_setup() code that sets model_id using qemu_hw_version() anymore. Backports commit 9cf2cc3d8237732946720d78bf9aec0064026ed8 from qemu	2018-02-24 03:18:11 -05:00
Eduardo Habkost	aa3d46ef83	osdep: Move default qemu_hw_version() value to a macro The macro will be used by code that will stop calling qemu_hw_version() at runtime and just need a constant value. Backports commit d494352c2f7818aeba184a8ef757569083740bb2 from qemu	2018-02-24 03:16:34 -05:00
Eduardo Habkost	923dcf1cb8	target-i386: Use xsave structs for ext_save_area This doesn't introduce any change in the code, as the offsets and struct sizes match what was present in the table. This can be validated by the QEMU_BUILD_BUG_ON lines on target-i386/cpu.h, which ensures the struct sizes and offsets match the existing values in ext_save_area. Backports commit ee1b09f695dcd8532f470e53297473bd3bc88718 from qemu	2018-02-24 03:13:16 -05:00
Lioncash	05963470a2	target-i386: Include log.h in smm_helper Fixes a compilation error	2018-02-24 03:06:07 -05:00
Eduardo Habkost	128f7c078a	target-i386: Define structs for layout of xsave area Add structs that define the layout of the xsave areas used by Intel processors. Add some QEMU_BUILD_BUG_ON lines to ensure the structs match the XSAVE_* macros in target-i386/kvm.c and the offsets and sizes at target-i386/cpu.c:ext_save_areas. Backports commit b503717d28e8f7eff39bf38624e6cf42687d951a from qemu	2018-02-24 03:04:31 -05:00
Paolo Bonzini	77305ce4ee	memory: remove unnecessary masking of MemoryRegion ram_addr mr->ram_block->offset is already aligned to both host and target size (see qemu_ram_alloc_internal). Remove further masking as it is unnecessary. Backports commit e4e697940dff612b789b0858270c20a8b680f78d from qemu	2018-02-24 03:01:34 -05:00
Fam Zheng	74962feee1	memory: Drop FlatRange.romd_mode Its value is alway set to mr->romd_mode, so the removed comparisons are fully superseded by "a->mr == b->mr". Backports commit 5b5660adf1fdb61db14ec681b10463b8cba633f1 from qemu	2018-02-24 02:57:29 -05:00
Fam Zheng	fb8135cd0d	memory: Remove code for mr->may_overlap The collision check does nothing and hasn't been used. Remove the variable together with related code. Backports commit b61359781958759317ee6fd1a45b59be0b7dbbe1 from qemu	2018-02-24 02:55:25 -05:00
Gonglei	feff56cc11	memory: drop find_ram_block() On the one hand, we have already qemu_get_ram_block() whose function is similar. On the other hand, we can directly use mr->ram_block but searching RAMblock by ram_addr which is a kind of waste. Backports commit fa53a0e53efdc7002497ea4a76aacf6cceb170ef from qemu	2018-02-24 02:52:20 -05:00
Paolo Bonzini	9bb67a3f58	hw: clean up hw/hw.h includes Include qom/object.h and exec/memory.h instead of exec/ioport.h; exec/ioport.h was almost everywhere required only for those two includes, not for the content of the header itself. Remove block/aio.h, everybody is already including it through another path. With this change, include/hw/hw.h is freed from qemu-common.h. Backports commit df43d49cb8708b9c88a20afe0d1a3089b550a5b8 from qemu	2018-02-24 02:46:41 -05:00
Paolo Bonzini	d0d3712417	hw: remove pio_addr_t pio_addr_t is almost unused, because these days I/O ports are simply accessed through the address space. cpu_{in,out}[bwl] themselves are almost unused; monitor.c and xen-hvm.c could use address_space_read/write directly, since they have an integer size at hand. This leaves qtest as the only user of those functions. On the other hand even portio_* functions use this type; the only interesting use of pio_addr_t thus is include/hw/sysbus.h. I guess I could move it there, but I don't see much benefit in that either. Using uint32_t is enough and avoids the need to include ioport.h everywhere. Backports commit 89a80e7400f7225d9401b35ef32454b4ab29dc67 from qemu	2018-02-24 02:43:16 -05:00
Paolo Bonzini	9485b7c2e1	cpu: move exec-all.h inclusion out of cpu.h exec-all.h contains TCG-specific definitions. It is not needed outside TCG-specific files such as translate.c, exec.c or *helper.c. One generic function had snuck into include/exec/exec-all.h; move it to include/qom/cpu.h. Backports commit 63c915526d6a54a95919ebece83fa9ca631b2508 from qemu	2018-02-24 02:39:08 -05:00
Paolo Bonzini	58693409ea	exec: extract exec/tb-context.h TCG backends do not need most of exec-all.h; extract what they actually need to a separate file or move it directly to tcg.h. The next patch will stop including exec-all.h from everywhere. Backports commit 00f6da6a1a5d1ce085334eccbb50ec899ceed513 from qemu	2018-02-24 02:09:58 -05:00
Paolo Bonzini	f9b9d0ba0f	hw: explicitly include qemu/log.h Move the inclusion out of hw/hw.h, most files do not need it. Backports commit 03dd024ff57733a55cd2e455f361d053c81b1b29 from qemu	2018-02-24 02:00:45 -05:00
Paolo Bonzini	adf97a4d59	mips: move CP0 functions out of cpu.h These are here for historical reasons: they are needed from both gdbstub.c and op_helper.c, and the latter was compiled with fixed AREG0. It is not needed anymore, so uninline them. Backports commit e6623d88f44aae9e9c78276c0cb7bd352283d50a from qemu	2018-02-24 01:57:30 -05:00
Paolo Bonzini	058624b9e4	arm: move arm_log_exception into .c file Avoid need for qemu/log.h inclusion, and make the function static too. Backports commit 27a7ea8a1f351578ce869b41ba1ba662c063fd62 from qemu	2018-02-24 01:52:55 -05:00
Paolo Bonzini	37f26922dd	qemu-common: push cpu.h inclusion out of qemu-common.h Backports commit 33c11879fd422b759483ed25fef133ea900ea8d7 from qemu	2018-02-24 01:50:56 -05:00
Paolo Bonzini	e84da64a2b	qemu-common: stop including qemu/bswap.h from qemu-common.h Move it to the actual users. There are still a few includes of qemu/bswap.h in headers; removing them is left for future work. Backports commit 58369e22cf971448411bfbc8c894b2addebe2111 from qemu	2018-02-24 01:06:03 -05:00
Paolo Bonzini	78fd1aab94	cpu: move endian-dependent load/store functions to cpu-all.h Disentangle cpu-common.h and memory.h from NEED_CPU_H. Prototypes are not defined for !NEED_CPU_H, so remove them from poison.h too. Only macros need poisoning. Backports commit a7d6039cb35592683ecc56d2b37817da2d2f8b00 from qemu	2018-02-24 01:04:26 -05:00
Paolo Bonzini	9c0e31ed3a	target-sparc: make cpu-qom.h not target specific Make SPARCCPU an opaque type within cpu-qom.h, and move all definitions of private methods, as well as all type definitions that require knowledge of the layout to cpu.h. This helps making files independent of NEED_CPU_H if they only need to pass around CPU pointers. Backports commit d61d1b20610e4655d7846e4cb43d22188e935f5f from qemu	2018-02-24 01:00:56 -05:00
Paolo Bonzini	01bd1c1a73	target-mips: make cpu-qom.h not target specific Make MIPSCPU an opaque type within cpu-qom.h, and move all definitions of private methods, as well as all type definitions that require knowledge of the layout to cpu.h. This helps making files independent of NEED_CPU_H if they only need to pass around CPU pointers. Backports commit 416bf936864f16caad6993b9ebd452fb34f801bd from qemu	2018-02-24 00:59:03 -05:00
Paolo Bonzini	27ebc27beb	target-m68k: make cpu-qom.h not target specific Make M68KCPU an opaque type within cpu-qom.h, and move all definitions of private methods, as well as all type definitions that require knowledge of the layout to cpu.h. This helps making files independent of NEED_CPU_H if they only need to pass around CPU pointers. Backports commit a836b8fa00fa1032ccd234a71b33943627d211ea from qemu	2018-02-24 00:56:58 -05:00
Paolo Bonzini	2f4ae94b5c	target-i386: make cpu-qom.h not target specific Make X86CPU an opaque type within cpu-qom.h, and move all definitions of private methods, as well as all type definitions that require knowledge of the layout to cpu.h. This helps making files independent of NEED_CPU_H if they only need to pass around CPU pointers. Backports commit 4da6f8d954429c0cd1471d25cb9dbe909607374e from qemu	2018-02-24 00:55:22 -05:00
Lioncash	791413630e	target-arm: make cpu-qom.h not target specific Make ARMCPU an opaque type within cpu-qom.h, and move all definitions of private methods, as well as all type definitions that require knowledge of the layout to cpu.h. This helps making files independent of NEED_CPU_H if they only need to pass around CPU pointers. Backports commit 74e755647c1598a6845df1ee4f8b96d01afd96e7 from qemu	2018-02-24 00:48:59 -05:00
Paolo Bonzini	fee6dcb22a	include: move CPU-related definitions out of qemu-common.h Backports commit 4b4629d9d26fd0e100d9be526367a96aa35b541d from qemu	2018-02-24 00:33:49 -05:00
Wei Jiangang	7cf135457a	accel: make configure_accelerator return void Return the negated value of accel_initialised is meaningless, and the caller vl doesn't check it. Backports commit bdc3f61dec2f9c227235bb5f677a0272e1184c82 from qemu	2018-02-24 00:31:28 -05:00
Sergey Fedorov	eab60b7c77	cpu-exec: Clean up 'interrupt_request' reloading in cpu_handle_interrupt() Backports commit 8b1fe3f439eaa2f0a6ee7737942bb6c405725867 from qemu	2018-02-24 00:27:05 -05:00
Sergey Fedorov	b4b7b88f69	cpu-exec: Remove unused 'x86_cpu' and 'env' from cpu_exec() Backports commit ba048a4ae15ba0f70c6dcb12ee05db120408de78 from qemu	2018-02-24 00:16:40 -05:00
Sergey Fedorov	aefb8935a9	cpu-exec: Move TB execution stuff out of cpu_exec() Simplify cpu_exec() by extracting TB execution code outside of cpu_exec() into a new static inline function cpu_loop_exec_tb(). Backports commit 928de9ee14b0b63ee9f9275732ed3e1c8b5f4790 from qemu	2018-02-24 00:15:24 -05:00
Sergey Fedorov	d4ef96abf2	cpu-exec: Move interrupt handling out of cpu_exec() Simplify cpu_exec() by extracting interrupt handling code outside of cpu_exec() into a new static inline function cpu_handle_interrupt(). Backports commit c385e6e49763c6dd5dbbd90fadde95d986f8bd38 from qemu	2018-02-24 00:09:06 -05:00
Sergey Fedorov	c1b52a4387	cpu-exec: Move exception handling out of cpu_exec() Simplify cpu_exec() by extracting exception handling code out of cpu_exec() into a new static inline function cpu_handle_exception(). Also make cpu_handle_debug_exception() inline as it is used only once. Backports commit ea284766ec6b9f1712369249566b4c372f3cec8b from qemu	2018-02-24 00:03:37 -05:00
Sergey Fedorov	fc3d135dac	cpu-exec: Move halt handling out of cpu_exec() Simplify cpu_exec() by extracting CPU halt state handling code out of cpu_exec() into a new static inline function cpu_handle_halt(). Backports commit 8b2d34e997371c9729a0f41e3cc624d4300bbe78 from qemu	2018-02-23 23:53:20 -05:00
Lioncash	88d00a75ca	cpu-exec: move cpu_exec to the bottom of the file Remove forward declarations	2018-02-23 23:50:28 -05:00
Sergey Fedorov	0088ca994f	cpu-exec: Remove relic orphaned comment This comment should have been deleted by commit 0ac087f1f3ae ("removed unused code") but somehow it is still here. There's no point to keep it. Backports commit c6f0d9f84c43ae973270df1a77482466558ee487 from qemu	2018-02-23 23:47:05 -05:00
Sergey Fedorov	1a768018c2	tcg: Remove needless CPUState::current_tb This field was used for telling cpu_interrupt() to unlink a chain of TBs being executed when it worked that way. Now, cpu_interrupt() don't do this anymore. So we don't need this field anymore. Backports commit 3213525f8ab48742db09dab18cb9ae6f36a6c921 from qemu	2018-02-23 23:45:42 -05:00
Sergey Fedorov	73c75b4cf7	cpu-exec: Move TB chaining into tb_find_fast() Move tb_add_jump() call and surrounding code from cpu_exec() into tb_find_fast(). That simplifies cpu_exec() a little by hiding the direct chaining optimization details into tb_find_fast(). It also allows to move tb_lock()/tb_unlock() pair into tb_find_fast(), putting it closer to tb_find_slow() which also manipulates the lock. Backports commit a0522c7a55cc8ac76d82884cf8e52f76daa664cc from qemu	2018-02-23 23:38:57 -05:00
Sergey Fedorov	ba9a237586	tcg: Rework tb_invalidated_flag 'tb_invalidated_flag' was meant to catch two events: * some TB has been invalidated by tb_phys_invalidate(); * the whole translation buffer has been flushed by tb_flush(). Then it was checked: * in cpu_exec() to ensure that the last executed TB can be safely linked to directly call the next one; * in cpu_exec_nocache() to decide if the original TB should be provided for further possible invalidation along with the temporarily generated TB. It is always safe to patch an invalidated TB since it is not going to be used anyway. It is also safe to call tb_phys_invalidate() for an already invalidated TB. Thus, setting this flag in tb_phys_invalidate() is simply unnecessary. Moreover, it can prevent from pretty proper linking of TBs, if any arbitrary TB has been invalidated. So just don't touch it in tb_phys_invalidate(). If this flag is only used to catch whether tb_flush() has been called then rename it to 'tb_flushed'. Declare it as 'bool' and stick to using only 'true' and 'false' to set its value. Also, instead of setting it in tb_gen_code(), just after tb_flush() has been called, do it right inside of tb_flush(). In cpu_exec(), this flag is used to track if tb_flush() has been called and have made 'next_tb' (a reference to the last executed TB) invalid for linking it to directly call the next TB. tb_flush() can be called during the CPU execution loop from tb_gen_code(), during TB execution or by another thread while 'tb_lock' is released. Catch for translation buffer flush reliably by resetting this flag once before first TB lookup and each time we find it set before trying to add a direct jump. Don't touch in in tb_find_physical(). Each vCPU has its own execution loop in multithreaded mode and thus should have its own copy of the flag to be able to reset it with its own 'next_tb' and don't affect any other vCPU execution thread. So make this flag per-vCPU and move it to CPUState. In cpu_exec_nocache(), we only need to check if tb_flush() has been called from tb_gen_code() called by cpu_exec_nocache() itself. To do this reliably, preserve the old value of the flag, reset it before calling tb_gen_code(), check afterwards, and combine the saved value back to the flag. This patch is based on the patch "tcg: move tb_invalidated_flag to CPUState" from Paolo Bonzini <pbonzini@redhat.com>. Backports commit 6f789be56d3f38e9214dafcfab3bf9be7191f370 from qemu	2018-02-23 23:34:51 -05:00
Sergey Fedorov	c9700af2bd	tcg: Clean up from 'next_tb' The value returned from tcg_qemu_tb_exec() is the value passed to the corresponding tcg_gen_exit_tb() at translation time of the last TB attempted to execute. It is a little confusing to store it in a variable named 'next_tb'. In fact, it is a combination of 4-byte aligned pointer and additional information in its two least significant bits. Break it down right away into two variables named 'last_tb' and 'tb_exit' which are a pointer to the last TB attempted to execute and the TB exit reason, correspondingly. This simplifies the code and improves its readability. Correct a misleading documentation comment for tcg_qemu_tb_exec() and fix logging in cpu_tb_exec(). Also rename a misleading 'next_tb' in another couple of places. Backports commit 819af24b9c1e95e6576f1cefd32f4d6bf56dfa56 from qemu	2018-02-23 23:29:04 -05:00
Paolo Bonzini	66faf3b5df	tcg: code_bitmap and code_write_count are not used by user-mode emulation Backports commit 6fad459c91e8a1dedbb6681d3f57ede5222a225c from qemu	2018-02-23 23:17:37 -05:00
Sergey Fedorov	ffdc9d6323	tcg: Allow goto_tb to any target PC in user mode In user mode, there's only a static address translation, TBs are always invalidated properly and direct jumps are reset when mapping change. Thus the destination address is always valid for direct jumps and there's no need to restrict it to the pages the TB resides in. Backports commit 90aa39a1cc4837360889f0e033ca25cc82100308 from qemu	2018-02-23 23:12:14 -05:00
Sergey Fedorov	73c59faad5	tcg: Clean up direct block chaining safety checks We don't take care of direct jumps when address mapping changes. Thus we must be sure to generate direct jumps so that they always keep valid even if address mapping changes. Luckily, we can only allow to execute a TB if it was generated from the pages which match with current mapping. Document tcg_gen_goto_tb() declaration and note the reason for destination PC limitations. Some targets with variable length instructions allow TB to straddle a page boundary. However, we make sure that both of TB pages match the current address mapping when looking up TBs. So it is safe to do direct jumps into the both pages. Correct the checks for some of those targets. Given that, we can safely patch a TB which spans two pages. Remove the unnecessary check in cpu_exec() and allow such TBs to be patched. Backports commit 5b053a4a28278bca606eeff7d1c0730df1b047e9 from qemu	2018-02-23 22:26:00 -05:00
Sergey Fedorov	39d262f0d2	tcg: Clean up tb_jmp_unlink() Unify the code of this function with tb_jmp_remove_from_list(). Making these functions similar improves their readability. Also this could be a step towards making this function thread-safe. Backports commit f9c5b66f487a04d3747dc6997b1503f9258df945 from qemu	2018-02-23 21:40:07 -05:00
Lioncash	68272af618	translate-all: Remove unused variable in size_code_gen_buffer Also eliminates the unused parameter	2018-02-23 21:38:34 -05:00
Sergey Fedorov	c530eb06a9	tcg: Extract removing of jumps to TB from tb_phys_invalidate() Move the code for removing jumps to a TB out of tb_phys_invalidate() to a separate static inline function tb_jmp_unlink(). This simplifies tb_phys_invalidate() and improves code structure. Backports commit 89bba496322d4cf996d42cdd4bb0912231656c3d from qemu	2018-02-23 21:36:29 -05:00
Sergey Fedorov	0d2e91518b	tcg: Rename tb_jmp_remove() to tb_remove_from_jmp_list() tb_jmp_remove() was only used to remove the TB from a list of all TBs jumping to the same TB which is n-th jump destination of the given TB. Put a comment briefly describing the function behavior and rename it to better reflect its purpose. Backports commit 133626783aa5a1bf86332fa3e6f7b8efe005f924 from qemu	2018-02-23 21:34:01 -05:00
Sergey Fedorov	d60af028c5	tcg: Clarify thread safety check in tb_add_jump() The check is to make sure that another thread hasn't already done the same while we were outside of tb_lock. Mention this in a comment. Backports commit 9962c478b153a18fe88a6509fe58cd178aff8abc from qemu	2018-02-23 21:32:47 -05:00
Sergey Fedorov	e93f68a755	tcg: Init TB's direct jumps before making it visible Initialize TB's direct jump list data fields and reset the jumps before tb_link_page() puts it into the physical hash table and the physical page list. So TB is completely initialized before it becomes visible. This is pure rearrangement of code to a more suitable place, though it could be a preparation for relaxing the locking scheme in future. Backports commit 901bc3deb43bf37c85e43955905d003be7ae5fa5 from qemu	2018-02-23 21:31:36 -05:00
Sergey Fedorov	87f2bb42d4	tcg: Rearrange tb_link_page() to avoid forward declaration Backports commit e90d96b158665a684ab89b4f002838034b5fafc8 from qemu	2018-02-23 21:28:20 -05:00
Sergey Fedorov	fbc0a1105f	tcg: Use uintptr_t type for jmp_list_{next\|first} fields of TB These fields do not contain pure pointers to a TranslationBlock structure. So uintptr_t is the most appropriate type for them. Also put some asserts to assure that the two least significant bits of the pointer are always zero before assigning it to jmp_list_first. Backports commit c37e6d7e3589ecb96914faa21025ad7ba6654aea from qemu	2018-02-23 21:28:19 -05:00
Sergey Fedorov	e60c24cecf	tcg: Clean up direct block chaining data fields Briefly describe in a comment how direct block chaining is done. It should help in understanding of the following data fields. Rename some fields in TranslationBlock and TCGContext structures to better reflect their purpose (dropping excessive 'tb_' prefix in TranslationBlock but keeping it in TCGContext): tb_next_offset => jmp_reset_offset tb_jmp_offset => jmp_insn_offset tb_next => jmp_target_addr jmp_next => jmp_list_next jmp_first => jmp_list_first Avoid using a magic constant as an invalid offset which is used to indicate that there's no n-th jump generated. Backports commit f309101c26b59641fc1aa8fb2a98a5441cdaea03 from qemu	2018-02-23 21:28:19 -05:00
Richard Henderson	bb0b055a99	translate-all: Adjust 256mb testing for mips64 Make sure we preserve the high 32-bits when masking for mips64. Backports commit 7ba6a512ae439c98c0c1f0f4348c079d90f9dd9d from qemu	2018-02-23 21:28:19 -05:00
Emilio G. Cota	de17843702	translate-all: add missing munmap of the code_gen guard page for MIPS Backports commit 8bdf4997823126a39bd4c99e4b2283b02cc7865f from qemu	2018-02-23 21:28:19 -05:00
Emilio G. Cota	9a2b02b241	translate-all: remove redundant setting of tcg_ctx.code_gen_buffer_size The setting of tcg_ctx.code_gen_buffer_size is done by the only caller of size_code_gen_buffer(), which is code_gen_alloc(): $ git grep size_code_gen_buffer translate-all.c:static inline size_t size_code_gen_buffer(size_t tb_size) translate-all.c: tcg_ctx.code_gen_buffer_size = size_code_gen_buffer(tb_size); Backports commit 835154b6e2200460f04719d0028716a37c178368 from qemu	2018-02-23 21:28:19 -05:00
Sergey Fedorov	c5b234ed1f	tcg: Note requirement on atomic direct jump patching Backports commit 10b4f4855537dd421e193a7d0416513116370558 from qemu	2018-02-23 21:28:18 -05:00
Sergey Fedorov	87c3382dc8	tcg/mips: Make direct jump patching thread-safe Ensure direct jump patching in MIPS is atomic by using atomic_read()/atomic_set() for code patching. Backports commit c82460a560176ef69c2f0662bd280612e274db96 from qemu	2018-02-23 21:28:18 -05:00
Sergey Fedorov	7538001da9	tcg/sparc: Make direct jump patching thread-safe Ensure direct jump patching in SPARC is atomic by using atomic_read()/atomic_set() for code patching. Backports commit 84f79fb7c6e857edc807e4a251338243ce0cbac3 from qemu	2018-02-23 21:28:18 -05:00
Sergey Fedorov	a45f8cb49d	tcg/aarch64: Make direct jump patching thread-safe Ensure direct jump patching in AArch64 is atomic by using atomic_read()/atomic_set() for code patching. Backports commit 9e269112953be4d670cb0d25042bd6546fcf3e45 from qemu	2018-02-23 21:28:18 -05:00
Sergey Fedorov	52e2972300	tcg/arm: Make direct jump patching thread-safe Ensure direct jump patching in ARM is atomic by using atomic_read()/atomic_set() for code patching. Backports commit 7d14e0e2d661479985197203589c38840e1066df from qemu	2018-02-23 21:28:18 -05:00
Sergey Fedorov	57359fbe6c	tcg/s390: Make direct jump patching thread-safe Ensure direct jump patching in s390 is atomic by: * naturally aligning a location of direct jump address; * using atomic_read()/atomic_set() for code patching. Backports commit ed3d51ecd7fe248d3959e469d53890ac9ffe0cd2 from qemu	2018-02-23 21:28:18 -05:00
Sergey Fedorov	5eb2d6618f	tcg/i386: Make direct jump patching thread-safe Ensure direct jump patching in i386 is atomic by: * naturally aligning a location of direct jump address; * using atomic_read()/atomic_set() for code patching. Backports commit 0d07abf05e98903c7faf204a9a90f7d45b7554dc from qemu	2018-02-23 21:28:17 -05:00
Lioncash	fffa27d269	osdep: MSVC-compatible alignment macros	2018-02-23 21:28:17 -05:00
Sergey Fedorov	3456f0879e	include/qemu/osdep.h: Add macros for pointer alignment These macros provide a convenient way to n-byte align pointers up and down and check if a pointer is n-byte aligned. Backports commit 6b587d3cda48e7ba26de8d30bf0d8a7063970715 from qemu	2018-02-23 21:28:17 -05:00
Sergey Fedorov	47eac70cb9	include/qemu/osdep.h: Add a macro to check for alignment Backports commit 18a60a76147569ca9e11b0607e50ce4012fe1aaa from qemu	2018-02-23 21:28:17 -05:00
Emilio G. Cota	170f6e0b3b	tb: consistently use uint32_t for tb->flags We are inconsistent with the type of tb->flags: usage varies loosely between int and uint64_t. Settle to uint32_t everywhere, which is superior to both: at least one target (aarch64) uses the most significant bit in the u32, and uint64_t is wasteful. Compile-tested for all targets. Backports commit 89fee74a0f066dfd73830a7b5fa137e87888c870 from qemu	2018-02-23 21:28:11 -05:00
Peter Maydell	fe2000aa32	target-arm: Avoid unnecessary TLB flush on TCR_EL2, TCR_EL3 writes The TCR_EL2 and TCR_EL3 regdefs were incorrectly using the vmsa_tcr_el1_write function for writes. Since these registers don't have the A1 bit that TCR_EL1 does, we don't need to do a tlb_flush() when they are written. Remove the unnecessary .writefn and also the harmless but unneeded .raw_writefn and .resetfn definitions. Backports commit 6459b94c26dd666badb3547fef1456992a08e60b from qemu	2018-02-23 20:09:12 -05:00
Edgar E. Iglesias	eb79db28d5	target-arm/translate-a64.c: Unify some of the ldst_reg decoding The various load/store variants under disas_ldst_reg can all reuse the same decoding for opc, size, rt and is_vector. This patch unifies the decoding in preparation for generating instruction syndromes for data aborts. This will allow us to reduce the number of places to hook in updates to the load/store state needed to generate the insn syndromes. No functional change. Backports commit cd694521ca061a5d0436d5df4ec8c17c8f4dfcdb from qemu	2018-02-23 20:06:31 -05:00
Edgar E. Iglesias	602e9e34b9	target-arm/translate-a64.c: Use extract32 in disas_ldst_reg_imm9 Use extract32 instead of open coding the bit masking when decoding is_signed and is_extended. This streamlines the decoding with some of the other ldst variants. No functional change. Backports commit 026a19c3128678d4fe301fc36e8ffacdc9ecccb8 from qemu	2018-02-23 20:04:11 -05:00
Peter Maydell	56e9d7c09e	target-arm: Split data abort syndrome generator Split the data abort syndrome generator into two versions: One with a valid Instruction Specific Syndrome (ISS) and another without. The following new flags are supported by the syndrome generator with ISS: * isv - Instruction syndrome valid * sas - Syndrome access size * sse - Syndrome sign extend * srt - Syndrome register transfer * sf - Sixty-Four bit register width * ar - Acquire/Release These flags are not yet used, so this patch has no functional change except that we will now correctly set the IL bit in data abort syndromes without ISS information. Backports commit 094d028a7968236cd2b7f7b96394f7a3b8ad97c8 from qemu	2018-02-23 20:03:04 -05:00
Edgar E. Iglesias	bfc74c4da2	gen-icount: Use tcg_set_insn_param Use tcg_set_insn_param() instead of directly accessing internal tcg data structures to update an insn param. Backports commit 25caa94c4a26daaab1e65c6d887e2972aeb5749e from qemu	2018-02-23 20:01:17 -05:00
Edgar E. Iglesias	a30a478538	tcg: Add tcg_set_insn_param Add tcg_set_insn_param as a mechanism to modify an insn parameter after emiting the insn. This is useful for icount and also for embedding fault information for a specific insn. Backports commit 1d41478fd428e01f057d3248292e4cdcdb048523 from qemu	2018-02-23 19:58:49 -05:00
Sergey Sorokin	98a6d44c54	target-arm: Fix descriptor address masking in ARM address translation There is a bug in ARM address translation regime with a long-descriptor format. On the descriptor reading its address is formed from an index which is a part of the input address. And on the first iteration this index is incorrectly masked with 'grainsize' mask. But it can be wider according to pseudo-code. On the other hand on the iterations other than first the descriptor address is formed from the previous level descriptor by masking with 'descaddrmask' value. It always clears just 12 lower bits, but it must clear 'grainsize' lower bits instead according to pseudo-code. The patch fixes both cases. Backports commit dddb5223413c5425ae6eaeb3b967627efc9675f7 from qemu	2018-02-23 19:56:56 -05:00
Sergey Sorokin	00e751f18e	target-arm: Stage 2 permission fault was fixed in AArch32 state As described in AArch32.CheckS2Permission an instruction fetch fails if XN bit is set or there is no read permission for the address. Backports commit dfda68377e20943f474505e75238cb96bc6874bf from qemu	2018-02-23 19:55:11 -05:00
Eric Blake	2f42c2c195	qapi: Change visit_type_FOO() to no longer return partial objects Returning a partial object on error is an invitation for a careless caller to leak memory. We already fixed things in an earlier patch to guarantee NULL if visit_start fails ("qapi: Guarantee NULL obj on input visitor callback error"), but that does not help the case where visit_start succeeds but some other failure happens before visit_end, such that we leak a partially constructed object outside visit_type_FOO(). As no one outside the testsuite was actually relying on these semantics, it is cleaner to just document and guarantee that ALL pointer-based visit_type_FOO() functions always leave a safe value in obj during an input visitor (either the new object on success, or NULL if an error is encountered), so callers can now unconditionally use qapi_free_FOO() to clean up regardless of whether an error occurred. The decision is done by adding visit_is_input(), then updating the generated code to check if additional cleanup is needed based on the type of visitor in use. Note that we still leave obj unchanged after a scalar-based visit_type_FOO(); I did not feel like auditing all uses of visit_type_Enum() to see if the callers would tolerate a specific sentinel value (not to mention having to decide whether it would be better to use 0 or ENUM__MAX as that sentinel). Backports commit 68ab47e4b4ecc1c4649362b8cc1e49794d1a6537 from qemu	2018-02-23 19:53:17 -05:00
Eric Blake	0d52542da2	qapi: Simplify semantics of visit_next_list() The semantics of the list visit are somewhat baroque, with the following pseudocode when FooList is used: start() for (prev = head; cur = next(prev); prev = &cur) { visit(&cur->value) } Note that these semantics (advance before visit) requires that the first call to next() return the list head, while all other calls return the next element of the list; that is, every visitor implementation is required to track extra state to decide whether to return the input as-is, or to advance. It also requires an argument of 'GenericList *' to next(), solely because the first iteration might need to modify the caller's GenericList head, so that all other calls have to do a layer of dereferencing. Thankfully, we only have two uses of list visits in the entire code base: one in spapr_drc (which completely avoids visit_next_list(), feeding in integers from a different source than uint8List), and one in qapi-visit.py. That is, all other list visitors are generated in qapi-visit.c, and share the same paradigm based on a qapi FooList type, so we can refactor how lists are laid out with minimal churn among clients. We can greatly simplify things by hoisting the special case into the start() routine, and flipping the order in the loop to visit before advance: start(head) for (tail = head; tail; tail = next(tail)) { visit(&tail->value) } With the simpler semantics, visitors have less state to track, the argument to next() is reduced to 'GenericList *', and it also becomes obvious whether an input visitor is allocating a FooList during visit_start_list() (rather than the old way of not knowing if an allocation happened until the first visit_next_list()). As a minor drawback, we now allocate in two functions instead of one, and have to pass the size to both functions (unless we were to tweak the input visitors to cache the size to start_list for reuse during next_list, but that defeats the goal of less visitor state). The signature of visit_start_list() is chosen to match visit_start_struct(), with the new parameters after 'name'. The spapr_drc case is a virtual visit, done by passing NULL for list, similarly to how NULL is passed to visit_start_struct() when a qapi type is not used in those visits. It was easy to provide these semantics for qmp-output and dealloc visitors, and a bit harder for qmp-input (several prerequisite patches refactored things to make this patch straightforward). But it turned out that the string and opts visitors munge enough other state during visit_next_list() to make it easier to just document and require a GenericList visit for now; an assertion will remind us to adjust things if we need the semantics in the future. Several pre-requisite cleanup patches made the reshuffling of the various visitors easier; particularly the qmp input visitor. Backports commit d9f62dde1303286b24ac8ce88be27e2b9b9c5f46 from qemu	2018-02-23 19:50:26 -05:00
Lioncash	ed72ba0f8b	qapi: Fix string input visitor handling of invalid list As shown in the previous commit, the string input visitor was treating bogus input as an empty list rather than an error. Fix parse_str() to set errp, then the callers to exit early if an error was reported. Meanwhile, fix the testsuite to use the generated qapi_free_int16List() instead of rolling our own, and to validate the fixed behavior, while at the same time documenting one more change that we'd like to make in a later patch (a failed visit_start_list should guarantee a NULL pointer, regardless of what things were on input). Backports commit 74f24cb6306d065045d0e2215a7d10533fa59c57 from qemu	2018-02-23 19:25:26 -05:00
Eric Blake	6084be1882	qapi: Split visit_end_struct() into pieces As mentioned in previous patches, we want to call visit_end_struct() functions unconditionally, so that visitors can release resources tied up since the matching visit_start_struct() without also having to worry about error priority if more than one error occurs. Even though error_propagate() can be safely used to ignore a second error during cleanup caused by a first error, it is simpler if the cleanup cannot set an error. So, split out the error checking portion (basically, input visitors checking for unvisited keys) into a new function visit_check_struct(), which can be safely skipped if any earlier errors are encountered, and leave the cleanup portion (which never fails, but must be called unconditionally if visit_start_struct() succeeded) in visit_end_struct(). Generated code in qapi-visit.c has diffs resembling: \|@@ -59,10 +59,12 @@ void visit_type_ACPIOSTInfo(Visitor *v, \| goto out_obj; \| } \| visit_type_ACPIOSTInfo_members(v, obj, &err); \|- error_propagate(errp, err); \|- err = NULL; \|+ if (err) { \|+ goto out_obj; \|+ } \|+ visit_check_struct(v, &err); \| out_obj: \|- visit_end_struct(v, &err); \|+ visit_end_struct(v); \| out: and in qapi-event.c: @@ -47,7 +47,10 @@ void qapi_event_send_acpi_device_ost(ACP \| goto out; \| } \| visit_type_q_obj_ACPI_DEVICE_OST_arg_members(v, &param, &err); \|- visit_end_struct(v, err ? NULL : &err); \|+ if (!err) { \|+ visit_check_struct(v, &err); \|+ } \|+ visit_end_struct(v); \| if (err) { \| goto out; Backports commit 15c2f669e3fb2bc97f7b42d1871f595c0ac24af8 from qemu	2018-02-23 19:13:47 -05:00
Eric Blake	ae8d475ae0	qmp: Tighten output visitor rules Tighten assertions in the QMP output visitor, so that: - qmp_output_get_qobject() can only be called after pairing a visit_end_* for every visit_start_* (rather than allowing it on a partially built object) - qmp_output_get_qobject() cannot be called unless at least one visit_type_* or visit_start/visit_end pair has occurred since creation/reset (the accidental return of NULL fixed by commit ab8bf1d7 would have been much easier to diagnose) - ensure that we are encountering the expected object or list type, to provide protection against mismatched push(struct)/ pop(list) or push(list)/pop(struct), similar to the qmp-input protection added in commit bdd8e6b5. - ensure that except for the root, 'name' is non-null inside a dict, and NULL inside a list (this may need changing later if we add "name.0" support for better error messages for a list, but for now it makes sure all users are at least consistent) Backports commit 56a6f02b8ce1fe41a2a9077593e46eca7d98267d from qemu	2018-02-23 19:04:41 -05:00
Eric Blake	e5b2cff2bd	qmp: Support explicit null during visits Implement the new type_null() callback for the qmp input and output visitors. While we don't yet have a use for this in QAPI input (the generator will need some tweaks first), some potential usages have already been discussed on the list. Meanwhile, the output visitor could already output explicit null via type_any, but this gives us finer control. At any rate, it's easy to test that we can round-trip an explicit null through manual use of visit_type_null() wrapped by a virtual visit_start_struct() walk, even if we can't do the visit in a QAPI type. Repurpose the test_visitor_out_empty test, particularly since a future patch will tighten semantics to forbid use of qmp_output_get_qobject() without at least one intervening visit_type_*. Backports commit 3df016f185521f8dfa5bd89168722887156405c7 from qemu	2018-02-23 19:02:18 -05:00
Eric Blake	ef6b7b50f6	qapi: Add visit_type_null() visitor Right now, qmp-output-visitor happens to produce a QNull result if nothing is actually visited between the creation of the visitor and the request for the resulting QObject. A stronger protocol would require that a QMP output visit MUST visit something. But to still be able to produce a JSON 'null' output, we need a new visitor function that states our intentions. Yes, we could say that such a visit must go through visit_type_any(), but that feels clunky. So this patch introduces the new visit_type_null() interface and its no-op interface in the dealloc visitor, and stubs in the qmp visitors (the next patch will finish the implementation). For the visitors that will not implement the callback, document the situation. The code in qapi-visit-core unconditionally dereferences the callback pointer, so that a segfault will inform a developer if they need to implement the callback for their choice of visitor. Note that JSON has a primitive null type, with the single value null; likewise with the QNull type for QObject; but for QAPI, we just have the 'null' value without a null type. We may eventually want to add more support in QAPI for null (most likely, we'd use it via an alternate type that permits 'null' or an object); but we'll create that usage when we need it. Backports commit 3bc97fd5924561d92f32758c67eaffd2e4e25038 from qemu	2018-02-23 15:48:57 -05:00
Eric Blake	fafb3e354b	qapi: Document visitor interfaces, add assertions The visitor interface for mapping between QObject/QemuOpts/string and QAPI is scandalously under-documented, making changes to visitor core, individual visitors, and users of visitors difficult to coordinate. Among other questions: when is it safe to pass NULL, vs. when a string must be provided; which visitors implement which callbacks; the difference between concrete and virtual visits. Correct this by retrofitting proper contracts, and document where some of the interface warts remain (for example, we may want to modify visit_end_* to require the same 'obj' as the visit_start counterpart, so the dealloc visitor can be simplified). Later patches in this series will tackle some, but not all, of these warts. Add assertions to (partially) enforce the contract. Some of these were only made possible by recent cleanup commits. Backports commit adfb264c9ed04bfc694921b72173be8e29e90024 from qemu	2018-02-23 15:45:31 -05:00
Eric Blake	9e999acc83	qapi: Change visit_start_implicit_struct to visit_start_alternate After recent changes, the only remaining use of visit_start_implicit_struct() is for allocating the space needed when visiting an alternate. Since the term 'implicit struct' is hard to explain, rename the function to its current usage. While at it, we can merge the functionality of visit_get_next_type() into the same function, making it more like visit_start_struct(). Generated code is now slightly smaller: \| { \| Error err = NULL; \| \|- visit_start_implicit_struct(v, (void) obj, sizeof(BlockdevRef), &err); \|+ visit_start_alternate(v, name, (GenericAlternate )obj, sizeof(obj), \|+ true, &err); \| if (err) { \| goto out; \| } \|- visit_get_next_type(v, name, &(obj)->type, true, &err); \|- if (err) { \|- goto out_obj; \|- } \| switch ((*obj)->type) { \| case QTYPE_QDICT: \| visit_start_struct(v, name, NULL, 0, &err); ... \| } \|-out_obj: \|- visit_end_implicit_struct(v); \|+ visit_end_alternate(v); \| out: \| error_propagate(errp, err); \| } Backports commit dbf11922622685934bfb41e7cf2be9bd4a0405c0 from qemu	2018-02-23 15:33:25 -05:00
Eric Blake	5389c1cd5f	qmp-input: Refactor when list is advanced In the QMP input visitor, visiting a list traverses two objects: the QAPI GenericList of the caller (which gets advanced in visit_next_list() regardless of this patch), and the QList input that we are converting to QAPI. For consistency with QDict visits, we want to consume elements from the input QList during the visit_type_FOO() for the list element; that is, we want ALL the code for consuming an input to live in qmp_input_get_object(), rather than having it split according to whether we are visiting a dict or a list. Making qmp_input_get_object() the common point of consumption will make it easier for a later patch to refactor visit_start_list() to cover the GenericList * head of a QAPI list, and in turn will get rid of the 'first' flag (which lived in qmp_input_next_list() pre-patch, and is hoisted to StackObject by this patch). This patch is therefore altering the post-condition use of 'entry', while keeping what gets visited unchanged, from: start_list next_list type_ELT ... next_list type_ELT next_list end_list visits 1st elt last elt entry NULL 1st elt 1st elt last elt last elt NULL gone where type_ELT() returns (entry ? entry : 1st elt) and next_list() steps entry to this usage: start_list next_list type_ELT ... next_list type_ELT next_list end_list visits 1st elt last elt entry 1st elt 1nd elt 2nd elt last elt NULL NULL gone where type_ELT() steps entry and returns the old entry, and next_list() leaves entry alone. Backports commit fcf3cb21783b2dae3358fdbe7001cb2f74e0cedf from qemu	2018-02-23 15:19:40 -05:00
Eric Blake	68cf25fafa	qmp-input: Require struct push to visit members of top dict Don't embed the root of the visit into the stack of current containers being visited. That way, we no longer get confused on whether the first visit of a dictionary is to the dictionary itself or to one of the members of the dictionary, based on whether the caller passed name=NULL; and makes the QMP Input visitor like other visitors where the value of 'name' is now ignored on the root visit. (We may someday want to revisit the rules on what 'name' should be on a top-level visit, rather than just ignoring it; but that would be the topic of another patch). An audit of all qmp_input_visitor_new() call sites shows that there were only two places where callers had previously been visiting to a QDict with a non-NULL name to bypass a call to visit_start_struct(), and those were fixed in prior patches. Backports commit ce140b176920b5b65184020735a3c65ed3e9aeda from qemu	2018-02-23 15:16:43 -05:00
Eric Blake	1bb4e4c787	qmp-input: Don't consume input when checking has_member Commit e8316d7 mistakenly passed consume=true within qmp_input_optional() when checking if an optional member was present, but the mistake was silently ignored since the code happily let us extract a member more than once. Fix qmp_input_optional() to not consume anything, then tighten up the input visitor to ensure that a member is consumed exactly once (all generated code follows this pattern; and the new assert will catch any hand-written code that tries to visit the same key more than once). Backports commit e5826a2fd727f0be54a81083f31fe02a275465cd from qemu	2018-02-23 15:12:58 -05:00
Eric Blake	cae9c2bd2d	qapi: Use strict QMP input visitor in more places The following uses of a QMP input visitor should be strict (that is, excess keys in QDict input should be flagged if not converted to QAPI): - Testsuite code unrelated to explicitly testing non-strict mode (test-qmp-commands, test-visitor-serialization); since we want more code to be strict by default, having more tests of strict mode doesn't hurt - Code used for cloning QAPI objects (replay-input.c, qemu-sockets.c); we are reparsing a QObject just barely produced by the qmp output visitor and which therefore should not have any garbage, so while it is extra work to be strict, it validates that our clone is correct [note that a later patch series will simplify these two uses by creating an actual clone visitor that is much more efficient than a generate/reparse cycle] - qmp_object_add(), which calls into user_creatable_add_type(). Since command line parsing for '-object' uses the same user_creatable_add_type() through the OptsVisitor, and that is always strict, we want to ensure that any nested dictionaries would be treated the same in QMP and from the command line (I don't actually know if such nested dictionaries exist). Note that on this code change, strictness only matters for nested dictionaries (if even possible), since we already flag excess input at the top level during an earlier object_property_set() on an unknown key, whether from QemuOpts: $ ./x86_64-softmmu/qemu-system-x86_64 -nographic -nodefaults -qmp stdio -object secret,id=sec0,data=letmein,format=raw,foo=bar qemu-system-x86_64: -object secret,id=sec0,data=letmein,format=raw,foo=bar: Property '.foo' not found or from QMP: $ ./x86_64-softmmu/qemu-system-x86_64 -nographic -nodefaults -qmp stdio {"QMP": {"version": {"qemu": {"micro": 93, "minor": 5, "major": 2}, "package": ""}, "capabilities": []}} {"execute":"qmp_capabilities"} {"return": {}} {"execute":"object-add","arguments":{"qom-type":"secret","id":"sec0","props":{"format":"raw","data":"letmein","foo":"bar"}}} {"error": {"class": "GenericError", "desc": "Property '.foo' not found"}} The only remaining uses of non-strict input visits are: - QMP 'qom-set' (which eventually executes object_property_set_qobject()) - mark it as something to revisit in the future (I didn't want to spend any more time on this patch auditing if we have any QOM dictionary properties that might be impacted, and couldn't easily prove whether this code path is shared with anything else). - test-qmp-input-visitor: explicit tests of non-strict mode. If we later get rid of users that don't need strictness, then this test should be merged with test-qmp-input-strict Backports relevant parts of commit 240f64b6dc3346d044d7beb7cc3a53668ce47384 from qemu	2018-02-23 15:11:35 -05:00
Eric Blake	559304aed9	qapi: Consolidate QMP input visitor creation Rather than having two separate ways to create a QMP input visitor, where the safer approach has the more verbose name, it is better to consolidate things into a single function where the caller must explicitly choose whether to be strict or to ignore excess input. This patch is the strictly mechanical conversion; the next patch will then audit which uses can be made stricter. Backports commit fc471c18d5d2ec713d5a019f9530398675494bc8 from qemu	2018-02-23 15:09:57 -05:00
Eric Blake	b1c4558849	qmp-input: Clean up stack handling Management of the top of stack was a bit verbose; creating a temporary variable and adding some comments makes the existing code more legible before the next few patches improve things. No semantic changes other than asserting that we are always visiting a QObject, and not a NULL value. In particular, the check for 'name && qobject_type(qobj) == QTYPE_QDICT)' is a bit overkill (a dict visit should always have a name); a later patch revisits that, while this patch is only changing one layer of indentation due to dropping 'if (qobj)'. Backports commit b471d012e5d7bec1d2272738141e121b5581fcdf from qemu	2018-02-23 15:08:14 -05:00
Eric Blake	0ec9a5adaf	qapi: Guarantee NULL obj on input visitor callback error Our existing input visitors were not very consistent on errors in a function taking 'TYPE *obj'. These are start_struct(), start_alternate(), type_str(), and type_any(). next_list() is similar, but can't fail (see commit 08f9541). While all of them set 'obj' to allocated storage on success, it was not obvious whether 'obj' was guaranteed safe on failure, or whether it was left uninitialized. But a future patch wants to guarantee that visit_type_FOO() does not leak a partially-constructed obj back to the caller; it is easier to implement this if we can reliably state that input visitors assign 'obj' regardless of success or failure, and that on failure obj is NULL. Add assertions to enforce consistency in the final setting of err vs. obj. The opts-visitor start_struct() doesn't set an error, but it also was doing a weird check for 0 size; all callers pass in non-zero size if obj is non-NULL. The testsuite has at least one spot where we no longer need to pre-initialize a variable prior to a visit; valgrind confirms that the test is still fine with the cleanup. A later patch will document the design constraint implemented here. Backports commit e58d695e6c3a5cfa0aa2fc91b87ade017ef28b05 from qemu	2018-02-23 14:53:23 -05:00
Eric Blake	3cf7b6dd3b	qapi: Adjust layout of FooList types By sticking the next pointer first, we don't need a union with 64-bit padding for smaller types. On 32-bit platforms, this can reduce the size of uint8List from 16 bytes (or 12, depending on whether 64-bit ints can tolerate 4-byte alignment) down to 8. It has no effect on 64-bit platforms (where alignment still dictates a 16-byte struct); but fewer anonymous unions is still a win in my book. It requires visit_next_list() to gain a size parameter, to know what size element to allocate; comparable to the size parameter of visit_start_struct(). I debated about going one step further, to allow for fewer casts, by doing: typedef GenericList GenericList; struct GenericList { GenericList next; }; struct FooList { GenericList base; Foo value; }; so that you convert to 'GenericList ' by '&foolist->base', and back by 'container_of(generic, GenericList, base)' (as opposed to the existing '(GenericList )foolist' and '(FooList )generic'). But doing that would require hoisting the declaration of GenericList prior to inclusion of qapi-types.h, rather than its current spot in visitor.h; it also makes iteration a bit more verbose through 'foolist->base.next' instead of 'foolist->next'. Note that for lists of objects, the 'value' payload is still hidden behind a boxed pointer. Someday, it would be nice to do: struct FooList { FooList next; Foo value; }; for one less level of malloc for each list element. This patch is a step in that direction (now that 'next' is no longer at a fixed non-zero offset within the struct, we can store more than just a pointer's-worth of data as the value payload), but the actual conversion would be a task for another series, as it will touch a lot of code. Backports commit e65d89bf1a4484e0db0f3dc820a8b209f2fb1e8b from qemu	2018-02-23 14:49:06 -05:00
Eric Blake	eef0932471	qapi-visit: Add visitor.type classification We have three classes of QAPI visitors: input, output, and dealloc. Currently, all implementations of these visitors have one thing in common based on their visitor type: the implementation used for the visit_type_enum() callback. But since we plan to add more such common behavior, in relation to documenting and further refining the semantics, it makes more sense to have the visitor implementations advertise which class they belong to, so the common qapi-visit-core code can use that information in multiple places. A later patch will better document the types of visitors directly in visitor.h. For this patch, knowing the class of a visitor implementation lets us make input_type_enum() and output_type_enum() become static functions, by replacing the callback function Visitor.type_enum() with the simpler enum member Visitor.type. Share a common assertion in qapi-visit-core as part of the refactoring. Move comments in opts-visitor.c to match the refactored layout. Backports commit 983f52d4b3f86fb9dc9f8b142132feb5a8723016 from qemu	2018-02-23 14:25:41 -05:00
Dave Hansen	f50acc467f	target-i386: fix typo in xsetbv implementation QEMU 2.6 added support for the XSAVE family of instructions, which includes the XSETBV instruction which allows setting the XCR0 register. But, when booting Linux kernels with XSAVE support enabled, I was getting very early crashes where the instruction pointer was set to 0x3. I tracked it down to a jump instruction generated by this: gen_jmp_im(s->pc - pc_start); where s->pc is pointing to the instruction after XSETBV and pc_start is pointing _at_ XSETBV. Subtract the two and you get 0x3. Whoops. The fix is to replace this typo with the pattern found everywhere else in the file when folks want to end the translation buffer. Richard Henderson confirmed that this is a bug and that this is the correct fix. Backports commit 502c8e86ea07294067578292c6d402601c196019 from qemu	2018-02-23 14:15:35 -05:00
Fam Zheng	5c739f14f5	util: Fix MIN_NON_ZERO MIN_NON_ZERO(1, 0) is evaluated to 0. Rewrite the macro to fix it. Backports commit b6ece2c6f37926a994bc564a9e55ef3be6016d8f from qemu	2018-02-23 14:09:44 -05:00
Artyom Tarasenko	1781d5cfa6	target-sparc: fix register corruption in ldstub if there is no write permission Backports commit 9566ceeef41ccb5241d340b34776a33450e8f9e5 from qemu	2018-02-23 14:06:38 -05:00
Paolo Bonzini	44c4dd02c9	target-i386: key sfence availability on CPUID_SSE, not CPUID_SSE2 sfence was introduced before lfence and mfence. This fixes Linux 2.4's measurement of checksumming speeds for the pIII_sse algorithm: md: linear personality registered as nr 1 md: raid0 personality registered as nr 2 md: raid1 personality registered as nr 3 md: raid5 personality registered as nr 4 raid5: measuring checksumming speed 8regs : 384.400 MB/sec 32regs : 259.200 MB/sec invalid operand: 0000 CPU: 0 EIP: 0010:[<c0240b2a>] Not tainted EFLAGS: 00000246 eax: c15d8000 ebx: 00000000 ecx: 00000000 edx: c15d5000 esi: 8005003b edi: 00000004 ebp: 00000000 esp: c15bdf50 ds: 0018 es: 0018 ss: 0018 Process swapper (pid: 1, stackpage=c15bd000) Stack: 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000206 c0241c6c 00001000 c15d4000 c15d7000 c15d4000 c15d4000 Call Trace: [<c0241c6c>] [<c0105000>] [<c0241db4>] [<c010503b>] [<c0105000>] [<c0107416>] [<c0105030>] Code: 0f ae f8 0f 10 04 24 0f 10 4c 24 10 0f 10 54 24 20 0f 10 5c <0>Kernel panic: Attempted to kill init! Backports commit bd5d278668f33aa08755a982986cd1159746c037 from qemu	2018-02-23 14:03:19 -05:00
Aurelien Jarno	0c4bebb9bc	target-mips: fix call to memset in soft reset code Recent versions of GCC report the following error when compiling target-mips/helper.c: qemu/target-mips/helper.c:542:9: warning: ‘memset’ used with length equal to number of elements without multiplication by element size [-Wmemset-elt-size] Backports commit a525decfaa3449f1458ea2d7a06320cf46aebf3f from qemu	2018-02-23 14:01:50 -05:00
James Hogan	e4903fc5f2	target-mips: Fix RDHWR exception host PC Commit b00c72180c36 ("target-mips: add PC, XNP reg numbers to RDHWR") changed the rdhwr helpers to use check_hwrena() to check the register being accessed is enabled in CP0_HWREna when used from user mode. If that check fails an EXCP_RI exception is raised at the host PC calculated with GETPC(). However check_hwrena() may not be fully inlined as the do_raise_exception() part of it is common regardless of the arguments. This causes GETPC() to calculate the address in the call in the helper instead of the generated code calling the helper. No TB will be found and the EPC reported with the resulting guest RI exception points to the beginning of the TB instead of the RDHWR instruction. We can't reliably force check_hwrena() to be inlined, and converting it to a macro would be ugly, so instead pass the host PC in as an argument, with each rdhwr helper passing GETPC(). This should avoid any dependence on compiler behaviour, and in practice seems to ensure the full inlining of check_hwrena() on x86_64. This issue causes failures when running a MIPS KVM (trap & emulate) guest in a MIPS QEMU TCG guest, as the inner guest kernel will do a RDHWR of counter, which is disabled in the outer guest's CP0_HWREna by KVM so it can emulate the inner guest's counter. The emulation fails and the RI exception is passed to the inner guest. Backports commit d96391c1ffeb30a0afa695c86579517c69d9a889 from qemu	2018-02-23 13:59:37 -05:00
Christoffer Dall	6180cbd477	util: align memory allocations to 2M on AArch64 For KVM to use Transparent Huge Pages (THP) we have to ensure that the alignment of the userspace address of the KVM memory slot and the IPA that the guest sees for a memory region have the same offset from the 2M huge page size boundary. One way to achieve this is to always align the IPA region at a 2M boundary and ensure that the mmap alignment is also at 2M. Unfortunately, we were only doing this for __arm__, not for __aarch64__, so add this simple condition. This fixes a performance regression using KVM/ARM on AArch64 platforms that showed a performance penalty of more than 50%, introduced by the following commit: 9fac18f (oslib: allocate PROT_NONE pages on top of RAM, 2015-09-10) We were only lucky before the above commit, because we were allocating large regions and naturally getting a 2M alignment on those allocations then. Backports commit ee1e0f8e5d3682c561edcdceccff72b9d9b16d8b from qemu	2018-02-23 13:56:59 -05:00
Aurelien Jarno	6060ab6596	tcg: check for CONFIG_DEBUG_TCG instead of NDEBUG Check for CONFIG_DEBUG_TCG instead of NDEBUG, drop now useless code. Backports commit 8d8fdbae010aa75a23f0307172e81034125aba6e from qemu	2018-02-23 13:55:21 -05:00
Aurelien Jarno	355ed7cd08	tcg: use tcg_debug_assert instead of assert (fix performance regression) The TCG code is quite performance sensitive, but at the same time can also be quite tricky. That is why asserts that can be enabled with the --enable-debug-tcg configure option. This used to work the following way: \| #include "config.h" \| \| ... \| \| #if !defined(CONFIG_DEBUG_TCG) && !defined(NDEBUG) \| /* define it to suppress various consistency checks (faster) */ \| #define NDEBUG \| #endif \| \| ... \| \| #include <assert.h> Since commit 757e725b (tcg: Clean up includes) "config.h" as been replaced by "qemu/osdep.h" which itself includes <assert.h>. As a consequence the assertions are always enabled, even when using --disable-debug-tcg, causing a performance regression, especially on targets with many registers. For instance on qemu-system-ppc the speed difference is about 15%. tcg_debug_assert is controlled directly by CONFIG_DEBUG_TCG and already uses in some places. This patch replaces all the calls to assert into calss to tcg_debug_assert. Backports commit eabb7b91b36b202b4dac2df2d59d698e3aff197a from qemu	2018-02-23 13:52:13 -05:00
Artyom Tarasenko	af0e282ab1	target-sparc: fix Trap Based Address Register behavior for sparc64 Accoding the chapter 7.6 Trap Processing of the SPARC Architecture Manual v9, the Trap Based Address Register is not modified as a trap is taken. This fix allows booting FreeBSD-10.3-RELEASE-sparc64. Backports commit de5f1077446ca455342db149737bdc395a7b9882 from qemu	2018-02-23 13:39:59 -05:00

... 2 3 4 5 6 ...

1971 commits