mirror of https://github.com/yuzu-emu/unicorn.git synced 2025-12-13 21:31:26 +00:00

Unicorn CPU emulator framework (ARM, AArch64, M68K, Mips, Sparc, X86)

Find a file

Emilio G. Cota ae3e22a689 tb hash: hash phys_pc, pc, and flags with xxhash For some workloads such as arm bootup, tb_phys_hash is performance-critical. The is due to the high frequency of accesses to the hash table, originated by (frequent) TLB flushes that wipe out the cpu-private tb_jmp_cache's. More info: https://lists.nongnu.org/archive/html/qemu-devel/2016-03/msg05098.html To dig further into this I modified an arm image booting debian jessie to immediately shut down after boot. Analysis revealed that quite a bit of time is unnecessarily spent in tb_phys_hash: the cause is poor hashing that results in very uneven loading of chains in the hash table's buckets; the longest observed chain had ~550 elements. The appended addresses this with two changes: 1) Use xxhash as the hash table's hash function. xxhash is a fast, high-quality hashing function. 2) Feed the hashing function with not just tb_phys, but also pc and flags. This improves performance over using just tb_phys for hashing, since that resulted in some hash buckets having many TB's, while others getting very few; with these changes, the longest observed chain on a single hash bucket is brought down from ~550 to ~40. Tests show that the other element checked for in tb_find_physical, cs_base, is always a match when tb_phys+pc+flags are a match, so hashing cs_base is wasteful. It could be that this is an ARM-only thing, though. UPDATE: On Tue, Apr 05, 2016 at 08:41:43 -0700, Richard Henderson wrote: > The cs_base field is only used by i386 (in 16-bit modes), and sparc (for a TB > consisting of only a delay slot). > It may well still turn out to be reasonable to ignore cs_base for hashing. BTW, after this change the hash table should not be called "tb_hash_phys" anymore; this is addressed later in this series. This change gives consistent bootup time improvements. I tested two host machines: - Intel Xeon E5-2690: 11.6% less time - Intel i7-4790K: 19.2% less time Increasing the number of hash buckets yields further improvements. However, using a larger, fixed number of buckets can degrade performance for other workloads that do not translate as many blocks (600K+ for debian-jessie arm bootup). This is dealt with later in this series. Backports commit 42bd32287f3a18d823f2258b813824a39ed7c6d9 from qemu		2018-02-24 18:00:14 -05:00
bindings	link to Crystal binding	2017-12-23 00:26:40 +08:00
docs	Added note about installing tests dependencies on Mac OS X. Added note about tests failing when required architecture support is disabled in build. (#908 )	2017-10-12 19:56:00 +08:00
include	include: Move RAMList to ramlist.h	2018-02-20 08:47:51 -05:00
msvc	exec: add tb_hash_func5, derived from xxhash	2018-02-24 17:36:35 -05:00
qemu	tb hash: hash phys_pc, pc, and flags with xxhash	2018-02-24 18:00:14 -05:00
samples	Fixed register mistake in comments (#894 )	2017-09-17 16:40:01 +07:00
tests	add 64-bit test demonstrating setting MSRs and FS/GS segments (#901 )	2017-09-29 04:26:23 +08:00
.appveyor.yml	MSYS test (#852 )	2017-06-25 10:11:35 +08:00
.gitignore	arm64eb: add support for ARM64 big endian.	2017-04-24 23:30:01 +08:00
.travis.yml	use new travis osx image and brew (#935 )	2018-01-05 10:29:49 +08:00
AUTHORS.TXT	import	2015-08-21 15:04:50 +08:00
Brewfile	Update Brewfile	2017-09-30 17:36:44 +07:00
ChangeLog	update ChangeLog	2017-04-20 13:28:02 +08:00
config.mk	Fix document file extension	2016-08-08 17:33:49 +09:00
COPYING	import	2015-08-21 15:04:50 +08:00
COPYING.LGPL2	LGPL2 for all header files under include/unicorn/	2017-12-16 10:08:42 +08:00
COPYING_GLIB	glib_compat: add COPYING_GLIB	2016-12-27 10:15:08 +08:00
CREDITS.TXT	update CREDITS.TXT	2017-04-25 12:56:47 +08:00
install-cmocka-linux.sh	Start moving examples in S files (#851 )	2017-06-25 10:14:22 +08:00
list.c	callback to count number of instructions in uc_emu_start() should be executed first. fix #727	2017-06-16 13:22:38 +08:00
make.sh	Added MSVC support for arm64eb.	2017-04-25 14:23:58 +10:00
Makefile	crypto: introduce new module for computing hash digests	2018-02-17 15:23:17 -05:00
msvc.bat	add msvc.bat	2017-04-21 15:35:40 +08:00
pkgconfig.mk	bump extra version to 2	2017-04-21 15:30:40 +08:00
README.md	add Clojure	2017-12-23 00:32:33 +08:00
uc.c	uc: Move hook freeing code to its own function	2018-02-22 20:00:32 -05:00
windows_export.bat	Make the call out to visual studio extremely resilient	2017-01-02 03:32:48 -08:00

README.md

Unicorn Engine

Unicorn is a lightweight, multi-platform, multi-architecture CPU emulator framework based on QEMU.

Unicorn offers some unparalleled features:

Multi-architecture: ARM, ARM64 (ARMv8), M68K, MIPS, SPARC, and X86 (16, 32, 64-bit)
Clean/simple/lightweight/intuitive architecture-neutral API
Implemented in pure C language, with bindings for Crystal, Clojure, Visual Basic, Perl, Rust, Ruby, Python, Java, .NET, Go, Delphi/Free Pascal and Haskell.
Native support for Windows & *nix (with Mac OSX, Linux, *BSD & Solaris confirmed)
High performance via Just-In-Time compilation
Support for fine-grained instrumentation at various levels
Thread-safety by design
Distributed under free software license GPLv2

Further information is available at http://www.unicorn-engine.org

License

This project is released under the GPL license.

Compilation & Docs

See docs/COMPILE.md file for how to compile and install Unicorn.

More documentation is available in docs/README.md.

Contact

Contribute

If you want to contribute, please pick up something from our Github issues.

We also maintain a list of more challenged problems in a TODO list.

CREDITS.TXT records important contributors of our project.