mirror of https://github.com/yuzu-emu/unicorn.git synced 2026-07-17 07:55:27 +00:00

Unicorn CPU emulator framework (ARM, AArch64, M68K, Mips, Sparc, X86)

Find a file

Emilio G. Cota f772fd986d tcg: introduce regions to split code_gen_buffer This is groundwork for supporting multiple TCG contexts. The naive solution here is to split code_gen_buffer statically among the TCG threads; this however results in poor utilization if translation needs are different across TCG threads. What we do here is to add an extra layer of indirection, assigning regions that act just like pages do in virtual memory allocation. (BTW if you are wondering about the chosen naming, I did not want to use blocks or pages because those are already heavily used in QEMU). We use a global lock to serialize allocations as well as statistics reporting (we now export the size of the used code_gen_buffer with tcg_code_size()). Note that for the allocator we could just use a counter and atomic_inc; however, that would complicate the gathering of tcg_code_size()-like stats. So given that the region operations are not a fast path, a lock seems the most reasonable choice. The effectiveness of this approach is clear after seeing some numbers. I used the bootup+shutdown of debian-arm with '-tb-size 80' as a benchmark. Note that I'm evaluating this after enabling per-thread TCG (which is done by a subsequent commit). * -smp 1, 1 region (entire buffer): qemu: flush code_size=83885014 nb_tbs=154739 avg_tb_size=357 qemu: flush code_size=83884902 nb_tbs=153136 avg_tb_size=363 qemu: flush code_size=83885014 nb_tbs=152777 avg_tb_size=364 qemu: flush code_size=83884950 nb_tbs=150057 avg_tb_size=373 qemu: flush code_size=83884998 nb_tbs=150234 avg_tb_size=373 qemu: flush code_size=83885014 nb_tbs=154009 avg_tb_size=360 qemu: flush code_size=83885014 nb_tbs=151007 avg_tb_size=370 qemu: flush code_size=83885014 nb_tbs=151816 avg_tb_size=367 That is, 8 flushes. * -smp 8, 32 regions (80/32 MB per region) [i.e. this patch]: qemu: flush code_size=76328008 nb_tbs=141040 avg_tb_size=356 qemu: flush code_size=75366534 nb_tbs=138000 avg_tb_size=361 qemu: flush code_size=76864546 nb_tbs=140653 avg_tb_size=361 qemu: flush code_size=76309084 nb_tbs=135945 avg_tb_size=375 qemu: flush code_size=74581856 nb_tbs=132909 avg_tb_size=375 qemu: flush code_size=73927256 nb_tbs=135616 avg_tb_size=360 qemu: flush code_size=78629426 nb_tbs=142896 avg_tb_size=365 qemu: flush code_size=76667052 nb_tbs=138508 avg_tb_size=368 Again, 8 flushes. Note how buffer utilization is not 100%, but it is close. Smaller region sizes would yield higher utilization, but we want region allocation to be rare (it acquires a lock), so we do not want to go too small. * -smp 8, static partitioning of 8 regions (10 MB per region): qemu: flush code_size=21936504 nb_tbs=40570 avg_tb_size=354 qemu: flush code_size=11472174 nb_tbs=20633 avg_tb_size=370 qemu: flush code_size=11603976 nb_tbs=21059 avg_tb_size=365 qemu: flush code_size=23254872 nb_tbs=41243 avg_tb_size=377 qemu: flush code_size=28289496 nb_tbs=52057 avg_tb_size=358 qemu: flush code_size=43605160 nb_tbs=78896 avg_tb_size=367 qemu: flush code_size=45166552 nb_tbs=82158 avg_tb_size=364 qemu: flush code_size=63289640 nb_tbs=116494 avg_tb_size=358 qemu: flush code_size=51389960 nb_tbs=93937 avg_tb_size=362 qemu: flush code_size=59665928 nb_tbs=107063 avg_tb_size=372 qemu: flush code_size=38380824 nb_tbs=68597 avg_tb_size=374 qemu: flush code_size=44884568 nb_tbs=79901 avg_tb_size=376 qemu: flush code_size=50782632 nb_tbs=90681 avg_tb_size=374 qemu: flush code_size=39848888 nb_tbs=71433 avg_tb_size=372 qemu: flush code_size=64708840 nb_tbs=119052 avg_tb_size=359 qemu: flush code_size=49830008 nb_tbs=90992 avg_tb_size=362 qemu: flush code_size=68372408 nb_tbs=123442 avg_tb_size=368 qemu: flush code_size=33555560 nb_tbs=59514 avg_tb_size=378 qemu: flush code_size=44748344 nb_tbs=80974 avg_tb_size=367 qemu: flush code_size=37104248 nb_tbs=67609 avg_tb_size=364 That is, 20 flushes. Note how a static partitioning approach uses the code buffer poorly, leading to many unnecessary flushes. Backports commit e8feb96fcc6c16eab8923332e86ff4ef0e2ac276 from qemu		2018-03-14 12:10:29 -04:00
bindings	link to Crystal binding	2017-12-23 00:26:40 +08:00
docs	Added note about installing tests dependencies on Mac OS X. Added note about tests failing when required architecture support is disabled in build. (#908 )	2017-10-12 19:56:00 +08:00
include	tcg: introduce regions to split code_gen_buffer	2018-03-14 12:10:29 -04:00
msvc	osdep: introduce qemu_mprotect_rwx/none	2018-03-14 12:10:28 -04:00
qemu	tcg: introduce regions to split code_gen_buffer	2018-03-14 12:10:29 -04:00
samples	Fixed register mistake in comments (#894 )	2017-09-17 16:40:01 +07:00
tests	add 64-bit test demonstrating setting MSRs and FS/GS segments (#901 )	2017-09-29 04:26:23 +08:00
.appveyor.yml	MSYS test (#852 )	2017-06-25 10:11:35 +08:00
.gitignore	qapi: Move qapi-schema.json to qapi/, rename generated files	2018-03-09 11:35:11 -05:00
.travis.yml	use new travis osx image and brew (#935 )	2018-01-05 10:29:49 +08:00
AUTHORS.TXT	import	2015-08-21 15:04:50 +08:00
Brewfile	Update Brewfile	2017-09-30 17:36:44 +07:00
ChangeLog	update ChangeLog	2017-04-20 13:28:02 +08:00
config.mk	Fix document file extension	2016-08-08 17:33:49 +09:00
COPYING	import	2015-08-21 15:04:50 +08:00
COPYING.LGPL2	LGPL2 for all header files under include/unicorn/	2017-12-16 10:08:42 +08:00
COPYING_GLIB	glib_compat: add COPYING_GLIB	2016-12-27 10:15:08 +08:00
CREDITS.TXT	update CREDITS.TXT	2017-04-25 12:56:47 +08:00
install-cmocka-linux.sh	Start moving examples in S files (#851 )	2017-06-25 10:14:22 +08:00
list.c	callback to count number of instructions in uc_emu_start() should be executed first. fix #727	2017-06-16 13:22:38 +08:00
make.sh	Added MSVC support for arm64eb.	2017-04-25 14:23:58 +10:00
Makefile	crypto: introduce new module for computing hash digests	2018-02-17 15:23:17 -05:00
msvc.bat	add msvc.bat	2017-04-21 15:35:40 +08:00
pkgconfig.mk	bump extra version to 2	2017-04-21 15:30:40 +08:00
README.md	add Clojure	2017-12-23 00:32:33 +08:00
uc.c	tcg: define tcg_init_ctx and make tcg_ctx a pointer	2018-03-14 09:43:58 -04:00
windows_export.bat	Make the call out to visual studio extremely resilient	2017-01-02 03:32:48 -08:00

README.md

Unicorn Engine

Unicorn is a lightweight, multi-platform, multi-architecture CPU emulator framework based on QEMU.

Unicorn offers some unparalleled features:

Multi-architecture: ARM, ARM64 (ARMv8), M68K, MIPS, SPARC, and X86 (16, 32, 64-bit)
Clean/simple/lightweight/intuitive architecture-neutral API
Implemented in pure C language, with bindings for Crystal, Clojure, Visual Basic, Perl, Rust, Ruby, Python, Java, .NET, Go, Delphi/Free Pascal and Haskell.
Native support for Windows & *nix (with Mac OSX, Linux, *BSD & Solaris confirmed)
High performance via Just-In-Time compilation
Support for fine-grained instrumentation at various levels
Thread-safety by design
Distributed under free software license GPLv2

Further information is available at http://www.unicorn-engine.org

License

This project is released under the GPL license.

Compilation & Docs

See docs/COMPILE.md file for how to compile and install Unicorn.

More documentation is available in docs/README.md.

Contact

Contribute

If you want to contribute, please pick up something from our Github issues.

We also maintain a list of more challenged problems in a TODO list.

CREDITS.TXT records important contributors of our project.