Optimizing Memory Translation Emulation in Full System Emulators

The emulation speed of a Full System Emulator (FSE)
determines for the most part how useful this FSE can be.
This work quantitatively measures where time is spent
in QEMU, an industrial strength full system emulator,
and confirms that dynamic address translation as one
of the most heavily exercised components in the emulator.
This is even though QEMU implements a Software
Translation Lookaside Buffer (sTLB) to accelerate dynamic
address translation. Consequently, this work proposes
a series of sTLB optimizations that aim at reducing
the address translation emulation overhead. The proposed
techniques optimize address translations as well
as sTLB refills and provide an average performance improvement
of 24.1% over the baseline on a wide range of
workloads.

By: Xin Tong, Toshihiko Koju, Motohiro Kawahito

Published in: RT0956 in 2014

LIMITED DISTRIBUTION NOTICE:

This Research Report is available. This report has been submitted for publication outside of IBM and will probably be copyrighted if accepted for publication. It has been issued as a Research Report for early dissemination of its contents. In view of the transfer of copyright to the outside publisher, its distribution outside of IBM prior to publication should be limited to peer communications and specific requests. After outside publication, requests should be filled only by reprints or legally obtained copies of the article (e.g., payment of royalties). I have read and understand this notice and am a member of the scientific community outside or inside of IBM seeking a single copy only.

RT0956.pdf

Questions about this service can be mailed to reports@us.ibm.com .