[Qemu-devel] TCG flow vs dyngen

Stefano Bonifazi Fri, 10 Dec 2010 13:27:17 -0800

Hi all!

From the technical documentation(http://www.usenix.org/publications/library/proceedings/usenix05/tech/freenix/bellard.html)I read:

The first step is to split each target CPU instruction into fewersimpler instructions called /micro operations/. Each micro operationis implemented by a small piece of C code. This small C source code iscompiled by GCC to an object file. The micro operations are chosen sothat their number is much smaller (typically a few hundreds) than allthe combinations of instructions and operands of the target CPU. Thetranslation from target CPU instructions to micro operations is doneentirely with hand coded code.A compile time tool called dyngen uses the object file containing themicro operations as input to generate a dynamic code generator. Thisdynamic code generator is invoked at runtime to generate a completehost function which concatenates several micro operations.

instead from wikipedia(http://en.wikipedia.org/wiki/QEMU) and othersources I read:

The Tiny Code Generator (TCG) aims to remove the shortcoming ofrelying on a particular version of GCC<http://en.wikipedia.org/wiki/GNU_Compiler_Collection> or anycompiler, instead incorporating the compiler (code generator) intoother tasks performed by QEMU in run-time. The whole translation taskthus consists of two parts: blocks of target code (/TBs/) beingrewritten in *TCG ops* - a kind of machine-independent intermediatenotation, and subsequently this notation being compiled for the host'sarchitecture by TCG. Optional optimisation passes are performedbetween them.

- So, I think that the technical documentation is now obsolete, isn't it?

- The "old way" used much offline (compile time) work compiling themicro operations into host machine code, while if I understand well, TCGdoes everything in run-time(please correct me if I am wrong!).. so Iwonder, how can it be as fast as the previous method (or even faster)?


- If I understand well, TGC runtime flow is the following:
    - TCG takes the target binary, and splits it into target blocks

- if the TB is not cached, TGC translates it (or better the targetinstructions it is composed by) into TCG micro ops,

    - TGC compiles TGC uops into host object code,
    - TGC caches the TB,
    - TGC tries to chain the block with others,
    - TGC copies the TB into the execution buffer
    - TGC runs it

Am I right? Please correct me, whether I am wrong, as I wanna use thatflow scheme for trying to understand the code..

Thank you very much in advance!
Stefano B.

[Qemu-devel] TCG flow vs dyngen

Reply via email to