Re: [m5-dev] what scons can do

Gabriel Michael Black Thu, 21 Apr 2011 16:56:33 -0700

Quoting nathan binkert <[email protected]>:

That doesn't really fit with how the ISA files work. They get broken into an
AST, but that gets consumed as it goes,

Does it have to be?

Making it not work that way would likely be very painful. The parserpart is finicky (like they all are in any language) and we have lotsand lots of very intricate code built on top of it in the form of thedescriptions themselves.

and it has a lot of anonymous python
in it that just gets executed somehow. I want to move more into the python,
so the AST will be less and less useful.

Does the AST not contain enough information to know what files are
being generated?  The anonymous python itself creates files?  That
sounds crazy.

This may not be quite right, but off hand here is a summary of the isaparser's inputs and outputs. Going in, the parser starts withmain.isa. There are ##includes (there are two #s on purpose) whichbring in other .isa files. The parser reads in all those files byfollowing the ##includes, stitches them together into one huge string,and then crunches through it all. As that's being processed, thedescription can read in other files that have, for instance, microcodein them. This already hits basically the problem we're talking aboutsince the involved are determined by execution, not static landmarkslike ##include. I "solve" this problem by manually listing allmicrocode files in the SConscript. It's a nasty hack, but it avoidsnot rebuilding when microcode changes which is even more annoying.

On the output side, the parser generates two files which implement thedecoder, decoder.hh and decoder.cc. It also outputs one file for eachCPU model involved that implements the exec (and related) functions.These are called something_something_exec.cc I think.

The problem is that for x86 for sure, but also now for ARM and likelyfor any other ISA with a lot of complexity and/or fidelity, thoseoutput files get to be very, very large. It's easy to run out or RAM,especially if scons tries to build more than one at a time or ifyou're on a smaller machine. Then the build grinds to a halt, as doeseverything else. Often the only solutions are to wait until itfinishes or you die (whichever comes first) or rebooting the machineand trying again with more conservative settings.

What this mechanism would do would be to allow you to put differentportions of the output into different files which would be compiledindependently. Then scons compiling three things at once is equivalentto three normal files at once, not a million lines of code all at once.

To do that, you have to decide how to split things up so they stillbuild. You could try hacking things up in an automated way, but thatwould likely either be overly restrictive, ineffective, incorrect, orall three. My plan is to expose the idea of different files to the ISAdescription author so that they can choose to put all the, say,floating point loads and stores together along with their utilityfunctions. These may not all be defined in the same place or even inthe same directory since there are ordering constraints in python aswell as in the resulting C++. It might be that you define output filesin a fixed place (def output floatMem, for instance) and then refer tothem later when it comes time to put C++ someplace. That might makethe most sense. It could also be that you have batches of similarlynamed output clusters (these will likely involve more than one file ata time, like a .cc and a .hh) and you'd want to generate them allprogramatically. I'm not sure exactly what it would look like toselect an output file either. You might want to just put down markersthat say, essentially, "henceforth output goes in floatMem". Or passan output cluster name into the outputting function (whatever thatlooks like).

It might work best in the near to mid term to put in static, ISAlanguage defined declarations of output files which would be feasibleto scan. In the mid to long term, though, I want to move away fromhaving a custom language and move towards having the same machinery(extended and parameterized more) exposed as a module or somethinginside regular python scripts. Maybe something ala scons's SConscriptswhich are regular python that run in an armature, sort of.

I would be hesitant to make the ISA descriptions open and write tofiles themselves directly, but primarily because that would becumbersome and error prone. I don't think we should design it out,though, unless it's just too evil to support.


Gabe
_______________________________________________
m5-dev mailing list
[email protected]
http://m5sim.org/mailman/listinfo/m5-dev

Re: [m5-dev] what scons can do

Reply via email to