Re: [Haskell-cafe] Compilers: Why do we need a core language?

wren ng thornton Fri, 23 Nov 2012 20:29:09 -0800

On 11/20/12 6:54 AM, c...@lavabit.com wrote:

Hello,


I know nothing about compilers and interpreters. I checked several
books, but none of them explained why we have to translate a
high-level language into a small (core) language. Is it impossible
(very hard) to directly translate high-level language into machine
code?

It is possible to remove stages in the standard compilation pipeline,and doing so can speed up compilation time. For example, Perl doesn'tbuild an abstract syntax tree (for now-outdated performance reasons),and instead compiles the source language directly into bytecode (whichis then interpreted by the runtime). This is one of the reasons why Perlis (or was?) so much faster than other interpreted languages like Pythonetc. But there are some big problems to beware of:

* Not having a concrete representation for intermediate forms can ruleout performing obvious optimizations. And I do mean *obvious*optimizations; I can talk more about this problem in Perl, if you reallycare.

* Not having a concrete representation for intermediate forms meansmixing together code from many different stages of the compilationprocess. This sort of spaghetti code is hard to maintain, and evenharder to explain to new developers.

* Not having a concrete representation for intermediate forms can leadto code duplication (in the compiler) because there's no convenient wayto abstract over certain patterns. And, of course, repeating code isjust begging for inconsistency bugs due to the maintenance burden ofkeeping all the copies in sync.

All three points are major driving forces in having intermediate forms.Joachim Breitner gave some illustrations for why intermediate forms are"inevitable". But then, once you have intermediate forms, if you'reinterested in ensuring correctness and having a formal(izable)semantics, then it makes sense to try to turn those intermediate formsinto an actual intermediate language. Intermediate forms are just animplementation detail, but intermediate languages can be reasoned aboutin the same ways as other languages. So it's more about shiftingperspective in order to turn systems problems (implementation details)into language problems (semantics of the Core).

Furthermore, if you're a PL person and really are trying to ensurecorrectness of your language (e.g., type safety), you want to try tomake your proof obligation as small as possible. For convenience toprogrammers, source code is full of constructs which are all more orless equivalent. But this is a problem for making proofs because when weperform case analysis on an expression we have to deal with all thosedifferent syntactic forms. Whereas if you first compile everything downinto a small core language, then the proof has far fewer syntactic formsit has to deal with and so the proof is much easier. Bear in mind thatthis isn't just a linear problem. If we have N different syntacticforms, then proving something like confluence will require provingO(N^2) cases since you're comparing two different terms.


--
Live well,
~wren

_______________________________________________
Haskell-Cafe mailing list
Haskell-Cafe@haskell.org
http://www.haskell.org/mailman/listinfo/haskell-cafe

Re: [Haskell-cafe] Compilers: Why do we need a core language?

Reply via email to