Re: Explicit TCE

Tyler Jameson Little Fri, 12 Oct 2012 11:20:29 -0700

No idea what you are talking about.

I'm not sure which part wasn't clear, so I'll try to explainmyself. Please don't feel offended if I clarify things youalready understand.

An optimizable tail call must simply be a function call. Thecurrent stack frame would be replaced with the new function, soanything more complex than a simple function call would requiresome stack from the preceding function to stick around in the newfunction, thus requiring the old stack to stick around.

For example, te following is not optimizable the old stack (theone with 3) needs to be maintained until foo() returns, which isnot TCE.


return foo() * 3

Since the old stack won't be around anymore, that leaves us within a sticky situation with regard to scope():


http://dlang.org/statement.html#ScopeGuardStatement

If the current stack is going to be replaced with data fromanother function call, the behavior of scope() is undefined. Thescope that scope() was in has now been repurpose, but the scopeis still kind of there. If scope() is allowed, they must beexecuted just before the tail call, otherwise it will beoverwritten (or it has to stick around until the actual stackframe is cleared. Consider:


void a() {
  become b();
}

void b() {
  // when does this get called?
  scope(exit) writeln("exited");
  become a();
}

If we allow scope(), then the line should be written before thecall to a(). If we don't, then this is a compile time error. Ilike disallowing it personally, because if the scope(exit) callfrees some memory that is passed to a, the programmer may thinkthat it will be called after a exits, which may not be the case.


void a(void* arr) {
  // do something with arr
  become b();
}

void b() {
  void* arr = malloc(sizeof(float) * 16);
  scope(exit) free(arr);
  become a(arr);
}

I just see this as being a problem for those who don't fullyunderstand scoping and TCE.

My mention of overhead was just how complicated it would be toimplement. The general algorithm is (for each become keyword):

* determine max stack size (consider all branches in allrecursive contexts)

* allocate stack size for top-level function
* do normal TCE stuff (use existing stack for new call)

The stack size should be known at compile time for cases like theone above (a calls b, b calls a, infinitely) to avoid infinitelyexpanding stack. A situation like this is a memory optimization,so forcing guaranteed stack size puts an upper-bound on memoryusage, which is the whole point of TCE. If the stack is allowedto grow, there is opportunity for stack overflow.

My use case for this is a simple compiler, but I'm sure thiscould be applied to other use cases as well. I'd like to producecode for some BNF-style grammar where each LHS is a function.Thus, my state machine wouldn't be a huge, unnatural switchstatement that reads in the current state, but a series of codebranches that 'become' other states, like an actual state machine.


For example:

A := B | C | hello
B := bye | see ya
C := go away

void A() {
    char next = getNext();
    if (next == 'b' || next == 's') {
        become B();
    }
    if (next == 'g') {
        become C();
    }
    if (next == 'h') {
        // consume until hello is found, or throw exception
        // then put some token on the stack
    }
}

void B() {
    // consume until 'bye' or 'see ya'
}

void C() {
    // consume until 'go away'
}

This would minimize memory use and allow me to write code thatmore closely matches the grammar. There are plenty of other usecases, but DSLs would be very easy to implement with TCE.

Re: Explicit TCE

Reply via email to