Re: Chapel error messages and semantics questions

Brad Chamberlain Tue, 03 Mar 2015 14:36:38 -0800


Hi Chris -

Sorry for the delayed response. It's feature-freeze week for Chapel 1.11,so everyone's a bit underwater right now. Vassily Litvinov and Itag-teamed on composing this response. When something is written in thefirst person, it's typically me speaking unless noted otherwise.

I’m asking these questions from the perspective of someone who wouldwrite generic parallel libraries similar to Thrust or Threading BuildingBlocks. The examples I've included below were designed as tests toreveal details of Chapel's semantics. They're meant to help identifyways to write robust generic code that works predictably in all usecases.

I'll mention at the outset that while generic programming has been amotivating theme in Chapel since the outset, the support has not yetreached the state of maturity that we'd hoped for. Specifically, we'vehad a multi-year academic collaboration running with the goal of addingsupport for constrained generics to Chapel (similar to the proposed C++'concepts' feature), but that effort has languished, and we're currentlytrying to figure out how to staff it internally for the coming year inorder to gain some traction. We rely on generics heavily in our ownlibraries (e.g., our support for domain maps), but in a way that suffersfrom many of the similar issues as C++ templates.

More generally in response to your comments, I want to say that those ofus who originally architected the language came at it much more from theperspective of parallel programmers seeking to significantly improve thestatus quo rather than as Programming Language experts (with a capital"PL"). At times, laxness in our specification can be attributed to thisperspective or lack of expertise. We're very open to having those who aremore "expert PL" types give us feedback to help improve the language,proposing changes to the specification to make it clearer/morebullet-proof, etc.

1. Higher-order functions

I wrote a test to see if higher-order functions worked.  Compiling this
code gives me internal error CAL0056.  What should happen in this code?
Are higher-order functions supported?


proc times2(x) {
   return x * 2;
}

proc map(f, t) {
   return (f(t[0]), f(t[1]))
}

map(times2, (1, 1+1i))

Chapel's current support for higher-order functions was developed as anintern project that has not received (m)any additional development cyclessince then. It continues to be something that we would like to supportmore fully, but which has not been on the critical path forcurrent/prospective users or for our own internal use cases.

One limitation of the current implementation is that generic functions,such as your times2(), cannot be used as higher-order functions. Fordetails, please see:


  $CHPL_HOME/doc/technotes/README.firstClassFns

which calls out this limitation.

Practically speaking, I think the reason that this hasn't been more of alimiting factor for our own use is that we tend to turn such casesinside-out, e.g., by providing iterators for user-defined data structuresand calling the 'times2' function in the body of the loop rather thanpassing the function into the a procedure to execute the loop. Thus, we'dend up with something like:


        for[all] e in myADT do
          times2(e);

And/or, we'd define the data structure to be promotable, in which case,we'd simply write:


        times2(myADT);

However, both of these approaches are currently fairly specific tohomogeneous data structures (and will be until there's more of afirst-class loop unrolling story for user-defined iterators).

None of this is to say that we wouldn't value higher-order functions aswell, I'm simply capturing why I think that their lack hasn't been a majorbarrier to our own support for rich (homogeneous) data structures to date.

2. Semantics of references and values

I had trouble understanding when Chapel uses value vs. reference
semantics.

Let's start with your inferences first and then explain what's happeningin your example:

What I infer from this is that
* Domains are mutable objects

Yes, domain variables are mutable. Whether domains are objects depends onyour definition of "objects." They are not objects in the Java sense.

* Domain variables hold references to domains
* Arrays of domains hold references to domains

This isn't quite right. Domain and array variables have value-orientedsemantics in most every context. The value of a domain can be thought ofas the set of indices that it describes; the value of an array iseffectively the collection of elements that it describes and their values.

The identity of a domain also matters in certain contexts -- primarilywhen it is the domain that defines an array's index set. In declaring anarray, the identity (not value) of its domain matters and so is capturedand forms an ongoing property of the array throughout its lifetime (we saythat the domain's identity is part of its type).

* Assignment on domains (lhs = rhs;) does not modify the lhs reference, but
copies the value of the rhs object into the value of the lhs object

That's correct. It essentially makes the lhs domain describe the sameindices as the rhs domain, while each retains its individual identity andtype (e.g., one could be a distributed domain and the other local).

For instance, in the following situation, resizing an array exposes thesemantic difference. At the beginning of the code, A[5] is the domainof B.
// Create array B whose size is given by A[5].
var myDom = {1..5};
var A : [myDom] myDom.type;
var B : [A[5]] int;

// B can be resized by assigning to A[5].
A[5] = {1..3};
writeln(B.domain);

// After resizing A, B can no longer be resized by assigning to A[5].
myDom = {1..4};
myDom = {1..5};
A[5] = {1..2};
writeln(B.domain);


Here's what's going on with your example:

When you shrink A by re-assigning its domain to {1..4}, A[5] is no longera valid element of the array. Typically, this means that that domainwould cease to exist; however, since B was declared using A[5] as itsdomain and B requires that domain for its definition, the domain that A[5]described is kept around even though A[5] can't be used to refer to itanymore.

When you grow A's domain back to {1..5}, a new domain value is created forthat fifth element, but its identity is not in any way associated with theprevious A[5] value, nor with B, so subsequent assignments to it don'taffect B.

If it helps, Chapel's domains are currently implemented as a record thatwraps a reference-counted class. This gives them the mix of value andreference semantics that I'm describing and you're seeing above. B'sreference to the original A[5] domain keeps that domain value alive, butwhen a new A[5] domain is created there is no relationship between the twothings.

I'll mention that there's been discussion of changing Chapel's semanticsso that having A[5] go away like that would render B unusable (i.e., "usererror to refer to an array whose domain no longer exists") rather than thecurrent scheme, to simplify the semantics and implementation. It's notclear to me which way this will go yet, and if you have input, we'd behappy to hear it.

At the end, is it possible to restore the situation that A[5] is thedomain of B?


Nope.

Is there a way to modify the reference held in a domain variable, instead
of modifying the domain that it references?

Not within the language, only by mucking with the internals of theimplementation.

Incidentally, domain assignment leads to some oddness because domain
expressions are lvalues.  The statement ({0..1}) = {0..2}; compiles, while
(1) = 2; doesn’t compile.  Without the parentheses, both are syntax errors.

I'd call this a bug, and I don't think it's one we're aware of -- thanksfor pointing it out. Personally, I don't view this as a bug in domainassignment so much in const-ness checking and/or how the current compilerimplements domain literals like {0..1}, but that's just a guess.

3. Subclassing

It looks like Chapel has subclassing with virtual methods.  I tried it out
with the code shown below, which tests how generics interact with
subclassing.  It gives internal error CHE0496.  What does that error mean?


class Base {
   proc foo(type T) : int;
}

class Derived1 {
   proc foo(type T) : int {
     var x : T;
   }
};

class Derived2 {
   proc foo(type T) : int {
     var x : T;
     var y : T;
   }
};

var x : Base;

if (stdin.read(bool)) {
   x = new Derived1();
}
else {
   x = new Derived2();
}

x.foo((int, int));

Internal errors mean you hit a bug in the compiler. "CHE0496" directs usto where in the compiler it occured; this has no intended meaning to theend user. Throwing the --developer flag decrypts it slightly, butthrowing such cases to us is the right thing to do.


This program should generate a clear user error.

In the current implementation/specification, procedure prototypes (thosewith no body like your Base.foo()) are supported for interoperabilitypurposes only and are not intended as a means of creating a pure virtualfunction.


If I change your code as follows:

* add a body to Base.foo()
* add return statements to the foo() overloads
* declare the Derived classes as being derived from Base

then it compiles without errors:

class Base {
  proc foo(type T) : int { return -1; }
}

class Derived1 : Base {
   proc foo(type T) : int {
     var x : T;
     return 2;
   }
};

class Derived2 : Base {
   proc foo(type T) : int {
     var x : T;
     var y : T;
     return 3;
   }
};

var x : Base;

if (stdin.read(bool)) {
   x = new Derived1();
}
else {
   x = new Derived2();
}

x.foo((int, int));

4. Dependent types

One nice feature of Chapel is that function parameters can be used in the
types of other parameters.  This makes it possible to impose constraints on
array domains, such as writing a function where all array parameters have
equal domains.  But why is it sometimes an error for a parameter to
reference an earlier parameter, while other times it’s an error for a
parameter to reference a later parameter?

I believe the intention is that a later argument should always be able torefer to an earlier one -- the only cases I'm aware of where this is notsupported are simply cases that we haven't gotten to yet (bugs /unimplemented features), rather than intentional decisions.

I believe it should be an error for earlier arguments to refer to laterones, in order to establish a well-defined order of evaluation. One mightargue that cases in which one can currently get away with this shouldresult in errors.

Why does the error message say “’reindex’ used before defined”?


// error: ‘reindex’ used before defined
proc bad_1(const sizes : [{0..1}] int, ref A : [sizes[0]] int) {}

This is another bug in the compiler, sorry. 'reindex' is a variableinserted internally by the compiler, in this case incorrectly.

Note also that sizes[0] is an integer and so cannot be used to specify anarray's domain (you may want 0..sizes[0], for example, though this doesn'tfix the bug).

// First parameter references second, this works
proc good1(ref A : [sizes[0]] int, const sizes : [{0..1}] int) {}

This works (with a tweak) because under the hood 'sizes' is used aftergood1() has started. good1() resizes the incoming 'A' to the domainspecified by sizes[0]. The tweak is to make 'sizes' be an array of domainsrather than integers.

// Error: ‘D’ used before defined
proc bad_2(const n : D.idxType, D : domain(1,int,false)) {}

// Second parameter references first, this works but is the opposite order
from good1
proc good2(D : domain(1,int,false), const n : D.idxType) {}

Both examples match our intention. We could probably support your bad_2example; currently we do not. In most cases, we require each variable tobe defined before it is used.

5. Types

The first sentence of the chapter on types in the language specification
0.96 says, “Chapel is a statically typed language with a rich set of
types.”  This sentence is misleading because the type system described in
the language spec is not a static type system.  From what I’ve seen, I
think Chapel’s type annotations would be best understood as dynamically
checked assertions.

Chapel’s type system is not a static type system because types are
intermingled with evaluation in a way that prevents types from being
checked, once and for all, at compile time.  Verifying type-correctness at
compile time is the point of a static type system.  Chapel doesn’t do
that.  Moreover, types can have side effects and can depend on mutable
values, which pretty much precludes Chapel from doing that in the future.

What are Chapel’s types, really?

Vass writes: I agree, in some cases we rely on types at run time. Forexample when casting between class types or when performing arrayoperations. In most other cases, our intention is to ensure type safety atcompile time, making dynamic type checking unnecessary.



Brad writes:

One closing comment that's also a bit of an open question: You mentionedusing an array argument's formal type as a means of specifying aconstraint on that variable. I.e., one might read:


        proc foo(X: [?D] int, Y: [D] real) { ... }

as being a constraint that X and Y have the same domain. At present, whena formal argument type names a specific rectangular domain like this, itconstrains the array argument to have that size/shape, but it could be adifferent index set, and the compiler "reindexes" the array such that itcan be accessed using D's indices within the function. Thus, the abovecould be called with:


        var A: [1..3] real;
        var B: [1..10] real;

        foo(A, B[5..7]);

and within foo(), both A and the 5..7 slice of B could be indexed usingthe indices 1..3.

I mention this both because it's a common misunderstanding (and one thatyour mail seemed to suggest you'd fallen into), but also because we'vebeen planning on splitting these two cases ("Constrain the actual to havethis domain" vs. "reindex the actual to have this domain") into twoseparate things to avoid confusion and because the second interpretationhas a much higher runtime overhead in general than the first (and yet, isnot the common case).

Thanks for your questions and bugs (which we'll file). If you havefollow-up questions or proposed improvements to the language /specification / implementation based on this, please let us know (ifyou're interested in contributing those proposals as patches against therepository, all the better!)


Thanks,
-Brad and Vass

------------------------------------------------------------------------------
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the 
conversation now. http://goparallel.sourceforge.net/

_______________________________________________
Chapel-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/chapel-users

Re: Chapel error messages and semantics questions

Reply via email to