Re: [racket-users] Re: [racket] Unsafe version of require/typed?

Neil Toronto Fri, 01 May 2015 08:58:59 -0700

Your preferences are my command. :)


**** Actual Performance Bottlenecks (and Workarounds) ****

(1) Plot's sending of `snip%` instances from untyped to typed code madethem so slow that they were unresponsive. I got around it by makinghelper functions to create the snips, and inserting their types directlyinto the base type environment using the`typed-racket/base-env/extra-env-lang` language. (See"plot-gui-lib/plot/private/gui/lazy-snip-typed.rkt".)

(2) Again in Plot, I needed to give types to `dc` (from `pict`), andhelper functions to create `pdf-dc%`, etc., instances. To keep renderingplots using those targets from taking $MANY MB memory each (I think itwas in the range 7-14 MB), I inserted the helper functions' typesdirectly into the base type environment. (See"plot-lib/plot/private/no-gui/evil.rkt".)

(3) In Pict3D, see "pict3d/private/gui/pict3d-bitmap.rkt" and"pict3d/private/gui/pict3d-canvas.rkt" for more examples of the same.Using `require/typed` would have forced me to favor either typed oruntyped use of Pict3D for rendering on canvases and bitmaps. Using`extra-env-lang` is a little dangerous, but favors both.

(4) Again in Pict3D: Racket's FFI doesn't have a typed interface, so Iwrote a subset of it using the `extra-env-lang` language. I couldn'thave used `require/typed` because `memcpy` and similar functions havecases that can't be distinguished by arity.

(5) I did the same for OpenGL. I could have used `require/typed`, butspeed tests showed I would get at most 5000 OpenGL calls per 60Hz framethat way, which is too few to do anything serious. Using`extra-env-lang` to insert the types into the base type environment, theupper bound is around 60000 OpenGL calls per frame, which is plenty.

(6) Anything polymorphic takes a lot of time to cross the boundary,especially if it's higher-order. One higher-order example is using`math/array` from untyped code: operations on newly created arrays takeover 50x the time they do in typed code. Jens Axel and Ryan Culpepperhave worked around it by wrapping matrices (which are arrays) withsomething like


  (struct matrix ([value : (Matrix Real)]) #:transparent)

or some concrete type other than `Real`. Then they create wrapperfunctions for `math/matrix` exports. The `matrix?` contract is O(1), so`matrix` instances cross over cheaply.

(7) Untyped Pict3D users wouldn't tolerate a deep check of all shapes ina scene every time it crosses the contract boundary. (Complex sceneshave thousands of shapes, and scenes cross over a lot.) But I also wantusers to eventually be able to extend Pict3D with new kinds of shapes.So scene functions that with polymorphism would look like this:


  (: add-shape (All (A) (-> (Scene A) A (Scene A))))

look like this instead:

  (: add-shape (-> Scene Shape Scene))

where `Shape` is a struct type that new shapes must inherit from.


**** General Observations ****


(A) The contract boundary makes objects very slow and very memory-heavy.

(B) Using `extra-env-lang` to work around (A) doesn't seem to beterribly dangerous.

(C) The contract boundary makes operations on polymorphic data typesslow, and for higher-order data, they stack. For polymorphic,first-order data, they're O(n), where n is the size of the data. Forpolymorphic, higher-order data, operations are O(n*k), where k is thenumber of crossings.

(D) Tricks (6,7) for getting around performance bottlenecks in (C) don'tgeneralize to full polymorphism.

(E) I've used `extra-env-lang` to insert types into the base typeenvironment a lot. But I've often been able to avoid using it bychanging the types of data (e.g. polymorphic to monomorphic), or bymoving or shrinking an abstraction boundary. In the examples I gave,changing types or boundaries wouldn't work or would be too invasive (2),or would change an existing user-facing API in a backward-incompatibleway (1).

(F) Some applications (5) are ridiculously sensitive to contractoverhead, even fast first-order contracts.

(G) There are functions that `require/typed` can't deal with because itcan't generate contracts for them (4). This isn't usually a problem whenyou're writing both the typed and untyped code, but it's a problem whenthe untyped code you want to use in TR isn't yours and has been inwidespread use for over a decade.



**** Comments and Speculation ****


I'll assume speculation is more OK now that we have examples. :)

It's in our nature to want to tackle (G) first because it'swell-defined. But it's also the least pressing. (You can always write anuntyped wrapper and use `require/typed` to import the wrapper. I knowthis goes against TR's goals, but the workaround is easy and obvious.)Here's how I would prioritize:


 1. Polymorphic, first-order data
 2. Polymorphic, higher-order data
 3. Class contracts
 4. Contracts for weird functions like `memcpy`

In the meantime, throw us a bone like `require/typed/unsafe` to replace`extra-env-lang`. I would love to have something like it, if only toavoid creating two new files and remembering all the crazy incantationswhen I have to use `extra-env-lang`.


Neil ⊥

On 05/01/2015 08:02 AM, Matthias Felleisen wrote:


What I'd much prefer at the moment over speculative solutions are reports of 
actual performance bottlenecks. -- Matthias





On May 1, 2015, at 1:09 AM, michael.ballantyne wrote:

I've started using Typed Racket several times recently only to flip the switch 
to #lang typed/racket/no-check or remove types entirely. Something like Vincent 
suggests with an option to write with types and have them checked but turn off 
the type-driven optimizer and skip contract checking at typed/untyped 
boundaries would be fantastic.

Having the option to flip this switch either from the code doing the requiring 
or at package installation time seems important, regardless of whether the 
author of the typed code wrote with that in mind. When I wanted to use the 
tr-pfds package from untyped code I had to modify it to use #lang 
typed/racket/no-check because the performance hit of contract checks was 
massive.

A raco option to compile and install a package without Typed Racket 
optimizations or contracts might be another piece of the solution.


On Monday, March 23, 2015 at 1:23:12 PM UTC-6, Robby Findler wrote:

Just to be sure we are on the same page, my comments were in the
context of push back that came for reasons that are unclear to me, but
I think had something to do with the thought that we shouldn't
compromise the type system. My comments were meant to be in that
context, trying to point out what the real value of a type system is;
that is, to give a judgment we can use as a basis for design decisions
here.

As for smothering: the cost of contracts is "observable enough" (you
know what I mean) that we cannot just ignore it, as I'm sure you
agree. And since we do not have an acceptable solution we should
explore generalizing our support for unsafe operations that we already
have. It seems to fit very naturally to think of parallel libraries,
the safe and the unsafe version.

I also like Vincent's idea about coupling the optimizer to the
contracts as a point worth pragmatic exploration. We shouldn't kid
ourselves, however, it is still unsafe and programs that turn it on
can behave in arbitrarily weird ways (when an error is skipped over).

Robby


On Mon, Mar 23, 2015 at 2:12 PM, Matthias Felleisen
<matth...@ccs.neu.edu> wrote:


On Mar 20, 2015, at 5:10 PM, Robby Findler <ro...@eecs.northwestern.edu> wrote:

Well, that's already the case if you use the FFI (which lots and lots
of Racket programs do).

Fundamentally the typechecker is a tool that programmers can choose to
use to make their programs better. It should not try to be more than
that.



I think these statements paint an image in broad brushstrokes that
some people appreciate properly and some don't.

Yes, ffi/unsafe makes Racket programs unsafe. They may introduce
causes for segfaults beyond the Racket engine itself. That's not
a good thing but ffi/unsafe suggests this problem by its name and
I think we try to stick this to a layer where we know it's potentially
problematic.

Otherwise the goal is to smother the unsafety of the existing software
infrastructure. If we don't have such a minimal goal, why bother with
Racket (at least from a research perspective)?

;; ----------------------------------------------------------------------

On Mar 23, 2015, at 1:37 PM, Vincent St-Amour <stamo...@ccs.neu.edu> wrote:

Then there's the possibility of turning off the optimizer (makes most
sense for the `#lang` design), which would compromise by avoiding
contracts, but remain as safe as `#lang racket`. There's already a way
to turn off the optimizer (`#:no-optimize`), so that may be redundant.



This is an acceptable and pragmatic alternative. It gives the programmer
some of the advantages of types (checking, documentation, hooks for tools).
And it does not lower the level of safety that Racket aims for.

On a more general note: It also exposes the Reynolds insight that types
are an inherent part of the meaning (static). As such, compiling uses of
first in

(: f (-> [List X Y Z] X))

to not check the listness of the argument is intrinsic to __compilation__,
not __optimization__. I think the strict use of this guideline would establish
another point in our language spectrum.


-- Matthias


--
You received this message because you are subscribed to the Google Groups "Racket 
Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to racket-users+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Re: [racket-users] Re: [racket] Unsafe version of require/typed?

Reply via email to