Re: [fonc] Simple, Simplistic, and Scale

BGB Fri, 29 Jul 2011 03:18:31 -0700

On 7/28/2011 8:19 PM, David Barbour wrote:

On Thu, Jul 28, 2011 at 2:16 PM, BGB <[email protected]<mailto:[email protected]>> wrote:
    striving for simplicity can also help, but even simplicity can
    have costs:
    sometimes, simplicity in one place may lead to much higher complexity

    somewhere else. [...]

    it is better to try to find a simple way to handle issues, rather
than try to
    sweep them under the carpet or try to push them somewhere else.
I like to call this the difference between 'simple' and 'simplistic'.It is unfortunate that it is so easy to strive for the former andachieve the latter.
* Simple is better than complex.
* Complex is better than complicated.
* Complicated is better than simplistic.
The key is that 'simple' must still capture the essential difficultyand complexity of a problem. There really is a limit for 'as simple aspossible', and if you breach it you get 'simplistic', which shiftsuncaptured complexity unto each client of the model.

yeah. "simplistic" IMO would likely describe the JVM architecture ataround 1.0.the design seemed clean and simple enough (well, except the classlibraries, where Sun seemed to really like their "huge number ofclasses" / "huge number of API calls", in stark contrast to theminimalism of the Java language and Java ByteCode...).

some time later, and a mountain of crud is built on top. why?... becauseJBC could not extend cleanly or gracefully.

later, when I was working on my own implementation of the JVM (seemedlike a good idea at the time), I developed a very nifty hack to the".class" format to allow more readily extending the file format.

some of my own bytecode formats (designed after all this petered out),basically took the basic idea and ran with it, essentially discardingthe whole rest of the file format (the existence of any structuresbesides the constant pool seemed unnecessary).


now, whether this is simple or simplistic, who knows?...

sadly, the contained bytecode itself is not quite so elegant (at themoment it has 533 opcodes). it has also "evolved" over much of a decadenow, but has never really had an external file-format. in its abstractsense, it is an RPN and blocks-based format, vaguely like PostScript(stack machine with a dynamic/variant type-model, makes heavy use of'mark', ...)

a close sibling/fork of this IL was used by the C compiler, but thatversion made some ill-advised simplifications (damage caused byinitially dropping nested blocks, using right-to-left ordering forordinary function calls, but left-to-right for built-ins, ...).

We can conclude some interesting properties: First, you cannot achieve'simplicity' without knowing your problem or requirements veryprecisely. Second, the difference between simple and simplistic onlybecomes visible for a model, framework, or API when it is shared andscaled to multiple clients and use cases (this allows you to seerepetition of the uncaptured complexity).
I first made these observations in early 2004, and developed amethodological approach to achieving simplicity:
(1) Take a set of requirements.
(2) Generate a model that /barely/ covers them, precisely as possible.(This will be simplistic.)(3) Explore the model with multiple use-cases, especially at largescales. (Developer stories. Pseudocode.)
(4) Identify repetitions, boiler-plate, any stumbling blocks.
(5) Distill a new set of requirements. (Not monotonic.)
(6) Rinse, wash, repeat until I fail to make discernible progress fora long while.(7) At the end, generate a model that /barely/ overshoots therequirements.

I generally start with a more "complex" description and tend to see whatcan be shaved off without compromising its essential properties. doesn'talways work though, such as if requirements change or a dropped featurecomes back to bite one sometime later.

This methodology works on the simple principle: it's easier torecognize 'simplistic' than 'not quite as simple as possible'. All youneed to do is scale the problem (in as many dimensions as possible)and simplistic hops right out of the picture and slaps you in the face.

possible, but at times problems don't fully appear until one tries toimplement or test the idea, and one is left to discover that somethinghas gone terribly wrong.

By comparison, unnecessary complexity or power will lurk, invisible toour preconceptions and perspectives - sometimes as a glass ceiling,sometimes as an eroding force, sometimes as brittleness - but alwayscausing scalability issues that don't seem obvious. Most people wholive in the model won't even see there is a problem, just 'the waythings are', just Blub. The only way to recognize unnecessary power orcomplexity is to find a simpler way.

IMO, glass-ceilings and brittleness are far more often a result of cruftthan of complexity in itself.

plain complexity will often lead to a straightforward solution, ifalbeit an often similarly complex one.

OTOH, cruft (or kludges) will often crumble if poked too hard, or willlead to otherwise unpredictable behavior or bugs.

IMO, the main form of cruft is a result of trying to side-step a usualsource of complexity: properly following APIs and maintainingabstraction, and instead resorting to cheap hacks which work but dependon code/behaviors/implementation details/... that they really shouldn't.

sadly, abstraction and modularity can be itself a big source of overallsystem complexity.

So, when initially developing a model, it's better to start simplisticand work towards simple. When you're done, at the very edges, add justenough power, with constrained access, to cover the cases you did notforesee (e.g. Turing-complete only at the toplevel, or only with aspecial object capability). After all, complicated but sufficient /is/better than simplistic or insufficient.


fair enough, if albeit I prefer to operate in the other direction.

I've been repeating this for 7 years now. My model was seededin 2003 October with the question: /"What would it take to build thecyber-world envisioned in Neal Stephenson's Snow Crash?" /At thattime, I had no interest in language design, but that quickly changedafter distilling some requirements. I took a bunch of post-gradcourses related to language design, compilers, distributed systems,and survivable networking. Of course, my original refinement modeldidn't really account for inspiration on the way. I've since becomeinterested in command-and-control and data-fusion, which now has amajor influence on my model. A requirement discovered in 2010 Marchled to my current programming model, Reactive Demand Programming,which has been further refined: temporal semantics were addedinitially to support precise multimedia synchronization in adistributed system, my temporal semantics have been refined to supportanticipation (which is useful for ad-hoc coordination, smoothanimation, event detection), and my state model was refined twice tosupport anticipation (via the temporal semantics) and live programming.

Snow Crash: "dot pattern from space -> brain-damage -> glitching avatar-> biological virus" => "how does this work, exactly?..."

but, this book does hold some special status for me: I actually botheredto read all of it.

I started out with language design and VM implementation some timearound 2000.at the time I was using Guile, but was frustrated some with it. forwhatever reason, I skimmed over the source of it and several otherScheme implementations, and threw together my own.

IIRC, my initial exposure to Scheme was IIRC by reading Euclid's Window(I think this was it), IIRC because it mentioned Lisp and Scheme at onepoint in the book, and at the time this seemed interesting (yes, inmiddle and high-school I was probably a bit different than now, as Iactually liked math back then, ... this being before being repeatedly"owned" by classes and Q-like teachers helped to ruin my general opinionof the topic).



my first BGBScript implementation was in 2004, prompted by several factors:

my Scheme implementation had turned into an unmaintainable mess by thispoint, so I dropped it;I tried using a PostScript derived language as my primary scriptinglanguage, and found this to be a terrible language to work in directly;I recently had encountered JavaScript, and thought it was cool, sowanted to have something similar for my own projects.

nevermind that this first implementation sucked (it was itself a bigpile of kludges), and from about 2007-2010 I had largely almostforgotten it (it was "sort of on life support"). in 2010 interest in itwas renewed some due to initially unrelated developments leading to anifty new FFI.

I /had/ to develop a methodological approach to simplicity, becausethe problem I so gleefully attacked is much, much bigger than I am.(Still is. Besides developing RDP, I've also studied interactivefiction, modular simulations, the accessibility issues for blindaccess to the virtual world, the possibility of CSS-like transforms on3D structures, and so on. I have a potentially powerful idea involvingmulti-agent generative grammars for intelligent controlled creativityand stability in a shared, federated world. I doubt I'll finish/any/ of that on my own, except maybe the generative grammars bit.)
Most developers are clever, but lack perspective. Their eyes and nosesare close to a problem, focused on a local problem andcode-smells. When they build atop a /powerful/ substrate - such as OOPor monads - composition and integration issues are opaque to them. Acommon consequence is that their algorithms or DSLs are /simplistic/:they work okay for /a specific/ program or example, but they aredifficult to integrate in a new context.
/Too much power with too little perspective - that is the problem./
/
/


I doubt "power" is the cause of this problem.

being simplistic or made from cruft or hacks are far more likely to betheir downfall.

Developers have recognized this problem, at least implicitly, and sothey build frameworks. A framework is an ad-hoc, slow, informallyspecified, bug-ridden language that is, critically, more constrainedthan full OOP. The idea is that we write code in the context of aframework, and the constraints imposed on us help with integration.Unfortunately, most frameworks are simplistic, built too close to aproblem, and do not integrate nicely with other systems andframeworks. A common consequence is that developers spend more timeworking around a framework, or adapting between frameworks, thanworking within them.


above:

I regard this as "ivory tower VM design", the likely biggest example ofthis strategy being the JVM, however, it is not unique to the JVM (mostVMs have it to a greater or lesser extend).


major properties:
typically a language-centric design ("language X will rule the world");
VM facilities are typically only really usable from within "language X";

typically a set of APIs which try to largely or completely wrap theunderlying system;

typically a poor FFI (and poor cross-language integration);
...

Still, the goal of frameworks is noble. Just, to make them work, wemust start with a large set of diverse problems across many domains,and at enormous scales... millions of developers, integratingthousands of DSLs or frameworks. We need a general frameworkprogramming language, one that ensures a large number of compositionalproperties - most critically, those useful for system integration.This is, in a sense, what my RDP attempts to be. My pursuit of/reactive/ programming stems from this requirement directly: if wehave reactivity, then abstracting or composing a framework is littledifferent than abstracting or composing any other object.

I don't think the problem actually requires anywhere near this level ofresources, rather it more requires FFIs capable of doing a bit of "heavylifting" needed to integrate disparate languages and technologies in arelatively seamless and automatic manner.


a good portion of the problem then boils down to data-mining and heuristics.

source code/headers/... itself actually provides a good deal ofinformation for how to interface with it.


granted, practical limits still exist.

Some people question the value of simplicity. They think complicatedis 'good enough'. But these people don't understand that simplicityopens doors that we never realized were closed, raises glass ceilingswe didn't realize we were struggling against, and removes performancebarriers that we wrongly assumed to be a natural part of ourenvironment (wow! who knew there was a racetrack under all thisdebris?). A subtle qualitative difference in our abstractions can makehuge quantitative and qualitative differences in our emergentsystems. Pursuit of simplicity, at scale, tells us just howineffective and unscalable our systems are today... and offers greathope and optimism for tomorrow.

possibly, but there is still a lot that can be done without requiringfundamental changes.

typically though, one may parse code, and then compile it to astack-machine based IL, as personally I have had the most luck workingwith stack-machines. IME, stack machines are fairly easy to reason aboutfrom software, so I have had best luck with things liketemporaries/register-allocation/type-analysis/value-flow/... working interms of the stack.

in this sense, rather than seeing the stack as a 2D graph(stack-location and time), it can be seen as a 1D array of immutabletemporaries (every value that ever was on the stack within a givenregion of code, which can be mapped out statically regardless of anyinternal control-flow/...).

note that stack items are immutable, because one can't actually modify avalue on the stack. they can only pop it off (which discards theimmediate reference to the value), or replace it with a new value (whichhas a new identity, type, and storage assignment, despite overlapping interms of its stack-index). also, one doesn't need a forward scan todetermine whether or not any future references to a given value mayexist (once it is popped, its effective lifetime has ended, and anyassigned physical storage can be released for reuse).


...

but, sadly, RPN is not "in vogue" in the same way as TAC + SSA...

so, one can try to argue that it is an "inferior" technology in thissense, but oh well...

_______________________________________________
fonc mailing list
[email protected]
http://vpri.org/mailman/listinfo/fonc

Re: [fonc] Simple, Simplistic, and Scale

Reply via email to