Re: [fonc] x86_64...

BGB Fri, 05 Dec 2008 14:46:33 -0800

----- Original Message -----From: "Aaron Gray" <[EMAIL PROTECTED]>

To: "Fundamentals of New Computing" <[email protected]>
Sent: Saturday, December 06, 2008 2:53 AM
Subject: Re: [fonc] x86_64...

apparently both this, and my effort, had independently discovered theidea of having 2 different this/self values (in my case, this was due tothe issue of mixing delegation with class/instance, where I havedelegation methods which accept the self which recieved the originalmethod call, and normal virtual methods get the self from the objectcontaining the method). however beyond this I suspect the object systemsdiffer notably (my system is based mostly on the use of a class/instancesystem and interfaces).
There is more about IDC's Lieberman prototype implementation here :-

   http://piumarta.com/pepsi/prototypes.html

Aaron


yeah...

I have looked over the project a bit more.

it provides a few interesting ideas, and even points out a few things in theWin32 API I was not aware of (namely, it is possible to introspect the app'ssymbol table and similar without having to load and process the app'sbinary, ...).



but, yes, it is a very different sort of project it seems.

most is written in SmallTalk, and seems to be a mostly centralized andintegrated project;it seems that the implementation written in itself is actually meaningful,in that the C version of the implementation seems to be largely the compileroutput from an ST->C compiler, which I suspect is also located in theproject.

more so, it has some things merged together which are in my projects severaldifferent subsystems:apparently it deals with machine code generation, low-level code generationissues, ... all in a single place.

it is also so clear what separates one thing from another (there does notappear to be any clear separation between subsystems, ...).



now, in my case, I have different libraries:

BGBASM, which provides the assembler and dynamic linker (also disassembler,code for managing symbol lookup, ...), accepts data in a textual formsimilar to NASM;VRM, which provides low-level codegen (register allocation, largelyabstracts over processor level register and type handling, ...);RPNIL, accepts RPN based language for describing the code to be compiled(granted, the existing RPNIL compiler also includes many things since movedto VRM);

..

I have not ran a line counter, but I suspect this project is some orders ofmagnitude smaller than my project (maybe a 10x or more code-sizedifference), or, at least, this project is a little smaller than 250 kloc...

I would suspect the are both advantages and disagvantages to having thingsintegrated or isolated.


integrated, pros:

there is much less code to work with, and so drastic changes can be donemore readily;the same task can be accomplished with far less code, and via a potentiallyfaster process.


integrated, cons:
there is much less abstraction;
can become a horrible and unworkable mess in non-OO languages (such as C);

it is not really possible to reuse components in different contexts, sincethe context is integrated with the component;in this case, it is not possible to utilize alternate capabilities orimplications of a component, because as it so happens many of thesecapabilities are not implemented (for example, writing the assembler doesnot give one a disassembler almost for free, ...);

..

modular, pros:

components can be replaced with others to give new and differentfunctionality;

components can be used in a wider variety of contexts;

it may be possible to make use of far more elaborate and complextransformation processes;

it provides an alternative to code duplication and modification;
it keeps one thing isolated from the inner workings of other things;

it is easy to provide a good number of possible "routes" and thusdrastically different behavior and results, as well as making it morepossible to accomodate alternate components with merged functionality;

..

modular, cons:
much more code may be needed;
often, lots of code is duplicated between modules;

general structures from one subsystem may be mirrored in another, eventhough there is no direct interaction between these structures;a task involving numerous modules and stages may run much slower than oneimplemented as a single integrated component;although the internals of accomplishing a task are kept flexible, and thewhole structure becomes flexible, the way in which the general process isapproached becomes fairly rigid;

the APIs become much like impassible walls;
..

as a result, with a modular system one has to be fairly careful with howthey design their APIs and what they expose and where. this is because,things hidden well behind the wall can be changed as needed, anyfunctionality which touches or crosses the wall may become "set in stone"...

so, the design of specific APIs, subsystems, and rules of interaction,becomes almost as much a central part of the project as the code itself.

one may end up larglely writing their own code for the primary reason thatmost existing code does not do the right things in the right way (manypieces of code don't like seeing themselves as a tiny and rigidly definedpiece of machinery embedded inside a much larger system, or they might dothe right thing in the wrong way, or in some rare cases the wrong thing inthe right way).

..

psychology may relate to all this as well, since apparently I am an ESTJ(yes... people here can revile in horror...), and this may all relate to howI approach coding...

BTW: I am considering the thoughts and implications of in my case doingsomething similar to 'coke'...

the big hairy issue though is that I would be compiling it to the JVM, whichis slightly different from the normal way languages of this sort operate.

as well, considering ideas for representing a wider variety of data types inS-Exps (at present, S-Exps force a fairly narrow type model). new syntaxwill probably be related to serializing classes and instances, inline XML,..


#X<foo bar="text">baz...<br/>again...</foo>

in the past, I had not done this, but more because I had usually been usingS-Exps as a convinient way of dumping internal data in a readable form (butnot as much for actual/useful data serialization).



as is, I lack any "capable" data serialization format.

S-Exps work, but only represent a narrow range of types (lists, arrays,atomic values, symbols, keywords, ...).

XML can be used to implement a data serialization format, but is not initself such a format, and it is a pain to make it do so (and efficiently,since I don't currently have support for SAX, and the use of DOM for dataserialization is slow and expensive).

I have not maintained any of my binary serialization formats (most arenarrow and only serialize the particular kind of data I am looking toserialize at the time).

..

best bet is still almost to try to hack a much larger syntactic type modelonto S-Exps (as noted above).

as is, it can serialize objects from my older prototype system, but I couldlower these objects from their privledged place in the syntax (especiallysince for many uses my newer class/instance system is likely to absorb manyof the use cases of this older system, however the systems are sufficientlydifferent to where each likely has meaningful use cases).


in particular:
{x: 3 y: 4}
may be downgraded to:
#{x: 3 y: 4}


#C and #O may be added and intended for serialized classes and instances.

#C<classname>{ stuff I have yet to decide on... }
#O<classname>{ key: value}

#O"app/Foo"{x: 3 y: 4}

most likely, I could use special serialization/deserialization handling forclasses and instances, given classes and instances are statically typed inmy framework...

note: classes and instances may also meaningfully absorb many uses of ad-hocstructural types (which posed a sufficient pain for data serialization anddeserialization that in the past I had generally not bothered with any kindof generalized data serialization...).



may use the "traditional" syntax for inline references:
'#<num>=' and '#<num>#', even though this syntax is horrid IMO.

in either case, it is uncertain how to efficiently encode references apartfrom causing a potentially serious cost to serialization performance(recursively checking from each compound element if it is used elsewherecould quickly become an O(n^2) cost).

however, it could be possible by using a separate pass which would keeptrack of every non-atom in an array, and transferring any duplicated elementto a separate array (making this essentially a linear complexity).

in this case, for the output pass, only the first occurence is serialized(checking if it has occured in the table), and for all following occurencesit is referenced.

I am likely to require a strictly linear encoding process (AKA: no forwardreferences), although forward references could be allowed if a 2-passparsing scheme were used (references are initially given "placeholders", andthe correct values are substituted in place via a secondary "un-flattening"pass).

potentially, I could also split out merged forms and serialize thembeforehand (makes more sense for automatically serialized data, and mayimprove readability in many cases, avoiding producing a potentially massivepartial nested graph of objects followed by lots of smaller fragmentaryreferences).


first form:
{z: 5 _parent: #1={x: 3 y: 4}}
{w: 7 _parent: #1#}

if forward refs were possible:
{z: 5 _parent: #1#}
{w: 7 _parent: #1={x: 3 y: 4}}


if splitting is done, and the table is serialized first:
#;#1={x: 3 y: 4}
{z: 5 _parent: #1#}
{w: 7 _parent: #1#}

this would work because my parser still parses expression comments, but thendiscards the results.

for now, I will probably not face the issue of method and functionserialization (it would be both ugly and unproductive to dump out masses ofdisassembled functions, and far less certain that any such dumps could bere-assembled into a working form anyways...). so, any methods which havebeen compiled to machine code would probably be serialized as symbolicreferences (C-side function name).


or such...


_______________________________________________
fonc mailing list
[email protected]
http://vpri.org/mailman/listinfo/fonc



_______________________________________________
fonc mailing list
[email protected]
http://vpri.org/mailman/listinfo/fonc

Re: [fonc] x86_64...

Reply via email to