Re: [agi] Introducing Steve's "Theory of Everything" in cognition.

Richard Loosemore Thu, 25 Dec 2008 06:45:30 -0800

Steve Richfield wrote:

Ben, et al,
After ~5 months of delay for theoretical work, here are the basic ideasas to how really fast and efficient automatic learning could be madealmost trivial. I decided NOT to post the paper (yet), but rather, tojust discuss the some of the underlying ideas in AGI-friendly terms.Suppose for a moment that a NN or AGI program (they can be easily mappedfrom one form to the other

... this is not obvious, to say the least. Mapping involves manycompromises that change the functioning of each type ...


), instead of operating on "objects" (in an

object-oriented sense)


Neither NN nor AGI has any intrinsic relationship to OO.

, instead, operates on the rate-of-changes in the

probabilities of "objects", or dp/dt. Presuming sufficient bandwidth togenerally avoid superstitious coincidences, fast unsupervised learningthen becomes completely trivial, as like objects cause simultaneouslike-patterned changes in the inputs WITHOUT the overlapping effects ofthe many other objects typically present in the input (with numerousminor exceptions).

You have already presumed that something supplies the system with"objects" that are meaningful. Even before your first mention of dp/dt,there has to be a mechanism that is so good that it never inventsobjects such as:

Object A: "A person who once watched all of Tuesday Welds movies in thespace of one week" or

Object B: "Something that is a combination of Julius Caesar's pinky toeand a sour grape that Brutus' just spat out" or

Object C: "All of the molecules involved in a swiming gala that happento be 17.36 meters from the last drop of water that splashed from the pool".

You have supplied no mechanism that is able to do that, but thatmechanism is 90% of the trouble, if learning is what you are about.


Instead, you waved your hands and said "fast unsupervised learning

> then becomes completely trivial" .... this statement is a declarationthat a good mechanism is available.

You then also talk about "like" objects. But the whole concept of"like" is extraordinarily troublesome. Are Julius Caesar and Brutus"like" each other? Seen from our distance, maybe yes, but from thepoint of view of Julius C., probably not so much. Is a G-type star"like" a mirror? I don't know any stellar astrophysicists who would sayso, but then again OF COURSE they are, because they are almostindistinguishable, because if you hold a mirror up in the right way itcan reflect the sun and the two visual images can be identical.

These questions can be resolved, sure enough, but it is the wholebusiness of resolving these questions (rather than waving a hand overthem and declaring them to be trivial) that is the point.

But, what would Bayesian equations or NN neuron functionality look likein dp/dt space? NO DIFFERENCE (math upon request). You could triviallydifferentiate the inputs to a vast and complex existing AGI or NN,integrate the outputs, and it would perform _identically_ (except forsome "little" details discussed below). Of course, while the transformswould be identical, unsupervised learning would be quite a differentmatter, as now the nearly-impossible becomes trivially simple.For some things (like short-term memory) you NEED an integratedobject-oriented result. Very simple - just integrate the signal. Howabout muscle movements? Note that muscle actuation typically causesacceleration, which doubly integrates the driving signal, which wouldrequire yet another differentiation of a differentiated signal to, whendoubly integrated by the mechanical system, produce movement to thedesired location.Note that once input values are stored in a matrix for processing, thebaby has already been thrown out with the bathwater. You must START withdifferentiated input values and NOT static measured values. THIS is whatthe PCA folks have been missing in their century-long quest for anefficient algorithm to identify principal components, as their arrayshad already discarded exactly what they needed. Of course you couldsimply subtract successive samples from one another - at someconsiderable risk, since you are now sampling at only half theNyquist-required speed to make your AGI/NN run at its intended speed. Inshort, if inputs are not being electronically differentiated, thensampling must proceed at least twice as fast as the NN/AGI cycles.But - how about the countless lost constants of integration? They "allcome out in the wash" - except for where actual integration at theoutputs is needed. Then, clippers and leaky integrators, techniquescommon to electrical engineering, will work fine and produce many of thesame artifacts (like visual extinction) seen in natural systems.It all sounds SO simple, but I couldn't find any prior work in thisdirection using Google. However, the collective memory of this group ispretty good, so perhaps someone here knows of some prior effort that didsomething like this. I would sure like to put SOMETHING in the"References" section of my paper.Loosemore: THIS is what I was talking about when I explained that thereis absolutely NO WAY to understand a complex system through directobservation, except by its useless anomalies. By shifting an entire AGIor NN to operate on derivatives instead of object values, it works*almost* (the operative word in this statement) exactly the same as oneworking in object-oriented space, only learning is transformed from thenearly-impossible to the trivially simple. Do YOU see anyobservation-based way to tell how we are operating behind our eyeballs,object-oriented or dp/dt? While there are certainly other explanationsfor visual extinction, this is the only one that I know of that isabsolutely impossible to engineer around. No one has (yet) proposed anyvalue to visual extinction, and it is a real problem for hunters, so ifit were avoidable, then I suspect that ~200 million years of evolutionwould have eliminated it long ago.

Read David Marr's book "Vision", or any other text that discusses thelow level work done by the visual system. There are indeeddifferentiation functions in there (IIRC, Marr came up with theDifference of Gaussians (DOG) idea because the difference of Gaussianswas a way to do the equivalent of dp/dt). BUT... this is all in thefirst few wires coming out of the retina! It is not interesting.Visual extinction (of the sort you are talking about) is all over anddone with in the first few cells of the visual pathway, whereas you aretalking here about the millions of other processes that occur higher up.

As for your comment about complex systems, it looks like a nonsequiteur.Just does not follow, as far as I can see.




Richard Loosemore

 From this comes numerous interesting corollaries.
Once the dp/dt signals are in array form, it would become simple toautomatically recognize patterns representing complex phenomena at thelevel of the neurons/equations in question. Of course, putting it inthis array form is effectively a transformation from AGI equations to NNconstruction, a transformation that has been discussed in priorpostings. In short, if you want your AGI to learn at anythingapproaching biological speeds, it appears that you absolutely MUSTtransform your AGI structure to a NN-like representation, regardless ofthe structure of the processor on which it runs.Unless I am missing something really important here, this shouldCOMPLETELY transform the AGI field, regardless of the particularapproach taken.Any thoughts?Steve Richfield------------------------------------------------------------------------*agi* | Archives <https://www.listbox.com/member/archive/303/=now><https://www.listbox.com/member/archive/rss/303/> | Modify<https://www.listbox.com/member/?&;>Your Subscription [Powered by Listbox] <http://www.listbox.com>




-------------------------------------------
agi
Archives: https://www.listbox.com/member/archive/303/=now
RSS Feed: https://www.listbox.com/member/archive/rss/303/
Modify Your Subscription: 
https://www.listbox.com/member/?member_id=8660244&id_secret=123753653-47f84b
Powered by Listbox: http://www.listbox.com

Re: [agi] Introducing Steve's "Theory of Everything" in cognition.

Reply via email to