Re: [agi] WHAT PORTION OF CORTICAL PROCESSES ARE BOUND BY "THE BINDING PROBLEM"?

Richard Loosemore Thu, 03 Jul 2008 10:50:21 -0700

Ed Porter wrote:

WHAT PORTION OF CORTICAL PROCESSES ARE BOUND BY "THE BINDING PROBLEM"?
Here is an important practical, conceptual problem I am having troublewith.
In an article entitled “Are Cortical Models Really Bound by the ‘BindingProblem’? ” Tomaso Poggio’s group at MIT takes the position that thereis no need for special mechanisms to deal with the famous “bindingproblem” --- at least in certain contexts, such as 150 msec feed forwardvisual object recognition. This article implies that a properlydesigned hierarchy of patterns that has both compositional andmax-pooling layers (I call them “gen/comp hierarchies”) automaticallyhandles the problem of what sub-elements are connected with whichothers, preventing the need for techniques like synchrony to handle thisproblem.
Poggio’s group has achieved impressive results without the need forspecial mechanisms to deal with binding in this type of visualrecognition, as is indicated by the two papers below by Serre (the laterof which summarizes much of what is in the first, which is an excellent,detailed PhD thesis.)The two works by Geoffrey Hinton cited below are descriptions ofHinton’s hierarchical feed-forward neural net recognition system (which,when run backwards, generates patterns similar to those it has beentrained on). These two works by Hinton show impressive results inhandwritten digit recognition without any explicit mechanism forbinding. In particular, watch the portion of the Hinton YouTube videostarting at 21:35 - 26:39 where Hinton shows his system alternatingbetween recognizing a pattern and then generating a similar patternstochastically from the higher level activations that have resulted fromthe previous recognition. See how amazingly well his system seems tocapture the many varied forms in which the various parts and sub-shapesof numerical handwritten digits are related.
So my question is this: HOW BROADLY DOES THE IMPLICATION THAT THEBINDING PROBLEM CAN BE AUTOMATICALLY HANDLED BY A GEN/COMP HIERARCHY ORA HINTON-LIKE HIERARCHY APPLY TO THE MANY TYPES OF PROBLEMS A BRAINLEVEL ARTIFICIAL GENERAL INTELLIGENCE WOULD BE EXPECTED TO HANDLE? Inparticular HOW APPLICABLE IS IT TO SEMANTIC PATTERN RECOGNITION ANDGENERATION --- WITH ITS COMPLEX AND HIGHLY VARIED RELATIONS --- SUCH ASIS COMMONLY INVOLVED IN HUMAN LEVEL NATURAL LANGUAGE UNDERSTANDING ANDGENERATION?

The answer lies in the confusion over what the "binding problem"actually is. There are many studies out there that misunderstand theproblem is such a substantial way that their conclusions aremeaningless. I refer, for example, to the seminal paper by Shastri andAjjangadde, which I remember discussing with a colleague (Janet Vousden)back in the early 90s. We both went into that paper in great depth, anindependently came to the conclusion that S & A had their causality socompletely screwed up that the paper said nothing at all: they claimedto be able to explain binding by showing that synhcronized firing couldmake it happen, but they completely failed to show how the RELEVANTneurons would become synchronized.

Distressingly, the Shastri and Ajjangadde paper then went on to become,as I say, seminal, and there has been a lot of research on somethingthat these people call "the binding problem", but which seems (from mylimited coverage of that area) to be about getting various things toconnect using synchronized signals, but without any explanation of howthe the things that are semantically required to connect, actual connect.

So, to be able to answer your question, you have to be able todisentangle that entire mess and become clear what is the real bindingproblem, what is the fake binding problem, and whether the new ideamakes any difference to one or other of these.

In my opinion, it sounds like Poggio is correct in making the claim thathe does, but that Janet Vousden and I already understood that generalpoint back in 1994, just by using general principles. And, mostprobably, the solution Poggio refers to DOES apply as well to what youare calling the semantic level.

The paper “Are Cortical Models Really Bound by the ‘Binding Problem’?”,suggests in the first full paragraph on its second page that gen/comphierarchies avoids the “binding problem” by
“coding an object through a set of intermediate features made up oflocal arrangements of simpler features [that] sufficiently constrain therepresentation to uniquely code complex objects without retaining globalpositional information."

This is exactly the position that I took a couple of decades ago. Youwill recall that I am always talking about doing this with CONSTRAINTS,and using those constraints at many different levels of the hierarchy.

For example, in the context of speech recognition,
"...rather than using individual letters to code words, letter pairs orhigher-order combinations of letters can be used—i.e., although the word“tomaso” might be confused with the word “somato” if both were coded bythe sets of letters they are made up of, this ambiguity is resolved ifboth are represented through letter pairs.”

A strangely trivial point for them to make: this was the basis for allthe "triplet" representations that were in widespread use for NNsimulations back in the late 1980s.

The issue then becomes, WHAT SUB-SETS OF THE TYPES OF PROBLEMS THE HUMANBRAIN HAS TO PERFORM CAN BE PERFORMED IN A MANNER THAT AVOIDS THEBINDING PROBLEM JUST BY USING A GEN/COMP HIERARCHY WITH SUCH “A SET OFSIMPLER FEATURES [THAT] SUFFICIENTLY CONSTRAIN THE REPRESENTATION TOUNIQUELY CODE” THE TYPE OF PATTERNS SUCH TASKS REQUIRE?


All of them.

There is substantial evidence that the brain does require synchrony forsome of its tasks --- as has been indicated by the work of people likeWolf Singer --- suggesting that binding may well be a problem thatcannot be handled alone by the specificity of the brain’s gen/comphierarchies for all mental tasks.

No. The brain uses synchrony, but what relationship this has to bindingis unclear. I suspect these are processes happening at completelydifferent levels of description, therefore the connection is nonexistent.

The table at the top of page 75 of Serre’s impressive PhD thesissuggests that his system --- which performs very quick feedforwad objectrecognition roughly as well as a human --- has an input of 160 x 160pixels, and requires 23 million pattern models. Such a large number ofpatterns helps provide the “simpler features [that] sufficientlyconstrains the representation to uniquely code complex objects withoutretaining global positional information.”
But, it should be noted --- as is recognized in Serre’s paper --- thatthe very rapid 150 msec feed forward recognition described in that paperis far from all of human vision. Such rapid recognition --- althoughsurprisingly accurate given how fast it is --- is normally supplementedby more top down vision processes to confirm its best guesses. Forexample, if a human is shown a photograph of a face, his eyes willnormally saccade over it, with multiple fixation points, often on keyfeatures such as eyes, nose, corners of mouth, points on the outline ofthe face, all indicating the recognition of the face is normally muchmore than one rapid feed forward process. It is possible thatsynchronies, attention focusing, or other binding processes are involvedin these further steps of visual recognition.
One of my questions is: if such a relatively small (i.e., 160 x 160pixel), low-dimensional (i.e., 2 dimensional) input space as that inSerre’s system requires 23 million models so that it can sufficientlyconstrain the representation to uniquely recognize high-level visualobjects without then need for any additional mechanism for binding, HOWMANY MODELS WOULD BE REQUIRED TO PROPERLY CONSTRAIN RECOGNITION OFPATTERNS IN THE MUCH LARGER, AND MUCH HIGHER-DIMENSIONAL SPACE IN WHICHSEMANTIC PATTERNS --- SUCH AS THOSE INVOLVED IN HUMAN LEVEL LANGUAGEUNDERSTANDING --- ARE REPRESENTED?

You may recall in my previous response that I did ask how these modelsscaled up. If his "models" are what I think they are, and if they scaleas the square of the linear array size, then this approach would beuseless at these higher levels. That was what I suspected before: theapproach might well be progress in the lower reaches of the visualsystem, but other - completely different - mechanisms are probably atwork higher up.

It is my hunch that in such a large. high-dimensional space, it is notpossible for the brain to have enough models necessary to provide thetype of constraint required --- without using additional mechanisms,such as synchrony, or its equivalent --- to deal with the bindingproblem. IS THIS CORRECT?

Not correct. Or rather, you are correct to say that the model does notapply, but I see no reason to deduce that the binding problem per se hasany relevance to the problem of dealing with higher level processes. Asfar as I can see, this is just a non sequiteur.


Rather, we need to understand the basic nature of those higher processes.

But it is also my hunch that --- in the more complex representationalspaces required for more abstract levels of human thought --- gen/compmodel networks of the general type described by Poggio and Serre greatlyreduce the amount of binding that has to be handled by additionalmethods such as synchrony and/or sequential wide-spread activation ofconscious and near conscious concepts at gama wave frequencies. Thisdecrease in the amount of binding that has to be dealt with by methodsother than the relatively high resolution of the gen/comp hierarchy,itself, would greatly increase the amount of parallel processing thatcan be performed in the sub-consious. IS THIS CORRECT?


Cannot answer this question:  do not understand it.

I would be grateful for any intelligent answers to any of the questionsI have posed in ALL CAPS or any feedback about any of my other commentsin this email.These questions appear to be very important for the design of artificialgeneral intelligence (AGI) because AGI methods for dealing with bindingare considerably more computationally expensive than the relativelysimple feed forward computations of the type used in systems like thatshown in Serre’s PhD thesis, or in Hinton's RBMs, because theiradditional binding methods tend to require spreading activation thatstores more complicated state information at each activated node, andthey tend to require matching between the stored activation states atthose nodes. I have not figured out any way to do massively-parallel,complex, context-sensitive semantic pattern recognition and generationwithout some form of additional processing to handle binding, unless onewere to use many more models than there are neurons in the human brain.IS THIS CORRECT?
I would appreciate very much any guidance those who might be moreknowledgeable on this subject could give to me and to the AGI list.

Stepping back from the details, I think that what is happening here isthat if you take an approach to AGI that emphasizes the "Standard Model"of the sort found in Novamente (a broad class of systems that is hard todefine compactly, but which has to do with the fact that there arepassive symbols being manipulated by supposedly smart mechanisms), thenyou tend to get sucked into the idea that binding the right thingstogether is a crucial problem. To put it crudely, getting the system towork depends crucially on who decides to talk to whom in your system.

It is very difficult to describe why this happens. Basically, thisstyle of AGI commits itself to specific acrchitectures very early, andthen later on the researcher wonders why, with so many degrees offreedom nailed down, the last few pieces of the puzzle do not fit. Thenthe researcher needs a name for the main culprit that does not fit. Inthis case, you are calling it the binding problem .... getting the rightthings to hook up together.

Problem is, you see, that getting the right things to hook up togetheris the WHOLE STORY.




Richard Loosemore

Sincerely,

Ed Porter
References
1. Are Cortical Models Really Bound by the “Binding Problem”? , byMaximilian Riesenhuber and Tomaso Poggio, athttp://cbcl.mit.edu/projects/cbcl/publications/ps/riesenhuber-neuron-1999.pdf2. Learning a Dictionary of Shape-Components in Visual Cortex:Comparison with Neurons, Humans and Machines, PhD thesis by ThomasSerre, athttp://cbcl.mit.edu/projects/cbcl/publications/ps/MIT-CSAIL-TR-2006-028.pdf3. Robust Object Recognition with Cortex-Like Mechanisms, Thomas Serre,Lior Wolf, Stanley Bileschi, Maximilian Riesenhuber, and Tomaso Poggio,at http://web.mit.edu/serre/www/publications/Serre_etal_PAMI07.pdf4. Learning multiple layers of representation, Geoffrey E. Hinton, athttp://www.csri.utoronto.ca/~hinton/absps/tics.pdf5. The Next Generation of Neural Networks, Google Tech Talk by GeoffreyE. Hinton, 11/29/07, on YouTube, athttp://www.youtube.com/watch?v=AyzOUbkUf3M
------------------------------------------------------------------------
*agi* | Archives <http://www.listbox.com/member/archive/303/=now><http://www.listbox.com/member/archive/rss/303/> | Modify<http://www.listbox.com/member/?&;>Your Subscription [Powered by Listbox] <http://www.listbox.com>




-------------------------------------------
agi
Archives: http://www.listbox.com/member/archive/303/=now
RSS Feed: http://www.listbox.com/member/archive/rss/303/
Modify Your Subscription: 
http://www.listbox.com/member/?member_id=8660244&id_secret=106510220-47b225
Powered by Listbox: http://www.listbox.com

Re: [agi] WHAT PORTION OF CORTICAL PROCESSES ARE BOUND BY "THE BINDING PROBLEM"?

Reply via email to