Here's what I was reaching for in my last post to Matt - it represents a profound change in vision philosophy (and helps explain why current AI/scientific approaches to vision (& everything else) are so mindblowingly fragmented and simplistic).

Vision is always "visual scene recognition" and never just "visual object recognition".

To realise this, look at how you look at four pictures:

http://media.oregonlive.com/portland_impact/photo/barber-shootingjpg-fca82e981d021b3b.jpg

http://www.forgotmylines.com/wp-content/uploads/2010/07/ReasonstobePrettyFIghtScene.jpg

http://2.bp.blogspot.com/_ASZ1J20yTgQ/TLuZJT-TgXI/AAAAAAAAAA0/ps7cifXlBWo/s1600/DSC01098.JPG

http://simonhalliday.files.wordpress.com/2009/03/library8003.jpg

What you see - what "visual scene recognition" means first of all - is [seeing] **object[s]-in-a- field**

You see - a policeman putting up a tape on a street, kids fighting on a playground[or similar], a guy playing tennis on a tennis court, books and candles on a table in some kind of library.

You don't just see isolated objects - you automatically see objects-doing-things-in-a-field. You aren't just aware vaguely of the "gestalt" or the "context", which is the vague way some vision philosophy has thought about these things. You immediately look to see what the principal objects are doing in that whole field. Your life in the real world depends on quickly working that out - otherwise that car/animal/falling masonry may hit you. Every scene is potentially dangerous.

But current AI and scientific thinking never - characteristically - "sees the big picture". AI and science metacognitively only see isolated objects - they see the world like this:

http://www.freevector.com/site_media/preview_images/FreeVector-People-Vector-Art.jpg

They don't see or look for the scene - the field surrounding those objects, and what those objects are doing in (& how they're relating to) that field.

And this applies to everything AI touches. Its vision is always fragmented. It sees only fragmented words and sentences:

http://img2.etsystatic.com/000/0/5464342/il_fullxfull.130745770.jpg

It never sees the text as a whole. And human readers always read sentences-in-a-text, (and talk sentences-in-a-conversation), never just sentences.

But even "visual scene recognition" is too simplistic!

Actually, it's always "visual movie theatre recognition".

Human vision is never vision of just a scene out there. It's simultaneously :

"vision of an observer (oneself) viewing the scene" [/objects in a field]. The acting observer seeing as well as the scene seen.

We always see ourselves watching the scene - are always aware of the point of view of the scene, and its distance from us (as I have detailed before on this forum). In real life, of course, we are normally acting in that field and moving through it - and the position of objects in relation to us is vital information..

And then "visual movie theatre recognition" entails not just visual-scene, but visual-scene-in-a-MOVIE recognition!! (in a whole story!)

We don't just see a scene in an isolated moment in time, but are aware of it as a scene-in-a-stream-of-scenes - a movie

We don't just see objects in a field in a timeless moment - - we place them as moving in time as well as space . When we look at those kids fighting, we don't just see them occupying isolated postures - we see them as moving in time - having started a fight beforehand, and still to finish it. When we look at the policeman we are aware that he has come up to that tape in the past, and will move away from it in the future.

And similarly, we don't just see objects-in-a-field, but objects-in-a-field-in-a-WORLD. As objects not just in a single "theatre of operations" but a whole world-of-operations. We are aware of the fields that lie beyond the immediate field - what lies beyond the movie theatre - this may be crucial to understanding what is happening We couldn't understand this picture if our vision were confined to the immediate wing/field shown, and not the fields beyond the picture:

http://www.google.co.uk/imgres?um=1&hl=en&biw=1645&bih=767&tbm=isch&tbnid=lBHESwvnTIYWOM:&imgrefurl=http://www.advrider.com/forums/showthread.php%3Ft%3D499369%26page%3D26&docid=Uoxo7hdI1Bj4MM&imgurl=http://rookery2.viary.com/storagev12/1189500/1189682_2b3b_625x625.jpg&w=399&h=300&ei=m6deUbWqHeOm0QWjtYDIBw&zoom=1&ved=1t:3588,r:22,s:0,i:153&iact=rc&dur=2325&page=1&tbnh=184&tbnw=251&start=0&ndsp=28&tx=130&ty=89

And actually - well you knew this was coming, right? - we don't just see "objects" - lifeless things. We see BODIES. We always see bodies. We're always aware of how those bodies move - whether they're alive or dead - and how they're moving.

And "visual body recognition" is always EMBODIED. You can't understand how other bodies - especially other human bodies - move if you haven't got a body yourself - and can't use that body to simulate how other bodies do and will move.

So let's see [!] what we have here.

"Visual recognition" - human vision - is a helluva lot more than "visual object recognition". "Visual scene recognition" is a bare minimum "Visual movie theatre recognition" is more like it.

And that means embodied-vision-of-a-viewer-viewing-objects/bodies-moving- at-points-in-a-stream-of-=movement- in-a-field-within-a-world-of-fields.

And that's the easy-peasy part. Then we get on to understanding language - and general conceptualisation - which deals with whole classes of object -in whole classes of fields etc. in whole classes of world

Vision then entails not just the small , scientific, fragmented, narrow, narrow-minded, totally blinkered picture of objects, but the big, artistic, integrated, broad-minded, panoramic picture of what lies beyond that object in space and time - in a field in a world of fields, in a scene in a stream of scenes.

How many quadrillions you got, Matt?











-------------------------------------------
AGI
Archives: https://www.listbox.com/member/archive/303/=now
RSS Feed: https://www.listbox.com/member/archive/rss/303/21088071-f452e424
Modify Your Subscription: 
https://www.listbox.com/member/?member_id=21088071&id_secret=21088071-58d57657
Powered by Listbox: http://www.listbox.com

Reply via email to