Re: Goals of AGI (was Re: [agi] AGI interests)

Mark Waser Wed, 18 Apr 2007 17:20:11 -0700

I could have used a lossy test by using human subjects to judge theequivalence of the reproduced output text, but it seemed like moretrouble than it is worth. The lossless test is fair because everyonestill has to encode the (incompressible) choice of representation.

Whether or not the lossless test is fair is irrelevant and you entirelyfailed to address my argument that "Requiring an AI to decompress the sameknowledge into a variety of different forms based upon what was input is atremendously more difficult problem than AI without that requirement (andhaving that requirement doesn't seem to have any benefit)."

----- Original Message -----From: "Matt Mahoney" <[EMAIL PROTECTED]>

To: <[email protected]>
Sent: Wednesday, April 18, 2007 6:43 PM
Subject: Re: Goals of AGI (was Re: [agi] AGI interests)

I want to first clarify my earlier proposed definition of AGI, and then
address the concerns that were posted in response to my claim of the
equivalence of compression and AI.  I will propose just one specific
application: an operating system for personal computers. An AGI residinginyour PC should be able to do the same tasks as a human assistant, at leastasfast and as accurately. For example, you could ask your computer to writeand
submit a research paper on some topic, or to find consulting work and make
money, or to compose and play music, or you could hold up a scrambledRubik's
cube and show 5 of the sides to a camera mounted in your PC and ask it to
display what is on the hidden side. Unlike current operating systems,therewould be no notion of files or programs or a GUI. You ask it to do workand
it obeys.
The interface would be an Internet connection and standard peripheralssuch asa monitor, keyboard, mouse, speakers, and maybe a microphone and camera.Amicrophone (for speech) and camera (to identify the user and interpretfacialexpressions) would have negligible effect on requrements because the AGIwouldneed a vision system anyway to interpret video on the Internet. The AGImay
be distributed.  It need not all reside locally.

An AGI should be complementary to the human mind, not a copy of one.  It's
purpose would be to enhance communication between the user and thecomputer,and thus to much of the world. The AGI should be able to generate imagesthathumans can interpret, even though this is not a capability of humans. TheAGIneed not have a physical body or the means to control it, but it must beableto interpret images or descriptions of human actions such as running,eating,
falling in love, etc.
I proposed text compression and video compression as tests. For text, theAGI
must be able to losslessly compress 1 GB of text with no initial training
(i.e. the size of the decompressor is included) to match humans in testsof
text prediction, about 1 bit per character.  I chose 1 GB because it is
equivalent to human language exposure from infancy to adulthood. Usingmoretext is not allowed because it would make the problem easier by usingbrute
force to compensate for a slow learning algorithm, e.g. Google's 5-gram
language model trained on 10 TB of text.

For video, I propose that the AGI be required to compress (using lossy
compression) 275 TB of DVD quality MPEG-2 video (or 14000 TB uncompressed
video) to 800 MB.  The input represents 20 years of video for 16 hours per
day, or 60,000 DVD quality movies at 4.7 GB each (2 hours x 640 x 480 x 30fpsx 3 colors x 8 bits at 0.65 bits per pixel compression). The output is arate
of 10 bits per second, 16 hours per day for 20 years.  The quality of the
compression must be such that if an audience views a 2 hour movie, then 24
hours later views either the same movie or the movie after compression and
decompression, each with 50% probability, that the audience will not beable
to guess with more than 75% accuracy which version they saw.

The video input size (275,000 GB) will be the maximum allowed, in order to
prevent brute force solutions.
For both text and video, the decompressor must finish in no more than 20years
(real time).  There should be no time limit for compression.
My proposed video output rate of 10 bits per second is a rough guess. Itwill
depend on psychological tests of the rate of human long term memory.
Unfortunately, I don't know of any equivalence to Shannon's text guessinggame
[1] to measure this.  My estimate is based on:

1. Speech converted to compressed text contains 10 bits per second.
Intuitively, video should be similar or a little higher in terms of whatyou
learn and remember, and in the allocation of neurons in the cortex.
2. Landauer [2] estimated that the capacity of human long term memory is10^9
bits for a wide range of modalities (words, images, music).  Human life
expectancy is 2 x 10^9 seconds, or 0.5 bits per second.
3. Standing [3] had subject memorize 10,000 pictures, one every 5.6secondsover 5 days. Two days later they could recall about 80% in tests. Thisisabout the result you would get if you reduced each picture to a 16 bitfeaturevector and checked for matches. This is a memory rate of 0.3 bits persecond.
Now let me address the responses to my earlier post.


--- "Kingma, D.P." <[EMAIL PROTECTED]> wrote:
I'm not convinced by this reasoning. First, the way individuals store
audiovisual information differs, simply because of slight differences in
brain development (nurture). Also, memory is condensed information abouttheactual high-level sensory/experience information. The actual 45kb memoryofa movie is therefore quite personal to the subject. Recall of aphoto/video
is more like an impressionistic painting then an actual photo.
It is true that different people will notice different differences, so wemust
average over a large audience.

--- Mark Waser <[EMAIL PROTECTED]> wrote:
>> In http://cs.fit.edu/~mmahoney/compression/rationale.html I argue the
equivalence of text compression with AI.

We've had this argument before so I'll summarize . . . .
Knowledge compression may well be mostly equivalent with the "logicalview"of AI. Text, however, can express the same knowledge in a nearinfinitudeof different forms. Requiring an AI to decompress the same knowledgeinto avariety of different forms based upon what was input is a tremendouslymore
difficult problem than AI without that requirement (and having that
requirement doesn't seem to have any benefit).
I am proposing a lossy test for video compression for this reason. Alosslessvideo test would make no sense because the vast majority of theinformationcontent is inperceptable noise. But this is not the case for text. Icouldhave used a lossy test by using human subjects to judge the equivalence ofthereproduced output text, but it seemed like more trouble than it is worth.The
lossless test is fair because everyone still has to encode the
(incompressible) choice of representation.

--- James Ratcliff <[EMAIL PROTECTED]> wrote:
This is jumping ahead of ourselves as well... we really have toprioritizeand take small steps... We first have to get it to basic understand ofteh
words and the direct interaction of these words... and just from Text
stories even, not moveis, before we can go to global moral plots and long
terms thinking ahead.
I expect that a good video compressor will have to understand plots,morals,
human emotions, etc.  But first we need good models of the lower levels of
visual processing. The nice thing about video compression is we canmeasure
this too.

--- David Clark <[EMAIL PROTECTED]> wrote:
If a huge statistical database of valid English information could beparsed
(compressed or otherwise), it might be possible to predict with some
accuracy if a given sentence was likely to be grammatically correct ornot.
This capability seems far removed from an AGI IMHO.
Thus my reasoning behind limiting the size of the training set. The AGIneeds
to learn as well as a human with the same amount of data.
If a book is put in a computer and then I refer to that book by it'stitle1M times, what is my percentage compression? If you think it is highthen
show me where the intelligence lies in this reference?  By using simple
references, humans compress huge amounts of data that would consumestorage
our brains couldn't physically handle.  The problem is that this kind of
compression is accomplished by understanding.  If you can crack the
*understanding* part of compression then you might have an AGI but I failtosee how just compressing data will result in understanding. Compressionandunderstanding are not reciprocal concepts. If humans had unlimitedstorage
and compression of information wasn't necessary, wouldn't the humans
*understanding* still confer intelligence to that human?
What is your definition of "understanding"? I know what it means inpeople,but what does it mean in a computer? If you accept Turing's definition ofAI,
then you have to accept the equivalence of passing the Turing test with
computing a probability distribution.
A common argument against compression as a test for AI is that humansdon'tcompress like a zip program. Compression requires a *deterministic*model. A
compressor codes string x using a code of length log 1/p(x) bits.  The
decompressor must also compute p(x) exactly to invert the code. Humanscan'tdo this because they use noisy neurons to compute p(x) that varies a biteach
time.

References
[1] Shannon, Claude E. (1950), "Prediction and Entropy of PrintedEnglish",
Bell Sys. Tech. J (3) p. 50-64.
[2] Landauer, Tom (1986), "How much do people remember? Some estimates ofthequantity of learned information in long term memory", Cognitive Science(10)
pp. 477-493.

[3] Standing, L. (1973), "Learning 10,000 Pictures", Quarterly Journal of
Experimental Psychology (25) pp. 207-222.


-- Matt Mahoney, [EMAIL PROTECTED]

-----
This list is sponsored by AGIRI: http://www.agiri.org/email
To unsubscribe or change your options, please go to:
http://v2.listbox.com/member/?&;


-----
This list is sponsored by AGIRI: http://www.agiri.org/email
To unsubscribe or change your options, please go to:
http://v2.listbox.com/member/?member_id=231415&user_secret=fabd7936

Re: Goals of AGI (was Re: [agi] AGI interests)

Reply via email to