Matt,

The complexity of a simulated environment is tricky to estimate, if
the environment contains complex self-organizing dynamics, random
number generation, and complex human interactions ...

ben

On Tue, Jan 13, 2009 at 1:29 PM, Matt Mahoney <matmaho...@yahoo.com> wrote:
> My response to Ben's paper is to be cautious about drawing conclusions from 
> simulated environments. Human level AGI has an algorithmic complexity of 10^9 
> bits (as estimated by Landauer). It is not possible to learn this much 
> information from an environment that is less complex. If a baby AI did 
> perform well in a simplified simulation of the world, it would not imply that 
> the same system would work in the real world. It would be like training a 
> language model on a simple, artificial language and then concluding that the 
> system could be scaled up to learn English.
>
> This is a lesson from my dissertation work in network intrusion anomaly 
> detection. This was a machine learning task in which the system was trained 
> on attack-free network traffic, and then identified anything out of the 
> ordinary as malicious. For development and testing, we used the 1999 
> MIT-DARPA Lincoln Labs data set consisting of 5 weeks of synthetic network 
> traffic with hundreds of labeled attacks. The test set developers took great 
> care to make the data as realistic as possible. They collected statistics 
> from real networks, built an isolated network of 4 real computers running 
> different operating systems, and thousands of simulated computers that 
> generated HTTP requests to public websites and mailing lists, and generated 
> synthetic email using English word bigram frequencies, and other kinds of 
> traffic.
>
> In my work I discovered a simple algorithm that beat the best intrusion 
> detection systems available at the time. I parsed network packets into 
> individual 1-4 byte fields, recorded all the values that ever occurred at 
> least once in training, and flagged any new value in the test data as 
> suspicious, with a score inversely proportional to the size of the set of 
> values observed in training and proportional to the time since the previous 
> anomaly.
>
> Not surprisingly, the simple algorithm failed on real network traffic. There 
> were too many false alarms for it to be even remotely useful. The reason it 
> worked on the synthetic traffic was that it was algorithmically simple 
> compared to real traffic. For example, one of the most effective tests was 
> the TTL value, a counter that decrements with each IP routing hop, intended 
> to prevent routing loops. It turned out that most of the attacks were 
> simulated from a machine that was one hop further away than the machines 
> simulating normal traffic.
>
> A problem like that could have been fixed, but there were a dozen others that 
> I found, and probably many that I didn't find. It's not that the test set 
> developers weren't careful. They spent probably $1 million developing it 
> (several people over 2 years). It's that you can't simulate the high 
> complexity of thousands of computers and human users with anything less than 
> that. Simple problems have simple solutions, but that's not AGI.
>
> -- Matt Mahoney, matmaho...@yahoo.com
>
>
> --- On Fri, 1/9/09, Ben Goertzel <b...@goertzel.org> wrote:
>
>> From: Ben Goertzel <b...@goertzel.org>
>> Subject: [agi] What Must a World Be That a Humanlike Intelligence May 
>> Develop In It?
>> To: agi@v2.listbox.com
>> Date: Friday, January 9, 2009, 5:58 PM
>> Hi all,
>>
>> I intend to submit the following paper to JAGI shortly, but
>> I figured
>> I'd run it past you folks on this list first, and
>> incorporate any
>> useful feedback into the draft I submit
>>
>> This is an attempt to articulate a virtual world
>> infrastructure that
>> will be adequate for the development of human-level AGI
>>
>> http://www.goertzel.org/papers/BlocksNBeadsWorld.pdf
>>
>> Most of the paper is taken up by conceptual and
>> requirements issues,
>> but at the end specific world-design proposals are made.
>>
>> This complements my earlier paper on AGI Preschool.  It
>> attempts to
>> define what kind of underlying virtual world infrastructure
>> an
>> effective AGI preschool would minimally require.
>>
>> thx
>> Ben G
>>
>>
>>
>> --
>> Ben Goertzel, PhD
>> CEO, Novamente LLC and Biomind LLC
>> Director of Research, SIAI
>> b...@goertzel.org
>>
>> "I intend to live forever, or die trying."
>> -- Groucho Marx
>
>
>
> -------------------------------------------
> agi
> Archives: https://www.listbox.com/member/archive/303/=now
> RSS Feed: https://www.listbox.com/member/archive/rss/303/
> Modify Your Subscription: https://www.listbox.com/member/?&;
> Powered by Listbox: http://www.listbox.com
>



-- 
Ben Goertzel, PhD
CEO, Novamente LLC and Biomind LLC
Director of Research, SIAI
b...@goertzel.org

"This is no place to stop -- half way between ape and angel"
-- Benjamin Disraeli


-------------------------------------------
agi
Archives: https://www.listbox.com/member/archive/303/=now
RSS Feed: https://www.listbox.com/member/archive/rss/303/
Modify Your Subscription: 
https://www.listbox.com/member/?member_id=8660244&id_secret=126863270-d7b0b0
Powered by Listbox: http://www.listbox.com

Reply via email to