On 22 March 2016 at 17:20, Álvaro Begué <alvaro.be...@gmail.com> wrote:

> A very simple-minded analysis is that, if the null hypothesis is that
> AlphaGo and Lee Sedol are equally strong, AlphaGo would do as well as we
> observed or better 15.625% of the time. That's a p-value that even social
> scientists don't get excited about. :)
>
>
"For "as well ... or better", I make it 18.75%.

Nick



> Álvaro.
>
>
> On Tue, Mar 22, 2016 at 12:48 PM, Jason House <jason.james.ho...@gmail.com
> > wrote:
>
>> Statistical significance requires a null hypothesis... I think it's
>> probably easiest to ask the question of if I assume an ELO difference of x,
>> how likely it's a 4-1 result?
>> Turns out that 220 to 270 ELO has a 41% chance of that result.
>> >= 10% is -50 to 670 ELO
>> >= 1% is -250 to 1190 ELO
>> My numbers may be slightly off from eyeballing things in a simple excel
>> sheet. The idea and ranges should be clear though
>> On Mar 22, 2016 12:00 PM, "Lucas, Simon M" <s...@essex.ac.uk> wrote:
>>
>>> Hi all,
>>>
>>> I was discussing the results with a colleague outside
>>> of the Game AI area the other day when he raised
>>> the question (which applies to nearly all sporting events,
>>> given the small sample size involved)
>>> of statistical significance - suggesting that on another week
>>> the result might have been 4-1 to Lee Sedol.
>>>
>>> I pointed out that in games of skill there's much more to judge than
>>> just the final
>>> outcome of each game, but wondered if anyone had any better (or worse :)
>>> arguments - or had even engaged in the same type of
>>> conversation.
>>>
>>> With AlphaGo winning 4 games to 1, from a simplistic
>>> stats point of view (with the prior assumption of a fair
>>> coin toss) you'd not be able to claim much statistical
>>> significance, yet most (me included) believe that
>>> AlphaGo is a genuinely better Go player than Lee Sedol.
>>>
>>> From a stats viewpoint you can use this approach:
>>> http://www.inference.phy.cam.ac.uk/itprnn/book.pdf
>>> (see section 3.2 on page 51)
>>>
>>> but given even priors it won't tell you much.
>>>
>>> Anyone know any good references for refuting this
>>> type of argument - the fact is of course that a game of Go
>>> is nothing like a coin toss.  Games of skill tend to base their
>>> outcomes on the result of many (in the case of Go many hundreds of)
>>> individual actions.
>>>
>>> Best wishes,
>>>
>>>   Simon
>>>
>>>
>>> _______________________________________________
>>> Computer-go mailing list
>>> Computer-go@computer-go.org
>>> http://computer-go.org/mailman/listinfo/computer-go
>>
>>
>> _______________________________________________
>> Computer-go mailing list
>> Computer-go@computer-go.org
>> http://computer-go.org/mailman/listinfo/computer-go
>>
>
>
> _______________________________________________
> Computer-go mailing list
> Computer-go@computer-go.org
> http://computer-go.org/mailman/listinfo/computer-go
>



-- 
Nick Wedd      mapr...@gmail.com
_______________________________________________
Computer-go mailing list
Computer-go@computer-go.org
http://computer-go.org/mailman/listinfo/computer-go

Reply via email to