A very simple-minded analysis is that, if the null hypothesis is that
AlphaGo and Lee Sedol are equally strong, AlphaGo would do as well as we
observed or better 15.625% of the time. That's a p-value that even social
scientists don't get excited about. :)

Álvaro.


On Tue, Mar 22, 2016 at 12:48 PM, Jason House <jason.james.ho...@gmail.com>
wrote:

> Statistical significance requires a null hypothesis... I think it's
> probably easiest to ask the question of if I assume an ELO difference of x,
> how likely it's a 4-1 result?
> Turns out that 220 to 270 ELO has a 41% chance of that result.
> >= 10% is -50 to 670 ELO
> >= 1% is -250 to 1190 ELO
> My numbers may be slightly off from eyeballing things in a simple excel
> sheet. The idea and ranges should be clear though
> On Mar 22, 2016 12:00 PM, "Lucas, Simon M" <s...@essex.ac.uk> wrote:
>
>> Hi all,
>>
>> I was discussing the results with a colleague outside
>> of the Game AI area the other day when he raised
>> the question (which applies to nearly all sporting events,
>> given the small sample size involved)
>> of statistical significance - suggesting that on another week
>> the result might have been 4-1 to Lee Sedol.
>>
>> I pointed out that in games of skill there's much more to judge than just
>> the final
>> outcome of each game, but wondered if anyone had any better (or worse :)
>> arguments - or had even engaged in the same type of
>> conversation.
>>
>> With AlphaGo winning 4 games to 1, from a simplistic
>> stats point of view (with the prior assumption of a fair
>> coin toss) you'd not be able to claim much statistical
>> significance, yet most (me included) believe that
>> AlphaGo is a genuinely better Go player than Lee Sedol.
>>
>> From a stats viewpoint you can use this approach:
>> http://www.inference.phy.cam.ac.uk/itprnn/book.pdf
>> (see section 3.2 on page 51)
>>
>> but given even priors it won't tell you much.
>>
>> Anyone know any good references for refuting this
>> type of argument - the fact is of course that a game of Go
>> is nothing like a coin toss.  Games of skill tend to base their
>> outcomes on the result of many (in the case of Go many hundreds of)
>> individual actions.
>>
>> Best wishes,
>>
>>   Simon
>>
>>
>> _______________________________________________
>> Computer-go mailing list
>> Computer-go@computer-go.org
>> http://computer-go.org/mailman/listinfo/computer-go
>
>
> _______________________________________________
> Computer-go mailing list
> Computer-go@computer-go.org
> http://computer-go.org/mailman/listinfo/computer-go
>
_______________________________________________
Computer-go mailing list
Computer-go@computer-go.org
http://computer-go.org/mailman/listinfo/computer-go

Reply via email to