To answer the original question: yes, the curation of a dataset like this would be hugely beneficial to the community. Look at what ImageNet has done for computer vision. In fact, it might be good to emulate ImageNet further and pre-split the dataset into a publicly-available training set, and a hidden testing set, for truly objective comparisons between move-prediction algorithms.
If you undertake this, many thanks in advance! On Fri, Nov 13, 2015 at 1:20 PM, Dave Dyer <dd...@real-me.net> wrote: > > I was recently working on assigning final scores to completed games, using > the large data set from Badukmovies.com. > > My observation is that the size of the data set (50,000 games) is not > large enough to get good coverage of unusual situations occurring in real > games. > > There's a definite need for a curated collection of atypical but > interesting games, probably manipulated to explore the boundaries > between interesting and normal. > > _______________________________________________ > Computer-go mailing list > Computer-go@computer-go.org > http://computer-go.org/mailman/listinfo/computer-go >
_______________________________________________ Computer-go mailing list Computer-go@computer-go.org http://computer-go.org/mailman/listinfo/computer-go