Hi Graham, Did you have a look at the tarballs that were distributed last year? http://www.statmt.org/wmt14/translation-task.html
There are three different version: - Test sets (5.2 MB) These are the source sgm files with extra "filler" sentences. They were the actual files released for the campaign. http://www.statmt.org/wmt14/test.tgz - Filtered Test sets (3.2 MB) These are the source and reference sgm files used to evaluate, i.e. the Test sets without the "filler" sentences. If you want to reproduce results from the campaign, use these. http://www.statmt.org/wmt14/test-filtered.tgz - Cleaned Test sets (3.2 MB) These include fixes to minor encoding errors, and reinstate around 10% of the en-de data which was excluded from the evaluation. For further research, use these. http://www.statmt.org/wmt14/test-full.tgz WMT has a Google Group: https://groups.google.com/forum/#!forum/wmt-tasks Cheers, Matthias On Mon, 2015-04-27 at 22:14 +0900, Graham Neubig wrote: > Hi Moses List, > > Sorry about this being a bit off topic, but I have a question about the > files on matrix.statmt.org, and couldn't find any information about who to > contact on the site and assumed that here would be the next-best place to > ask. > > Specifically, I'm looking for the SGM files for newstest2014 in the same > order as the system outputs on matrix.statmt.org. On the "test sets" page, > in the place where there should be a link to newstest2014, it seems like > the link actually points to newstest2013: > http://matrix.statmt.org/test_sets/list > > And the ones downloadable from the WMT 2015 site seem to be in a different > order, and it'd be a bit of a pain (although possible) to match the lines > properly: > http://www.statmt.org/wmt15/translation-task.html > > If possible, could someone help out with this, or tell me who's in charge > of the evaluation matrix so I can contact them directly? > > Graham > _______________________________________________ > Moses-support mailing list > [email protected] > http://mailman.mit.edu/mailman/listinfo/moses-support -- The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336. _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
