Hello, My name is Graham Neubig, and I'm currently a PhD Student at Kyoto University.
I am just writing to inform you of a Japanese-English parallel corpus that I prepared based on some data that was released by NICT in Japan. All data is tokenized and split into training/test sets, and a script that performs the full process of preparing the data and running a Moses baseline system is included. The data can be found here: Kyoto Free Translation Task http://www.phontron.com/kftt/ If you are interested, please feel free to use the data in your research and I'd appreciate any comments/questions/suggestions. Also, if possible I'd appreciate a link on the "Links to Corpora" page. Thanks in advance! Graham _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
