Dear all EAMT members, This is the 2nd call for participation to the 2nd Workshop on Asian Translation (WAT2015).
UPDATES: 1. Korean->Japanese patent translation subtask is added. http://lotus.kuee.kyoto-u.ac.jp/WAT/#task.html 2. The test data for Chinese->Japanese and Korean->Japanese patent translation subtasks will be released on May 8th and 25th respectively. (Those for scientific paper translation subtasks have already been released.) http://lotus.kuee.kyoto-u.ac.jp/WAT/patent/index.html 3. Important dates are fixed. WAT2015 participants must submit their translation results by ***August 16, 2015***. http://lotus.kuee.kyoto-u.ac.jp/WAT/#important-dates.html NOTICE: To join the evaluation campaign, you need to register to WAT2015. See registration page and register now! http://lotus.kuee.kyoto-u.ac.jp/WAT/registration/index.html ******************************************************************** Participants of the previous WAT2014 are also required to register again to WAT2015 ******************************************************************** Best regards, --------------------------------------------------------------------------- WAT 2015 (The 2nd Workshop on Asian Translation) http://lotus.kuee.kyoto-u.ac.jp/WAT/ October 16, 2015, Kyoto, Japan Following the success of the previous Workshop on Asian Translation (WAT2014), WAT2015 brings together machine translation researchers and users to try, evaluate, share and discuss brand-new ideas of machine translation. We are working toward the practical use of machine translation among all Asian countries. For the 2nd WAT, we adopt new translation subtasks "Chinese-to-Japanese and Korean-to-Japanese patent translation" in addition to the subtasks that were conducted in WAT2014. TASK ---- The task is to improve the text translation quality for scientific papers and patent documents. Participants choose any of the subtasks in which they would like to participate and translate the test data using their machine translation systems. The WAT organizers will evaluate the results submitted using automatic evaluation and human evaluation. We will also provide a baseline machine translation. Subtasks: Scientific Paper Subtask: Japanese --> English English --> Japanese Japanese --> Chinese Chinese --> Japanese Patent Subtask: <= NEW! Chinese --> Japanese Korean --> Japanese Dataset: For the scientific paper subtask, WAT uses ASPEC (Asian Scientific Paper Excerpt Corpus) for the dataset including training, development, development test and test data. Participants must get a copy of ASPEC by themselves from http://lotus.kuee.kyoto-u.ac.jp/ASPEC/ For the patent subtask, WAT uses the JPO Patent Corpus, which was constructed by the Japan Patent Office (JPO). This corpus consists of 1 million Chinese-Japanese and Korean-Japanese parallel sentences from patent descriptions with four categories. Participants of the patent subtask are required to obtain the data from the WAT2015 site for the JPO Patent Corpus. http://lotus.kuee.kyoto-u.ac.jp/WAT/patent/index.html Automatic evaluation: We are providing an automatic evaluation server. It is for free for everyone, but you need to create an account for evaluation. Just showing the list of evaluation results does not require an account. Registration: http://lotus.kuee.kyoto-u.ac.jp/WAT/registration/index.html Eval. result: http://lotus.kuee.kyoto-u.ac.jp/WAT/evaluation/index.html Human evaluation: Human evaluation will be carried out using crowdsourcing. Participants can submit translation results a maximum of twice. Sentence-by-sentence pair-wise evaluation compared to the baseline system will be carried out. The crowdsourcing workers will be asked to judge which translation is better than the other in view of adequacy and fluency. All systems will be ranked by the percentage of translations judged to improve upon the baseline system. INVITED TALK ------------ We are planning to have an invited talk as follows: Speaker: Dr. Haizhou Li Research Director of the Institute for Infocomm Research in Singapore Principal Scientist and Department Head of Human Language Technology Title: Adequacy-Fluency Metrics: Evaluating MT in the Continuous Space Model Framework Please visit the homepage of WAT for the biography of the speaker. http://lotus.kuee.kyoto-u.ac.jp/WAT/#invited-talk.html REGISTRATION ------------ The registration for task participants: The registration fee is FREE for all participants. http://lotus.kuee.kyoto-u.ac.jp/WAT/registration/index.html The registration for WAT2015 audiences: There is no need to register in advance. The registration fee is FREE for all audiences include participants. IMPORTANT DATES --------------- Submission due for crowdsourcing evaluation August 16, 2015 System description draft paper due September 20, 2015 Review feedback September 27, 2015 System description camera-ready paper due October 4, 2015 WAT 2015 October 16, 2015 PAPER ----- Participants who submit results for human evaluation should submit description papers of their translation systems and evaluation results. We strongly prefer that papers include a section entitled "Issues for Context-aware Machine Translation" which discusses the importance and usefulness of context. ORGANIZERS ---------- Toshiaki Nakazawa (Japan Science and Technology Agency (JST)) Hideya Mino (National Institute of Information and Communications Technology (NICT)) Isao Goto (Japan Broadcasting Corporation (NHK)) Graham Neubig (Nara Institute of Science and Technology (NAIST)) Eiichiro Sumita (National Institute of Information and Communications Technology (NICT)) Sadao Kurohashi (Kyoto University) CONTACT ------- [email protected] --------------------------------------------------------------------------- -- Toshiaki Nakazawa (Researcher) Japan Science and Technology Agency (JST) (@ Graduate School of Informatics, Kyoto University) Yoshida-honmachi, Sakyo-ku, Kyoto, 606-8501, Japan tel: +81-75-753-5346, fax: +81-75-753-5962 [email protected] / [email protected] _______________________________________________ Mt-list site list [email protected] http://lists.eamt.org/mailman/listinfo/mt-list
