[ https://issues.apache.org/jira/browse/JOSHUA-288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15541397#comment-15541397 ]
John Hewitt commented on JOSHUA-288: ------------------------------------ I'm moving to benchmark the port against the original C implementation in runtime and AER. The authors of the original papers specify datasets on which they evaluate: - French: Europarl and News Commentary from WMT 12 - Chinese: (LDC2003E14) - Arabic : all parallel data made available for the NIST 2012 Open MT At least for French and Arabic, it is unclear where the manual reference alignments reside. Any thoughts? [~post]? > Port fast_align to java > ----------------------- > > Key: JOSHUA-288 > URL: https://issues.apache.org/jira/browse/JOSHUA-288 > Project: Joshua > Issue Type: New Feature > Reporter: Matt Post > Assignee: John Hewitt > Priority: Minor > Fix For: 6.2 > > Original Estimate: 168h > Remaining Estimate: 168h > > It would be great to have a Java port of fast_align, so that we don't have to > worry about compiling it, and could distribute it via Maven. > https://github.com/clab/fast_align -- This message was sent by Atlassian JIRA (v6.3.4#6332)