Lewis John McGibbney created JOSHUA-304:
-------------------------------------------
Summary: word-align.conf alignment template file not compatable
with berkeley aligner
Key: JOSHUA-304
URL: https://issues.apache.org/jira/browse/JOSHUA-304
Project: Joshua
Issue Type: Bug
Components: templates, berkeley, alignment
Affects Versions: 6.0.5
Reporter: Lewis John McGibbney
Priority: Blocker
Fix For: 6.1
It takes me quite some time to debug what was going on and why pipeline's were
failing when using the berkeley aligner.
It turns out that the word-align.conf template provided at
https://github.com/apache/incubator-joshua/blob/master/scripts/training/templates/alignment/word-align.conf
is not compatible with the berkeley aligner.
In particular the following lines are non compatible
https://github.com/apache/incubator-joshua/blob/master/scripts/training/templates/alignment/word-align.conf#L12-L15
Evidence of this is provided below
{code}
lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 -Xmx10g
-jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar
++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf
Invalid enum: 'MODEL1 HMM'; valid choices: MODEL1|MODEL2|HMM|SYNTACTIC|NONE
lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 -Xmx10g
-jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar
++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf
Invalid enum: 'MODEL1, HMM'; valid choices: MODEL1|MODEL2|HMM|SYNTACTIC|NONE
lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 -Xmx10g
-jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar
++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf
Invalid enum: 'MODEL1 HMM'; valid choices: MODEL1|MODEL2|HMM|SYNTACTIC|NONE
lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 -Xmx10g
-jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar
++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf
Invalid enum: 'JOINT JOINT'; valid choices: FORWARD|REVERSE|BOTH_INDEP|JOINT
lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 -Xmx10g
-jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar
++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf
Exception in thread "main" java.lang.NumberFormatException: For input string:
"5 5"
at
java.lang.NumberFormatException.forInputString(NumberFormatException.java:65)
at java.lang.Integer.parseInt(Integer.java:580)
at java.lang.Integer.parseInt(Integer.java:615)
at
edu.berkeley.nlp.fig.basic.OptInfo.interpretValue(OptionsParser.java:143)
at
edu.berkeley.nlp.fig.basic.OptInfo.interpretValue(OptionsParser.java:240)
at edu.berkeley.nlp.fig.basic.OptInfo.set(OptionsParser.java:294)
at
edu.berkeley.nlp.fig.basic.OptionsParser.readOptionsFile(OptionsParser.java:555)
at
edu.berkeley.nlp.fig.basic.OptionsParser.doParse(OptionsParser.java:604)
at edu.berkeley.nlp.fig.exec.Execution.init(Execution.java:293)
at edu.berkeley.nlp.wordAlignment.Main.main(Main.java:149)
lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 -Xmx10g
-jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar
++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf
Cannot create directory: alignments/0
{code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)