Hi,

I'm having a strange problem that moses crashes when fed with a
lattice that has non-ASCII characters in it. If the input is:

((('damit',1.0,1),),(('ist',1.0,1),('war',1.0,1),('sei',1.0,1),),(('der',1.0,1),('die',1.0,1),('das',1.0,1),),(('arbeitsplan',1.0,1),),)

then moses completes without any problems. However, if I change the
last edge into Chinese,

((('damit',1.0,1),),(('ist',1.0,1),('war',1.0,1),('sei',1.0,1),),(('der',1.0,1),('die',1.0,1),('das',1.0,1),),(('给',1.0,1),),)

.. then moses crashes with the following error:

...
...

Translating: word lattice: 4
0 -- (damit , 0.000, 1)
1 -- (ist , 0.000, 1) (war , 0.000, 1) (sei , 0.000, 1)
2 -- (der , 0.000, 1) (die , 0.000, 1) (das , 0.000, 1)
3 -- (给 , 0.000, 1)

path stats for current CN:
CN (full):     8.000 15.000 18.000 9.000
CN (explored): 1 0 0 0
ERROR: call to ConfusionNet::GetSubString

My command is:

./???/moses-cmd/src/moses \
    -inputtype 2 -weight-i 0 \
    -config `pwd`/moses.ini \
    -input-file `pwd`/src.lattice \
    -verbose 3

And below is my moses.ini, although there's nothing particularly
interesting about it:

# MERT optimized configuration
# decoder /???/moses-cmd/src/moses
# BLEU 0.517812 -> 0.517945 on dev /???/working/dev.zh
# We were before running iteration 7
# finished Tue Dec 22 02:44:42 SGT 2009
### MOSES CONFIG FILE ###
#########################

# input factors
[input-factors]
0

# mapping steps
[mapping]
0 T 0

# translation tables: source-factors, target-factors, number of scores, file
[ttable-file]
0 0 5 /my-working-directory/working/model/phrase-table.gz

# no generation models, no generation-file section

# language models: type(srilm/irstlm), factors, order, file
[lmodel-file]
0 0 5 /???/en.lm


# limit on how many phrase translations e for each phrase f are loaded
# 0 = all elements loaded
[ttable-limit]
20

# distortion (reordering) files
[distortion-file]
0-0 monotonicity-bidirectional-f 4 /???/working/model/reordering-table.gz

# distortion (reordering) weight
[weight-d]
0.008227
-0.085606
0.024868
0.064056
0.074998

# language model weights
[weight-l]
0.218681

# translation model weights
[weight-t]
0.042811
0.084593
0.147177
0.045780
0.073527

# no generation models, no weight-generation section

# word penalty
[weight-w]
-0.129677

[distortion-limit]
6

[drop-unknown]
1

Any help is greatly appreciated.

Regards,
Liu Chang
National University of Singapore

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to