Hi Hiue,
Unfortunately, I use the new scripts to create phrase tables.
The hierarchical model has the new format.
[X][X] 五 百 [X] ||| [X][X] five hundred [X] ||| 0.944295
0.44408 0.595991 0.478547 2.718 ||| 0-0 ||| 12.3925 19.6349
The phrase-based model also has the new format, although it does not
contain the alignment information.
五 百 日元 ||| five hundred yen ||| 0.933333 0.429554 0.7
0.372403 2.718 ||| ||| 15 20
I also checkout the latest svn trunk. Despite a little modification of
the configuration for the EMS (e.g. rename mert-moses-new.pl to
mert-moses.pl), it also fails to decode an input sentence using a
hierarchical model. Still, a phrase-based model has no problem.
I don't find the source of the problem yet.
--
Hwidong Na <[email protected]>
KLE lab, POSTECH, KOREA
2010-08-11 (수), 10:01 +0100, Hieu Hoang:
> Hi Hwidong
>
> Can you send me a few lines of the phrase table for the hierarchical model.
>
> The 1 change that we've done in the last few weeks which is breaking
> previous experiments is the phrase table format has changed from
> source ||| target ||| [alignment] ||| scores ||| [count]
> to
> source ||| target ||| scores ||| [alignment] ||| [count]
> eg.
> [X][DT] [X][NN] [X] ||| [X][DT] [X][NN] [NP] ||| 0-0 1-1 ||| 1.0
> ||| 5442.66 323.246
> to
> [X][DT] [X][NN] [X] ||| [X][DT] [X][NN] [NP] ||| 1.0 ||| 0-0 1-1
> ||| 5442.66 323.246
>
> to make the code easier & more consistent for the phrase-based and
> hierarchical models. If you're using phrase tables created by the old
> code, it may not work with the new release. But hopefully, it's easy to
> change. Sorry about that
>
>
> On 11/08/2010 04:34, Hwidong Na wrote:
> > Hi all,
> >
> > I run the new version with the EMS. I get an error when I try a
> > hierarchical model. The configuration for the hierarchical model works
> > in the previous revision (r3284).
> >
> > $SCRIPTS_ROOTDIR/ems/experiment.perl -config config.hierarchical
> >
> > ...ellipsis...
> > executing /home00/leona/IWSLT10.zh-en/steps/3/TUNING_tune.3 via
> > sh (1)
> > number of steps doable or running: 1
> > step TUNING:tune crashed
> > number of steps doable or running: 0
> >
> > The running script of TUNING:tune is
> >
> >
> > /home/leona/moses-2010-08-09/bin/scripts-20100810-1203//training/mert-moses-new.pl
> > -mertdir=/home/leona/moses-2010-08-09/mert
> > /home00/leona/IWSLT10.zh-en/tuning/input.txt.1
> > /home00/leona/IWSLT10.zh-en/tuning/reference.tok.1.ref
> > /home/leona/moses-2010-08-09/moses-chart-cmd/src/moses_chart
> > /home00/leona/IWSLT10.zh-en/model/moses.ini.2 --nbest 100 --working-dir
> > /home00/leona/IWSLT10.zh-en/tuning/tmp.3 --decoder-flags " -v 0 "
> > --rootdir /home/leona/moses-2010-08-09/bin/scripts-20100810-1203/
> > --no-filter-phrase-table
> >
> > In the STDERR of TUNING:tune,
> >
> > After default: -l mem_free=0.5G -hard
> > Using
> > SCRIPTS_ROOTDIR:
> > /home/leona/moses-2010-08-09/bin/scripts-20100810-1203/
> > Executing: mkdir -p /home00/leona/IWSLT10.zh-en/tuning/tmp.3
> > Saved: ./run1.moses.ini
> > Normalizing lambdas: 0 1 1 0.3 0.2 0.3 0.2 0 0.3
> > DECODER_CFG = -w %.6f -lm %.6f -d %.6f -tm %.6f %.6f %.6f %.6f
> > %.6f %.6f
> > values = 0 0.303030303030303 0.303030303030303
> > 0.0909090909090909 0.0606060606060606 0.0909090909090909
> > 0.0606060606060606 0 0.0909090909090909
> > Executing:
> > /home/leona/moses-2010-08-09/moses-chart-cmd/src/moses_chart -v 0
> > -config /home00/leona/IWSLT10.zh-en/model/moses.ini.2 -inputtype 0 -w
> > 0.000000 -lm 0.303030 -d 0.303030 -tm 0.090909 0.060606 0.090909 0.060606
> > 0.000000 0.090909 -n-best-list run1.best100.out 100 -i
> > /home00/leona/IWSLT10.zh-en/tuning/input.txt.1> run1.out
> > /home00/leona/IWSLT10.zh-en/lm/iwslt10.lm.1: line 23: warning:
> > non-zero probability for<unk> in closed-vocabulary LM
> > Start loading new format pt model : [0.000] seconds
> > Start loading new format pt model : [70.000] seconds
> > [0..0]= [1..1]= [2..2]= [3..3]= [4..4]= [5..5]= [6..6]= [7..7]=
> > [8..8]= [9..9]= [10..10]= [11..11]= [12..12]= [13..13]=
> > [14..14]= [15..15]= [16..16]= [17..17]= [0..1]= [1..2]= [2..3]=
> > [3..4]= [4..5]= [5..6]= [6..7]= [7..8]= [8..9]= [9..10]=
> > [10..11]=
> > ...ellipsis...
> > [10..11]= [11..12]= [12..13]= [13..14]= [0..2]= [1..3]= [2..4]=
> > [3..5]= [4..6]= [5..7]= [6..8]= [7..9]= [8..10]= [9..11]=
> > [10..12]= [11..13]= [12..14]= [0..3]= [1..4]= [2..5]= [3..6]=
> > [4..7]=Segmentation fault
> > Exit code: 139
> >
> > The decoder died. CONFIG WAS -w 0.000000 -lm 0.303030 -d
> > 0.303030 -tm 0.090909 0.060606 0.090909 0.060606 0.000000
> > 0.090909
> >
> > By the way, the new version works with EMS when I try a phrase-based
> > model with a lexicalized reordering model. Here are differences in
> > configurations.
> >
> > $diff config.hierarchical config.moseslex
> > 36c36
> > < decoder = $moses-src-dir/moses-chart-cmd/src/moses_chart
> > ---
> > > decoder = $moses-src-dir/moses-cmd/src/moses
> > 266c266
> > < #lexicalized-reordering = msd-bidirectional-fe
> > ---
> > > lexicalized-reordering = msd-bidirectional-fe
> > 270c270
> > < hierarchical-rule-set = true
> > ---
> > > #hierarchical-rule-set = true
> >
> > What is the problem?
>
>
>
>
>
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support