Hieu Hoang schreef op de 17e dag van de lentemaand van het jaar 2014:

> can you send me me the exact command you executed, the moses.ini file used,
> and a few lines of the input file

    Here is the training command:

en_word=0
en_root=1
en_pos=2
en_rel=3
nl_word=0
nl_root=1
nl_pos=2
nl_postag=3
nl_rel=4  
/net/aps/64/opt/moses/mosesdecoder/scripts/training/train-model.perl \
  -cores 12 \
  -root-dir train \
  -corpus /net/aistaff/kleiweg/moses/europarl/factored/corpus/train-small.clean 
\
  -f en -e nl \
  -alignment grow-diag-final-and \
  -reordering msd-bidirectional-fe \
  --lm $nl_word:3:/net/aistaff/kleiweg/moses/europarl/factored/lm/blm.nl-word:8 
\
  --lm $nl_root:3:/net/aistaff/kleiweg/moses/europarl/factored/lm/blm.nl-root:8 
\
  --lm $nl_pos:3:/net/aistaff/kleiweg/moses/europarl/factored/lm/blm.nl-pos:8 \
  --lm 
$nl_postag:3:/net/aistaff/kleiweg/moses/europarl/factored/lm/blm.nl-postag:8 \
  --lm $nl_rel:3:/net/aistaff/kleiweg/moses/europarl/factored/lm/blm.nl-rel:8 \
  --alignment-factors $en_root-$nl_root \
  --translation-factors 
$en_word-$nl_word+$en_root-$nl_root+$en_pos-$nl_pos+$en_pos-$nl_postag+$en_rel-$nl_rel
 \
  --reordering-factors 
$en_word-$nl_word+$en_root-$nl_root+$en_pos-$nl_pos+$en_pos-$nl_postag+$en_rel-$nl_rel
 \
  --generation-factors $nl_root,$nl_pos,$nl_postag,$nl_rel-$nl_word \
  --decoding-steps t0:t1,t2,t3,t4,g0 \
  -external-bin-dir /net/aps/64/opt/moses/mgizapp/bin \
  -mgiza \
  -mgiza-cpus=12 \
   > training.out \
   2> training.err

    Here are three lines of training data, input:

I|I|prp|nsubj declare|declare|vbp|hd resumed|resume|vbd|hd the|the|dt|det 
session|session|nn|hd of|of|in|hd the|the|dt|det European|European|nnp|nn 
Parliament|Parliament|nnp|hd adjourned|adjourn|vbn|hd on|on|in|hd 
Friday|Friday|nnp|hd 17|17|cd|num December|December|nnp|hd 1999|1999|cd|num 
,|,|,|punct and|and|cc|cc I|I|prp|nsubj would|would|md|aux like|like|vb|hd 
once|once|rb|advmod again|again|rb|hd to|to|to|aux wish|wish|vb|hd 
you|you|prp|nsubj a|a|dt|det happy|happy|jj|amod new|new|jj|amod 
year|year|nn|hd in|in|in|hd the|the|dt|det hope|hope|nn|hd that|that|in|mark 
you|you|prp|nsubj enjoyed|enjoy|vbd|hd a|a|dt|det pleasant|pleasant|jj|amod 
festive|festive|jj|amod period|period|nn|hd .|.|.|punct

although|although|in|mark ,|,|,|punct as|as|in|mark you|you|prp|nsubj 
will|will|md|aux have|have|vb|aux seen|see|vbn|hd ,|,|,|punct the|the|dt|det 
dreaded|dreaded|jj|amod '|'|"|punct millennium|millennium|nn|nn 
bug|bug|nn|hd '|'|pos|punct failed|fail|vbn|hd to|to|to|hd 
materialise|materialise|nn|pobj ,|,|,|punct still|still|rb|advmod 
the|the|dt|det people|people|nns|hd in|in|in|hd a|a|dt|det number|number|nn|hd 
of|of|in|hd countries|country|nns|pobj suffered|suffer|vbd|hd a|a|dt|det 
series|series|nn|hd of|of|in|hd natural|natural|jj|amod 
disasters|disaster|nns|hd that|that|wdt|nsubj truly|truly|rb|advmod 
were|be|vbd|cop dreadful|dreadful|jj|hd .|.|.|punct

you|you|prp|nsubj have|have|vbp|aux requested|request|vbn|hd a|a|dt|det 
debate|debate|nn|hd on|on|in|hd this|this|dt|det subject|subject|nn|hd 
in|in|in|hd the|the|dt|det course|course|nn|hd of|of|in|hd the|the|dt|det 
next|next|jj|amod few|few|jj|amod days|day|nns|hd ,|,|,|punct 
during|during|in|hd this|this|dt|det part-session|part-session|nn|hd .|.|.|punct

    And three lines of training data, target:

ik|ik|pron|VNW(pers,pron,nomin,vol,1,ev)|su 
verklaar|verklaar|verb|WW(pv,tgw,ev)|hd de|de|det|LID(bep,stan,rest)|det 
zitting|zitting|noun|N(soort,ev,basis,zijd,stan)|hd van|van|prep|VZ(init)|hd 
het|het|det|LID(bep,stan,evon)|det Europees|Europees|name|SPEC(deeleigen)|mwp 
Parlement|Parlement|name|SPEC(deeleigen)|mwp ,|,|punct|LET()|-- 
die|die|pron|VNW(betr,pron,stan,vol,persoon,getal)|rhd op|op|prep|VZ(init)|hd 
vrijdag|vrijdag|noun|N(eigen,ev,basis,zijd,stan)|hd 
17|17|noun|TW(hoofd,vrij)|mwp december|december|noun|TW(hoofd,vrij)|mwp 
werd|word|verb|WW(pv,verl,ev)|hd 
onderbroken|onderbreek|verb|WW(vd,vrij,zonder)|hd ,|,|punct|LET()|-- 
te|te|comp|VZ(init)|cmp zijn|ben|verb|WW(inf,vrij,zonder)|hd 
hervat|hervat|verb|WW(vd,vrij,zonder)|vc .|.|punct|LET()|-- 
ik|ik|pron|VNW(pers,pron,nomin,vol,1,ev)|su wens|wens|verb|WW(pv,tgw,ev)|hd 
u|u|pron|VNW(pers,pron,nomin,vol,2b,getal)|hd allen|allen|adv|BW()|mod 
een|een|det|LID(onbep,stan,agr)|det 
gelukkig|gelukkig|adj|ADJ(prenom,basis,zonder!
 )|mod nieuwjaar|nieuwjaar|noun|N(soort,ev,basis,onz,stan)|hd 
en|en|vg|VG(neven)|crd hoop|hoop|verb|WW(pv,tgw,ev)|hd 
dat|dat|comp|VG(onder)|cmp u|u|pron|VNW(pers,pron,nomin,vol,2b,getal)|su 
een|een|det|LID(onbep,stan,agr)|det 
goede|goed|adj|ADJ(prenom,basis,met-e,stan)|mod 
vakantie|vakantie|noun|N(soort,ev,basis,zijd,stan)|hd 
heeft|heb|verb|WW(pv,tgw,met-t)|hd gehad|heb|verb|WW(vd,vrij,zonder)|hd 
.|.|punct|LET()|--

zoals|zoals|comp|VG(onder)|cmp u|u|pron|VNW(pers,pron,nomin,vol,2b,getal)|su 
heeft|heb|verb|WW(pv,tgw,met-t)|hd kunnen|kan|verb|WW(inf,vrij,zonder)|hd 
constateren|constateer|verb|WW(inf,vrij,zonder)|hd ,|,|punct|LET()|-- 
is|ben|verb|WW(pv,tgw,ev)|hd de|de|det|LID(bep,stan,rest)|det 
grote|groot|adj|ADJ(prenom,basis,met-e,stan)|mod "|"|punct|LET()|-- 
millenniumbug|millenniumbug|name|N(eigen,ev,basis,zijd,stan)|hd 
"|"|punct|LET()|-- uitgebleven|blijf_uit|verb|WW(vd,vrij,zonder)|hd 
.|.|punct|LET()|-- de|de|det|LID(bep,stan,rest)|det 
burgers|burger|noun|N(soort,mv,basis)|hd van|van|prep|VZ(init)|hd 
een|een|det|LID(onbep,stan,agr)|det 
aantal|aantal|noun|N(soort,ev,basis,onz,stan)|hd van|van|prep|VZ(init)|hd 
onze|ons|det|VNW(bez,det,stan,vol,1,mv,prenom,met-e,rest)|det 
lidstaten|lidstaat|noun|N(soort,mv,basis)|hd zijn|ben|verb|WW(pv,tgw,mv)|hd 
daarentegen|daarentegen|adv|BW()|mod door|door|prep|VZ(init)|hd 
verschrikkelijke|verschrikkelijk|adj|ADJ(prenom,basis,me!
 t-e,stan)|mod natuurrampen|natuur_ramp|noun|N(soort,mv,basis)|hd 
getroffen|tref|verb|WW(vd,vrij,zonder)|hd .|.|punct|LET()|--

u|u|pron|VNW(pers,pron,nomin,vol,2b,getal)|su 
heeft|heb|verb|WW(pv,tgw,met-t)|hd 
aangegeven|geef_aan|verb|WW(vd,vrij,zonder)|hd dat|dat|comp|VG(onder)|cmp 
u|u|pron|VNW(pers,pron,nomin,vol,2b,getal)|su 
deze|deze|det|VNW(aanw,det,stan,prenom,met-e,rest)|det 
vergaderperiode|vergader_periode|noun|N(soort,ev,basis,zijd,stan)|hd 
een|een|det|LID(onbep,stan,agr)|det 
debat|debat|noun|N(soort,ev,basis,onz,stan)|hd 
wilt|wil|verb|WW(pv,tgw,met-t)|hd over|over|prep|VZ(init)|hd 
deze|deze|det|VNW(aanw,det,stan,prenom,met-e,rest)|det 
rampen|ramp|noun|N(soort,mv,basis)|hd .|.|punct|LET()|--

    Here is the moses.ini created by the training:

#########################
### MOSES CONFIG FILE ###
#########################

# input factors
[input-factors]
0
1
2
3

# mapping steps
[mapping]
0 T 0
1 T 1
1 T 2
1 T 3
1 T 4
1 G 0

[distortion-limit]
6

# feature functions
[feature]
UnknownWordPenalty
WordPenalty
PhrasePenalty
PhraseDictionaryMemory name=TranslationModel0 table-limit=20 num-features=4 
path=/net/aistaff/kleiweg/moses/europarl/factored/working2/train/model/phrase-table.0-0.gz
 input-factor=0 output-factor=0
PhraseDictionaryMemory name=TranslationModel1 table-limit=0 num-features=4 
path=/net/aistaff/kleiweg/moses/europarl/factored/working2/train/model/phrase-table.1-1.gz
 input-factor=1 output-factor=1
PhraseDictionaryMemory name=TranslationModel2 table-limit=0 num-features=4 
path=/net/aistaff/kleiweg/moses/europarl/factored/working2/train/model/phrase-table.2-2.gz
 input-factor=2 output-factor=2
PhraseDictionaryMemory name=TranslationModel3 table-limit=0 num-features=4 
path=/net/aistaff/kleiweg/moses/europarl/factored/working2/train/model/phrase-table.2-3.gz
 input-factor=2 output-factor=3
PhraseDictionaryMemory name=TranslationModel4 table-limit=0 num-features=4 
path=/net/aistaff/kleiweg/moses/europarl/factored/working2/train/model/phrase-table.3-4.gz
 input-factor=3 output-factor=4
Generation name=GenerationModel0 num-features=2 
path=/net/aistaff/kleiweg/moses/europarl/factored/working2/train/model/generation.1,2,3,4-0.gz
 input-factor=1,2,3,4 output-factor=0
LexicalReordering name=LexicalReordering0 num-features=6 
type=wbe-msd-bidirectional-fe-allff input-factor=0 output-factor=0 
path=/net/aistaff/kleiweg/moses/europarl/factored/working2/train/model/reordering-table.0-0.wbe-msd-bidirectional-fe.gz
LexicalReordering name=LexicalReordering1 num-features=6 
type=wbe-msd-bidirectional-fe-allff input-factor=1 output-factor=1 
path=/net/aistaff/kleiweg/moses/europarl/factored/working2/train/model/reordering-table.1-1.wbe-msd-bidirectional-fe.gz
LexicalReordering name=LexicalReordering2 num-features=6 
type=wbe-msd-bidirectional-fe-allff input-factor=2 output-factor=2 
path=/net/aistaff/kleiweg/moses/europarl/factored/working2/train/model/reordering-table.2-2.wbe-msd-bidirectional-fe.gz
LexicalReordering name=LexicalReordering3 num-features=6 
type=wbe-msd-bidirectional-fe-allff input-factor=2 output-factor=3 
path=/net/aistaff/kleiweg/moses/europarl/factored/working2/train/model/reordering-table.2-3.wbe-msd-bidirectional-fe.gz
LexicalReordering name=LexicalReordering4 num-features=6 
type=wbe-msd-bidirectional-fe-allff input-factor=3 output-factor=4 
path=/net/aistaff/kleiweg/moses/europarl/factored/working2/train/model/reordering-table.3-4.wbe-msd-bidirectional-fe.gz
Distortion
KENLM lazyken=0 name=LM0 factor=0 
path=/net/aistaff/kleiweg/moses/europarl/factored/lm/blm.nl-word order=3
KENLM lazyken=0 name=LM1 factor=1 
path=/net/aistaff/kleiweg/moses/europarl/factored/lm/blm.nl-root order=3
KENLM lazyken=0 name=LM2 factor=2 
path=/net/aistaff/kleiweg/moses/europarl/factored/lm/blm.nl-pos order=3
KENLM lazyken=0 name=LM3 factor=3 
path=/net/aistaff/kleiweg/moses/europarl/factored/lm/blm.nl-postag order=3
KENLM lazyken=0 name=LM4 factor=4 
path=/net/aistaff/kleiweg/moses/europarl/factored/lm/blm.nl-rel order=3

# dense weights for feature functions
[weight]
UnknownWordPenalty0= 1
WordPenalty0= -1
PhrasePenalty0= 0.2
TranslationModel0= 0.2 0.2 0.2 0.2
TranslationModel1= 0.2 0.2 0.2 0.2
TranslationModel2= 0.2 0.2 0.2 0.2
TranslationModel3= 0.2 0.2 0.2 0.2
TranslationModel4= 0.2 0.2 0.2 0.2
GenerationModel0= 0.3 0
LexicalReordering0= 0.3 0.3 0.3 0.3 0.3 0.3
LexicalReordering1= 0.3 0.3 0.3 0.3 0.3 0.3
LexicalReordering2= 0.3 0.3 0.3 0.3 0.3 0.3
LexicalReordering3= 0.3 0.3 0.3 0.3 0.3 0.3
LexicalReordering4= 0.3 0.3 0.3 0.3 0.3 0.3
Distortion0= 0.3
LM0= 0.5
LM1= 0.5
LM2= 0.5
LM3= 0.5
LM4= 0.5

    This is the command I used:

moses -f train/model/moses.ini

    And this is the output of that command, until it exits,
    before any actual input:

Defined parameters (per moses.ini or switch):
        config: train/model/moses.ini 
        distortion-limit: 6 
        feature: UnknownWordPenalty WordPenalty PhrasePenalty 
PhraseDictionaryMemory name=TranslationModel0 table-limit=20 num-features=4 
path=/net/aistaff/kleiweg/moses/europarl/factored/working2/train/model/phrase-table.0-0.gz
 input-factor=0 output-factor=0 PhraseDictionaryMemory name=TranslationModel1 
table-limit=0 num-features=4 
path=/net/aistaff/kleiweg/moses/europarl/factored/working2/train/model/phrase-table.1-1.gz
 input-factor=1 output-factor=1 PhraseDictionaryMemory name=TranslationModel2 
table-limit=0 num-features=4 
path=/net/aistaff/kleiweg/moses/europarl/factored/working2/train/model/phrase-table.2-2.gz
 input-factor=2 output-factor=2 PhraseDictionaryMemory name=TranslationModel3 
table-limit=0 num-features=4 
path=/net/aistaff/kleiweg/moses/europarl/factored/working2/train/model/phrase-table.2-3.gz
 input-factor=2 output-factor=3 PhraseDictionaryMemory name=TranslationModel4 
table-limit=0 num-features=4 
path=/net/aistaff/kleiweg/moses/europarl/factored/working2/trai!
 n/model/phrase-table.3-4.gz input-factor=3 output-factor=4 Generation 
name=GenerationModel0 num-features=2 
path=/net/aistaff/kleiweg/moses/europarl/factored/working2/train/model/generation.1,2,3,4-0.gz
 input-factor=1,2,3,4 output-factor=0 LexicalReordering name=LexicalReordering0 
num-features=6 type=wbe-msd-bidirectional-fe-allff input-factor=0 
output-factor=0 
path=/net/aistaff/kleiweg/moses/europarl/factored/working2/train/model/reordering-table.0-0.wbe-msd-bidirectional-fe.gz
 LexicalReordering name=LexicalReordering1 num-features=6 
type=wbe-msd-bidirectional-fe-allff input-factor=1 output-factor=1 
path=/net/aistaff/kleiweg/moses/europarl/factored/working2/train/model/reordering-table.1-1.wbe-msd-bidirectional-fe.gz
 LexicalReordering name=LexicalReordering2 num-features=6 
type=wbe-msd-bidirectional-fe-allff input-factor=2 output-factor=2 
path=/net/aistaff/kleiweg/moses/europarl/factored/working2/train/model/reordering-table.2-2.wbe-msd-bidirectional-fe.gz
 LexicalReordering!
  name=LexicalReordering3 num-features=6 type=wbe-msd-bidirectional-fe-
allff input-factor=2 output-factor=3 
path=/net/aistaff/kleiweg/moses/europarl/factored/working2/train/model/reordering-table.2-3.wbe-msd-bidirectional-fe.gz
 LexicalReordering name=LexicalReordering4 num-features=6 
type=wbe-msd-bidirectional-fe-allff input-factor=3 output-factor=4 
path=/net/aistaff/kleiweg/moses/europarl/factored/working2/train/model/reordering-table.3-4.wbe-msd-bidirectional-fe.gz
 Distortion KENLM lazyken=0 name=LM0 factor=0 
path=/net/aistaff/kleiweg/moses/europarl/factored/lm/blm.nl-word order=3 KENLM 
lazyken=0 name=LM1 factor=1 
path=/net/aistaff/kleiweg/moses/europarl/factored/lm/blm.nl-root order=3 KENLM 
lazyken=0 name=LM2 factor=2 
path=/net/aistaff/kleiweg/moses/europarl/factored/lm/blm.nl-pos order=3 KENLM 
lazyken=0 name=LM3 factor=3 
path=/net/aistaff/kleiweg/moses/europarl/factored/lm/blm.nl-postag order=3 
KENLM lazyken=0 name=LM4 factor=4 
path=/net/aistaff/kleiweg/moses/europarl/factored/lm/blm.nl-rel order=3 
        input-factors: 0 1 2 3 
        mapping: 0 T 0 1 T 1 1 T 2 1 T 3 1 T 4 1 G 0 
        weight: UnknownWordPenalty0= 1 WordPenalty0= -1 PhrasePenalty0= 0.2 
TranslationModel0= 0.2 0.2 0.2 0.2 TranslationModel1= 0.2 0.2 0.2 0.2 
TranslationModel2= 0.2 0.2 0.2 0.2 TranslationModel3= 0.2 0.2 0.2 0.2 
TranslationModel4= 0.2 0.2 0.2 0.2 GenerationModel0= 0.3 0 LexicalReordering0= 
0.3 0.3 0.3 0.3 0.3 0.3 LexicalReordering1= 0.3 0.3 0.3 0.3 0.3 0.3 
LexicalReordering2= 0.3 0.3 0.3 0.3 0.3 0.3 LexicalReordering3= 0.3 0.3 0.3 0.3 
0.3 0.3 LexicalReordering4= 0.3 0.3 0.3 0.3 0.3 0.3 Distortion0= 0.3 LM0= 0.5 
LM1= 0.5 LM2= 0.5 LM3= 0.5 LM4= 0.5 
/net/aps/64/opt/moses/mosesdecoder/bin
line=UnknownWordPenalty
FeatureFunction: UnknownWordPenalty0 start: 0 end: 0
line=WordPenalty
FeatureFunction: WordPenalty0 start: 1 end: 1
line=PhrasePenalty
FeatureFunction: PhrasePenalty0 start: 2 end: 2
line=PhraseDictionaryMemory name=TranslationModel0 table-limit=20 
num-features=4 
path=/net/aistaff/kleiweg/moses/europarl/factored/working2/train/model/phrase-table.0-0.gz
 input-factor=0 output-factor=0
FeatureFunction: TranslationModel0 start: 3 end: 6
line=PhraseDictionaryMemory name=TranslationModel1 table-limit=0 num-features=4 
path=/net/aistaff/kleiweg/moses/europarl/factored/working2/train/model/phrase-table.1-1.gz
 input-factor=1 output-factor=1
FeatureFunction: TranslationModel1 start: 7 end: 10
line=PhraseDictionaryMemory name=TranslationModel2 table-limit=0 num-features=4 
path=/net/aistaff/kleiweg/moses/europarl/factored/working2/train/model/phrase-table.2-2.gz
 input-factor=2 output-factor=2
FeatureFunction: TranslationModel2 start: 11 end: 14
line=PhraseDictionaryMemory name=TranslationModel3 table-limit=0 num-features=4 
path=/net/aistaff/kleiweg/moses/europarl/factored/working2/train/model/phrase-table.2-3.gz
 input-factor=2 output-factor=3
FeatureFunction: TranslationModel3 start: 15 end: 18
line=PhraseDictionaryMemory name=TranslationModel4 table-limit=0 num-features=4 
path=/net/aistaff/kleiweg/moses/europarl/factored/working2/train/model/phrase-table.3-4.gz
 input-factor=3 output-factor=4
FeatureFunction: TranslationModel4 start: 19 end: 22
Exception: bitset::set



 
> 
> On 17 March 2014 17:21, Peter Kleiweg <[email protected]> wrote:
> 
> >
> > I run train-model.perl, and it ends with "(9) create moses.ini"
> > without any error.
> >
> > When I try to start moses with "moses -f moses.ini" it fails
> > with "Exception: bitset::set"
> >
> > Is this a bug? What do I do?
> > What does it even mean, bitset::set?



-- 
Peter Kleiweg
http://pkleiweg.home.xs4all.nl/
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to