Hi, I trained the conventional baseline system with translation model, lexical reordering model (wbe-msd-bidirectional-fe-allff), language model.
I've encountered the following problems: 1) When I *add* another hierarchical reordering model (hier-mslr-bidirectional-fe-allff, with 8 dense features), after tuning with MIRA, some of the weight of lexical reordering model become negative like: ... LexicalReordering0= -0.00398112195593915 0.00177901253382393 -0.00561620995293243 0.0115717603473397 0.00416629648355966 0.0123725554271622 LexicalReordering1= 0.0603959132391883 0.0637905029984998 0.0453474898750001 0.0255712011871197 0.0333243029043151 0.0360465652275341 0.0102744718438895 0.037224797723084 ... 2) When I *add* the bilingual NPLM model, after tuning with MIRA, one of the weight of translation model becomes negative like: .. NNJM0= 0.0425000449522696 TranslationModel0= 0.0528812807140059 0.0459795108631494 0.0322642254177683 -0.00103257479908177 ... I suspect there is a sign of *over-fitting* problem. Do you used to encounter this? May I seek for your advise? Thank you very much! -- Cheers, Vu
_______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
