Continuing on a similar thread. moses.ini's [lmodel-file] section and the lm packages refer the n-gram length as "-order", and other places on the moses web site it's referred to as "size" (below).
web site: 0 <factor> <size> filename.srilm moses.ini: # language models: type(srilm/irstlm), factors, order, file [lmodel-file] 0 0 3 /lm/filename.srilm Does this third position (<size>, order and 3) in the .ini file refer to the -order or -n(gram) count when training the srilm & irstlm models? Thanks, Tom On Wed, 06 Oct 2010 07:26:01 -0700, <[email protected]> wrote: > Thanks guys. Your comments are great and fixed a problem I created for > myself. > > Tom > > > On Wed, 6 Oct 2010 11:41:11 +0100, Miles Osborne <[email protected]> > wrote: >> the phrase length refers to the number of words in a phrase and the >> number of scores to the number of feature function, per phrase. >> >> they have nothing to do with each other >> >> On 6 October 2010 11:31, <[email protected]> wrote: >>> I found this message below, which mentions the topic, but leaves my >>> question unanswered. >>> >>> The train-model.perl script has an option called "max-phrase-length". >>> Documentation shows its default is 7. >>> >>> The processPhraseTable binarizer has an option called -nscores that >>> refers >>> to "number of scores". The moses binary's fourth numeric option in >>> moses.ini's [ttable-file] section is also "number of scores". >>> Documentation >>> and the message below define a default of 5. >>> >>> Are the "max-phrase-length" and "number of scores" values the same? If >>> not >>> the same, is there a connection and if so, what is it? If there's no >>> connection, What criteria should one choose when setting "number of >>> scores" >>> and what the consequence of changing it from the default of 5? >>> >>> Thanks, >>> Tom >>> >>> >>> On Fri, 25 Jun 2010 18:14:07 +0100, Philipp Koehn <[email protected]> >>> wrote: >>>> Hi, >>>> >>>> something has gone awry in your use of the binarizer. >>>> >>>> A typical way to call the binarizer is: >>>> >>>> LC_ALL=C sort phrase-table | ~/bin/processPhraseTable -ttable 0 0 - >>>> -nscores 5 -out phrase-table & >>>> >>>> -nscores refers to the number of scores in the phrase translation > table >>>> which are by default 5. >>>> >>>> -phi >>>> >>>> On Fri, Jun 25, 2010 at 5:45 PM, Cyrine NASRI <[email protected]> >>>> wrote: >>>>> >>>>> Good morning everybody >>>>> I dont understand the meaning of -nscores 5 >>>>> When i make the command wich Binaryze the Phrase Tables, a message >>>>> appears >>>>> to me processing ptree for 5 >>>>> Can't read 5 >>>>> >>>>> Thank you very much >>>>> >>>>> PS : i'm not english so please excuse me for the very bad english >>>>> wich >>>>> i >>>>> write >>>>> Cyrine >>>>> >>>>> >>>>> _______________________________________________ >>>>> Moses-support mailing list >>>>> [email protected] >>>>> http://mailman.mit.edu/mailman/listinfo/moses-support >>>>> >>>>> >>>> >>>> _______________________________________________ >>>> Moses-support mailing list >>>> [email protected] >>>> http://mailman.mit.edu/mailman/listinfo/moses-support >>> _______________________________________________ >>> Moses-support mailing list >>> [email protected] >>> http://mailman.mit.edu/mailman/listinfo/moses-support >>> > _______________________________________________ > Moses-support mailing list > [email protected] > http://mailman.mit.edu/mailman/listinfo/moses-support _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
