Hi Barry, I'm trying to figure out how to convert the old phrase table into the new format and I have a short question. Here's how my current phrase table looks like: ... - a c o l o ||| a k o l o ||| () (0) (1) (2) (3) (4) ||| (1) (2) (3) (4) (5) ||| 0.0117647 0.0101026 1 0.621085 2.718 - a c o l ||| a k o l ||| () (0) (1) (2) (3) ||| (1) (2) (3) (4) ||| 0.00378788 0.0101027 1 0.654066 2.718 ...
I've read about the new format and I think I should transform this lines into: ... - a c o l o ||| a k o l o ||| 1-0 2-1 3-2 4-3 5-4 ||| 0-1 1-2 2-3 3-4 4-5 ||| 0.0117647 0.0101026 1 0.621085 2.718 - a c o l ||| a k o l ||| 1-0 2-1 3-2 4-3 ||| 0-1 1-2 2-3 3-4 ||| 0.00378788 0.0101027 1 0.654066 2.718 ... Is this right or am I getting something wrong? Thanks, Horia On 16 December 2011 11:39, Barry Haddow <[email protected]> wrote: > Hi Horia > > The phrase table format has changed so you might have to recreate or > convert > it. You should first try changing the phrase-table line in your moses.ini > from > > 0 0 5 <ttable-path> > > to > > 0 0 0 5 <ttable-path> > > The leading '0' signifies that the table is in text format. The new text > format > for phrase tables is given here > http://www.statmt.org/moses/?n=FactoredTraining.ScorePhrases > so you should be able to convert your old one with a simple script, > > cheers - Barry > > On Thursday 15 December 2011 14:05:09 Horia Cucu wrote: > > Hi, > > > > I have a phrase table created with a version of moses compiled in 2007. > > I've just installed moses on my computer and tried to decode using this > old > > phrase table and I'm stuck here: > > > > ... > > ... > > Loading lexical distortion models...have 0 models > > Start loading LanguageModel > > /home/cucu/speechRoot/tools/wordsPhonetization/dist/dict.train.ph.lm : > > [0.000] seconds > > Finished loading LanguageModels : [0.000] seconds > > Warning: Phrase table specification in old 4-field format. Assuming > binary > > phrase tables (type 1)! > > Start loading PhraseTable > > > /home/cucu/speechRoot/tools/wordsPhonetization/filteredTable/phrase-table.0 > > -0.1 > > > > : [0.000] seconds > > > > filePath: > > > /home/cucu/speechRoot/tools/wordsPhonetization/filteredTable/phrase-table.0 > > -0.1 Finished loading phrase tables : [0.000] seconds > > IO from STDOUT/STDIN > > Created input-output object : [0.000] seconds > > Translating line 0 in thread id 3074714480 > > Translating: c a s ă > > > > bin ttable does not exist -> create it > > distinct source phrases: 19 distinct first words of source phrases: 8 > > number of phrase pairs (line count): 41 > > reading bin ttable > > size of OFF_T 8 > > binary phrasefile loaded, default OFF_T: -1 > > Check weightT.size() == scoreVector.size() failed in > > moses/src/TargetPhrase.cpp:155 > > ./translateWordList.sh: line 16: 21578 Aborted moses -f > > filteredTable/moses.ini < $WORDLIST.gr > $WORDLIST.ph > > > > any ideas? > > horia > > > > -- > Barry Haddow > University of Edinburgh > +44 (0) 131 651 3173 > > -- > The University of Edinburgh is a charitable body, registered in > Scotland, with registration number SC005336. > >
_______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
