Dear Marwa, Try cutting the bad data in half and then in half again, etc. to get a very small input that still suffers from the error. Then you'll probably realize what is the problem or you can at least send it to the mailing list.
Cheers, O. On October 9, 2014 2:10:12 AM CEST, Marwa Refaie <[email protected]> wrote: >How I should fix this error ?? Tokenizing didn't differ !! how to >normalize data or set sentence boundaries ??? > >Start loading text SCFG phrase table. Moses format : [1.000] >secondsReading >/cygdrive/c/mosesdecoder-master/try/ai/sep/fsmt/work/model/phrase-table. >0,1-0,1.gz----5---10---15---20---25---30---35---40---45---50---55---60---65---70---75---80 >---85---90---95--100Either your data contains <s> in a position other >than the first word or your la >nguage model is missing <s>. Did you build your ARPA using IRSTLM and >forget to > run add-start-end.sh? > >Marwa N. Refaie > > > >------------------------------------------------------------------------ > >_______________________________________________ >Moses-support mailing list >[email protected] >http://mailman.mit.edu/mailman/listinfo/moses-support -- Ondrej Bojar (mailto:[email protected] / [email protected]) http://www.cuni.cz/~obo _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
