Actually I don't know if this is a decoder question or such. Here is my issue
Let's say I have a text string with 2 sentences, with a period ending the first sentence, but no CR+LF, just a space before the second sentence. When I pass the full string to the pipe : tokenizer + truecaser + moses + detruecase + detokenizer the output is only one sentence, the period at the end of the first sentence has been eliminated, the sentence is nonsense (well not good at all) If I insert a CRLF just after the period of the first sentence and send the whole thing to the pipe, the output is correct. Am I missing something ? Should we only send string to moses segment by segment ? thanks, Vincent _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
