Hello all, I have a rather strange request. Does anyone know of any papers (or impementations) on bag-of-words language models ? That is, a language model which does not take into account the order in which the words appear in an ngram, so if you have the string 'police chief of' in your model, you will get a result for both 'chief of police' and 'police chief of'. I have thought of using IRSTLM or some generic model and scoring all the permutations, but wondered if there was a more efficient implementation already in existence. I have searched without much luck in Google, but perhaps I am searching with the wrong words.
Best regards, Fran _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
