Hi, All, Many thanks for your replies and for sharing the information. It's been very helpful :) .
Paul, So far, I only have about 70,000 word data to train the Chinese language model. I also have the problem of lack of bitext since I am doing domain-specific SMT. Huiling -- For MT-List info, see http://www.eamt.org/mt-list.html
