Hello Mailing List 1. How should i go about building a parallel corpus from Hindi to English?
2. How to go about doing tokenization and sentence-alignment? 3. Do we need a dictionary/lexicon ? 4. Also, do we need to do parsing? 5. I thought building a bi-lingual corpus was having a text file with a sentence in the source language along with the sentence in the target language. 6. I don't want to spend a lot of time on this. Is there a faster way of building parallel corpus? 7. How big the corpus should be to get relatively accurate results? Thanks a lot in advance. Regards Vineet _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
