Hi All Im Master Degree Student and I have problem in sentence aligning. I extracted some data (Persian-English) which is not parallel (comparable). each Persian sentence is aligned with the top 50 similar sentences in English side. Now I want to find out the log probability of alignment and the number of aligned/unaligned words of each English sentence with Persian sentence. Example: Persian Sentence A English Sentence 1 Persian Sentence A English Sentence 2 Persian Sentence A English Sentence 3 .... ..... Persian Sentence A English Sentence 50 I Know Giza++ is suitable for alignment ,However I used Giza for Parallel Corpus..How Can I get log Probability alignment and number of aligned/unaligned words. Best Regards.
_______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
