Hi, it is not entirely clear to me what you are asking here.
Moses uses as data format for the parallel corpus and the input just plain one-sentence-per-line text. So you would have to convert your XML files into this format. If there are XML tags in the plain text such as <b>bold tags</b> then there are number of ways to deal with that. The easiest is to just strip them out, remember their position, and re-insert them based on the word alignment. -phi On Sun, Dec 1, 2013 at 2:37 AM, Kalyani Baruah <[email protected]> wrote: > Good day... > My qustinon regarding xml ..as i have said that i was using a text file as > an input to moses. bt i was to give an xml input. As in text file i am > having my parallel corpus ..not a file with probability ratio mentioned.. > just a parallel corpus collection in two languages.so how to convert them as > a xml file. so that i can take those two xml files as input for moses. > > > > > > > Regards, > > > Kalyanee Kanchan Baruah > Department of Information Technology, > Institute of Science and Technology, > Gauhati University,Guwahati,India > Phone- +91-9706242124 > > _______________________________________________ > Moses-support mailing list > [email protected] > http://mailman.mit.edu/mailman/listinfo/moses-support > _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
