Hi,

it is not entirely clear to me what you are asking here.

Moses uses as data format for the parallel corpus and the input
just plain one-sentence-per-line text. So you would have to
convert your XML files into this format.

If there are XML tags in the plain text such as <b>bold tags</b>
then there are number of ways to deal with that. The easiest is
to just strip them out, remember their position, and re-insert
them based on the word alignment.

-phi

On Sun, Dec 1, 2013 at 2:37 AM, Kalyani Baruah <[email protected]> wrote:
> Good day...
> My qustinon regarding xml ..as i have said that i was using a text file as
> an input to moses. bt i was to give an xml input. As in text file i am
> having my parallel corpus ..not a file with probability ratio mentioned..
> just a parallel corpus collection in two languages.so how to convert them as
> a xml file. so that i can take those two xml files as input for moses.
>
>
>
>
>
>
> Regards,
>
>
> Kalyanee Kanchan Baruah
> Department of Information Technology,
> Institute of Science and Technology,
> Gauhati University,Guwahati,India
> Phone- +91-9706242124
>
> _______________________________________________
> Moses-support mailing list
> [email protected]
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to