You could use the files from OPUS but they are in XML: http://opus.lingfil.uu.se/MultiUN/raw/ar/ http://opus.lingfil.uu.se/MultiUN/raw/en/
The sentence alignment is standoff annotation in this file: http://opus.lingfil.uu.se/MultiUN/raw/ar-en.xml.gz You can use this script to convert to Moses format if you like: http://opus.lingfil.uu.se/tools/opus2moses Good luck! Jörg On Dec 7, 2014, at 3:33 PM, emna hkiri wrote: > Dear Friends; > I need the UN corpus only 2000 folder i need 2 file.true for the couple of > language arabic and english > > you sent me before the link of the complete UN corpus 2;4GO for ar and 1,6GO > for english but i just need the 2000 folder > > i hope you will help me again thank you very much in advance > _______________________________________________ > Moses-support mailing list > [email protected] > http://mailman.mit.edu/mailman/listinfo/moses-support
_______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
