You could use the files from OPUS but they are in XML:

http://opus.lingfil.uu.se/MultiUN/raw/ar/
http://opus.lingfil.uu.se/MultiUN/raw/en/

The sentence alignment is standoff annotation in this file:
http://opus.lingfil.uu.se/MultiUN/raw/ar-en.xml.gz

You can use this script to convert to Moses format if you like:
http://opus.lingfil.uu.se/tools/opus2moses

Good luck!
Jörg



On Dec 7, 2014, at 3:33 PM, emna hkiri wrote:

> Dear Friends; 
> I need  the UN corpus only 2000 folder i need  2 file.true for the couple of 
> language arabic and english 
> 
> you sent me before the link of the complete UN corpus 2;4GO for ar and 1,6GO 
> for english but i just need the 2000 folder
> 
> i hope you will help me again thank you very much in advance
> _______________________________________________
> Moses-support mailing list
> [email protected]
> http://mailman.mit.edu/mailman/listinfo/moses-support

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to