Thanks Mikel, I'll do that.
Another question, in the script, I need to split the corpus into sentences. Is
there any tool in apertium which does this task (like NLTK tokenizer for
Python)? I can simply split the corpus on . or ? or !. But in that case,
sentences where '.' is not the last letter will create problem. For example
"Washington D.C. is a beautiful place." will be spitted into three parts:
"Washington D", "C", and "is a beautiful place".
It's not a big issue though :)
On Saturday, April 12, 2014 11:08 PM, Mikel Forcada <[email protected]> wrote:
Al 04/12/2014 05:09 PM, En/na Rafi Kamal ha escrit:
But how can I instruct apertium to run this script? I can modify the modes
file, but these files will be regenerated every time after executing the
makefile.
If you cannot solve this at structural transfer level, you can always add your
command to the corresponding modes in the modes.xml file. The makefiles will
generate the right scripts in the modes directory, I guess.
--
Mikel L. Forcada (http://www.dlsi.ua.es/~mlf/)
Departament de Llenguatges i Sistemes Informàtics
Universitat d'Alacant
E-03071 Alacant, Spain
Phone: +34 96 590 9776
Fax: +34 96 590 9326
------------------------------------------------------------------------------
Put Bad Developers to Shame
Dominate Development with Jenkins Continuous Integration
Continuously Automate Build, Test & Deployment
Start a new project now. Try Jenkins in the cloud.
http://p.sf.net/sfu/13600_Cloudbees
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff