On 03/26/2013 08:40 AM, Riccardo Tasso wrote:
Is the Sentence Detector able to split also on non dot characters? In my
case there should be also other characters delimiting the end of a segment,
such as: colon (:), dash (-), various kind of quotation marks (", `, ',
...).
The Sentence Detector can only split on end-of-sentence characters, by
default these
are . ! ? but with 1.5.3 you can set them during training to your custom
set, there is
a command line argument for it on the Sentence Detector Trainer, haver a
look at the help.
If you don't want to compile yourself use the 1.5.3 RC2 which we are
currently testing.
Jörn