-------- Original Message --------
Subject: Re: Asian Sentence Detector Models
Date: Thu, 22 Mar 2012 20:12:09 +0900
From: wl.gao.tkl@gmail <[email protected]>
To: Jörn Kottmann <[email protected]>
On 03/22/2012 06:50 PM, Jörn Kottmann wrote:
On 03/22/2012 10:01 AM, wl.gao.tkl@gmail wrote:
You can. Almost every time we use this symbol to signal the end of a
sentence.
However, sometimes, it can be missing, especially in a dialogue or
chatroom.
The OpenNLP Sentence Detector actual does sentence boundary
disambiguation,
in English or European languages usually these: !, ?,. are used to
indicate a sentence
boundary, but often they are used for other things as well, e.g. in
abbreviations.
If the sentence boundary is missing it will never split there.
Jörn
Sorry, I have to add that if this mark is missing, then it might be
obvious there are no need to separate this text into sentences. (only
one sentence)
? - are used in Chinese and Japanese as question mark.
! - emphasise/strongly opinionated
. - not used in C&J
, - for segmentation of one sentence. just like English.
Thank you Jörn ,for the reminding.
Gao