Quick note:

If you implement this, ensure NLP analyses square-bracketed text [],
as that often contains whole sentences.

Probably the best solution would be to add in a condition that if the
word begins with a capital and ends in a full-stop, it is not a
sentence.

On Mon, Nov 28, 2011 at 10:26 AM, Yuan Luo <[email protected]> wrote:
> Hi,
> Does the team have plans to deal with bracketed text? For example
> The sentence
> An EGD on 10/24/06 showed mild antral erosions ( mild regeneration ,
> nonspecific; no H. pylori ).
> will be split into two at "H." by the Opennlp 1.5.1 with 1.5.0 models.
> Intuitively, it would be natural to separate bracketed texts from affecting
> sentence breakers.
>
> Thanks,
> Yuan

Reply via email to