Quick note: If you implement this, ensure NLP analyses square-bracketed text [], as that often contains whole sentences.
Probably the best solution would be to add in a condition that if the word begins with a capital and ends in a full-stop, it is not a sentence. On Mon, Nov 28, 2011 at 10:26 AM, Yuan Luo <[email protected]> wrote: > Hi, > Does the team have plans to deal with bracketed text? For example > The sentence > An EGD on 10/24/06 showed mild antral erosions ( mild regeneration , > nonspecific; no H. pylori ). > will be split into two at "H." by the Opennlp 1.5.1 with 1.5.0 models. > Intuitively, it would be natural to separate bracketed texts from affecting > sentence breakers. > > Thanks, > Yuan
