On 03/19/2012 09:55 PM, [email protected] wrote:
I don't know if it is conclusive, but with the changes (case insensitive, remove non word chars) the sentence detector performed worse at least for my Portuguese corpus.
Maybe it is matching now at places where it should not match (and did not match before) ?
Jörn
