On 8/15/11 2:47 PM, György Chityil wrote:
It seems Dr. and the first double quotes are not tokenized. I guess Dr. should not be tokenized, while the double quotes are missed in this case.
You are getting this as a token: "Dr.It is not a bug in our code, but rather a problem with the statistical model, usually
such mistakes are fixed by adding more training data. Jörn