Hi,I am trying to optimize my sentence detector model by adding an abbreviation dictionary.
Can anybody give some hints on best practices which abbreviations to add here? E.g., only very frequent ones? Problematic ones? Any?
I just experimented with a very big abbreviation dictionary and found that, in german medical patient records, this rather decreases performance.
Any experiences were abbreviation dictionaries improved performance ? Best Katrin